Hyperparamter: Grid Serach from scratch

author:

date: 30. April 2021

download: https://github.com/StatNLP/empirical_methods


TASK: Predict SOFA Score for liver based on training data without circular features.

LEARNER: The system consist of two components. The first component is a Feed Forward Neural Network which is trained to output a real valued score (called raw prediction) based on 44 physiological status variables. This network is trained with a L2-loss function. The second component is an ordinal regression model which is used to derive threshold values to map the raw score to 0, 1, 2, 3 or 4. The output of this mapping i called the predicted sofa score and forms the basis of all further evaluations.

RESEARCH QUESTION: Determine a quantification of hyperparameter influence on summative evaluation metrics like corpus BLEU, Accuracy etc.

PROCEDURE: We identified a set of hyperparameters we would like to study and defined a range of relevant values for each of them. We then choose 3 to 6 values that reasonable cover this range for each of the chosen hyperparameters and trained the learner under each possible combination.

The chosen hyperparameter an values are:

We than evaluate the trained models on a test set (see circularity analysis for details) and calculated the accuracy for each of them. The results are briefly summarized by:

To get a first impression let us plot univariate summarizations (mean and sd) for each hyperparameter:

Now we calculate a random effect model to decompose the variance:

And summarize the results: