Evaluation
- multi-class logloss (cross-entropy loss or negative loglikelihood)
$$logloss = -\frac{1}{N} \sum_{i=1}^{N}\sum_{j=1}^{M}y_{ij} log(p_{ij})$$
- N is size of test set (20,000)
- M is number of class labels (121)
- yij is 1 if observation i is in class j and 0 otherwise.
- pij is our predicted probability that i belongs to j