If we have a model which outputs class probabilities for $K$ classes, e.g. a NN with softmax layer, how can we return an aggregate "confidence metric"?
Some intuitive ideas would be the probability of the highest class or the difference in probability between the top 2 classes.
Is there a standard choice here? If not, what are the options.