Questions tagged [threshold]

Used (1) for discrete classification (if an instance's predicted probability exceeds a threshold, classify as TRUE, otherwise FALSE), or (2) for discretizing/binning continuous data. If you are tempted to use this tag, PLEASE read the tag wiki!

Either usage of thresholds is usually an error for our purposes.

Using thresholds for discrete classification

The statistical part of modeling should extend to outputting predicted class membership probabilities. Deciding whether to treat a particular case as a member of a certain class is based on this probability, but must also consider the costs of misclassification. More information at Classification probability threshold.

Predicted probabilities and thresholds are often used to calculate accuracy. Related to the point above, accuracy is not a good measure for assessing classification models.

Using thresholds to discretize/bin continuous data

Discretizing or binning continuous data typically throws away a lot of information and introduces step changes that are rarely found in actual data generating processes. It is usually better to model nonlinearities using . For more information (despite the title), see What is the benefit of breaking up a continuous predictor variable? (Answer: none.)

263 questions
23
votes
1 answer

What is F1 Optimal Threshold? How to calculate it?

I've used h2o.glm() function in R which gives a contingency table in the result along with other statistics. The contingency table is headed "Cross Tab based on F1 Optimal Threshold" Wikipedia defines F1 Score or F Score as the harmonic mean of…
3
votes
1 answer

Find the ideal threshold with the test data?

I'm working with unbalanced data (2% of the class yes and 98% of the class no). Regardless of the evaluation metric chosen in the training, I have obtained low sensitivity and high specificity. For this reason I am working with the threshold…
0
votes
1 answer

Classification of data with statistics

I have rainfall data series for a particular city. With the help of statistics i would like to classify the data of rainfall into light rain , heavy and very heavy. Is it possible to know the threshold value of each classification with help of…
Deepa
  • 1