As far as i know theoretically our model tend to be drifting/shifting as time goes on and need to be retrained. i wonder if its acceptable that instead of retraining the classification model, we keep everything of the model as it was and only adjust the probability threshold based on latest/newest data?
Asked
Active
Viewed 46 times
1 Answers
1
Retraining the model accounts for changes in the relationship between features and outcomes, or probabilities of outcomes.
Changing the threshold(s) accounts for changes in the cost of decisions relative to outcomes: Reduce Classification Probability Threshold
The two are conceptually distinct. Your data generating process may change without changes in the cost structure: retrain your model. Or your costs may change without changes in the DGP: adapt your decision process. Or both: do both.
Stephan Kolassa
- 123,354
If I may, I'd like to validate my understanding. Assume a churn model with the expected cost of decisions to maintain the churn rate below 10% quarterly. After some time, the model consistently fails to achieve the unchanged expected cost result.
In this case, retraining is wiser than just adjusting the threshold. Am I on the right track?
– raffo Mar 06 '24 at 18:32