I have a dataset with very skewed distribution (approx. 90 with class 0 and 10 with class 1). I have considered to use undersampling to reduce size of the majority class. I would like to know the right order to undersampling the class and run a cross validation.
Asked
Active
Viewed 160 times
https://stats.stackexchange.com/questions/357466/are-unbalanced-datasets-problematic-and-how-does-oversampling-purport-to-he https://www.fharrell.com/post/class-damage/ https://www.fharrell.com/post/classification/ https://stats.stackexchange.com/a/359936/247274 https://stats.stackexchange.com/questions/464636/proper-scoring-rule-when-there-is-a-decision-to-make-e-g-spam-vs-ham-email https://twitter.com/f2harrell/status/1062424969366462473?lang=en
– Dave Mar 16 '21 at 09:30