I have an model that I want to test with my existing data to calculate precision, recall etc. The data is actually unbalanced dataset: Class A 70%/Class B 30%. I created a data set by undersampling class A, so that I get an equal distribution of both classes: Class A 50%/Class B 50%. When calculating the metrics for evaluation, do I have to weight the results? So would a false positive have a higher weight, due to the unbalance in the actual population?
Asked
Active
Viewed 80 times
0
-
In short, your metrics are not appropriate for your data, change the metrics not the data. – user2974951 Jan 25 '23 at 07:24
-
Please take some time to read what's already written about these topics on Cross Validated. Good places to start: When is unbalanced data really a problem in Machine Learning? and Why is accuracy not the best measure for assessing classification models?. – dipetkov Jan 25 '23 at 08:05