I am using re-sampling methods to address the imbalance between classes for my binary classification problem.
I am not sure how to measure the performance of my model on the test set:
- should I re-sample the test set to have an idea how my model is performing on the test set compared to the training set?
- or should I measure the performance on the original test set and, to compare it with the training set, also measure the performance on the original training set?
I would think that my second bullet point is the best methodological approach, but my first bullet point still makes some sense IMO (from a performance point of view, not a business point of view).
– Tanguy Aug 28 '17 at 08:43