I have two sets (small) of features that I need to compare against a class, and decide which set is better at classifying. The features are generated using ranking methods, and only the top 2% of the features are compared, to determine which method is better differentiating amongst the top ranking features. Both the small datasets achieve a low AUC: 0.2 and 0.15 to be exact?
Could I make a conclusion that Method 1 is better than Method 2 in differentiating amongst the top 2% basis on the scores if both AUC values are below 0.5?