How can I compare whether classifiers outcome is significantly different?
I have a relatively small data set which I'm trying to classify. My training set consists of 24 items come from two different groups, 12 items from each group. each item has two properties. I have 12 new items which I would like to classify based on this training set. (not sure whether the numbers are so important to my question... )
I have tested 5 different classifiers, and their classification result is as follows:
- 1, 1, 0, 0, 0, 1, 1, 0, 0, 1, 0, 0
- 0, 0, 0, 1, 0, 0, 1, 0, 0, 0, 1, 1
- 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1
- 0, 0, 1, 1, 0, 1, 1, 1, 0, 0, 0, 1
- 0, 0, 0, 1, 0, 0, 1, 0, 0, 0, 0, 1
my question is: how can i test whether those results are significantly different from each other ?
thanks!