I need to compare means of two data sets that binary. For example:
a = [1,1,0,0,0,0,0,0,0,1]
b = [1,0,1,1,1,0,0,1,1,0]
All I need to know is whether the means are statistically significantly different between the two datasets, in other words the order in which 1 are arranged does not matter. And I do know that all values are either 0 or 1. Also in my case sizes of and be are fairly large, greater than 10,000 and number of 1 is about 10 to 100.
What is the best test to use in this case?
I know that I cannot use t-test because my data is not normally distributed.