I am trying to study the influence of a Treatment Model (denoted by 1) vs a control group (denote by 0). The goal of my statistical analysis is to be able to say with a certain confidence that the Treatment model has an influence in sales. In other words, I want to test for the null hypothesis that the Treatment does not have an effect in sales - the difference observed in Revenue from one method to the other is due to random causes, not the Treatment model. I have no a priori-knowledge about the distribution and also have little data to work with.
Hence, I have decided to perform a permutation test. My question is exactly at this point: If I use the median difference (in Revenue) between both model as my test statistic I get a p-value of about 0.15 - which is considerably high but the data is also very noisy in the industry under analysis. However, if I use the mean difference (in Revenue) between both model as my test statistic I get a p-value of about 0.05. This is a considerable difference and have huge impact in the conclusions that I want to take from this experiment. So my question are
- What is the best test statistic to use in this case?
- Additionally, given the fact that the median and mean have such different results this means that the data is skewed. What to do in such a case?
Please see an example of my sample data below.
Location MODEL Revenue
A 0 -200.73
A 1 -300.42
A 0 153.02
B 0 40.23
C 0 300.07
B 0 -599.10
C 0 323.47
D 1 14.37
Many thanks in advance.