For example,
\begin{array} {|r|r|}\hline & \text{Sample}\enspace X & \text{Sample}\enspace Y \\ \hline mean & 14 & 20 \\ \hline median & 5 & 5 \\ \hline \end{array}
How should we approach in order to test if $mean_Y > mean_X$ is statistical significant?
My thought is: It seems that the variables in the 2 groups don't follow a normal distribution (mean != median). But if the sample size is large enough, we can use the two-sample t-test (parametric). However, if our sample size is too small, we should use Wilcoxon-Mann-Whitney (or rank sum) test (non-parametric).
Is that a good approach? Not sure if I'm missing anything here.
Thank you!
1/ From your answer here: https://stats.stackexchange.com/questions/121852/how-to-choose-between-t-test-or-non-parametric-test-e-g-wilcoxon-in-small-sampl, it seems that we could use any of these 3 tests (t-test, Mann-Whitney, Permutation test) depending on the sample size? May I know how did you define if it's "n medium-large/moderate small/very small"?
2/ When the 2 samples have the same medians but different means, is it correct to say that the distribution isn't normal? If not, what else I should also consider?
– KatieN Sep 16 '20 at 00:51