If got data (value) of two types (cond true or false):
> str(test.df)
'data.frame': 3208 obs. of 2 variables:
$ cond : Factor w/ 2 levels "FALSE","TRUE": 1 1 1 1 1 1 1 1 1 1 ...
$ value: num 31 25 21 29 18 41 15 7 33 6 ...
The mean and the variance (and also the number of cases) are different:
> stats <- ddply(test.df, .(cond), summarize, mean=mean(value), var=var(value), n=length(value))
> stats
cond mean var n
1 FALSE 17.33918 141.3199 3137
2 TRUE 25.91549 177.4499 71
So different variances means I can't use wilcoxon-test, right?
If I plot the densities I see the following:

The Shapiro-test says they are not normal-distributed.
> shapiro.test(test.df[test.df$cond==TRUE, 'value'])
Shapiro-Wilk normality test
data: test.df[test.df$cond == TRUE, "value"]
W = 0.9583, p-value = 0.01901
> shapiro.test(test.df[test.df$cond==FALSE, 'value'])
Shapiro-Wilk normality test
data: test.df[test.df$cond == FALSE, "value"]
W = 0.9348, p-value < 2.2e-16
No normal distribtion means I can't use Welch-test, right?
So how can I test if the difference of the means is significant?
Also possibly related: http://stats.stackexchange.com/questions/88457/hypothesis-testing-wilcoxon-test-bootstrapping-or-something-else
– jona May 03 '14 at 13:28