Normality plot and shapiro test for repeated-measures ANOVA: near significant but not quite

Question

I have a repeated-measures anova (Two-way), and I want to check for normality; my plots show this:

lmerabsolute <- aov(Proportionofundershoots~ Target*Experiment + (ID/(Target*Experiment)), data=overallundershootproportion)

However, my shapiro test shows this:

shapiro.test(resid(lmerabsolute))

    Shapiro-Wilk normality test

data:  resid(lmerabsolute)
W = 0.98896, p-value = 0.001789

What do I do? I would be so grateful for some advice!

When I arcsine transform the data, my new shapiro test result looks like this:

Shapiro-Wilk normality test

data:  resid(lmerabsolute)
W = 0.98975, p-value = 0.003128

Here is the arcsine transformed normality plot:

So definitely an improvement! I have not managed to get a more significant P-value than this!

Please say more about the details of your data and experimental design. If your outcome is a proportion, as the name Proportionofundershoots suggests, then a mixed-effects ANOVA such as you have done might not be the best analysis to pursue. Also, please say more about why you are doing normality testing; see this page for an introduction to why such testing often can be unnecessary or even misleading. — EdM, Mar 10 '20 at 21:10
Why would you use a test when the conditional response is obviously not continuous? The null is immediately false. What could the test tell you that you don't already know for sure? The QQ plot addresses a somewhat more useful question ("how far from normal" in some sense), though there are still issues with choosing analyses by looking at your data. — Glen_b, Mar 11 '20 at 05:39

Maarten Buis · Answer 1 · 2020-03-10T21:48:26.980

Based on your first graph, your problem is granularity; there is a limited number of values that happen often. It is impossible to (meaningfully) fix that using a transformation. So, the second graph suggests to me that you made an error when applying the arcsine transformation. Appart from the granularity, the original distribution does not look bad, so I would stick with that. Based on eyeballing your graph, the number of observations is large enough, such that a statistically "significant" result could easily be the result of a substantively insignificant deviation from the null hypothesis. So the tests are not that meaningful in your case.

Normality plot and shapiro test for repeated-measures ANOVA: near significant but not quite

1 Answers1