I am modeling data from an experiment with a mixed model. The outcome variable is a percentage. There are three fixed effects, Condition: diseased and healthy, Time point: 1, 2 and 3 , Drug: A,B,C,D,E. Subject is taken as the random effect. I need to perform three tests:
- Check for a significant difference between disease and healthy for time point 1 and drug A, and so on for all the combinations of time points and drugs
- Check for significant difference between Time point 1 and 2 for all the combinations of drugs and conditions
- Check for a significant difference between Drug A and B for all the combinations of time points and conditions.
The data is unbalanced
This is what I did:
1. Build a linear mixed model
fit_1 <- lmer(y~Condition*Drug*Timepoint+(1|Subject))
2) Use lsmeans to perform tests
lsmeans(fit_1,pairwise~Condition | Drug * Timepoint,adjust="none")
lsmeans(fit_1,pairwise~Timepoint | Drug * Condition,adjust="none")
lsmeans(fit_1,pairwise~Drug | Condition * Timepoint,adjust="none")
However, none of the p-values were less than the nominal 0.05 alpha-level. The inference on these values was significant when Wilcoxon test was used. So I went back to check the residuals and they seemed to violate the assumptions of normal distribution and homoscedasticity. Should I use GLMM instead? If so, which family will be applicable when $y$ variable is a percentage of counts data?
xyplots can help visualize the experimental conditions better. Those assumptions are rarely met, but the inference is usually valid, as many posts on this site will tell you! – AdamO Apr 09 '18 at 17:29