I'd like to compare several distributions fitted to one dataset (of i.i.d. random variables) by AIC. Do there exist some specific rules of thumb for such a situation?
It seems that most of such rules are either for regression models or some extra conditions are needed (e.g. models are nested and sample sizes are large, as in book by Burnham and Anderson).
To be precise, I have a distribution $F$ and a distribution $G$, the sample size $n=100$, $$ AIC(G) - AIC(F) = 17.9. $$ Can I claim that there is substantial evidence that the fit of $F$ is better than that one of $G$?