Let's say we do a classical ANOVA F-test to reject or not reject the hypothesis:
H0: $µ_1 = µ_2 = ... = µ_k$
with $k$ classes, and $n$ observations per class
Are there ranges of $k$, $n$ for which the F-test is not applicable or sub-obptimal?
Example 1: $k=10$, $n=1$ impossible because only 1 observation per class!
Example 2: $k=100$, $n=2$ will probably give a poor test because we have only 2 observations per class
Example 3: $k=3$, $n=100$ seems reliable because we have many observations per class
Is there a rule of thumb when we apply this test, are there requirements such that $n$ should be at least $2 k$ or something like that, for the test to be useful?