2

As I understand it, the Welch–Satterthwaite equation says that if $s_i^2$ is the sample variance of group $i$ (with $n_i$ samples), then, assuming that the measurements in each group are iid's that are normally distributed, the statistic $\sum a_i s_i^2$ has roughly $\chi^2$ distribution with degrees of freedom $\frac{(\sum a_i s_i^2)^2}{\sum \frac{(a_is_i^2)^2}{n_i-1}}$.

This equation doesn't take into account the $\sigma_i$'s, and that makes me very suspicious...!

Here's a quick example to show how weird this formula is. Assume for a second that we apply this equation to the case that $a_1=1$ and $a_i=0$ for $i>1$. Then this equation says that $s_1^2$ has roughly $\chi^2$ distribution with $n_1-1$ degrees of freedom. But that's incredibly false! What if $\sigma_1^2$ is huge?

When you take more than one $a_i$ to be nonzero, more ridiculousness ensues.

So what, if anything, am I getting wrong?

Andrew NC
  • 319
  • 2
  • 8
  • 2
  • Typically $a_i$ is taken $ = 1/(n_i - 1)$, just FYI. 2. It is an approximation to the distribution of the sum, but if you only have one term in the sum, you cannot expect the approximation to be good. Consider, for comparison, the central limit theorem: clearly a Normal approximation to the distribution of a single draw from an Exponential distribution (i.e., sample mean with a sample size $=1$) will be poor, but it doesn't take a large sample for the Normal approximation to be halfway decent for the sample mean of an Exponential distribution.
  • – jbowman Aug 23 '17 at 22:21
  • 1
    Similar examples can be cooked up for more $a_i$'s that are nonzero, so I remain skeptical... – Andrew NC Aug 24 '17 at 01:38
  • 2
    You can usually come up with examples that break approximations like this, but it doesn't mean that the approximations are generally poor. (Sometimes they are, though, but those tend to get disappeared from applied work over the years.) It does, however, mean that the statistician who is contemplating using it needs to have some awareness of when it is likely to be poor and actually check the data for those conditions! So good for you for not accepting the universal applicability of the W-S equation! I may come up with an actual answer, with examples, later. – jbowman Aug 24 '17 at 02:01
  • @jbowman: In https://secure-media.collegeboard.org/apc/ap05_stats_allwood_fin4prod.pdf , on page 6, the (more specific) formulation of the WS formula, unlike the version I gave above, does take into account what they call $\sigma_B^2$, which accounts for the weird example that I mentioned in the body of the question.

    But the WS formula stated there is still weird for other reasons...

    – Andrew NC Aug 24 '17 at 14:31