28

In almost any textbook introducing the topic of frequentist statistics, null hypotheses of the form $H_0: \mu=\mu_0$ or similar are presented (the coin is unbiased, two measurement devices have identical behavior, etc.). Classic statistical tests such as the $Z$ or $T$ tests are then based on rejecting these null hypotheses.

As I see it, these types of equality hypotheses are uninteresting for a number of reasons:

  • In "real life" one is always only interested in some finite accuracy $\epsilon$, meaning the hypothesis of interest is actually of the form $H_0: |\mu-\mu_0|<\epsilon$.
  • The equality hypothesis is a priori known to be wrong when considering continuous variables (no coin is perfectly unbiased in reality!), and as a corollary,
  • The fact that the null hypothesis cannot be rejected is by definition temporary, and is an artifact of not enough data. Given enough data, any type of equality hypothesis on continuous variables will be rejected in a real world use case.

So, why are these hypotheses still used, both in text books and in applications, while it is difficult to find formulas for more interesting* "real" hypotheses?


* e.g. a question I recently asked regarding these types of hypotheses

  • 1
    This is a slight tangent from the main thrust of your question, but on the topic of biased coins, you may be interested in Gelman's paper "You Can Load a Die, But You Can't Bias a Coin" – Adrian Nov 05 '22 at 19:27
  • 3
    I don't think what I am about to say will answer your question, however, forming an interval estimate (e.g. $(1-\alpha)100%$ CI) basically avoids all of the shortcomings by focusing on what $\mu$ is likely to be, rather than focusing on what it isn't – jcken Nov 05 '22 at 20:03
  • @jcken, your interpretation of a CI sounds more like one of a credible interval (or perhaps a fiducial interpretation of confidence interval) than a frequentist confidence interval to me. I wonder if a particular realization of a confidence interval tells us much about what $\mu$ is likely to be. – Richard Hardy Nov 05 '22 at 21:01
  • @RichardHardy you're right. I guess the point I want to make is that concentrating on estimation (and in particular, interval estimates) rather than testing would, I think, solve/avoid each of the 3 problems posed in the question – jcken Nov 05 '22 at 21:13
  • 11
    This of course is a loaded question, and such questions are not usually very helpful to learn something that goes beyond already existing prejudices. – Christian Hennig Nov 06 '22 at 01:02
  • 16
  • Tests like "H0:|μ−μ0|<ϵ" are called equivalence tests; there's also noninferiority and superiority tests. https://en.wikipedia.org/wiki/Equivalence_test . They have not as been as widely used as they should, but I think that's in part because they require thinking (in coming up with $\varepsilon$) for more than a few seconds. 2. People often use tests in situations where their stated question of interest is clearly not a testing question. Neither of those issues is the fault of hypothesis tests, but of how they're being taught and used by people we have little influence over
  • – Glen_b Nov 06 '22 at 01:37
  • 2
    @Glen_b That’s been my observation, too, that people are reluctant to pick an epsilon. – Dave Nov 06 '22 at 04:33
  • 1
    @Glen_b Of course one can always compute a confidence interval and see whether something smaller than $\epsilon$ is in it, and without even specifying an $\epsilon$ one can see whether everything in the CI looks substantially big or not. – Christian Hennig Nov 06 '22 at 09:41
  • Indeed so; in many cases a CI serves perfectly well. – Glen_b Nov 06 '22 at 11:23
  • @Glen_b - TOTS was exactly what I was looking for. Thank you! Do you want to add this as an answer? – Nathaniel Bubis Nov 06 '22 at 14:49
  • This question seems to be in part about the falsification of scientific hypotheses. This topic has been discussed previously on CV; see for example: 1, 2, 3, 4, ... – dipetkov Nov 06 '22 at 15:18
  • @nbubis If that's really an answer, the question should probably close as a duplicate since there are multiple questions on site for which my comment is already an answer. – Glen_b Nov 06 '22 at 22:04
  • I'll think about how to frame an answer that doesn't seem like a duplicate. – Glen_b Nov 06 '22 at 22:18