9

In logistic regression there is Box-Tidwell but I know of nothing like that in linear regression. I use partial residual plots to look for this, a graphical feature, but would love to find a formal test (in honesty I doubt you can do a formal test of this, but I could be wrong).

Silverfish
  • 23,353
  • 27
  • 103
  • 201
user54285
  • 1,051
  • Related: https://stats.stackexchange.com/questions/70009/how-do-you-check-the-linearity-of-a-multiple-regression. – StubbornAtom May 01 '19 at 10:31
  • For the model $y=\beta_0+\sum_j\beta_jx_j+\varepsilon$, isn't a formal test $H_0:\beta_j=0$ for all $j$ vs $H_1: \text{not }H_0$? This is similar to an ANOVA F-test. – StubbornAtom May 01 '19 at 10:41

3 Answers3

10

Box-Tidwell was developed for ordinary least squares regression models.

So if you were inclined to use Box-Tidwell for this, that's actually what it's designed for.

It's not the only possible approach, but it sounds like an approach you're already familiar with.

However, I'm not convinced that (most times it's used) a formal test is appropriate - I believe it usually answers the wrong question, while the diagnostic plots you've been looking at come closer to answering a useful question. [I have a similar opinion of many other tests of regression assumptions]

Glen_b
  • 282,281
  • 4
    Would I be better off with a different specification is usually a good question, but one hard to tackle except very specifically. (Pun not really intended, but it seems to fit the case.) – Nick Cox May 01 '19 at 07:00
  • @Glen_b Could you state the "wrong" and "useful" questions you refer to? Thanks. – rolando2 May 01 '19 at 12:28
  • 4
    The hypothesis being tested is exact linearity -- which will almost never be the case. If we fail to reject, all we learned was that our sample was too small to detect the nonlinearity, not that its effect was small. If we do reject, we're no better off, we learned what we already knew, but if the nonlinearity is small, it is of little consequence. The test still doesn't tell us whether the nonlinearity actually matters; what we needed to know is how much difference the nonlinearity we have makes to our inference. – Glen_b May 01 '19 at 12:48
  • A problem I have, because I work with the entire population in question normally, is that I have thousands of data points. They tend to look like big blobs in the residuals so its hard to discern patterns in the regression; they don't represent very well what you see in text.books. – user54285 May 01 '19 at 20:40
  • 1
  • With a large sample, that would form the basis of a (readily answered) question (how to see nonlinearity in a residual plot in such circumstances); ideally adding an example plot that would give you difficulty. 2. If you're fitting to the entire population of interest, notions of testing go out the window (you certainly don't have a random sample!). You literally have the whole thing you want to make inferences about, just calculate what you need.
  • – Glen_b May 02 '19 at 00:07