14

I am an MBA student that is taking courses in statistics. Yesterday, I attended a statistics seminar in which some graduate students presented their research on some psychology experiments (e.g. response of mice to some stimulus).

The students presented some theory behind the models they used - in particular, they showed us about "General Linear Models" (GLM). I think I was able to follow the general ideas that were discussed. It seems to me that GLM's model some function of the mean instead of directly modelling the mean - this gives GLM's the advantage of being more flexible and versatile in modelling more complicated datasets.

  • The one thing that I couldn't understand however, is that why is the "Linear" part of GLM's so important? Why can't you just use/call it "General Models"?

  • When I tried to raise this point in the question and answer period, someone told me that the "Linear" aspect of GLM's allow for "ease in estimation" (i.e. easier "number crunching") compared to "Non-Linear" models and that the "Linear" aspect also allows for easier interpretation of the model compared to "Non-Linear" Models.

I am also not sure what exactly is a "Non-Linear" Model. I would have thought that a Logistic Regression is a "Non-Linear" Model because the output looks non-linear and it can model non-linear relationships, but it was explained to me that any model that can be written "linearly" like "x1b1 + x2b2 + ..." is called a "Linear Model". Therefore, is something like a Deep Neural Network a "Non-Linear Model"?

I couldn't really understand why this is true - can someone please help me with this?

stats_noob
  • 1
  • 3
  • 32
  • 105
  • "Linear" aspect is because they are linear in their parameters. – User1865345 Sep 17 '22 at 20:31
  • 1
    "In spite of the availability of highly innovative tools in statistics, the main tool of the applied statistician remains the linear model. The linear model involves the simplest & seemingly most restrictive statistical properties: independence, normality, constancy of variance, and linearity. However, the model and the statistical methods associated with it are surprisingly versatile and robust. More importantly, mastery of the linear model is a prerequisite to work with advanced statistical tools because most advanced tools are generalizations of the linear model." - Alvin C. Rencher. – User1865345 Sep 17 '22 at 20:51
  • 1
    Fun fact: I'm chemometrician, and most chemists would call a model/function linear if it is linear in the data (similar to OP). As others have said, in stats, the important linearity is in the parameters. To avoid ambiguity, in chemometrics we speak about bilinear models, which are both linear in data and in the parameters :-). (And they are of huge importance to us, because we often know from the physics underlying the data generation processes that bilinearity is at least a good approximation) – cbeleites unhappy with SX Sep 18 '22 at 12:28

2 Answers2

14
Tim
  • 138,066
  • Do GLM's allow for easier estimation of regression coefficients compared to non-linear models?
  • – stats_noob Sep 18 '22 at 02:40
  • 1
    @MBA_Grad_Student_2022 when response variable is related with the predictor variables through non-linear function, say, $y = \theta_1\exp(\theta_2x) +\varepsilon.$ The normal equations corresponding to non-linear least squares aren't that easy to solve compared to the linear counterparts. – User1865345 Sep 18 '22 at 05:32
  • @MBA_Grad_Student_2022 1) check the links I provided. Any model that is not linear in parameters is non-linear. – Tim Sep 18 '22 at 05:32
  • @Tim +1 for the compilation of those posts. – User1865345 Sep 18 '22 at 05:34
  • 2
  • Usually yes. But not always. For example, a decision tree is non-linear and easy to fit.
  • – Tim Sep 18 '22 at 05:34