0

Is there any benefit to doing a for loop so that a logistic regression model is fit separately for each independent variable (i.e. 100 models all using the same dependent variable)? As opposed to fitting one model with all 100 independent variables and the dependent variable.

Insu Q
  • 355

2 Answers2

1

There is a major disadvantage to fitting individual predictors separately in regression. This page points out a major difficulty in general and a difficulty specific to generalized models like logistic regression.

Even in ordinary least-squares regression, omitting a predictor that is both correlated to outcome and to the included predictors leads to omitted-variable bias in the estimates of the coefficients of the included predictors. In logistic regression, omitting any predictor associated with outcome, even one not correlated with the included predictors, leads to omitted-variable bias.

Thus your coefficient estimates from the single-predictor regressions would be biased in difficult-to-know ways. And as a comment points out, the single-predictor route does not allow for examining interactions. I can think of no offsetting benefit.

EdM
  • 92,183
  • 10
  • 92
  • 267
0

That would be similar to what naive bayes classifier is doing. There is no advantage for prediction. The only advantage is that this can be much faster to fit, while Ib singe situation not having too much worse performance.

rep_ho
  • 7,589
  • 1
  • 27
  • 50