4

In lasso regression, increasing the regularisation strength/shrinkage penalty eventually forces all of the regression coefficients to zero. In this instance the regression is logistic.

The plot shows the 'trajectory' of each coefficient as the regularisation strength increases along the x axis. The explanatory variables have been standardised.

Values of regression coefficients as shrinkage penalty increases

Coefficients 3 and 7 (7 is the upper green curve whose label is slightly obscured) for example 'survive' the longest, i.e. take the largest penalty values to force them to zero. Other coefficients, like 9, quickly die off to zero for smaller penalty values so don't affect the response for large enough penalty values.

(1) Can one interpret this as coefficients 3 and 7 are most important in terms of association with the response?

(2) What determines the rate of decay of the coefficients with shrinkage penalty? Why do some coefficients become zero quickly, and some remain non-zero longer?

PM.
  • 627
  • 1
  • 4
  • 15
  • 5
  • Yes. 2. the shrinkage basically pushes the coefficients to zero in the order of their magnitude of correlation with the response. Sometimes there are high correlation amongst the covariates (not with response), sometimes called collinearity. In this case, lasso selects only one variable
  • – TPArrow Apr 05 '19 at 14:46
  • This question and answer may be useful: https://stats.stackexchange.com/questions/154825/what-to-conclude-from-this-lasso-plot-glmnet – Matthew Drury Apr 05 '19 at 14:58
  • @MatthewDrury that does look useful. I will need some time to study it. Thanks for the link. – PM. Apr 05 '19 at 15:10
  • @TPArrow maybe you could change your comment to an answer. Do you have a reference to a text or paper that discuses this in more detail? I have Elements of Statistical Learning but it wasn't clear where this precise question was discussed. Thanks. – PM. Apr 07 '19 at 08:29