1

Could somebody explain why ridge regression does not perform feature selection although it makes use of regularization? So, it penalizes the regression coefficients like LASSO does, but how come we end up with using all features for all the lambda (penalty) values in range? Why don't we end up getting some zero coefficients in case of high penalization? I know this is a very basic question, but I would appreciate any response. Thanks.

user5054
  • 1,549

0 Answers0