1

I am performing group lasso and need to double check if I include a dummy variable for the reference answer or not. For example:

original question : no (0), Yes (1), Unknown (9).

If I create 3 dummy variables, no (reference) (0/1), yes (0/1), and unknown (0/1), would I include all three in the group lasso or just the 2 (yes and unknown).

Levi M
  • 75

1 Answers1

1

It doesn't really matter because you are using regularization. Without regularization, you would have too many parameters, but it's not a problem for a regularized model. So you can use either. Including all the categories is a popular choice for regularized models though, as you don't need to decide which category to drop.

Tim
  • 138,066