I remember reading that for a certain kind of splines, all basis functions have to be included when using feature selection methods such as stepwise regression or shrinkage.
The argument for that was that without including all basis functions the splines could not be constructed, hence, selecting only parts of the basis function would be wrong.A similar argument might hold for selecting parts of one-hot-encoding nominal features, although I cannot think for a good reason why one should not be able to just "throw" certain levels "away".
Now I'm looking for a reference to read up on this again.