Huber loss function is widely used, because it combines the good properties of squared and absolute losses. Therefore, when I apply the penalized regressions, i.e. LASSO, Elastic net and Ridge, to make predictions, the Huber loss is used to tune the hyperparameter by cross validation method in training process, and the MAE or MSE is applied for evaluation in validation and test stages. In Kolassa(2020), the author calims that it makes no sense that a model to be fitted by minimizing the in-sample MSE, but for holdout forecasts to be evaluated using the MAPE. (see the third point in Section 4 "Takeways").
So my main question is - Does it make sense that I use Huber loss for training, but use other measures, such as MAE and MSE, to evaluate the forecasts?