My question is, after I have already performed CV and tuned my model is there a standard practice that data scientists use regarding fitting your final model to the entire data set versus only the training data? Would it potentially add some overfitting if fit to the whole data set? Please correct me if my logic is wrong as these are just my initial thoughts.
While I think it's probably ok to not fit the final model to ALL the data for small data sets (since that means you are only excluding a small portion?), if your data set was large I would think you would want to utilize all of your data otherwise it would almost be wasteful? We don't have to worry about any extra added "overfitting" by fitting to the entire data set because we are not going to go back and change our parameters anyway since that would be "peeking"?
This is my first question on this beautiful website :D Thanks!