Right now, I'm doing my thesis which use multilayer perceptron learning method to train a model. What I learned from my class is the purpose to partition the data which can be separated into 3 group
- Training dataset - This set is to train the model.
- Validation dataset - This set is to find the best model parameters.
- Testing dataset - This set is to evaluate the performance of the model.
But the problem is I cannot find the reference to support why I have to partition the samples in my thesis with certain ratio.
I try to google it but can't find those articles. It's probably because I googled it with the incorrect technical terms.
Back to the question, Can anyone here suggest the articles discussing the best ratio with supported evidence?