Is it OK to use the parameters for the lowest cost?

Question

In a Neural Network training, the cost of the model changes throughout the training process when using gradient descent (or something analogous), this is the point of the algorithm. However the cost might not decrease monotonically. In some points the cost might even increase.

So, Is it OK to keep track of the parameters that output the lowest cost and use those as the best parameters for the model? In the image it would imply using the parameters that output the lowest cost instead of the last parameters found.

Assume that the cost returned by the lowest cost outputs an acceptable model accuracy.

Does doing this cause some kind of problem?

It is all approximations. That means it isn't going to be "true" lowest cost, but estimated lowest cost. — EngrStudent, Jun 01 '18 at 19:03

score 0 · Answer 1 · answered May 29 '18 at 22:26

0

This is a perfectly okay thing to do. Other things you might try are reducing the learning rate to make it more likely that the training cost goes down most of the time and also selecting the model based on the cost on the validation set rather than training.

answered May 29 '18 at 22:26

Aaron

3,275

Great, thanks for your insights. Do you have any source you can provide to validate this? – loco.loop May 29 '18 at 23:57
What about mini-batch? After each epoch should I do a comparison? Or how would this work? – loco.loop Jun 01 '18 at 22:30

Is it OK to use the parameters for the lowest cost?

1 Answers1