Training loss is decreasing while validation loss is fluctuating

Asked Jul 25 '22 at 20:49

Active Jul 25 '22 at 20:49

Viewed 16 times

I'm training an LSTM model architecture with the following hyperparameters: hidden_size = 128 num_layers = 1 batch_size = 64 learning_rate = 0.001

The learning curve looks as shown below

I'm wondering why is the training loss decreasing, while the validation loss is oscillating around some mean value. Does it mean that the model is overfitting, since it can't perform well on the test set?

asked Jul 25 '22 at 20:49

Eman.suradi

Yes, it is overfitting. – frank Jul 26 '22 at 04:39

Training loss is decreasing while validation loss is fluctuating

0 Answers0