I've read this regarding the difference between epoch and mini-batch.
To clarify:
- With an
epochvalue of1000andbatch sizeof50, does that mean that the model will use each data point exactly1000times in such an (random) order where at each iteration only50data points are used for optimization? (meaning a total of 50*1000 calculations?) - Is every data point used exactly
1000times?
epoch / batch-sizeand the number of total calculations isepoch * number-of-points(as each point is used to calculate the rate of change of errorepochamount of times?) – alexandrosangeli Feb 02 '21 at 17:35