I am training YOLO network consisting of resnet50 architecture.This problem is to find different text labels on the image and predict bounding boxes
During training, I am seeing very less change in both training and validation loss. What are different method / debugging techniques to know where exacly is the problem. I am training on 5000 images and have around 23M parameters to train. I am using batch size as 8 and training times as 5 and number of epochs as 50.
