I am handling a binary classification problem on an imbalanced dataset.
The goal is to create a system able to insert the returned score (probability to be in the positive class) in a bins between 1 and 10, where 1 means low probability to be in the positive class and 10 high prob.
The main problem is that I only have the training dataset, so I don't know any values in the test set. Moreover, the predictions will be done one by one, so I cannot analyze the whole test scores.
I tried many models, but in particular I use tree-based models (such as XG-Boost, RF). In these cases, considering also the imbalanced dataset, the output scores are in a very small range, much smaller than [0, 1]. The scores are necessary since I don't want to classify the instances directly into the 0-1 class, but I want to analyze the scores.
How should I build a method able to find the different thresholds in order to create the 10 classes?