I have ordinal predictions by experts: 1, 2, 3, 4, 5. The outcome to be predicted is, however, continuous: 0-36.
I don't know what threshold the experts use for the categories 1-5, but I know that the outcomes in 1 should be smaller than in 2, in 2 smaller than in 3, and so forth. How can I compute the accuracy of each expert?
Intuitively, I would like to allow for different thresholds for each expert. So I would basically count the ranking errors.
Are there any established methods to do this?