I have one numeric score S1 to (ascending) sort a dataset. Now I have developed another method that returns a numeric score S2 that should sort the dataset in similar way. My question: with what methods can I evaluate the sorted results to compare performance of S2 to the "ground truth" S1?
If S1 returns ABCDEFG and S2 returns ADEFGBC , how can I compare the result of this permutation to the ground truth?