Hi, I'm quite new to statistics and have been tasked to evaluate if there is a difference in accuracy between 2 subpopulations in a logistic model.
The credit scoring company's model calculates the risk credit score (0-1 or 0-100%) of an individual defaulting and then they send this score to the banks who decide if they want to hand out a loan or not. Then a year later we get back info if the individual that had a loan approved have recieved a payment remark (binary 1 or 0). (We have no way of seeing for what risk scores they decide to give out loans)
Since we have one continous dependent variable and a binary outcome variable I've found calculating the acutal accuracy impossible and are now turning to Proper scoring rules to determine which model have the most "loss" in prediction compared to outcome.
I've found Brier score quite interesting for my occasion but there are so many scoring rules and few places document when it's best to use which scoring rule.
Would love an explanation, suggestion of alterative approach or link me to a place where this is discussed.
Thank you for your time and consideration.