I've built a model that tries to evaluate the percentage chance of winning for all NHL games. For example:
- Ana vs Bos (Bos- 57% chance of winning)
- Chi vs Mtl (Mtl - 51% chance of winning)
- LA vs Det (Det- 80% chance of winning)
The model is updated daily to take into account what happened in the latest games. At the end of the season, how can I validate my model against the actual results?
If my percentages were a fixed set (55%, 75%, 95%), it would be easy to validate. My model should be right 55% of time for all games that have a 55% chance of winning. My model should be right 95% of time for all games that have a 95% chance of winning.
But since my percentages are all different I don't know how to deal with them. For example, I have predicted percentages like: 51.5%, 53,2%, 54.9%, 62.4%, 88.1%, etc. These came from my model. I'm using a lot of historical data to predict the winner (past games, goalie stats, active players, home vs away, etc., it is similar to the odd column on this page).