Suppose I have a large (~10M) group of games, played by a large (~100K) group of players. Games are 1v1 and scored, so each player has a rating (analogous to chess in these regards; the real dataset is this).
Player rating distribution is kinda normally distributed, but the rating of players in games is biased:
- Strong players play a lot more than typical or weak players
- The game-matching algorithm matches strong players with strong players
I'd like to check whether strong players exhibit certain characteristics. For example, keeping the chess analogy, is it true that strong players tend to move the queen more. This is a hobby project, but I work enough around statisticians to know that the sampling approach matters. Alas, not enough to know what to use in this case :)
Naively, my plan would be:
- bin players into 'rating buckets'
- uniformly sample
Mplayers from each bucket - uniformly sample
Ngames for each player - scatter plot
(rating, ratio-of-queen-moves)for these games - do a Pearson correlation (to compare strategies, e.g. queen vs rook moves)
Does this sound plausible? Any particular terms or topics you recommend I read up on to better decide this myself? (e.g., AFAICT, what I described is neither 'stratified' nor 'clustered', but I don't know what to call it).