Inter annotator agreement when the number of annotators is not consistent across the samples

Question

I have a sample of around 100 that has been annotated as either 0, 0,5, or 1 by 5 annotators. For some reason, I found that some of the annotations need to be deleted. Thus, for some samples, we have less than 5 annotations.

If there were no missed annotations, I'd use the Fleiss kappa test to measure the inter-annotators agreement. But since we have missed cases, is there a way to measure the agreement even though we miss some annotations?

score 0 · Answer 1 · answered May 09 '22 at 21:42

0

You just need to use a formulation of Fleiss' kappa (or another chance-adjusted index of categorical agreement) that allows for missing data. If you want to use Python, see the irrCAC library.

answered May 09 '22 at 21:42

Jeffrey Girard

4,747

Inter annotator agreement when the number of annotators is not consistent across the samples

1 Answers1