I am using GAMs to model the relationship between a binary response variable (0 or 1) and several continuous fixed and random explanatory variables. It seems that a binomial distribution is the standard choice, but the data are quite sparse, there are 21,000 0's and only about 800 1's.
I'm curious if there are any other distributions I should consider that might be better equipped to handle these data? I'm intrigued by beta-binomial, but it's my understanding that it's for data bound between 0 and 1.