My data set includes 400 records. Each record comprises values for the binary outcome variable $y$ and 12 categorical predictor variables $x_1, ..., x_{12}$, most of which are binary too. The records originate from 10 different studies, of which one is much larger than the remaining ones (it contributes nearly three-third of the records, while each of the remaining studies contributes 4 to 30 records).
My plan is to fit a logistic regression model. Since I cannot rule out that records from the same study are correlated, I consider using GEEs instead of ordinary logistic regression modeling. The records from the same study would form the clusters. As working correlation structure I would use 'exchangable'. However, I am still unsure if GEEs are a suitable approach in my case, which is distinguished by
- the low number of clusters (viz., ten)
- the large differences in cluster size.
Are there any advices/experiences on this? I found in the literature that actually 40, 50 or even more clusters would be needed (opinions seem to diverge) in order to get reliable 'sandwich' errors for the model parameter estimates. For smaller numbers of clusters, some correction would be needed. However, I neither found much on how to precisely apply this correction, nor on the second issue, if differences in cluster size affects the results of GEEs. I would highly appreciate any advice.