Say we have a number of different geographic areas, $A_1, A_2, ...$, and a quantity $X$ that follows a gamma distribution in each area but, we expect, with different parameters: $(k_1,\theta_1), (k_2,\theta_2),...$. If we take a sample from the full population across all areas (call this the 'top-level' sample), we can fit a gamma distribution to it and get its parameters.
If, from some other source, we are told the mean value of $X$ in each geographic area, what can we say about the most likely values of the distribution's parameters for each area (call these the 'lower-level')?
I have built a simple example in Excel in which I take a sample from 5 randomly generated samples from different gamma distributions. From this, it looks like if the lower-level distributions all have the same shape parameter, then the top-level distribution also has that shape parameter and the mean of the top-level distribution is the mean of the lower-level means. If the lower-level shape parameters are different, though, then things are not so simple.
Is it ok to assume that if my top-level distribution has parameters $k,\theta$, then the best estimates of the parameters of the lower-level distributions are for them all to have $\theta$ as their shape parameter and choose $k_1, k_2, ...$ to match the lower-level means?
Is there a general result for what I am looking for?
Edit: A complicating point is that the top-level data that I have is likely to only be available after it has been binned into ranges.