I have a sales data for a certain type of grocery products at stores' transaction-level (sales data gathered through cashiers' scanners). As you can imagine, for the most part, the number of units sold per transaction is rather small, which results in this non-normal distribution of sales:
I do have infrequent extremes (not shown here) that go into hundreds of units and (rarely) thousands.
Since the number of units sold is my dependent variable, I seek to find the best data distribution to fit the model. In the absence of normality, OLS is not an option, so I wonder if the use of a count model like Poisson, negative binomial or gamma could be justified here? (I compared the three in Stata, and the Gamma model fits best).
Any suggestions on the appropriateness of the gamma distribution for the transaction-level sales data? Other options I could consider?
