1

I have a data set for a PhD paper in which I need to look for differences between 3 groups of observations. The issue is that ~95% are in one group and the rest are in the other two. The data table is attached below. Will any research on the differences between the three groups is valid from a formal statistical point of view?

    group 1  group 2
  1 402286   383596
  2   8523   6107
  3  15112   8535 
user49422
  • 213

1 Answers1

1

Typically, you can measure treatment differences for two or three treatments in unbalanced data. Correct me, if I am wrong but theoretically I don't see a problem. However, it is worth pondering over if comparing the treatment difference between when the data is not imbalanced, qualitatively makes sense.

Also, do you know what specific statistical method do you plan to use for your problem?

  • This data is about measuring basic statistics metrics like quantiles, mean, median etc,.Later on I'm going to use CART to look for variable importance for my main research question. – user49422 Jul 21 '17 at 17:15