2

I'm dealing with this big dataset which has:

  • 1 categorical variable with 90 levels that represent some sort of "geographical area"

  • 3 continuous variables

What I'm trying to do is to "aggregate" the levels of the categorical variable, to a maximum of 10.

I don't know exactly which technique I should use to accomplish this, for example whether I should use factor analysis or some clustering (unsupervised)

mariob6
  • 540

0 Answers0