2

I'm new to data analysis and data mining. Often in the papers I'm reading, they use the term "high dimensional multivariate data set." Currently, my task is to detect an outlier and visualize the same from a large complex data set. But how does one find out whether I have a multivariate high dimensional data set or not?

Sycorax
  • 90,934
beeCoder
  • 123

1 Answers1

6

A high dimensional multivariate data set would simply be a data set with lots of variables. These days, most data sets qualify. Exactly how many variables makes it "high" is not, as far as I know, generally agreed to.

Peter Flom
  • 119,535
  • 36
  • 175
  • 383
  • 4
    "High" presumably means no more than "challenging" to one or more of the hardware, software, programmer or user. I agree with Peter's implication that there is not an agreed technical definition. – Nick Cox Nov 20 '13 at 14:30