I have a dataset with a handful of predictors and one output variable which is categorical and can only be C or N.
I am working in R, using the plsda function from the mixOmics package. When I plot the results of the PLSDA with the plotIndiv function for the first 2 variates, the separation is not great. It looks like this:
There is an argument you can give to the plotIndiv function called rep.space, which the manual says it determines the subspace where the individuals are projected. Its possible values are X-variate, Y-variate and XY-variate. When the object being plotted is the result of a PLSDA, rep.space defaults to X-variate.
But if I set rep.space to XY-variate, then it looks like this:
Which looks amazing. But is it too good to be true? In other words, is it using the outcome variable to artificially separate the individuals in the plot? What does XY-variate even mean? I understand that the X-variates are the new variables, analogous to the PCs in PCA. But what are the XY-variates? Are my predictors actually separating the 2 classes, or is this artificial?

