I recently ran a PCA on a dataset of self-report data from 226 subjects to zoom in on which specific individual differences might account for participants’ predicted choices in a separate task we have them complete. Here are the resulting loading scores for PCs 1,2, and 3:
I added another 67 subjects to the dataset, reran everything exactly the same, and out came these loading scores:
As you can see, many of the self-report loading scores that were positive or negative for PC1 and PC2 in the first run (n=226) are now negative for the same PCs in the updated run (n=293), yet the magnitude stays more or less the same. This is puzzling to me, seeing as we have only added 67 subjects. How can we explain the switching of signs of the loading scores with just the addition of 67 subjects?
Let me know if I should elaborate on any of this. Any and all guidance would be much appreciated!
Thank you so much in advance.
