I did a PCA on some data as a data reduction technique. I had 8 original items. The results showed a single component accounted for most of the variance in the data, with all items loading onto a single component with loadings all $>.8$.
What I would like to do now is to use this component as an outcome in a regression analysis. My confusion is over what to use as the variable itself. Do I extract the component somehow (akin to factor scores in a CFA)? or do I just go back to raw items and create an index (by averaging raw items, for example).
This is an outcome variable for a multivariate regression model. We have 5 items. Before running the analyses, we wanted to see the underlying dimensionality of the data as a data reduction technique. What we found was that one component fit the data. All loadings were high.
Since I found one component, I"m wondering what the added benefit is of using the component - which I would define as the linear combination of the variable * their loading - versus the raw averaged scores as the measured outcome? Does that question make sense?
– user1638567 Mar 16 '16 at 15:20Do I extract the component somehow (akin to factor scores in a CFA)Why do you mention Confirmatory FA at all, I wonder. The immediate idea is to compute component scores of PC1. (If you find it make sense in the subsequent regression you're speaking about.) Averaging the highly loaded items is possible, too. Think of potential differences: see, see. – ttnphns Mar 17 '16 at 20:36