I want to find a single metric to assess how spread (or how much variance) a multidimensional dataset (a large number of features) is. I learned that the determinant (or pseudo-determinant) of the covariance matrix of features could be a good measurement (the volume intuition). However, taking pseudo-determinant as an example, it requires the computation of all the non-zero eigenvalues (with svd). For a large covariance matrix, the full-svd is usually slow, but the partial-svd (e.g. get only a few biggest eigenvalues) is fast. I wonder if it is fair to only use the first few biggest eigenvalues to evaluate the total spreads/variance, e.g., by calculating a production of them. If so, does it also have a geometric intuition (such as the volume of a subspace)? After extensive searching, I haven't found discussion/usage along this idea. Thanks.
I have read the questions posted (very helpful already):