I've been using Cox for survival analysis with the goal of identifying the effect of 5 different levels of a treatment on survival. My dataset is not small and has 130 covariates after dummy encoding.
About 15 of these - including one level of my treatment - violate the PH assumption.
I think in my situation the best extended Cox option would be adding time interactions to those that violate the PH assumption meaningfully. Another option, of course, could be an AFT, if the assumptions there are also satisfied.
My question is: If I were to try each of these approaches, how could I compare which these are preferred given my objective. That is, how would I know which is better? It's not like I'm making a prediction about survival in which case I could use CI or something more precise. Thus, comparing which is best and knowing how to know this is not obvious to me.
cox.zph. You can run Spearman's correlation of survival time vsr <- resid(fit, 'scaledsch')to get it. I think you can get the survival time fromas.numeric(dimnames(r)[[1]]). $\rho$ gives you a unitness index of how correlated $\beta(t)$ is with $t$, which is a sample-size-less index of non-PH. – Frank Harrell Aug 10 '22 at 14:36Small enough to ignore, then? Feel free to link anything. You have one example in your book in Chapter 20 in which ρ is -0.23 and highly significant but you do not comment on what -0.23 means in terms of magnitude
– JED HK Aug 11 '22 at 08:01https://www.fharrell.com/post/impactpo and even better to embed models inside more general models putting Bayesian priors on parameters you hope you don't need (in your case, parameters allowing for non-PH). We need a linear rather than a dichotomous modeling process. – Frank Harrell Aug 11 '22 at 11:47