Briefly, 184 patients are included in my analysis. I have one variable that seperates 184 patients into two groups. 173 are in group 0 and 11 are in group 1. I need to compare the survival between these two groups. (This variable is something like gene mutation status, where 0 represents no mutation, 1 represents mutation)
Univariate Cox regression seems to be the choice. However, such unbalanced sample size between these two groups makes me worried about the power of test.
I didn't find much literature discussing this kind of situation. Two topics discussed this case first one second one. But my case is a little bit different from them.
My questions are:
- Is it correct to directly use univariate cox regression to test survival difference between these two groups?
- Downsampling might be a solution? But randomly sampling 11 objects from 173 patients results in millions possible combination. How can I achieve this in R?
caretpackage has the downsampling function, but it does not include cox model. After downsampling, it is impossible to perform k-fold cross-validation usingtrain()function. Could anyone tell me how to create a downsampling Cox regression model?
Thank you.
Surv(month,status)~stage+A. ), Cox model found significance in A and stage. But it didn't tell the significant difference between A0 and A1 groups is resulted from covariatestage(like more low stage patients have A0). How to evaluate it? By testing their interaction? – tomasz Mar 23 '22 at 14:49powerSurvEpito calculate the power of Cox regression based on exprected number of events and HR (as mentioned in functionpowerCT.default0). But later I found it is retrospective power as you mentioned "for design not analysis". I don't know if it is meaningless to use a power larger than 75% as threshold to filter results. – tomasz Mar 23 '22 at 14:49stageandAare significant in your 2-predictor model, then there's evidence that each has an association with outcome that goes beyond things like "more low stage patients have A0." The model adjusts for that type of thing, at least in part. (You still should report whether there is an association betweenAandstagevalues.) An interaction term betweenAandstagein the model would further evaluate whether the association ofstagewith outcome depends onA(and vice-versa). – EdM Mar 23 '22 at 16:17