I am building a Cox PH model in r to analyze my data and I am unsure what the best way to do it is. It is observational data and the idea is that we want to know the effect of a treatment A used at two different hospitals, site 1 and 2. We examine time-to-death and we have a few covariates (binary and continous) that we would like to include as well. However, due to differences in standard of care it is not given that often at site 2 whereas it is more common at site 1. This means that when it is NOT given at site 1 there is something very different going on with the patient. What we want is therefore to compare all patients treated with A (site 1 and 2) with patients not treated with A at site 2 only. So, we want to exclude patients from site 1 not receiving the treatment, because we believe there is another reason for not recieving treatment A. How would you build this model? Should I construct a dummy variable with a binary for TreatmentA vs. Site2_NoTreatmentA and exclude the Site1_NoTreatmentA or should I build the model with a strata or cluster for site?
Thank you in advance.
If we just consider the problem with having two sites. If I include a cluster(site) It will be the same as including the dummy variable as interaction term, right?
– User LG Oct 10 '22 at 05:48clusterterm doesn't affect coefficient estimates, it only adjusts standard error estimates, and it doesn't allow for different treatment effects between sites. Astrata(site)term with atreatmentinteraction or asitedummy coefficient with atreatmentinteraction is needed to allow for different treatment effects between sites. – EdM Oct 10 '22 at 13:08