I am trying to compare the impact of different censoring proportions on different survival analysis models. For that, I plan to do a simulation study and then apply the best model on real-life child mortality data to find the determinants of mortality.
The simulated data should reflect the pattern of effects seen in child mortality data. My data set consists of 32 covariates. I’m having trouble understanding how to simulate the data using the simsurv package in R. I have gone through so many research papers but none of them actually gave me a clear picture of how the data must be simulated. I failed to understand how to include the covariates in the simulation.
Can someone please explain to me how I should simulate the data to reflect the covariates and the censoring proportions? From my understanding, the 2-component Weibull mixture model is a good fit for complex data like this.
I was told that I should first decide on a survival function that best fits the data by fitting a parametric survival model (Weibull). Then compare with a 2-component Weibull mixture model to see which model better fits the data. But where do I incorporate covariates and censoring proportions? Is my understanding wrong?