1

I'm conducting a study on mandatory reports in the healthcare sector. I've got a sample of 760 visits (690 individual patients ). I will use a binary logistic regression model to see if my independent variables will affect if a child is reported or not (yes/no). Reported will be set ass dependent variable. I've checked for multicollinearity using VIF.

I have two questions:

  1. Is it okay to use VIF when the variables are categorical (1/0)? If no, is there another way to check for multicollinearity?
  2. Can I use backward elimination to exclude variables that are not significant? Or is there a better way to select the model?

Really appreciate if someone could help me!

mkt
  • 18,245
  • 11
  • 73
  • 172
  • 3
  • It's awkward to try to give direct answers to "can I?" and "do I have to?" questions on this site. It's preferable to talk about what the consequences of a certain decision are likely to be. The indicator of VIF will be more "crude", less sensitive, when using only binary variables, but I don't see what other choice you have. Maybe lower your threshold for a VIF that you would consider too high. 2. This has been treated extensively on this site and elsewhere: you could search for "stepwise problems", etc., but be prepared not so much for a quick answer as for a whole course of study.
  • – rolando2 Apr 25 '18 at 12:08