I have been wondering about this: Do control variables need to have a correlation to both the independent variable and the dependent variable?
E.g. I want to check the effect of Education (independent variable) on Income (dependent variable) using a regression. Does it make sense to include control variables which obviously have a relationship to the dependent variable but not necessarily to the independent variable?
If not, why not? My friend argues that yes, they do need to have a correlation to both independent and dependent variable, otherwise they just go in the error term of the regression.