14

I work in Political Economy, and a lot of the models include "innocent" control variables such as population, inequality, colonial legacy, etc. so that the author can claim unbiasedness on their independent variable of interest.

But if any of these control variables are endogenous to some omitted variable, doesn't this contaminate the unbiasedness of ALL the independent variables?

If that's true, then what can we do? Leave those control variables out and they lead to omitted variable bias themselves. Include those in and they will contaminate everything in the model.

Example: A researcher wants to know if inequality leads to violence, and he controls for a few things: \begin{equation} Violence = Inequality + Growth + Development + \epsilon \end{equation} Seeing that Inequality is likely to be endogenous (because of the omitted variable Level of altruism), he will try to find a instrumental variable for Inequality. But aren't Growth and Development likely to be endogenous (i.e. correlated with Level of altruism) too?

This example may look silly, but my point is in Political Economy / Development work, there are so many factors at play (yet omitted) that I'm afraid many variables included on the LHS are endogenous. Yet often, the researcher only looks for an instrument for his pet independent variable only.

Heisenberg
  • 714
  • 2
  • 6
  • 14
  • Yet another thing to consider is the so-called "bad control" issue - a situation when the control is an outcome variable itself. I would suggest you to read Section 3.2.3 in Angrist and Pischke's celebrated "Mostly Harmless Econometrics" to get a grasp of this topic and why it matters if you want to have a better understanding of your question. – MauOlivares Apr 18 '18 at 00:34

5 Answers5

10

"But if any of these control variables are endogenous to some omitted variable, doesn't this contaminate the unbiasedness of ALL the independent variables?"

I don't want to emphasize this too much, but it's worth mentioning that this is not true in general. The following derivation will hopefully provide some understanding of the "contamination" you mention. As a simple counterexample, suppose that the data generating process is given by $$ Y = X_1 \beta_1 + X_2 \beta_2 + Z \gamma + \varepsilon, $$ where $Z$ is unobserved. Let $Cov(X_1,Z) = 0$, $Cov(X_2, Z) \neq 0$, and $Cov(X_1,X_2) = 0$. Then, it is clear that $X_2$ is "endogenous." But notice that because $Cov(X_1,Z) = 0$, our estimate of $\beta_1$ will still be ok: $$ \text{plim}\, \hat \beta_{1} = \beta_1 + \gamma \frac{Cov(X_1^*, Z)}{Var(X_1^*)} = \beta_1, $$ where $X_1^* = M_2 X_1$ and $M_2 = [I - X_2(X_2'X_2)^{-1}X_2']$. Because $Cov(X_1,X_2) = 0$, $X_1^* = X_1$. So $Cov(X_1^*,Z)=0$.

"What can we do?"

One of the mains challenges of doing good econometrics is thinking of potential identification strategies. In the type of situation you describe, there is probably nothing you can do but to try to approach the problem a different way.

jmbejara
  • 9,345
  • 5
  • 30
  • 80
  • While you're technically right, I would not emphasize this point. I'd rather say that in general, we cannot rule out biasedness of any of the variables, instead of saying in some scenarios its ok, well, because we usually don't know the DGP. – FooBar Jan 28 '15 at 14:30
  • Could you point me to a reference where the $\hat\beta$ is derived this way? I wasn't taught this in my econometrics. 2) Where do you use $Cov(X_1, Z)=0$ in the proof? It seems like $Cov(X_1, X-2)=0$ is sufficient. 3) I agree with @FooBar that $Cov(X_1, X_2)=0$ are the exception, not the norm. Indeed, if $Cov(X_1, X_2)=0$ we wouldn't bother to control for $X_2$ in the first place (except to increase precision).
  • – Heisenberg Jan 28 '15 at 14:38
  • @FooBar, I agree. I've updated the post to emphasize that this is a special case. As far as the point about not knowing the DGP, that is true. But that's not the point. Any analysis has to make assumptions about the DGP and the quality of the analysis depends on the quality of the assumptions. The derivation I gave just serves to illustrate an example of the assumptions (albeit, very strong assumptions) that could get you where you'd want to go. – jmbejara Jan 28 '15 at 17:52
  • @Heisenberg: 1) Could you open a new question in main about this? If you just copy and paste the derivation and present your question, that would be best. 2) $Cov(X_1, Z) = 0$ is needed when I say that $Cov(X_1^*,Z) = 0$. 3) You're right. If we we're interested in predicting $Y$, it would be important. But, yeah, that's a good point. On the other hand, it's maybe useful to note that the size of the bias depends on how correlated you believe $X_1$ and $X_2$ to be. – jmbejara Jan 28 '15 at 17:59
  • 1
    @jmbejara I posted 1) as a separate question. Please feel free to edit my question / title, since I don't know how to phrase the title intelligently and useful for Googler in this case. – Heisenberg Jan 28 '15 at 19:03
  • You're missing a subscript on the first $\beta$ in the DGP. – dimitriy Jan 29 '15 at 02:12