What, exactly, is a random variable in a simple ols model? ($Y = B_0 + B_1*X + U$)

Question

If our population regression function is ($Y = B_0 + B_1 \times X + U$) where $U$ is a disturbance term with $Var[U]$ = $\sigma^2$...

How would one classify the following (random variable or not):

$X_i$
$\hat{B}_0$
$\hat{B}_1$
$U$
$\sigma^2$
$Y_i$

It seems clear to me that $Y_i$ is a random variable because it has a probability distribution for what values it could take on. What is least clear is $\hat{B}_1$ and $\hat{B}_0$? Those have some definite value for our sample, but would change if we took a new sample. The same is true for the $X_i$th value of $X_i$.

This is a well asked question, but we do require questions like the have the self-study tag. I've added it for you. — Matthew Drury, Oct 04 '17 at 00:28

score 2 · Answer 1 · answered Mar 26 '18 at 01:34

Now that this is several months old, I am going to assume it is safe to give a full answer, without unfairly advantaging you in any academic assessment. So, have a look again at the model form:

$$Y_i = B_0 + B_1 X_i + U_i \quad \quad \quad \mathbb{V}(U_i) = \sigma^2.$$

In a classical linear regression model, the explanatory variables $X_i$ are considered to be fixed (since the problem is to find the conditional behaviour of the response) and the parameter values $B_0$, $B_1$ and $\sigma$ are considered to be "unknown constants", meaning they are also treated as fixed.$^\dagger$ So the only thing on the right-hand-side of the equation that is random is the error term $U_i$.

Since $U_i$ is a random variable, this means that anything that depends on $U_i$ is also a random variable. Hence, the response $Y_i$ is also a random variable (when not conditioning on it having been observed) and the estimators $\hat{B}_0$ and $\hat{B}_0$, which depend on these response values, are also random variables.

Hence, in this question, items 2, 3, 4 and 6 would all be considered random variables.

$^\dagger$ This assumes you are doing your analysis in the classical frequentist statistical framework. Using Bayesian analysis the parameters would be treated as random variables.

What, exactly, is a random variable in a simple ols model? ($Y = B_0 + B_1*X + U$)

1 Answers1

Linked