1

The aim of my question can be better illustrated by this quote extracted from the third chapter of Elements of Statistical Learning (link to book): Quote from ESTL book

I'm trying to understand why, given that the error term follows a Gaussian distribution with mean 0 and constant and finite variance, do the coefficients also follow a Gaussian.

I understand how to derive the expected values of the beta estimate as well as its variance. I'm only having trouble proving its sampling distribution (the Normal).

Also (and please let me know if I should be asking this question in a new post)

  • how would the sampling distribution of the beta estimates be affected if the error term is not normally distributed - would they, for example, also tend to follow a normal distribution if our sample size is sufficiently large, under the CLT?

1 Answers1

0

Quoting almost verbatim from Wooldridge...

Under the assumptions of the classical linear model (namely, that the errors are gaussian with mean 0 and finite variance), each of the estimated coefficients can be written as

$$ \hat{\beta}_j = \beta_j + \sum_{i=1}^n w_{i, j}u_i$$

where $w_{i, j} = \hat{r}_{i,j}/SSR_j$ is the ratio between the ith residual from the regression of $x_j$ onto the other variances and the sum of swaured residuals from said regression. Because the $wz$ depend only on the independent variables, we can treat them as nonrandom and observe the estimated coefficients are linear combinations of iid gaussians. Since, by assumption, the $u$ were iid Gaussian, so to is the estimated coefficient.

  • Thank you I'll take a look at it! In the meantime I also found a post that mostly answers my question (this post question did not pop up in the 'possibly related' section when writing a new post). It talks about affine transformations of multivariate normal distributions – Frederico Portela Oct 12 '23 at 10:58