The aim of my question can be better illustrated by this quote extracted from the third chapter of Elements of Statistical Learning (link to book):

I'm trying to understand why, given that the error term follows a Gaussian distribution with mean 0 and constant and finite variance, do the coefficients also follow a Gaussian.
I understand how to derive the expected values of the beta estimate as well as its variance. I'm only having trouble proving its sampling distribution (the Normal).
Also (and please let me know if I should be asking this question in a new post)
- how would the sampling distribution of the beta estimates be affected if the error term is not normally distributed - would they, for example, also tend to follow a normal distribution if our sample size is sufficiently large, under the CLT?