This is with regards to the example given in the wikipedia article on statistical models. In the example it is claimed that "Each possible value of $\theta = (b_0, b_1, σ^2)$ determines a distribution on S," but no further explanation is given. I understand that $(b_0, b_1, σ^2)$ gives us a height probability distribution for any given age, but I do not understand how this can be extended to a probability distribution on the space of all (age, height) pairs.
The most straightforward idea is to consider y-axis aligned bell curves tracking along the line $y = b_0x + b_1$, but this of course does not integrate to 1 and so does not constitute a probability distribution on S.
Edit for clarification: A probability distribution on $R^2$ is a positive function $p$, such that $\int_{R^2} p(x,y) dxdy = 1$. Given a linear model $y = b_0 x + b_1 + \epsilon$, where $\epsilon \sim N(0, \sigma^2)$, how does a fixed $(b_0, b_1, \sigma^2)$ induce such a function, as claimed in the linked article?