I would like to know why in the below formula the prior distribution of theta is not conditioned on X (observations):
$$P(\theta|X, y)=\frac{P(y|X, \theta)P(\theta)}{P(y|X)}$$
In my understanding, the correct formula should be:
$$P(\theta | X, y) = \frac{P(y| X, \theta) P(\theta| X)}{P(y|X)}$$
But I think I am missing something.
We added new condition to the formula P(theta | y, X). The entire formula has changed but the peior P(theta). Why does this make sense? If we condition to X, then both likelihood and prior should be conditioned to X.
– Amir Jalilifard Sep 09 '20 at 17:23