Carlos Cinelli in this great post https://stats.stackexchange.com/a/384460/198058 gives an example of 3 different Data Generating Processes/Causal Models giving rise to the same joint distribution $(X,Y)$. Below is a snapshot of the 3 models taken from his post.
I was able to work out that for each of the 3 models $P(X)=P(Y)=N(0,1)$ but there is still quite a bit of work left to do to show that in these 3 models $(X,Y)$ have the same joint distribution. In model 1: $P(X,Y)=P(X)*P(Y|X)$ but what is $P(Y|X)$? In model 2: $P(X,Y)=P(Y)*P(X|Y)$ but what is $P(X|Y)$? Model 3 is even more complicated. I am hoping someone can explain all the steps we go through in order to show that these 3 models have the same joint distribution.
