I am trying to understand the reparameterization trick. I got some intuition while looking at this popular question, but I still feel largely confused. I am putting my understanding and doubts here and would appreciate the help of the community. Let's assume -
$y = 10$
$\hat{y} = w_3*z + b$
$z \sim N(\mu, \sigma^2)$
$\mu = w_2*4$
$\sigma = w_1*3$
Now, if I had to compute, $de/d{\mu}$ (e is the error function which is the difference between $y$ and $\hat{y}$), I wouldn't be able to apply chain rule and compute $dz/d{\mu}$. This is because z is a sample of the Normal distribution and therefore stochastic with parameters $\mu$ and $\sigma$ and therefore changing them based on z might not be a good idea. So, we come up with the following $z$ -
$z = \mu + \sigma*\epsilon$ (Apparently $z = \mu + \sigma*\epsilon$ is the same as $z \sim N(\mu, \sigma^2)$)
$\epsilon \sim N(0,1)$
However, I don't see why this $z$ is much different from the previous one. This $z$ is still stochastic, due to the presence of $\epsilon$. Perhaps my understanding is wrong.