Questions about that section of the book (see image below) were already asked and they cover how to get
from 2.9 to 2.11, see here
and from 2.12 to 2.13, see here
but do not cover how to get from 2.11 to 2.12!
2.11 can be written as
\begin{align*} & E_{X}E_{Y|X}([Y - f(X)]^2| X = x) \\ &= \int_x \left( E_{Y|X}([Y - f(X)]^2|X = x) \right) p(x)dx\\ &= \int_x\left( \int_y [y - f(x)]^2p(y|x)dy \right)p(x)dx \end{align*}
They just say "we see that it is sufficient to minimize EPE point-wise" and drop the integration over x and THEN do a point-wise minimization.
But how come they can just drop the whole $x$ integration? Why is it sufficient to minimize 2.11 it point-wise?
