I was reading this great answer: What are the advantages/disadvantages of these approaches to deal with volatility surface?
And I have the following question:
How to show that the forward volatility smile of Local Volatility flattens while the one from Stochastic Volatility does not?
Could you please provide a mathematical demonstration of this fact?
EDIT
This answer answers my question. But in the following equation from the answer
$C(t,S;T\to T+\theta,K) := E^Q[(\frac{S_{T+\theta}}{S_{T}}-K)_+] =: C_{BS}(S=1,\theta,K;\Sigma(t,S;T\to T+\theta,K))$
I don't understand why the payoff of the forward start option is not rather
$ E^Q[(S_{T+\theta}-S_{T})_+] $
and why it is set $S=1$