I would like to forecast the return volatility in a financial market. I am using symbolic regression/genetic programming to generate models with a good in-sample fit. I would like to compute predictive R Squared for each model to enable me to select the model to make an out-of-sample forecast.
The brute force method of computing PRESS for a model involves removing one observation from the data set, finding the values of the model's parameters that minimize the sum of squared residuals, and then making a forecast using the resulting model for the observation removed earlier. Then we repeat this for each observation in the data set.
I am aware of a shortcut that ought to generate the same value of PRESS as the procedure described above. The shortcut is described on PRESS statistic for ridge regression and in the answer to How can one compute the PRESS diagnostic?
All of the sources that describe that shortcut mention that it is valid for "ordinary least squares".
My question is - can the models that I am working with be described as ordinary least squares?
The models generated by my symbolic regression algorithm are of the form Y = a + bf(A,B,C,D,E,G) + cg(A,B,C,D,E,G) + dh(A,B,C,D,E,G) + error term, where the functions f(), g(), and h() are nonlinear products like A*(C^2)DE*(G^3).
For the models of the form above, would the PRESS statistic computed using the full method be the same as the PRESS statistic computed using that shortcut?
Thank you for your kind help!