Notation - gradients

Question

I am reading Friedman's paper on stochastic gradient boosting - and saw as per below the following set of formulas. Purely from a notation perspective - why do we write $F(x)$ = $F_{m-1}(x)$ outside the brackets in step 3 when we take the gradient?

Matthew Drury · Accepted Answer · 2015-10-07T12:16:50.300

6

It's saying: compute the gradient of this expression with respect the variable $F(x)$ , and then substitute the value of $F_{n-1}(x)$ for $F(x)$ into the result.

The "take the derivative with respect to the variable $F(x)$" is a bit awkward conceptually, it may help to introduce a dummy for $F(x)$, like $\zeta = F(x)$, and then take the derivative with respect to $\zeta$.

edited Oct 07 '15 at 12:16

answered Jul 24 '15 at 18:25

Matthew Drury

35,629

Notation - gradients

1 Answers1