I have the following output from a boosted regression trees model and I would like to calculate the total deviance explained.
mean total deviance = 1.283
mean residual deviance = 0.107
estimated cv deviance = 0.212 ; se = 0.045
training data correlation = 0.97
cv correlation = 0.937 ; se = 0.016
training data AUC score = 1
cv AUC score = 0.996 ; se = 0.002
I have come across two methods to do this which give me a different answer.
1) D2 = 1 – (residual deviance/total deviance) (Nieto and Mélin, 2017)
With my results this equation is
D2 = 1 - (0.107/1.283) = 0.92
2) D2 = (total deviance - cross validated residual deviance)/total deviance (Leathwick et al., 2006)
With my results this equation is
D2 = (1.283-0.212)/1.283 = 0.83
The paper for method 2 does not provide the equation in their text but it is clear given their model results (Table 3, pp.272) that this is how it was calculated. Also, there is a question here that discusses this method.
Which of these methods is correct?
residual devianceis what, in the first question, is denoted astotal deviance. Can you please clarify that these are meant to be the same (i.e. the total deviance)? – usεr11852 May 17 '19 at 09:20