I've started reading about Random Forests and one of the attributes that appeals to me is that they are good at dealing with independent variables that interact with one another. Does Random Forest also "automatically" transform variables into, for example, the square of the variable? Is there a good reason to square or perform other transformations of a variable (of course dependent on the situation)? For example if I have y = x1 + x2 + x3 is there anything to be gained by running the model y = x1 + x1^2 + x2 + x2^2 + x3 + x3^2?
Asked
Active
Viewed 3,094 times
7
Dean MacGregor
- 1,138
1 Answers
5
Random Forests are Decision Tree - based. So all they do is comparisons of your individual variables with some thresholds. Squaring your parameters simply will shift these thresholds, introducing no change to the actual output. So, in theory, RF's are not sensitive to rescaling.
sashkello
- 2,244
(0,0.1)for a group and on(0.95,1.05)for the other, now take the non-linear transformationx^200and there is a high chance those guys between0.95and1.00will be classified with the first group . – VFreguglia Sep 08 '18 at 21:58