I currently have a quite ok model fitting my data with Regression Tree and a Random Forest. However, while trying to move on to a Neural Network (to evaluate other models), I have been struggling to put above average R2_score results. Even though my RF model could put good results.
So far I:
- Tested several combinations of activation (relu, tanh etc.)
- Tested different layers-size combinations
- Normalizing/Standardizing my Data
- Removed a couple features (which actually performed OK in the Random Forest)
- Tried different random samplings by changing the stratification and the RandomSeed values
As I'm pretty much getting a bit confused by those approaches, my question might be a bit elementary, but: Does a model that was able to be fit by a regression tree, should also be able to be fit by a NN? Is there any restriction I'm unaware of?
hidden_layer_sizesand the optimisers you have tried which are rather significant influences to the networks performance. – usεr11852 Mar 23 '18 at 10:13For the hidden_layer_sizes, I've been iterating in a range fashion and checking if any combination performs best (hidden_layer_sizes=(j, ), with j iterating from 10 to 100 in steps of 10). As for the optimizer I've been trying mostly with the stochastic gradient-based 'ADAM'. Will have a more look on some others. It is already a heads up! thanks!
– Nicolas Vieira Mar 23 '18 at 10:21