One small doubt I may be novice Please help me with your guidance.
I have dependent variable Log(Y) and a set of several independent variables say x1,x2,x3,x4,x5,x6,x7,x8,x9,x10,...... around 50 variables. I have tried many transformations like normalization , mean subtraction , reciprocals, sqrt and many other but I was not able improve my F value at all above 15 max.
But if I divide Log(Y) by one of independent variables elementwise say by x1 then I able a get a good fit of (Log(Y)/x1)= ax_i+bx_j+c*x_k+d with good regression values.
Is their anyway I can interpret this statistically any advice to improve. I am at a novice level when it comes to deep thing in statistics.
Please help with your suggestion it will help me.
By data is more biological in nature and it be difficult to share the data directly kind help
ALL X are some properties of proteins
Y is some activity of it.
50 variables are the main effects After including many other transformations it has become around 600 in totals with the transformation
Their 36 rows of data in this case As getting more may difficult in certain cases in biological studies based on the nature of the problem in use
I used standard model, best subsets method from automatic linear regression in SPSS to get this
The same has been verified via linear regression through Enter
My doubt here is on this mainly
But if I divide Log(Y) by one of independent variables elementwise say by x1 then I able a get a good fit of (Log(Y)/x1)= ax_i+bx_j+c*x_k+d with good regression values.
How can I interpret kind help
Yvalue and thex1value that you are dividing by. Also, please add to the question how many rows of data that you are trying to fit with 50 predictors and why you care so much about the value of F. If you have developed this model with stepwise regression, as seems to be the case from this question, and you also tried several different types of transformations before you came up with this model, all your measures of the quality of model fit will be wrong. – EdM Dec 10 '22 at 20:30