1

I already referred these posts here. Please don't mark it as duplicate. This post is continuation of my other post here. So, am not providing the full context here. You can refer the link here

a) Is there any other regression model that can take raw input (without any data standardization/transformation using log,sqrt etc) and give us positive predictions only? or any other relevant regression model for customer revenue prediction problem?

The Great
  • 3,272

1 Answers1

3

YES

Generalized linear models with appropriate “link” functions can force predictions to be positive. For instance, logistic and probit regression squeeze into $(0,1)$ the values given by the linear prediction. Other link functions can allow for different allowed prediction values, such as a $\log$ link forcing values to be positive (but any positive number is possible).

Notably, this is different from transforming the outcome variable. Remember that you are (probably) estimating an expected value. For a nonlinear link function $g$:

$$ \mathbb E\left[g\left( Y\vert X=x\right)\right] \ne g\left( \mathbb E\left[ Y\vert X=x \right] \right) $$

Therefore, using a link function is not equivalent to transforming the outcome.

This idea can be extended to models such as generalized additive models and neural networks. If $R$ is the regression function that has unconstrained outputs (so $R(X)=X\beta$ for a generalized linear model), you can take $ g(R(X)) $ to assure yourself of positive values.

Dave
  • 62,186
  • 2
    Maybe a better example (in this case) would be gamma regression? – kjetil b halvorsen Nov 20 '22 at 17:50
  • Would tweedie, huber all fall under this category (like gamma)? – The Great Nov 21 '22 at 00:05
  • @TheGreat Gamma regression refers to using a gamma likelihood with a logarithm link function that precludes predictions less than zero. Huber regression uses a particular loss function but puts no restrictions on the predictions (unless you structure the model to do so, such as a link function). – Dave Nov 21 '22 at 00:37
  • @Dave - I was trying gamma regression and encountered this error which am not able to figure out why. Do you have any idea on why is this happening? https://stats.stackexchange.com/questions/596414/gamma-regressor-some-values-of-y-are-out-of-the-valid-range-of-the-loss-hal – The Great Nov 21 '22 at 08:14