1

I have a dependent variable ($Y$) whose distribution is as shown below. In particular, the variable exhibits positive skewness, has a few negative values and many positive values (generally less than one), and peaks near zero. The distribution after log transformation ($\ln(1+Y)$) is very similar too.

enter image description here

Would OLS models be a good choice in this case, or are there any alternatives that would better fit the nature of the dependent variable?

  • 4
    That is the marginal distribution of the response, yes? But regression models do not make assumptions on the marginal distributions, only on the conditional distribution given the regressors! That can only be tested after fitting a model, by, say, plotting the residuals. So show us the residuals after a linear regression. – kjetil b halvorsen Nov 20 '17 at 13:29
  • @kjetilbhalvorsen Hey can you expand a bit more on marginal distribution and conditional distribution? Maybe some links or articles? This is the first time I am hearing about it and before this I was only familiar of the distribution of variable like when we plot a distplot using Seaborn. – spectre Jan 06 '22 at 11:21
  • @spectre: This will be treated in any introduction to probability, but see many posts on this site – kjetil b halvorsen Jan 09 '22 at 14:30

1 Answers1

1

Have you considered a Skewed Normal Distribution?

I believe that using this distribution you'll be able to account for the skewness of your variable, but keeping the parametric space of a Normal Distribution, which seems adequate here.

Also, one part of your question was already answered here: Skewness in dependent variable (OLS, Gauss Markov, non-normality)

Bruna w
  • 541
  • 1
    Many thanks for the suggestion, worked very well. In fact, a skewed-t distribution fits better than a skewed normal distribution. – financial theory Nov 20 '17 at 13:47