0

My model consists of a dependent variable $y$ that can take values 0 and 1. The independent variable $x$ is a categorical variable that can take values, 1, 2 and 3. The logit model looks like this: $$L=\beta x$$ where $L$ are the log-odds of event $y=1$.

Now when I see my data graphically, I can clearly see the probability of $y=1$ increasing as $x$ increases from 1 to 3. However, when I run the regression above in Stata, I don't get significant values for $\beta$.

Given that I know that $x$ changing causes the probability of $y=1$ to change, can these insignificant results be interpreted? Are they useful at all?

Edit:

The sample size is very small, only 273 observations. $y=1$ occurs 204 times whereas $y=0$ occurs 69 times. This is a bar graph showing the percentage of $x=1, 2, 3$ that also have $y=1$:

enter image description here

Regression results:

enter image description here

  • The coefs are mislabeled above. 1,2 should be 2,3.
Parth
  • 103
  • @user158565 edited – Parth Jul 31 '19 at 20:55
  • Given Yes/no = 204/69, the sample size is not too small. If there is no other covariate in the model, i can say that maybe your visually difference is kind of illusion. maybe you could add graph in your question. How do you use the $x$, continuous or categorical. – user158565 Jul 31 '19 at 21:03
  • @user158565 thanks for the input. I've uploaded the graph and the regression results. x is categorical. – Parth Jul 31 '19 at 21:24
  • It is really difficult to say there is difference between three groups based on graph and results of logistic regression. – user158565 Jul 31 '19 at 21:35
  • I think your logit coefficients are mislabeled. invlogit(1.176321)=.76428567 (category 1), invlogit(1.176321+.463422) =.83749996 (category 3, not 2), and invlogit(1.176321+.4106438)=.83018864 (category 2, not 1). – dimitriy Jul 31 '19 at 22:09
  • You might want to report the output of margins your_cat_var or margins r.your_cat_var after the logit to make the comparison to your bar chart easier. – dimitriy Jul 31 '19 at 22:12
  • Given LRT p value = 0.3463, no way to decrease p value < 0.05, and no way to argue that x has effect on y. – user158565 Aug 01 '19 at 01:21
  • @DimitriyV.Masterov yes you're right. They should be 2 and 3, not 1 and 2. Thanks. – Parth Aug 01 '19 at 04:56