My GLM is as follows:
logit.final <- glm(Claim_Occurrence ~ Sum.Insured100kto200k + Sum.Insured200kto300k +
Sum.Insured30kto50k + Sum.Insured50kto100k +
Sum.Insured300Kplus,
family = binomial(link = "logit"), offset = Exposure.Years.Earned)
I am trying to predict whether a claim will be reported in a vehicle or not, based on sum insured. The base level of the Sum.Insured categorical variable is Sum.Insured0to30K. Exposure years is the offset term, which is between 0 and 1. For example, a 0.5 would mean 6 months and 1 would mean a year.
If the fitted intercept is -2.64997, does this mean the odds of a claim occurring in a vehicle with sum insured 0 to 30K is 7.07% (i.e., $\exp -2.64997)$)? Would the offset term have any influence on this odds / interpretation?
EDIT:
I read somewhere that the coefficient of an offset is 1. So to incorporate the offset in my interpretation, would the odds be $\exp(-2.64997 + 1) = 19\%$?
EDIT 2:
Okay, as per advise in the answer, I have removed Exposure Years Earned from offset term, and included it as a predictor.
My revised glm model is now as follows:
logit.final <- glm(Claim_Occurrence ~ Sum.Insured100kto200k + Sum.Insured200kto300k + Sum.Insured30kto50k + Sum.Insured50kto100k + Sum.Insured300Kplus + Exposure.Years.Earned, family = binomial(link = "logit"))
My intercept is now -3.6464, and coeff estimate of Exposure years earned is 2.0046.
So if I want to find probability of claim occurrence of a vehicle with sum insured 0 to 30K, and exposure years earned worth of 1.083, would it be Exp(-3.6464) x Exp(2.0046) x 1.083 = 20.98% ?
Intercept + coefficient * predictorValue) before you do any exponentials. The value I get for the linear predictor is then -1.476, which exponentiated is 0.228. Your value of 0.2098 came from exponentiating the coefficient for exposure-years before multiplying by the number of exposure-years. Second, the exponentiated result is the estimated odds of the outcome, not the probability. – EdM Sep 20 '20 at 16:15