Questions tagged [rare-events]

Situations where a level of a categorical variable occurs very rarely (eg, a rare disease). This can be a problem especially when the variable is the response variable in a model.

In statistics, "rare events" refers to situations where a level of a categorical variable occurs very rarely. Such cases can create special problems. The prototypical situation would be predicting a rare disease with a logistic regression model. However, a particular level of a multilevel categorical variable can also be rare, and there can be adverse consequences even when the variable is a predictor as well.

177 questions
1
vote
1 answer

Is there a R package or SAS proc for modeling rare events?

Does there exist packages in R or SAS that are designed specially for logistic regression, tree model or other binary classifiers for rare events? What is the simplest way to deal with rare events if no package is available? Adding weights?…
John
  • 283
0
votes
0 answers

Mitigating cell count <5 or almost zero

Mitigating problem when cell count $<5$ or almost zero a. 2x2 table with cell count zero b. When event rate is small in logistic regression: such that when seen further by an independent categorical variable it leads cell counts less than 5 or…
SR1
  • 31
0
votes
0 answers

Estimating the true frequency of rare events

I have a dataset containing some hundreds of thousands of observations, out of which some small number contain an event of interest x. Let's say that my total dataset is large enough that I have a decent confidence in the overall frequency of x. But…
JSBձոգչ
  • 101
  • 1