I have a population which can be thought of as infinitely large. I take a large sample (with replacement) of $n\approx10^{11}$, and observe a small probability of success ($\hat{p}\approx10^{-10}$).
I see that this answer states that to use the normal distribution approximation, the conditions should be:
$n\hat{p}>5$ and $n(1−\hat{p})>5$
The answer also lists the following alternative methods:
- Wilson score
- Clopper-Pearson
- Agresti-Coull
Given $n\hat{p}=10$, and $n(1−\hat{p})\gg 5$, is it okay for me to use the normal distribution approximation or should I use another method?
R:plot(0:24, dbinom(0:24, 1e11, 1e-10), type="h")It may help to overplot the Normal reference curve,curve(dnorm(x, 1e1, sqrt(1e1*(1-1e-11))), add=TRUE)The Poisson approximation will, of course, be excellent. – whuber Sep 08 '20 at 14:50