Questions tagged [confidence-interval]

A confidence interval is an interval that covers an unknown parameter with $100(1-\alpha)%$ confidence. Confidence intervals are a frequentist concept. They are often confused with credible intervals which is the Bayesian analog.

A confidence interval is an interval that covers an unknown parameter of interest (e.g., the mean) with $100(1-\alpha)\%$ confidence. Confidence intervals are a frequentist concept. A credible interval is a related concept in Bayesian statistics. People often incorrectly ascribe the meaning of credible intervals to confidence intervals.

In frequentist statistics, a confidence interval for a parameter, $\theta$, is an interval computed from a set of data whose distribution depends on that parameter in some way. The interval is computed such that, if the process of drawing a sample and computing the interval were repeated identically ad infinitum, the proportion of the intervals that included the true value of the parameter would converge to $(1-\alpha)$. This does not mean that the probability of a given interval including the true value of the parameter is $(1-\alpha)$. Each interval either does include the true value or it does not include the true value. The 'confidence' is a property of the procedure used to compute the interval and pertains to the theoretical infinite set of such intervals.

Some additional notes:

  1. The confidence interval is a function of the data, $X$. Since the data are conceptualized as a random sample from a population, confidence intervals are random variables (although the confidence interval computed on a particular set of data is a realization).
  2. Often one can only compute approximate confidence intervals, which may have the nominal coverage asymptotically.
  3. It may not be possible to compute any exact confidence interval that might otherwise be preferred if the data or the parameter can only take discrete or otherwise limited values.
  4. The same ideas can be applied to a set of parameters, e.g., $\vec{\theta} = [\mu\ \ \sigma^2]^T$. In that case, it is more correct to refer to the confidence region.
  5. In a regression context, the set of confidence intervals for all possible conditional means ($\mu_Y|X=x$) is called a confidence band.
4581 questions
64
votes
4 answers

Are all values within a 95% confidence interval equally likely?

I have found discordant information on the question: "If one constructs a 95% confidence interval (CI) of a difference in means or a difference in proportions, are all values within the CI equally likely? Or, is the point estimate the most likely,…
pmgjones
  • 5,773
  • 8
  • 38
  • 36
49
votes
14 answers

Clarification on interpreting confidence intervals?

My current understanding of the notion "confidence interval with confidence level $1 - \alpha$" is that if we tried to calculate the confidence interval many times (each time with a fresh sample), it would contain the correct parameter $1 - \alpha$…
Elliott
  • 543
31
votes
4 answers

Confidence interval of RMSE

I have taken a sample of $n$ data points from a population. Each of these points has a true value (known from ground truth) and an estimated value. I then calculate the error for each sampled point and then calculate the RMSE of the sample. How can…
robintw
  • 2,117
24
votes
4 answers

What is a rigorous, mathematical way to obtain the shortest confidence interval given a confidence level?

After reading the great answer for this question by @Ben, I am a bit confused by the part " set the relative tail sizes as a control variable, and then you find the formula for the length of the confidence interval conditional on that variable". I…
24
votes
7 answers

How to interpret confidence interval of the difference in means in one sample T-test?

SPSS provides the output "confidence interval of the difference means." I have read in some places that it means "95 times out of 100, our sample mean difference will be between between these bounds" I find this unclear. Can anyone suggest clearer…
Anne
  • 2,227
22
votes
1 answer

Confidence interval around the ratio of two proportions

I have two proportions (e.g., clickthrough rate (CTR) on a link in a control layout, and CTR on a link in an experimental layout), and I want to calculate a 95% confidence interval around the ratio of these proportions. How do I do this? I know I…
raegtin
  • 9,930
22
votes
5 answers

Narrow confidence interval -- higher accuracy?

I have two questions about confidence intervals: Apparently a narrow confidence interval implies that there is a smaller chance of obtaining an observation within that interval, therefore, our accuracy is higher. Also a 95% confidence interval is…
upabove
  • 3,137
17
votes
3 answers

Confidence interval for the product of two parameters

Let us assume we have two parameters, $p_1$ and $p_2$. We also have two maximum likelihood estimators $\hat{p}_1$ and $\hat{p}_2$ and two confidence intervals for these parameters. Is there a way to build a confidence interval for $p_1p_2?$
guest
  • 173
14
votes
4 answers

How to calculate confidence intervals for ratios?

Consider an experiment that outputs a ratio $X_i$ between 0 and 1. How this ratio is obtained should not be relevant in this context. It was elaborated in a previous version of this question, but removed for clarity after a discussion on meta. This…
koalo
  • 298
13
votes
3 answers

Interpreting a 95% confidence interval

I originally posted the following as a partial answer to a question asking why a 95% confidence interval does not imply that there is a 95% chance that the interval contains the true mean (see: Why does a 95% Confidence Interval (CI) not imply a 95%…
13
votes
1 answer

Does invertible monotone transformation of a confidence interval give you a confidence interval (at the same level) in the transformed space?

Suppose $$ (a,b) $$ is a $(1-\alpha)$ level confidence interval for a parameter $\theta$. Suppose $\eta$ is a monotone invertible transformation. Then, is $$ \left (\eta(a), \eta(b) \right ) $$ a $(1-\alpha)$ level confidence interval for…
SASsy
  • 221
10
votes
2 answers

Confused about confidence interval

I am confused about the concept of confidence interval. Specifically, assume there is a Gaussian variable $X \sim N(\mu, \sigma)$ with $\sigma$ known, and I am interested in the lower-bound $\mu_L$ of the mean with $95\%$ confidence level. I will do…
calbear
  • 165
9
votes
1 answer

Calculating 95% CI of a proportion of 0%

I have a sample of 72 individuals. 0 of these had the trait I'm investigating. How do I calculate the 95% CI when the proportion is 0%?
Rebekka
  • 91
8
votes
2 answers

The importance of a correct interpretation of a confidence interval

Given a confidence interval with 95% confidence level, it is incorrect to state that the probability of the estimated parameter being included in the confidence interval is 95%. The reason is that the sample has already been taken; the real value…
Sam
  • 537
8
votes
3 answers

Confidence intervals for a polynomial

I have a random variable $Z$ which takes values in the nonnegative integers $\{ 0,1,2,\dots \}$, call the probabilities for each outcome $z_k:=P[Z=k]$. I can sample from $Z$'s distribution independently and cheaply; I currently have a sample size…
1
2 3
20 21