Questions tagged [variance]

The expected squared deviation of a random variable from its mean; or, the average squared deviation of data about their mean.

The variance of a random variable $X$, sometimes called a population variance and often denoted by $\sigma^2$, is the expected squared deviation from its mean:

$$\mbox{Var}\left[X\right] = \mbox{E}\left[\left(X - \mbox{E}\left[X\right]\right)^2\right] = \mbox{E}\left[X^2\right] - \left(\mbox{E}\left[X\right]\right)^2.$$

As such, the variance measures the spread of a random variable around its expected value. The square root of the variance, is called the standard deviation.

Variance is a parameter, i.e. a property of a random variable or of the population. One way to estimate that property from data, is by means of the sample variance. Concretely, assuming the population is infinite and given an independently and identically distributed random sample $X_1,\ldots,X_n$, the sample variance is

$$S^2 = (n-1)^{-1}\sum_{i=1}^n (X_i-\bar X)^2,$$ where $\bar X = n^{-1}\sum_i X_i$ is the sample average.

Not all random variables have finite variance. This occurs when $\mbox{E}\left[X^2\right] $ diverges. For example, the Cauchy distribution (Student's $t$ distribution with 1 degree of freedom) does not have a finite variance.

4239 questions
30
votes
3 answers

If teachers account for 30% of variance of student achievement, can a teacher have 30% increase in achievement by teaching better?

My professor wrote: Research using sophisticated statistical techniques indicates that teaching expertise accounts for about 30 percent of the variance in student achievement (Hattie, 2003). Think about what your student test scores would look like…
Mike
  • 419
  • 4
  • 5
25
votes
2 answers

Why does the variance of a sample change if the observations are duplicated?

The variance is said to be a measure of spread. So, I had thought that the variance of 3,5 is equal to the variance of 3,3,5,5 since the numbers are equally spread. But this is not the case, the variance of 3,5 is 2 while the variance of 3,3,5,5 is…
20
votes
8 answers

Why isn't variance defined as the difference between every value following each other?

This may be a simple question for many but here it is: Why isn't variance defined as the difference between every value following each other instead of the difference to the average of the values? This would be the more logical choice to me, I guess…
19
votes
2 answers

Explaining The Variance of a Regression Model

This may be a simple explanation (I'm hoping anyway). I have done some regression analysis in Matlab using the regression toolbox. However, I have come across a study that states this: "Using regression analysis, it was possible to set up a…
19
votes
3 answers

How to calculate the variance of a partition of variables

I'm running an experiment where I'm gathering (independent) samples in parallel, I compute the variance of each group of samples and now I want to combine then all to find the total variance of all the samples. I'm having a hard time finding a…
gallamine
  • 373
16
votes
1 answer

How to find variance between multidimensional points?

Suppose I have a matrix X which is n by p, i.e. it has n observations, with each observation in p-dimensional space. How do I find the variance of these n observations? In the case where p = 1, I just need to use the regular variance formula. What…
statnub
  • 841
16
votes
5 answers

Statistical variation in two Formula 1 qualifying formats

I've just read this BBC article about the qualifying format in Formula 1. Organisers wish to make qualifying less predictable, i.e. to increase statistical variation in the result. Glossing over a few irrelevant details, at the moment the drivers…
innisfree
  • 1,480
15
votes
4 answers

What is the practical application of variance?

I'm teaching myself probability theory, and I'm not sure I understand any use for variance, as opposed to standard deviation. In the practice situations I'm looking at, the variance is larger than the range, so it doesn't seem intuitively useful.
Andrew
  • 193
9
votes
2 answers

What are the minimum and maximum values of variance?

I am new to statistics. I am getting my hands dirty on VarianceThreshold. I am having a single dimensional array, containing N values. What's the maximum and minimum values of a variance for any values present in array? I guess the minimum value…
9
votes
1 answer

What's the minimum of $\mu (1-\mu)/ \sigma^2$ over all continuous unimodal distributions on a bounded interval $[0,1]$?

All distributions on a bounded interval $[0,1]$ satisfy: $$\sigma^2 \le \mu (1-\mu)$$ where $\mu$ is the mean and $\sigma^2$ the variance. Now suppose that the distribution is unimodal, in the sense that it has at most one local maximum. What's the…
a06e
  • 4,410
  • 1
  • 22
  • 50
8
votes
1 answer

why sample variance has has n-1 in the denominator?

Sample variance is calculated according to: $s^2=\frac{\sum{(x-\bar{x})^2}}{n-1}$ Population variance is calculated according to: $\sigma^2=\frac{\sum{(x-\mu)^2}}{n}$ Why denominator for sample variance is $n-1$ and not $n$? Any help is appreciated.
flashburn
  • 311
8
votes
1 answer

Does the uniform distribution have the greatest variance among all concave distributions on a bounded interval?

The uniform distribution on a bounded interval $[a,b]$ has variance $(b-a)^2/12$. Consider any concave distribution on the same interval (concave in the sense that the graph of the pdf lies above any line segment joining two points of the graph).…
a06e
  • 4,410
  • 1
  • 22
  • 50
7
votes
6 answers

How to express error as a percentage?

I have data about how many unique users do a certain thing for each day of a month. I can average it, and i would like to display the variation in a intuitive format (such as % of something). Is there a standard way of doing this? I've found…
7
votes
1 answer

k-subset with maximal variance

I have two versions of the same question: Given a list of numbers (with possible duplicates), how to find a k-subset (with possible duplicates) that maximize the variance? is there a more efficient way than the obvious "check-all-k-subsets"? Given…
GreyGeek
  • 171
6
votes
1 answer

Variance of $X_i / \sum\limits_{j=1}^n X_j$

Suppose that $X_1 ,X_2,...X_N$ are independent identically distributed positive random variables. How to find the variance of $\frac{X_i}{\sum\limits_{j=1}^n X_j}$, $i \in 1, 2, \ldots, n$?
1
2 3 4 5 6 7 8