Questions tagged [distributions]

A distribution is a mathematical description of probabilities or frequencies.

Overview

A distribution is a mathematical description of probabilities or frequencies. It can be applied to observed frequencies, estimated probabilities or frequencies, and theoretically hypothesized probabilities or frequencies. Distributions can be univariate, describing outcomes written with a single number, or multivariate, describing outcomes requiring ordered tuples of numbers.

Two devices are in common use to present univariate distributions. The cumulative form, or "cumulative distribution function" (CDF), gives--for every real number $x$--the chance (or frequency) of a value less than or equal to $x$. The "density" form, or "probability density function" (PDF), is the derivative (rate of change) of the CDF. The PDF might not exist (in this restricted sense), but a CDF always will exist. The CDF for a set of observations is called the "empirical density function" (EDF). Thus, its value at any number $x$ is the proportion of observations in the dataset less than or equal to $x$.

References

The following questions contain references to resources about probability distributions:

9560 questions
59
votes
4 answers

How to identify a bimodal distribution?

I understand that once we plot the values as a chart, we can identify a bimodal distribution by observing the twin-peaks, but how does one find it programmatically? (I am looking for an algorithm.)
venkasub
  • 723
36
votes
7 answers

How to generate numbers based on an arbitrary discrete distribution?

How do I generate numbers based on an arbitrary discrete distribution? For example, I have a set of numbers that I want to generate. Say they are labelled from 1-3 as follows. 1: 4%, 2: 50%, 3: 46% Basically, the percentages are probabilities that…
14
votes
3 answers

What is the difference between exponential and geometric distribution?

I don't really understand the difference between exponential and geometric distribution.
11
votes
2 answers

Why are distributions important?

This may as well go down as the silliest questions ever asked on this forum, but having received sound and meaningful answers to a previous question, i thought i will stretch my luck again. I have been very confused for some time on the importance…
Bloodline
  • 133
10
votes
1 answer

Which distribution to use to model web page read time?

I have a function that returns an average wait time for a web user. That is, it gives an average time that an average user may stay on a web page, given the web resource length in words. I want to use this function (and the resulting average) in…
10
votes
1 answer

Can full conditionals determine the joint distribution?

I heard that all the full conditionals (as used in Gibbs sampling) can determine the joint distribution. But I don't understand why and how. Or did I mis-hear? Thanks!
Tim
  • 19,445
10
votes
1 answer

Transforming a distribution into another one?

Say I have two vectors X1 and X2, and they form two distributions. Is there any way to transform X1 so after the transformation the new_X1 will have a similar distribution with X2? As we can transform some distribution into normal ones, I am curious…
mommomonthewind
  • 987
  • 2
  • 11
  • 21
10
votes
1 answer

The distribution of the linear combination of Gamma random variables

If $X_i\sim\Gamma(\alpha_i,\beta_i)$ for $1\leq i\leq n$, let $Y = \sum_{i=1}^n c_iX_i$ where $c_i$ are positive real numbers. Assume all the parameters $\alpha_i$'s and $\beta_i$'s are all known, what is $Y$'s distribution ?
10
votes
1 answer

How to sum two variables that are on different scales?

If I have two variables following two different distributions and having different standard deviations... How do I need to transform two variables so that when I sum the two result is not "driven" by more volatile one. For example... Variable A is…
user333
  • 7,211
9
votes
1 answer

Sum of two independent Student t variables with same dof is t distributed?

What is the distribution of the difference of two-t-distributions suggests that the sum of two t-distributions is never t distributed. With t distribution I mean the (non-standardized) t distribution with location and scale parameter. Now, let…
9
votes
2 answers

Does this distribution have a name? Or what is a stochastic process that could generate it?

A discrete distribution with mass function $$p(x;k) = \frac{k}{(x+k)(x+k-1)},\quad x = 1,2,\ldots$$ arises on page 9 of this paper. For $k=1$ it is a Yule-Simon distribution with $\rho=1$, but I haven't found any other examples. Does it have a name?…
Simon Byrne
  • 3,486
  • 19
  • 31
8
votes
2 answers

An example of a SYMMETRIC distribution with finite mean but infinite/undefined variance?

Is there a known symmetric distribution with finite 1st moment but undefined or infinite for moments>1?
Ab YZ
  • 141
8
votes
1 answer

What is the distribution of the value in a sample closest to a given value?

For a fixed value $m$, draw $k$ samples from a normal distribution, and select one, say $X$, which is closest to $m$. Then what distribution will $X$ follow? It is kind of similar to an extreme value distribution but I can not figure it out.
8
votes
2 answers

Transforming arbitrary distributions to distributions on $[0,1]$

I was reading Robert Serfling's 1980 book "Approximation Theorems of Mathematical Statistics" and came across the following construction of the Dvoretzky–Kiefer–Wolfowitz inequality for arbitrary distributions $F$, which DKW prove for distributions…
blueberry
  • 163
8
votes
3 answers

What's the distribution of these data?

I got the data, and plot the distribution of the data, and use the qqnorm function, but is seems doesn't follow a normal distribution, so which distribution should I use to discribe the data? Empirical cumulative distribution function
PepsiCo
  • 241
1
2 3
16 17