I'm trying to determine the "special" population out of n populations.
I have measurements with n labels.
label measurement
a 1.4
b 1.2
a 1.2
c 1.3
...
- Measurements with
n-1of the labels are drawn iid from the same normal distribution with mean $\mu_{default}$ and variance $\sigma^2_{default}$. - One label is "special" and its measurements are drawn from a different normal distribution with mean $\mu_{special}$ and variance $\sigma^2_{special}$.
- $\mu_{special}$ > $\mu_{default}$ but I don't de novo know anything about the variances. (In practice $\sigma^2_{special}$ is probably close to $\sigma^2_{default}$ but I'd prefer not to make that assumption if possible.)
My goal is to find probabilities that each label is special; that is, if the labels are a, b, and c, I can compute
p(a is special | data)
p(b is special | data)
p(c is special | data)
such that all probabilities add to 1.
I can run Welch's t-test for each label against the rest of the data to find p-values for each label – but these aren't likelihoods! This gives me good results to determine by inspection which label is special, but I don't know of a justifiable way to generate the probabilities I want above.
The above may be a bit confusing due to bad terminology, so here's an example that I hope helps motivate the problem:
Consider a game with a 6-sided die. The die is hidden from the player. Every turn, the die is rolled.
- If the die rolls on value 1 (the "special" value), a random measurement
mis taken from a normal distribution with parameters $\mu_{special}$ and $\sigma^2_{special}$.(1, m)is reported to the plauyer. - If the die rolls on any other value $v \in [2,6]$, a random measurement
mis taken from a normal distribution with the parameters $\mu_{default}$ and variance $\sigma^2_{default}$ and(v, m)is reported.
The player can see all of the (die outcome, measurement) pairs but does not know anything about the underlying distributions other than that they are normal and $\mu_{special}$ > $\mu_{default}$
The player's goal is to derive the probabilities that each value is special, that is, derive p(1 is special), p(2 is special)... p(6 is special) given a long list of (die outcome, measurement) pairs. The player has is a uniform prior (the special value is chosen uniformly at random before the game starts.)
