I am currently working on a not-that-easy problem involving order statistics. As I am unsure as to how I could solve it, I thought it might already possess a solution. So here I am, my questions is: do you know of a pre-existing solution/approximation to (or do you see an easy way of solving) the probability below? $$\pi = \Pr\left[G > c\right],$$ where $$ G_1 = \max_{1 \leq j \leq K} \left(Z_{(j)} - Z_{(j - 1)}\right), Z_{(0)} = 0, Z_j \sim N(\mu, \sigma^2) \text{ and } Z_i \perp Z_j, \forall i \ne j. $$ Just to clarify things up, $Z_{(j)}$ is the $j$th order statistics of $Z_1, Z_2, \ldots, Z_K$. Also, if it helps, the problem can actually be reformulated this way (but I would prefer the one above):
$$G_2 = \max_{1 \leq j \leq K} \left(\tfrac{Z_{(j)}}{Z_{(j - 1)}}\right), Z_{(0)} = 1, Z_j \sim N(\mu, \sigma^2) \text{ and } Z_i \perp Z_j, \forall i \ne j.$$
I understand that mathematically speaking, those expressions are not equivalent, but for $\mu \in (0, 1)$ and $\sigma$ small enough, it should not matter.
//=======================================================================//
ETA: OK, worked quite a bit on this damned problem, and found that for $\mu \in (0, 1)$, e.g. $\mu = 0.1$, and $\sigma$ small enough, e.g. $\sigma = 0.01$, $G$ actually simplifies to
$$G = \max_{1 \leq j \leq n} \left(Z_{(j)} - Z_{(j - 1)}\right) \approx Z_{(1)}.$$
Thus, the probability $\pi$ is easily computed
$$\pi = \Pr\left[G > c\right] \approx \Pr\left[Z_{(1)} > c\right] = {\left(1 - \Phi\left(\tfrac{c - \mu}{\sigma}\right)\right)}^K$$
The critical value $c$ is also easily extracted
$${\left(1 - \Phi\left(\tfrac{c - \mu}{\sigma}\right)\right)}^K = \alpha \quad \Leftrightarrow \quad c = \mu + \Phi^{-1} \left(1 - \alpha^{\tfrac{1}{K}}\right)\sigma.$$
All I got to do now (other than finding a more general solution to the above problem) is find values of $\mu$ and $\sigma$, which are context-dependent.
If anyone feels I should pile on and add some more details, please let me know. I am, after all, new to this forum.
Also, I included the variable $Z_{(0)}$ because it was easier this way. Otherwise, I would have had to state $G$ as:
$$G = \max \left(Z_{(1)}, \max_{2 \leq j \leq K} \left(Z_{(j)} - Z_{(j - 1)} \right)\right).$$
Hope this clears things up.
– RSMax Jan 25 '14 at 02:12I understand that $G_1$ (or $G_2$) could be negative under the normal approximation, but that's a cross we all must bear when using the central limit theorem.
The exact underlying distributions would be nightmarish to compute and I know what my limits are.
Of course, one could argue that the two first moments of the underlying proportions (upon which the variables $Z_j$ are by derived) are by no means 0 and 1, but, hey, maybe I still got some thinking to do... OK, coming right up!
– RSMax Jan 25 '14 at 04:11I hear you, but still am confused about what I should do to remedy the situation. Should I start all over again, and in terms of Cross Validated, does that mean starting a new thread?
Were I to remove $G_2$ from the OP, would that dissipate the ambiguity that seems to linger around?
The truth now, the variables $Z_i$ are not normally distributed (they are proportion estimators, thus contained between $0$ and $1$), but, as stated by the central limit theorem, are asymptotically normal.
– RSMax Jan 25 '14 at 20:06Hope this was a tiny bit more informative and precise.
– RSMax Jan 25 '14 at 20:10