10

Suppose a random sample of $n$ variables from $N(\mu,1)$, $n$ odd. The sample median is $M=X_{(n+1)/2}$, the order statistic of the middle of the distribution.

How to prove that sample median is not a sufficient statistic of $\mu$? Do we need the probability density function of M?

I would like any help.

Note 1: I'm studying for the final exam of my course, and take previous exams to practice. This question appeared in a 2015 exam, that was given by other professor than mine, so it is possible that covers things that wasn't given by my professor. This question includes others two exercises: (1) Is $M-\bar{X}$ an ancillary statistic for $\mu$? (2) Prove that if $m$ is a unique point, such as $F_X(m)=1/2$, then $M \overset{p}{\rightarrow}m$. I couldn't answer these two exercises, because ancillary statistics were not covered by my professor.

So, to see wheter $M$ is a sufficient statistic for $\mu$, I derived the probability density (pdf) function of $M$, that I bring below. But I think there is a more inteligent way to do, without getting the pdf first. So this is the reason I'm asking this question, to see if there is a better and faster way of solving this question.

Note 2: things I know about sufficient statistics:

$T$ is a sufficient statistic if $$ \frac{f_X(x|\theta)}{q(t|\theta)} $$ does not depend on $\theta$, where $f_X$ is the probability density function of $X$ and $q$ is the probability density function of $T$. Also, $T$ is a sufficient statistic if, and only if, there exist functions $g(t|\theta)$ and $h(x)$ such that $$f_X(x|\theta)=g(T(x)|\theta)h(x)$$ for all points of $\theta$

Note 3: The probability density function of M is

$$f_{M}(m)=\frac{n!}{\left ( \frac{n-1}{2} \right) ! \left ( \frac{n-1}{2} \right) !}[1-F_{X}(m)]^{\frac{n-1}{2}}f_{X}(m)[F_X(m)]^{\frac{n-1}{2}}$$

I divided the probability density functions of $(X_1,...,X_n)$ and $M$: $$\frac{f_{X}(x|\mu)}{f_{M}(m)}=\frac{f_{X1}(x_{1}|\mu)\cdot ...\cdot f_{Xn}(x_{n}|\mu) }{\frac{n!}{\left ( \frac{n-1}{2} \right) ! \left ( \frac{n-1}{2} \right) !}[1-F_{X}(m)]^{\frac{n-1}{2}}f_{X}(m)[F_X(m)]^{\frac{n-1}{2}}}$$

So we have (I don't know if is wright): $$\frac{f_{X}(x|\mu)}{f_{M}(m)}=\frac{f_{X1}(x_{1}|\mu)\cdot ...\cdot f_{X_{n-1}}(x_{n-1}|\mu) }{\frac{n!}{\left ( \frac{n-1}{2} \right) ! \left ( \frac{n-1}{2} \right) !}[1-F_{X}(m)]^{\frac{n-1}{2}}[F_X(m)]^{\frac{n-1}{2}}}$$ It seems that this expression depends of $\mu.$

User1865345
  • 8,202
Diorne
  • 101
  • 3
  • This appears to be a homework-like question; please see https://stats.stackexchange.com/tags/self-study/info . 2. What things do you know about sufficient statistics? e.g. definition/facts/results/theorems etc? 3. What have you tried in order to use the things you know to answer the question?
  • – Glen_b Feb 22 '23 at 01:53
  • 1
    $T$ is a sufficient statistic if $$\frac{f_X(x|\theta)}{q(t|\theta)}$$ does not depend on $\theta$, where $f_X$ is the probability density function of $X$ and $q$ is the probability density function of $T$ Also, $T$ is a sufficient statistic if, and only if, there exist functions $g(t|\theta)$ and $h(x)$ such that $$f_X(x|\theta)=g(T(x)|\theta)h(x)$$ for all points of $\theta$ – Diorne Feb 22 '23 at 02:00
  • 2
    That's definitely a good thing to know. Did you try to use it? However, is that the only thing you have? 4. You should consider how $M$ is related to the parameter of the distribution of the $X_i$, $\mu$. i.e. note that $F$ and $f$ involve $\mu$ – Glen_b Feb 22 '23 at 02:01
  • 1
    I'm not saying you need to use that fact, but if you just write it in terms of F and f without thinking about how those things relate to $\mu$ you may not even see that $\mu$ is present there and trick yourself into thinking it isn't. – Glen_b Feb 22 '23 at 02:07
  • I divided the probability density functions of $(X_1,...,X_n)$ and $M$: $$\frac{f_{X}(x|\mu)}{f_{M}(m)}=\frac{f_{X1}(x_{1}|\mu)\cdot ...\cdot f_{Xn}(x_{n}|\mu) }{\frac{n!}{\left ( \frac{n-1}{2} \right) ! \left ( \frac{n-1}{2} \right) !}[1-F_{X}(m)]^{\frac{n-1}{2}}f_{X}(m)[F_X(m)]^{\frac{n-1}{2}}}$$ So we have (I don't know if is wright): $$\frac{f_{X}(x|\mu)}{f_{M}(m)}=\frac{f_{X1}(x_{1}|\mu)\cdot ...\cdot f_{X_{n-1}}(x_{n-1}|\mu) }{\frac{n!}{\left ( \frac{n-1}{2} \right) ! \left ( \frac{n-1}{2} \right) !}[1-F_{X}(m)]^{\frac{n-1}{2}}[F_X(m)]^{\frac{n-1}{2}}}$$ that depends of $\mu$ – Diorne Feb 22 '23 at 02:16
  • Anyway, in this case, is it possible to know if M is a sufficient statistic without calculating its probability density function? – Diorne Feb 22 '23 at 02:18
  • Sure, potentially, yes. One of those ways involves using a theorem you haven't mentioned, which is why I fished for additional knowledge. – Glen_b Feb 22 '23 at 02:21
  • What theorem? Might you share, please? – Diorne Feb 22 '23 at 02:23
  • If it wasn't covered in your course it clearly wasn't intended that you use it; hence the question. – Glen_b Feb 22 '23 at 02:31
  • Further, you don't seem to have responded to the things that stats.stackexchange.com/tags/self-study/info would have you do (including being candid about the source of the question) leaving me more hesitant about the circumstances. – Glen_b Feb 22 '23 at 02:45
  • It might be useful to review some of the many questions on site on the topic of sufficiency. – Glen_b Feb 22 '23 at 02:51
  • @Glen_b I updated my question with more information that you ask for. – Diorne Feb 23 '23 at 01:44
  • Rigorously speaking, showing $f(x|\mu)/f_M(m)$ does not depend on $\mu$ is not sufficient (check the factorization theorem more carefully). – Zhanxiong Feb 23 '23 at 01:47
  • Thanks, that all helps. To clarify; the theorem I was referring to was the Pitman–Koopman–Darmois theorem (which makes it very easy to establish), but I'm pretty sure you haven't covered it. You'll need to rely on the tools you do have, like the definition and the factorization theorem. – Glen_b Feb 23 '23 at 02:59
  • Look at some asymmetric distributions: The medians of $LN(0,1)$ and $LN(0,2)$ are both $1$, but their means are $\sqrt{e}$ and $e^2$. The medians of $B(1,\frac23)$ and $B(1,\frac34)$ are both $1$, but their means are $\frac23$ and $\frac34$. So obviously the medians don’t tell you everything about the means. – Matt F. Feb 23 '23 at 17:11