Estimating "population p-value" $\Pi$ using an observed p-value

Question

I asked a similar question last month, but from the responses, I see how the question can be asked more precisely.

Let's suppose a population of the form

$$X \sim \mathcal{N}(100 + t_{n-1} \times \sigma / \sqrt{n}, \sigma)$$

in which $t_{n-1}$ is the student $t$ quantile based on a specific value of a parameter $\Pi$ ($0<\Pi<1)$. For the sake of the illustration, we could suppose that $\Pi$ is 0.025.

When performing a one-sided $t$ test of the null hypothesis $H_0: \mu = 100$ on a sample taken from that population, the expected $p$ value is $\Pi$, irrespective of sample size (as long as simple randomized sampling is used).

I have 4 questions:

Is the $p$ value a maximum likelihood estimator (MLE) of $\Pi$? (Conjecture: yes, because it is based on a $t$ statistic which is based on a likelihood ratio test);
Is the $p$ value a biased estimator of $\Pi$? (Conjecture: yes because (i) MLE tend to be biased, and (2) based on simulations, I noted that the median value of many $p$s is close to $\Pi$ but the mean value of many $p$s is much larger);
Is the $p$ value a minimum variance estimate of $\Pi$? (Conjecture: yes in the asymptotic case but no guarantee for a given sample size)
Can we get a confidence interval around a given $p$ value by using the confidence interval of the observed $t$ value (this is done using the non-central student $t$ distribution with degree of freedom $n-1$ and non-centrality parameter $t$) and computing the $p$ values of the lower and upper bound $t$ values? (Conjecture: yes because both the non-central student $t$ quantiles and the $p$ values of a one-sided test are continuous increasing functions)

Out of curiosity, based upon your simulations, does $\hat{p}$ maintain a constant distribution, and if so, what distribution best models the results of $\hat{p}$? I'm also curious if the distribution parameters for $\hat{p}$ can be determined from the data used for the $t$-test and/or the $t$-test. — Tavrock, Mar 20 '17 at 20:12

score 1 · Answer 1 · edited Jun 11 '20 at 14:32

I think I may have found a possible answer for you.

In Computational Statistics Handbook with MATLAB by Wendy L. Martinez and Angel R. Martinez, they state:

Let $\theta$ represent a population parameter that we wish to estimate, and let $T$ denote a statistic that we will use as a point estimate for $\theta$. The observed value of the statistic is denoted as $\hat{\theta}$. An interval estimate for $\theta$ will be of the form $$\hat{\theta_{Lo}}<\theta<\hat{\theta_{Up}}$$ where $\hat{\theta_{Lo}}$ and $\hat{\theta_{Up}}$ depend on the observed value $\hat{\theta}$ and the distribution of the statistic $T$.

If we know the sampling distribution of $T$, then we are able to determine values for $\hat{\theta_{Lo}}$ and $\hat{\theta_{Up}}$ such that $$P\left(\hat{\theta_{Lo}}<\theta<\hat{\theta_{Up}}\right)=1-\alpha$$ where $0<\alpha<1$. [The preceding equation] indicates that we have a probability of $1-\alpha$ that we will select a random sample that produces and interval that contains $\theta$. [$\hat{\theta_{Lo}}<\theta<\hat{\theta_{Up}}$] is called a $\left(1-\alpha\right)\cdot100\%$ confidence interval. \dots It should be noted that one-sided confidence intervals can be defined similarly [Mood, Graybill and Boes, 1974].

$\dots$

the procedure for Monte Carlo hypothesis testing using the $p$-value approach is similar. Instead of finding the critical value from the simulated distribution of the test statistic, we use it to estimate the $p$-value.

Procedure—Monte Carlo Hypothesis Testing (P-Value)

For a random sample of size $n$ to be used in a statistical hypothesis test, calculate the observed value of the test statistic $t_0$.

Decide on a pseudo-population that reflects the characteristics of the population under the null hypothesis.

Obtain a random sample of size $n$ from the pseudo-population.

Calculate the value of the test statistic using the random sample in step 3 and record it as $t_i$.

Repeat steps 3 and 4 for $M$ trials. We now have values $t_i=1,\dots,t_M$, that serve as an estimate of the distribution of the test statistic, $T$, when the null hypothesis is true.

Estimate the $p$-value using the distribution $\dots$, using the following.

Lower Tail Test$$\hat{p}-value=\frac{\left(t_i\leq t_0\right)}{M}$$ for $i=1,\dots,M$

UpperTail Test$$\hat{p}-value=\frac{\left(t_i\geq t_0\right)}{M}$$ for $i=1,\dots,M$

It seems reasonable then, that you could use this same method to report the limits of the sampled $p$-values in some meaningful way to represent a confidence interval of the test statistic.

http://stats.stackexchange.com/questions/51304/confidence-interval-on-a-p-value also references a R package which reports the confidence interval for a $p$-value. — Tavrock, Mar 17 '17 at 16:31
Thanks for the research. It is exactly what I did in the second sub-point of point 2. in my question above. However, it does not answer any of my questions. Is this approach a biased estimate of $\Pi$? Is it a minimum variance estimate of $\Pi$? Here, because I know the sampling distribution of the statistic $\theta$, can I use the noncentral t distribution to get bounds instead of a simulation as suggested in your response? — Denis Cousineau, Mar 19 '17 at 23:59
To be honest, I don't know. I stumbled across this information while looking for some completely different information in the book. The book really didn't elaborate beyond what is shown here, it simply covered that this could be done and provided the above information as an example. It does seem to point to your fourth conjecture, as the Mote Carlo simulation isn't predicting $\Pi$ or the confidence interval for $\hat{p}$ as much as it is providing a means of a confidence interval for the $t$-test, using the conversion to $p$-values to make the results of the test intuitively meaningful. — Tavrock, Mar 20 '17 at 20:08

Estimating "population p-value" $\Pi$ using an observed p-value

1 Answers1

Procedure—Monte Carlo Hypothesis Testing (P-Value)

Linked