I have two p-values from independent test which I wish to combine. I do this using fisher's method, but I get some results that I don't quite understand. If I have two p-values, and combine them, this can result in a larger p-value than one of the original ones:
from scipy.stats import combine_pvalues
p1 = 0.25
p2 = 0.04
statistic,p_val = combine_pvalues([p1,p2])
print(statistic)
print(p_val)
>>> 9.210340371976182
>>> 0.05605170185988095
From wikipedia: a p-value "is the probability of obtaining test results at least as extreme as the result actually observed, under the assumption that the null hypothesis is correct". I probably am miss understanding something, but if the probability of obtaining the result of the second test is only 0.04, then I don't see how the probability of obtaining BOTH results can now be larger, i.e. 0.056.
How can this be?