Suppose, I have three time-points, Ta, Tb and Tc. Let Ta be control and Tb and Tc be the effect of a drug 4 hours and 8 hours after treatment. For each pair of time-points, I compare about 15000 observations (genes, for differential expression, to be precise). So, for each pair of time-point, for each gene, I obtain a p-value.
My objective is to obtain ALL those genes that might have had an effect due to the drug treatment. Now, in order to correct for multiple testing, I use BH method. However, I have a confusion regarding the way to apply FDR.
Case A: I could pool ALL the p-values from ALL pairs of time-points together (Ta.Tb, Tb.Tc, Tc.Ta, each gene occuring thrice) and correct for multiple-testing once on this pooled set.
Case B: I could correct for multiple testing within each pair of time-point (separately for Ta.Tb, Tb.Tc, Tc.Ta). I am not sure if there is a consensus as to which one to employ and why.
My understanding: For case A, suppose there are too many "highly" significant events between Ta and Tb, then, you "might" lose the events that are otherwise statistically significant (with p-values computed), i.e., they become insignificant after FDR correction due to low p-values in other time-point pairs. For example, if the overall effect of drug (meaning for most of the genes) is NOT as strong in Tb.Tc compared to Ta.Tb, then we might not see that there is an effect of the drug at Tb.Tc at all.
And in case B, you would probably get more significant events, meaning more false positives as well?
I'd greatly appreciate it if someone could clarify this.
Thank you very much.