I am trying to find a good method to test the null hypothesis(H0) on two unpaired samples. Those samples come from two different HTTP Servers and the unit I'm using is req/30s (requests concluded in 30 seconds).
Even a well-established server is expected to have a standard deviation of 50 requests. I have tried to use the Student's t-test to validate the null hypothesis, however, the following dataset it shows a p-value lower than the threshold of 0.05
A1 = [
4670, 4646, 4612, 4618, 4646,
4609, 4623, 4629, 4566, 4628,
4582, 4636, 4621, 4574, 4624,
4563, 4651, 4642, 4586, 4621,
4606, 4628, 4575, 4631, 4646,
4600, 4594, 4661, 4568, 4611
]
B1 = [
4630, 4655, 4652, 4633, 4637,
4661, 4625, 4680, 4647, 4639,
4633, 4661, 4638, 4621, 4630,
4682, 4703, 4665, 4652, 4648,
4673, 4651, 4669, 4646, 4612,
4654, 4651, 4619, 4637, 4620
]
st.ttest_ind(A1, B1)
Ttest_indResult(statistic=-4.855056212284194, pvalue=9.47100493260572e-06)
In my previous question Student's t-test on "high" magnitude numbers, I could understand that the data are clustered a low variance means statically significant. However, for my use case, this variance shouldn't reject the null hypothesis. I thought about comparing the means between two samples and if the difference was less than X, the hypothesis would be true. However, since the data is volatile I can't trust the mean, because sometimes, I can get results in a large range (4000 ~ 10000) for instance.
What methods the recommended to test if those two samples are statically significant considering those limitations?