1

The Wilcoxon signed-rank test is generally used for non-parametric data (i.e. not normally distributed). When the sample gets large, the data will be approximately normally distributed. Therefore there is no need to use the Wilcoxon signed-rank test, and a parametric test would be preferred.

Considering this, the Wilcoxon signed-rank test would be most appropriate where we cannot get a large sample, and the sample is not normally distributed. (Please correct me if I am wrong).

Could you suggest to me a couple of business use cases for the use of the Wilcoxon signed-rank test?

S. Tiss
  • 11
  • 2
    You have a common misconception about the central limit theorem. A question of mine has several nice answers explaining why it is false. – Dave Sep 22 '22 at 15:59
  • 1
    Your main point " the Wilcoxon signed-rank test would be most appropriate where we cannot get a large sample, and the sample is not normally distributed" is absolutely correct. It is also useful when we have large samples and the data is very not normally distributed. The first use case which jumps out at me would be A/B testing. – John Madden Sep 22 '22 at 16:00
  • @JohnMadden the reason I said that it is most appropriate in instances where we cannot get a large sample is because of the central limit theorem which I seem to have misunderstood according to the other comments. What is the actual reason for that? – S. Tiss Sep 22 '22 at 16:04
  • @S.Tiss Nah it is indeed because of the CLT; your misunderstanding (or perhaps simply a misstatement given your comment on Eoin's post, which my colleagues here seemed maybe just a little bit too excited to jump on?) does not affect the validity of your conclusion in this case (though it's true the reasoning you used is slightly off). – John Madden Sep 22 '22 at 16:18
  • 1
  • Data are neither parametric nor nonparametric $-$ data don't have parameters at all, models do. Models can be parametric or not, and the term 'nonparametric' does not mean 'not normally distributed'. There's an infinite variety of models that are parametric but not normal. 2. "When the sample gets large, the data will be approximately normally distributed" ... no. The sample size doesn't change the distribution you're sampling from, and it's that distribution you need to worry about for assumptions. The empirical distribution of data (sampled at random) will eventually approach the ...ctd
  • – Glen_b Sep 22 '22 at 23:18
  • ctd... population distribution. 3. "Therefore there is no need to use the Wilcoxon signed-rank test, and a parametric test would be preferred." the reasoning here is mistaken, both in justifying the use or not of a nonparametric test and in the specific choice of a particular test instead of relating the test (were one needed at all) to the question of interest. 4. "the Wilcoxon signed-rank test would be most appropriate where we cannot get a large sample" I see no good justification for this in your question. 5. It's hard to answer the Q because it seems to rely on a series of misconceptions – Glen_b Sep 22 '22 at 23:25
  • ... Those misconceptions may not be your fault at all. Sadly, many stats subjects taught outside of statistics itself seem to promulgate a large number of very seriously mistaken ideas, serving the students very poorly. (Edit: I should clarify my point "2." above; typically the assumptions being made apply under the null, so some of the assumptions might not need to be satisfied in the population the data were sampled from at all; we can be dealing with assumptions about a counterfactual situation) – Glen_b Sep 22 '22 at 23:29