5

I have used Kruskal-Wallis and Wilcoxon's rank-sum test for some analysis. For calculating the corresponding effect sizes, I have used:

  • epsilon-squared for the Kruskal-Wallis test

  • r-squared for Wilcoxon's rank-sum test

I'm not getting any definitive source to interpret the effect size values. Does anybody know of any definitive source?

Edit - A rule of thumb range for weak, moderate, and strong effect size would also be useful (like the one available for Cohen's d).

Outsider
  • 161
  • 7
  • 4
    Have you tried inspecting the formulas for the effect size measures you're using, to make sense of what they're actually trying to do? – user3716267 May 26 '21 at 12:25
  • @user3716267 No. I don't have a strong enough background in statistics. Any leads are welcome. – Outsider May 26 '21 at 12:28
  • 1
    Here are two websites offering interpretations for epsilon squared: https://peterstatistics.com/CrashCourse/3-TwoVarUnpair/NomOrd/NomOrd3c.html and https://rcompanion.org/handbook/F_08.html – Federico Tedeschi Nov 27 '22 at 18:55

2 Answers2

1

Rules of thumbs for effect sizes have been widely criticized (see this discussion and this discussion for some examples). As you mention Cohen's d, be aware that Cohen himself advised against using the rules of thumb he suggested (for $d$ and other effect sizes). He considered it as a last resort when you have no other way of determining yourself the importance of a given value for an effect size. Unfortunately, his caveats are routinely ignored. In addition, I wonder if there are circumstances where you absolutely can't determine yourself the importance of an effect size; the context and goals of the study should give you the information you need for that. (Even if you were in a situation where you can't use context to make a judgement, it may be more advisable to simply report the value of the effect size without attaching any qualitative adjective to it.)

It is evident if, for example, you reason in terms of cost and benefits. If you have an effect size measuring the positive impact of an intervention on a population, you still have to take context into account to judge if the intervention is worth it. If the effect size is 1.5, but despite its positive impact it also results in huge unintended financial or social costs, the intervention may be much less desirable than another intervention with a smaller positive impact (e.g. an effect size of 0.1) but with very limited costs.

Using a general rule of thumb in this context does not help at all to judge the relevance of the effect size, and may even prevent you from thinking and drawing relevant conclusions from your study. Who cares if the effect size in the first intervention is labelled as "strong", if the second intervention has a smaller effect size but is more adequate? Labels are not really helpful here.

J-J-J
  • 4,098
0

I finally followed the recommendations in the book SPSS survival guide by Julie Pallant.

It recommended calculating the effect size as

r=z/sqrt(N)

r=effect-size z=test statistic N=total number of observations.

for pairwise groups for the Kruskal-Wallis test and between the two groups for Wilcoxon's rank-sum test.

It recommended Cohen's rule of thumb threshold for reporting small, medium, or strong effect sizes.

I ended up following these guidelines, but this still doesn't answer the original question. So, I welcome other answers.

Outsider
  • 161
  • 7