*39*

A Kruskal-Wallis test is used to determine whether or not there is a statistically significant difference between the medians of three or more independent groups. It is considered to be the non-parametric equivalent of the One-Way ANOVA.

If the results of a Kruskal-Wallis test are statistically significant, then it’s appropriate to conduct **Dunn’s Test** to determine exactly which groups are different.

Dunn’s Test performs pairwise comparisons between each independent group and tells you which groups are statistically significantly different at some level of α.

For example, suppose a researcher wants to know whether three different drugs have different effects on back pain. He recruits 30 subjects for the study and randomly assigns them to use Drug A, Drug B, or Drug C for one month and then measures their back pain at the end of the month.

The researcher can perform a Kruskal-Wallis test to determine if the median back pain is equal among the three drugs. If the p-value of the Kruskal-Wallis test is below a certain threshold, it can be said that the three drugs produce different effects.

Following this, the researcher could then perform Dunn’s Test to determine *which *drugs produce statistically significant effects.

**Dunn’s Test: The Formula**

You will likely never have to perform Dunn’s Test by hand since it can be performed using statistical software (like R, Python, Stata, SPSS, etc.) but the formula to calculate the z-test statistic for the difference between two groups is:

z_{i} = y_{i} / σ_{i}

where *i *is one of the 1 to *m *comparisons, y_{i} =W_{A} – W_{B} (where W_{A} is the average of the sum of the ranks for the i^{th} group) and σ_{i }is calculated as:

σ_{i }= √((N(N+1)/12) – (ΣT^{3}_{s} – T_{s}/(12(N-1)) / ((1/n_{A})+(1/n_{B}))

where *N *is the total number of observations across all groups, *r *is the number of tied ranks, and T_{s} is the number of observations tied at the *s*th specific tied value.

**How to Control the Family-wise Error Rate**

Whenever we make multiple comparisons at once, it’s important that we control the family-wise error rate. One way to do so is to adjust the p-values that results from the multiple comparisons.

There are several ways to adjust the p-values, but the two most common adjustment methods are:

**1. The Bonferroni Adjustment**

Adjusted p-value = p*m

where:

**p:**The original p-value**m:**The total number of comparisons being made

**2. The Sidak Adjustment**

Adjusted p-value = 1 – (1-p)^{m}

where:

**p:**The original p-value**m:**The total number of comparisons being made

By using one of these p-value adjustments, we can dramatically reduce the probability of committing a type I error among the set of multiple comparisons.

**Additional Resources**

How to Perform Dunn’s Test in R

How to Perform Dunn’s Test in Python