Home » How to Perform a Cramer-Von Mises Test in R (With Examples)

How to Perform a Cramer-Von Mises Test in R (With Examples)

by Tutor Aspire

The Cramer-Von Mises test is used to determine whether or not a sample comes from a normal distribution.

This type of test is useful for determining whether or not a given dataset comes from a normal distribution, which is a common assumption used in many statistical tests including regression, ANOVA, t-tests, and many others.

We can easily perform a Cramer-Von Mises test using the cvm.test() function from the goftest package in R.

The following example shows how to use this function in practice.

Example 1: Cramer-Von Mises Test on Normal Data

The following code shows how to perform a Cramer-Von Mises test on a dataset with a sample size n=100:

library(goftest)

#make this example reproducible
set.seed(0)

#create dataset of 100 random values generated from a normal distribution
data #perform Cramer-Von Mises test for normality
cvm.test(data, 'pnorm')

	Cramer-von Mises test of goodness-of-fit
	Null hypothesis: Normal distribution
	Parameters assumed to be fixed

data:  data
omega2 = 0.078666, p-value = 0.7007

The p-value of the test turns out to be 0.7007.

Since this value is not less than .05, we can assume the sample data comes from a population that is normally distributed.

This result shouldn’t be surprising since we generated the sample data using the rnorm() function, which generates random values from a standard normal distribution.

Related: A Guide to dnorm, pnorm, qnorm, and rnorm in R

We can also produce a histogram to visually verify that the sample data is normally distributed:

hist(data, col='steelblue')

Shapiro-Wilk test for normality in R

We can see that the distribution is fairly bell-shaped with one peak in the center of the distribution, which is typical of data that is normally distributed.

Example 2: Cramer-Von Mises Test on Non-Normal Data

The following code shows how to perform a Cramer-Von Mises test on a dataset with a sample size of 100 in which the values are randomly generated from a Poisson distribution:

library(goftest)

#make this example reproducible
set.seed(0)

#create dataset of 100 random values generated from a Poisson distribution
data #perform Cramer-Von Mises test for normality
cvm.test(data, 'pnorm')

	Cramer-von Mises test of goodness-of-fit
	Null hypothesis: Normal distribution
	Parameters assumed to be fixed

data:  data
omega2 = 27.96, p-value 

The p-value of the test turns out to be extremely small.

Since this value is less than .05, we have sufficient evidence to say that the sample data does not come from a population that is normally distributed.

This result shouldn’t be surprising since we generated the sample data using the rpois() function, which generates random values from a Poisson distribution.

Related: A Guide to dpois, ppois, qpois, and rpois in R

We can also produce a histogram to visually see that the sample data is not normally distributed:

hist(data, col='coral2')

Shapiro-Wilk test histogram in R

We can see that the distribution is right-skewed and doesn’t have the typical “bell-shape” associated with a normal distribution.

Thus, our histogram matches the results of the Cramer-Von Mises test and confirms that our sample data does not come from a normal distribution.

What to Do with Non-Normal Data

If a given dataset is not normally distributed, we can often perform one of the following transformations to make it more normal:

1. Log Transformation: Transform the response variable from y to log(y).

2. Square Root Transformation: Transform the response variable from y to √y.

3. Cube Root Transformation: Transform the response variable from y to y1/3.

By performing these transformations, the response variable typically becomes closer to normally distributed.

Refer to this tutorial to see how to perform these transformations in practice.

Additional Resources

The following tutorials explain how to perform other normality tests in R:

How to Perform a Shapiro-Wilk Test in R
How to Conduct an Anderson-Darling Test in R
How to Conduct a Kolmogorov-Smirnov Test in R

You may also like