How does it work?

Sign up, learn at your own pace, and obtain your internationally recognized certificate. With personal guidance from our experts whenever you need it.

How does it work?

Sign up, learn at your own pace, and obtain your internationally recognized certificate. With personal guidance from our experts whenever you need it.

5s

5s

Shapiro–Wilk Test

Introduction: Shapiro–Wilk Test

The Shapiro–Wilk Test is a statistical method used to determine whether a dataset follows a normal distribution. It provides a test statistic and a p-value that indicate whether deviations from normality are statistically significant. This test is widely used in research, data analysis, and quality management to verify assumptions before applying parametric tests such as t-tests or ANOVA.

Background

Developed in 1965 by Samuel Shapiro and Martin Wilk, the Shapiro–Wilk Test quickly gained popularity due to its high statistical power, especially for small datasets. Compared to other normality tests, such as the Kolmogorov–Smirnov and Anderson–Darling tests, it is considered one of the most accurate and sensitive tools for detecting non-normality in data.

Key Elements / Features

  • Test Statistic (W): The value of W ranges from 0 to 1. Values close to 1 indicate that the data closely follow a normal distribution.
  • p-Value: If the p-value is less than the chosen significance level (commonly 0.05), the null hypothesis of normality is rejected, meaning the data likely deviate from normality.
  • Sample Size: The test performs best with small to medium sample sizes (typically n<2,000). For very large datasets, even small deviations may lead to significant results.
  • Null Hypothesis (H0): The data are normally distributed.
  • Alternative Hypothesis (H1: The data are not normally distributed.

Applications / Examples

  • Medicine: Assessing whether patient recovery times follow a normal distribution before performing t-tests.
  • Education: Testing the normality of exam scores before comparing class averages.
  • Business: Checking if sales data are normally distributed before applying forecasting or regression models.

Example: A dataset of 30 exam scores gives W=0.94 and p=0.04. Since p<0.05, the data are not normally distributed, and a non-parametric test should be used instead.

Relevance / Impact

The Shapiro–Wilk Test is one of the most reliable methods for testing normality, particularly with smaller datasets. By determining whether the assumption of normality holds, it helps analysts choose the correct statistical method, ensuring valid and trustworthy conclusions.

See also

Start today. Join 4,125 professionals.

Guidance from experienced Lean specialists
One fixed price, no hidden costs
Pass your exam with a 100% guarantee
Receive an internationally recognized certificate
Learn where and when you want, at your own pace.
Start for free with a realistic demo
Guidance from experienced Lean specialists
One fixed price, no hidden costs
Pass your exam with a 100% guarantee
Receive an internationally recognized certificate
Learn where and when you want, at your own pace.
Start for free with a realistic demo
HomeWikiShapiro–Wilk Test