Knowledge base

Central Limit Theorem (CLT)

Introduction: CLT

The Central Limit Theorem (CLT) is one of the most important principles in statistics. It states that when independent random samples are taken from any population with a finite mean and variance, the distribution of the sample means will approximate a normal distribution as the sample size increases, regardless of the original population’s shape.

Background

The theorem was developed through the work of mathematicians like Abraham de Moivre, Pierre-Simon Laplace, and Carl Friedrich Gauss. It provides the theoretical foundation for why normal distributions appear so frequently in real-world data and statistical analyses. The CLT supports many statistical methods, including hypothesis testing and confidence intervals, by allowing researchers to apply normal distribution assumptions to sample means.

Key Elements / Features

  • Sampling Distribution of the Mean: As the sample size n increases, the sampling distribution of the mean approaches normality.
  • Mean and Standard Error: The mean of the sampling distribution equals the population mean (μ), and its standard deviation (standard error) equals  \frac{\sigma}{\sqrt{n}}.
  • Independence: Samples must be independent for the CLT to hold.
  • Sample Size: The approximation to normality improves with larger n (commonly n ≥ 30 is sufficient).
  • Universality: Works even if the population is not normally distributed, as long as it has a finite variance.

Formula

Z = \frac{\bar{X} - \mu}{\sigma / \sqrt{n}}

Where: 

\bar{X} = sample mean 

\mu = population mean 

\sigma = population standard deviation 

n = sample size 

Applications / Examples

The CLT underpins many statistical analyses. For instance, in quality control or Six Sigma, process averages are assumed to follow a normal distribution to calculate process capability. In finance, it supports portfolio return modelling, and in healthcare, it helps estimate population health metrics from sample data.

Relevance / Impact

The Central Limit Theorem bridges the gap between non-normal real-world data and the mathematical convenience of the normal distribution. It enables practical use of inferential statistics across Lean Six Sigma, scientific research, and business analytics.

See also

Anend Harkhoe
Lean Consultant & Trainer | MBA in Lean & Six Sigma | Founder of Dmaic.com & Lean.nl
With extensive experience in healthcare (hospitals, elderly care, mental health, GP practices), banking and insurance, manufacturing, the food industry, consulting, IT services, and government, Anend is eager to guide you into the world of Lean and Six Sigma. He believes in the power of people, action, and experimentation. At Dmaic.com and Lean.nl, everything revolves around practical knowledge and hands-on training. Lean is not just a theory—it’s a way of life that you need to experience. From Tokyo’s karaoke bars to Toyota’s lessons—Anend makes Lean tangible and applicable. Lean.nl organises inspiring training sessions and study trips to Lean companies in Japan, such as Toyota. Contact: info@dmaic.com

Online Lean courses
100% Lean, at your own pace

Most popular article