In statistics and data analysis, the Span refers to the difference between two specified percentiles of a dataset, most commonly the 5th and 95th percentiles. It provides a measure of spread that excludes extreme values, making it useful for understanding variability without the distortion of outliers.
While the traditional range is calculated as the maximum minus the minimum value, this can be heavily influenced by extreme values. The Span addresses this limitation by focusing on percentiles, typically capturing the middle 90% of the data. This makes it a more robust measure of variability in many real-world applications.
Span is widely applied across disciplines:
For example, in quality control, if the P95 of a process measurement is 110 units and the P5 is 90 units, the Span is 20 units, representing the variation of the central 90% of outcomes.
Span provides a simple yet effective way to measure variability. It is more resilient to outliers than traditional range calculations and supports better decision-making in data-driven fields.
See also