Method in practice
- Collect process data (e.g., CI/CD build time, API latency, MRR churn).
- Compute the mean (μ):Example: builds = 10, 12, 11, 9, 10 min → μ = 10.4 min
- Compute the standard deviation (σ):For the same builds σ ≈ 1.02 min
- Build control limits:
- UCL = ≈ 13.46 min
- LCL = ≈ 7.34 min
What to do with normal variation (inside limits)
- Don’t panic. This is expected noise, not a problem.
- Keep statistics. Watch for sharp changes in the mean and σ.
- Optimize systematically. If you see a trend in the mean or σ increasing, improve the whole process rather than reacting to single points.
- Automate monitoring. Alert only on out-of-control signals — the team won’t be distracted by noise.
Avoid
- Reacting to every spike that stays within limits.
- Filing a “bug” in Jira or touching code if the value is within control.
- Calling panic meetings over single data points within limits.
Examples
1. SaaS / Business
- Average MRR churn = 5%, σ = 0.4%
- UCL = 6.2%, LCL = 3.8%
- 5.9% → normal → keep observing
- 8.2% → anomaly → investigate the cause (outside 6σ)
2. Engineering / CI/CD
- Average build time = 10 min, σ = 0.5 min
- UCL = 11.5 min, LCL = 8.5 min
- 11 min → normal → nothing to fix
- 13.2 min → anomaly → find the specific cause (outside 6σ)
3. DevOps / API latency
- Average latency = 200 ms, σ = 10 ms
- UCL = 230 ms, LCL = 170 ms
- 220 ms → normal → keep monitoring
- 265 ms → anomaly → investigate (outside 6σ)
Note: how to handle outliers (±3σ)
In process control and Shewhart statistics, values outside (or “6σ” in Six Sigma terms) are usually considered anomalies and do not reflect normal process variation. Therefore:
- When building control limits, do not include such outliers in the calculation of the mean (μ) and σ, otherwise the limits will shift and become less sensitive to real anomalies.
- Exception: if outliers repeat frequently, this is a signal that the process has changed — include them when analyzing the new “system variation”.
Standard practice:
- First compute preliminary μ and σ on historical data without obvious outliers.
- Build the limits.
- Any value outside the limits → anomaly → investigate, and do not use it to recompute μ and σ until you understand the cause.
Tip: treat values inside the limits as normal noise, and outliers as an “alarm signal” — they shouldn’t distort the chart.
Why this matters
Without control limits teams tend to:
- Fix noise, wasting energy on normal fluctuations.
- Miss true anomalies.
- Burn out from constant alert fatigue.
- Slow product progress by deferring systemic improvements.
With control limits teams can:
- Respond only to real problems.
- Preserve focus and energy.
- Trust metrics for decision-making.
Conclusion
Shewhart control limits are a tool for calm and effectiveness. Knowing the ±3σ boundaries helps you stop “fixing noise” and start improving the process systematically.