SlideShare a Scribd company logo
1 of 4
Section & Lesson #:
Pre-Requisite Lessons:
Complex Tools + Clear Teaching = Powerful Results
Distributions: Normal
Six Sigma-Measure – Lesson 9
A review of normal distributions and how to test their normality using a
normality test.
Six Sigma-Measure #08 – Distributions: Overview
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means
(electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
Normal Distributions (bell curve)
o Why is it called a “normal” distribution?
• “Normal” implies the typical randomness that we expect to occur in life.
• If there was no randomness, then we can presume there is some influence (bias or skewness).
• We want the data that we analyze to be unbiased, therefore we need to ensure it reflects the
“normal” randomness we would expect.
 Otherwise if the data is biased, then why analyze it if we can’t be confident we’ll find the right root cause?
o A normal distribution is bell-shaped.
• The bell shape is created because most
of the data points fall in the middle.
• The shape of the bell is influenced by
the mean and standard deviation.
o Characteristics of a normal distribution.
• Completely described by its mean and standard deviation.
• The tails on either end of the curve extend +/- infinity.
• The area under the curve represents 100% of possible observations.
• The curve is symmetrical where 50% of the data points fall on either side of the mean.
• The mean (average) will be relatively equal to the median (50th percentile).
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
Normality Testing
o A distribution with a bell curve doesn’t necessarily mean it’s “normal”.
• It’s not uncommon for a distribution to appear normal, but it really isn’t.
• The normality of the distribution should be statistically tested.
o Use the Anderson-Darling test of a Normality Test or Probability Plot.
• A normality test or probability plot will plot the data on a logarithmic scale.
 In Minitab, go to Stat > Basic Statistics > Normality Test or go to Graph > Probability Plot
• Normal data will appear like a straight line; Minitab will try to fit a line along the data points.
 A “fat pencil” test is if a fat pencil can lay
across and cover all the data points, then
it’s probably a normal distribution.
• A better test is to examine the p-value
of the Anderson-Darling (AD) test.
 If p-value > 0.05, then it’s normal.
 If p-value < 0.05, then it’s not normal.
– In these examples, 0.05 is referring to the
alpha risk which is the % chance of being
right when concluding the data is normal.
5% is a commonly acceptable threshold.
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
3
This p-value is > 0.05
which means it’s a
normal distribution
Practical Application
o Open the “Minitab Sample Data.MPJ” file and try to do the following:
• Run a normality test on each continuous metric.
• Which metrics are normally distributed? How can you prove it?
o Next, pull some historical data for at least 2 continuous metrics used by your
organization and try following the same steps described above.
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
4

More Related Content

What's hot

Carma internet research module sample size considerations
Carma internet research module   sample size considerationsCarma internet research module   sample size considerations
Carma internet research module sample size considerations
Syracuse University
 
East Infosheet
East InfosheetEast Infosheet
East Infosheet
Cytel
 

What's hot (20)

MSA – Overview
MSA – OverviewMSA – Overview
MSA – Overview
 
The Necessity of the Measure Phase with Matt Hansen at StatStuff
The Necessity of the Measure Phase with Matt Hansen at StatStuffThe Necessity of the Measure Phase with Matt Hansen at StatStuff
The Necessity of the Measure Phase with Matt Hansen at StatStuff
 
Statistical Process Control (SPC)
Statistical Process Control (SPC)Statistical Process Control (SPC)
Statistical Process Control (SPC)
 
Measure Phase Roadmap (Level 3) with Matt Hansen at StatStuff
Measure Phase Roadmap (Level 3) with Matt Hansen at StatStuffMeasure Phase Roadmap (Level 3) with Matt Hansen at StatStuff
Measure Phase Roadmap (Level 3) with Matt Hansen at StatStuff
 
Variation Over Time (Short/Long Term Data)
Variation Over Time (Short/Long Term Data)Variation Over Time (Short/Long Term Data)
Variation Over Time (Short/Long Term Data)
 
MSA – Improving the Measurement System
MSA – Improving the Measurement SystemMSA – Improving the Measurement System
MSA – Improving the Measurement System
 
Identify Root Causes – Building the DCP
Identify Root Causes – Building the DCPIdentify Root Causes – Building the DCP
Identify Root Causes – Building the DCP
 
Rational Sub-Grouping
Rational Sub-GroupingRational Sub-Grouping
Rational Sub-Grouping
 
Process Capability: Steps 1 to 3
Process Capability: Steps 1 to 3Process Capability: Steps 1 to 3
Process Capability: Steps 1 to 3
 
MSA – Attribute ARR Test
MSA – Attribute ARR TestMSA – Attribute ARR Test
MSA – Attribute ARR Test
 
Data drift and machine learning
Data drift and machine learningData drift and machine learning
Data drift and machine learning
 
ML Drift - How to find issues before they become problems
ML Drift - How to find issues before they become problemsML Drift - How to find issues before they become problems
ML Drift - How to find issues before they become problems
 
Carma internet research module sample size considerations
Carma internet research module   sample size considerationsCarma internet research module   sample size considerations
Carma internet research module sample size considerations
 
How to calculate power in statistics
How to calculate power in statisticsHow to calculate power in statistics
How to calculate power in statistics
 
A predictive analytics primer
A predictive analytics primerA predictive analytics primer
A predictive analytics primer
 
Top Five Ideas -- Statistics for Project Management
Top Five Ideas -- Statistics for Project ManagementTop Five Ideas -- Statistics for Project Management
Top Five Ideas -- Statistics for Project Management
 
ICAR - IFPRI- Power Calculation
ICAR - IFPRI- Power CalculationICAR - IFPRI- Power Calculation
ICAR - IFPRI- Power Calculation
 
What Is a Model, Anyhow?
What Is a Model, Anyhow?What Is a Model, Anyhow?
What Is a Model, Anyhow?
 
Process Capability: Step 4 (Normal Distributions)
Process Capability: Step 4 (Normal Distributions)Process Capability: Step 4 (Normal Distributions)
Process Capability: Step 4 (Normal Distributions)
 
East Infosheet
East InfosheetEast Infosheet
East Infosheet
 

Similar to Distributions: Normal with Matt Hansen at StatStuff

Similar to Distributions: Normal with Matt Hansen at StatStuff (20)

Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Overview of Statistical Terms and Concepts with Matt Hansen at StatStuff
Overview of Statistical Terms and Concepts with Matt Hansen at StatStuffOverview of Statistical Terms and Concepts with Matt Hansen at StatStuff
Overview of Statistical Terms and Concepts with Matt Hansen at StatStuff
 
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)
 
Hypothesis Testing: Spread (Compare 1:Standard)
Hypothesis Testing: Spread (Compare 1:Standard)Hypothesis Testing: Spread (Compare 1:Standard)
Hypothesis Testing: Spread (Compare 1:Standard)
 
Hypothesis Testing: Statistical Laws and Confidence Intervals
Hypothesis Testing: Statistical Laws and Confidence IntervalsHypothesis Testing: Statistical Laws and Confidence Intervals
Hypothesis Testing: Statistical Laws and Confidence Intervals
 
Hypothesis Testing: Central Tendency – Normal (Compare 1:Standard)
Hypothesis Testing: Central Tendency – Normal (Compare 1:Standard)Hypothesis Testing: Central Tendency – Normal (Compare 1:Standard)
Hypothesis Testing: Central Tendency – Normal (Compare 1:Standard)
 
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:Standard)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:Standard)Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:Standard)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:Standard)
 
Hypothesis Testing: Proportions (Compare 1:Standard)
Hypothesis Testing: Proportions (Compare 1:Standard)Hypothesis Testing: Proportions (Compare 1:Standard)
Hypothesis Testing: Proportions (Compare 1:Standard)
 
Hypothesis Testing: Relationships (Overview)
Hypothesis Testing: Relationships (Overview)Hypothesis Testing: Relationships (Overview)
Hypothesis Testing: Relationships (Overview)
 
Hypothesis Testing: Spread (Compare 1:1)
Hypothesis Testing: Spread (Compare 1:1)Hypothesis Testing: Spread (Compare 1:1)
Hypothesis Testing: Spread (Compare 1:1)
 
Hypothesis Testing: Central Tendency – Normal (Compare 2+ Factors)
Hypothesis Testing: Central Tendency – Normal (Compare 2+ Factors)Hypothesis Testing: Central Tendency – Normal (Compare 2+ Factors)
Hypothesis Testing: Central Tendency – Normal (Compare 2+ Factors)
 
Hypothesis Testing: Formal and Informal Sub-Processes
Hypothesis Testing: Formal and Informal Sub-ProcessesHypothesis Testing: Formal and Informal Sub-Processes
Hypothesis Testing: Formal and Informal Sub-Processes
 
Hypothesis Testing: Proportions (Compare 2+ Factors)
Hypothesis Testing: Proportions (Compare 2+ Factors)Hypothesis Testing: Proportions (Compare 2+ Factors)
Hypothesis Testing: Proportions (Compare 2+ Factors)
 
Lec 5 - Normality Testing.pptx
Lec 5 - Normality Testing.pptxLec 5 - Normality Testing.pptx
Lec 5 - Normality Testing.pptx
 
Hypothesis Testing: Central Tendency – Normal (Compare 1:1)
Hypothesis Testing: Central Tendency – Normal (Compare 1:1)Hypothesis Testing: Central Tendency – Normal (Compare 1:1)
Hypothesis Testing: Central Tendency – Normal (Compare 1:1)
 
Testing for Multicollinearity
Testing for MulticollinearityTesting for Multicollinearity
Testing for Multicollinearity
 
Hypothesis Testing: Overview
Hypothesis Testing: OverviewHypothesis Testing: Overview
Hypothesis Testing: Overview
 
Hypothesis Testing: Central Tendency – Non-Normal (Compare 2+ Factors)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 2+ Factors)Hypothesis Testing: Central Tendency – Non-Normal (Compare 2+ Factors)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 2+ Factors)
 
Hypothesis testing 1.0
Hypothesis testing 1.0Hypothesis testing 1.0
Hypothesis testing 1.0
 
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:1)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:1)Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:1)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:1)
 

More from Matt Hansen

More from Matt Hansen (20)

Getting Feedback with a Plus/Delta Tool
Getting Feedback with a Plus/Delta ToolGetting Feedback with a Plus/Delta Tool
Getting Feedback with a Plus/Delta Tool
 
Closing a Project
Closing a ProjectClosing a Project
Closing a Project
 
Documenting a New Process with SOPs
Documenting a New Process with SOPsDocumenting a New Process with SOPs
Documenting a New Process with SOPs
 
Building a Control Plan
Building a Control PlanBuilding a Control Plan
Building a Control Plan
 
Control Charts: Recalculating Control Limits
Control Charts: Recalculating Control LimitsControl Charts: Recalculating Control Limits
Control Charts: Recalculating Control Limits
 
Control Charts: U Chart
Control Charts: U ChartControl Charts: U Chart
Control Charts: U Chart
 
Control Charts: P Chart
Control Charts: P ChartControl Charts: P Chart
Control Charts: P Chart
 
Control Charts: Xbar-S Chart
Control Charts: Xbar-S ChartControl Charts: Xbar-S Chart
Control Charts: Xbar-S Chart
 
Control Charts: I-MR Chart
Control Charts: I-MR ChartControl Charts: I-MR Chart
Control Charts: I-MR Chart
 
Control Charts: Finding the Right Control Chart
Control Charts: Finding the Right Control ChartControl Charts: Finding the Right Control Chart
Control Charts: Finding the Right Control Chart
 
Building a Scorecard
Building a ScorecardBuilding a Scorecard
Building a Scorecard
 
Control Phase Roadmap (Level 3)
Control Phase Roadmap (Level 3)Control Phase Roadmap (Level 3)
Control Phase Roadmap (Level 3)
 
Piloting Solutions: Build the Pilot Plan
Piloting Solutions: Build the Pilot PlanPiloting Solutions: Build the Pilot Plan
Piloting Solutions: Build the Pilot Plan
 
Piloting Solutions: The Process
Piloting Solutions: The ProcessPiloting Solutions: The Process
Piloting Solutions: The Process
 
Risk Assessment with a FMEA Tool
Risk Assessment with a FMEA ToolRisk Assessment with a FMEA Tool
Risk Assessment with a FMEA Tool
 
Prioritize Solutions with an Impact Matrix
Prioritize Solutions with an Impact MatrixPrioritize Solutions with an Impact Matrix
Prioritize Solutions with an Impact Matrix
 
Brainstorm Solutions with an Affinity Diagram
Brainstorm Solutions with an Affinity DiagramBrainstorm Solutions with an Affinity Diagram
Brainstorm Solutions with an Affinity Diagram
 
Brainstorm & Prioritize Solutions with a Workout
Brainstorm & Prioritize Solutions with a WorkoutBrainstorm & Prioritize Solutions with a Workout
Brainstorm & Prioritize Solutions with a Workout
 
Compiling Analysis Results
Compiling Analysis ResultsCompiling Analysis Results
Compiling Analysis Results
 
Improve Phase Roadmap (Level 3)
Improve Phase Roadmap (Level 3)Improve Phase Roadmap (Level 3)
Improve Phase Roadmap (Level 3)
 

Recently uploaded

NewBase 24 May 2024 Energy News issue - 1727 by Khaled Al Awadi_compresse...
NewBase   24 May  2024  Energy News issue - 1727 by Khaled Al Awadi_compresse...NewBase   24 May  2024  Energy News issue - 1727 by Khaled Al Awadi_compresse...
NewBase 24 May 2024 Energy News issue - 1727 by Khaled Al Awadi_compresse...
Khaled Al Awadi
 
zidauu _business communication.pptx /pdf
zidauu _business  communication.pptx /pdfzidauu _business  communication.pptx /pdf
zidauu _business communication.pptx /pdf
zukhrafshabbir
 
Constitution of Company Article of Association
Constitution of Company Article of AssociationConstitution of Company Article of Association
Constitution of Company Article of Association
seri bangash
 

Recently uploaded (20)

Meaningful Technology for Humans: How Strategy Helps to Deliver Real Value fo...
Meaningful Technology for Humans: How Strategy Helps to Deliver Real Value fo...Meaningful Technology for Humans: How Strategy Helps to Deliver Real Value fo...
Meaningful Technology for Humans: How Strategy Helps to Deliver Real Value fo...
 
NewBase 24 May 2024 Energy News issue - 1727 by Khaled Al Awadi_compresse...
NewBase   24 May  2024  Energy News issue - 1727 by Khaled Al Awadi_compresse...NewBase   24 May  2024  Energy News issue - 1727 by Khaled Al Awadi_compresse...
NewBase 24 May 2024 Energy News issue - 1727 by Khaled Al Awadi_compresse...
 
MichaelStarkes_UncutGemsProjectSummary.pdf
MichaelStarkes_UncutGemsProjectSummary.pdfMichaelStarkes_UncutGemsProjectSummary.pdf
MichaelStarkes_UncutGemsProjectSummary.pdf
 
Elevate Your Online Presence with SEO Services
Elevate Your Online Presence with SEO ServicesElevate Your Online Presence with SEO Services
Elevate Your Online Presence with SEO Services
 
FEXLE- Salesforce Field Service Lightning
FEXLE- Salesforce Field Service LightningFEXLE- Salesforce Field Service Lightning
FEXLE- Salesforce Field Service Lightning
 
Unveiling Gemini: Traits and Personality of the Twins
Unveiling Gemini: Traits and Personality of the TwinsUnveiling Gemini: Traits and Personality of the Twins
Unveiling Gemini: Traits and Personality of the Twins
 
zidauu _business communication.pptx /pdf
zidauu _business  communication.pptx /pdfzidauu _business  communication.pptx /pdf
zidauu _business communication.pptx /pdf
 
New Product Development.kjiy7ggbfdsddggo9lo
New Product Development.kjiy7ggbfdsddggo9loNew Product Development.kjiy7ggbfdsddggo9lo
New Product Development.kjiy7ggbfdsddggo9lo
 
How to refresh to be fit for the future world
How to refresh to be fit for the future worldHow to refresh to be fit for the future world
How to refresh to be fit for the future world
 
A Brief Introduction About Jacob Badgett
A Brief Introduction About Jacob BadgettA Brief Introduction About Jacob Badgett
A Brief Introduction About Jacob Badgett
 
Team-Spandex-Northern University-CS1035.
Team-Spandex-Northern University-CS1035.Team-Spandex-Northern University-CS1035.
Team-Spandex-Northern University-CS1035.
 
Constitution of Company Article of Association
Constitution of Company Article of AssociationConstitution of Company Article of Association
Constitution of Company Article of Association
 
How to Maintain Healthy Life style.pptx
How to Maintain  Healthy Life style.pptxHow to Maintain  Healthy Life style.pptx
How to Maintain Healthy Life style.pptx
 
The Inspiring Personality To Watch In 2024.pdf
The Inspiring Personality To Watch In 2024.pdfThe Inspiring Personality To Watch In 2024.pdf
The Inspiring Personality To Watch In 2024.pdf
 
8 Questions B2B Commercial Teams Can Ask To Help Product Discovery
8 Questions B2B Commercial Teams Can Ask To Help Product Discovery8 Questions B2B Commercial Teams Can Ask To Help Product Discovery
8 Questions B2B Commercial Teams Can Ask To Help Product Discovery
 
Revolutionizing Industries: The Power of Carbon Components
Revolutionizing Industries: The Power of Carbon ComponentsRevolutionizing Industries: The Power of Carbon Components
Revolutionizing Industries: The Power of Carbon Components
 
Innomantra Viewpoint - Building Moonshots : May-Jun 2024.pdf
Innomantra Viewpoint - Building Moonshots : May-Jun 2024.pdfInnomantra Viewpoint - Building Moonshots : May-Jun 2024.pdf
Innomantra Viewpoint - Building Moonshots : May-Jun 2024.pdf
 
The Truth About Dinesh Bafna's Situation.pdf
The Truth About Dinesh Bafna's Situation.pdfThe Truth About Dinesh Bafna's Situation.pdf
The Truth About Dinesh Bafna's Situation.pdf
 
Unlock Your TikTok Potential: Free TikTok Likes with InstBlast
Unlock Your TikTok Potential: Free TikTok Likes with InstBlastUnlock Your TikTok Potential: Free TikTok Likes with InstBlast
Unlock Your TikTok Potential: Free TikTok Likes with InstBlast
 
Creative Ideas for Interactive Team Presentations
Creative Ideas for Interactive Team PresentationsCreative Ideas for Interactive Team Presentations
Creative Ideas for Interactive Team Presentations
 

Distributions: Normal with Matt Hansen at StatStuff

  • 1. Section & Lesson #: Pre-Requisite Lessons: Complex Tools + Clear Teaching = Powerful Results Distributions: Normal Six Sigma-Measure – Lesson 9 A review of normal distributions and how to test their normality using a normality test. Six Sigma-Measure #08 – Distributions: Overview Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
  • 2. Normal Distributions (bell curve) o Why is it called a “normal” distribution? • “Normal” implies the typical randomness that we expect to occur in life. • If there was no randomness, then we can presume there is some influence (bias or skewness). • We want the data that we analyze to be unbiased, therefore we need to ensure it reflects the “normal” randomness we would expect.  Otherwise if the data is biased, then why analyze it if we can’t be confident we’ll find the right root cause? o A normal distribution is bell-shaped. • The bell shape is created because most of the data points fall in the middle. • The shape of the bell is influenced by the mean and standard deviation. o Characteristics of a normal distribution. • Completely described by its mean and standard deviation. • The tails on either end of the curve extend +/- infinity. • The area under the curve represents 100% of possible observations. • The curve is symmetrical where 50% of the data points fall on either side of the mean. • The mean (average) will be relatively equal to the median (50th percentile). Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
  • 3. Normality Testing o A distribution with a bell curve doesn’t necessarily mean it’s “normal”. • It’s not uncommon for a distribution to appear normal, but it really isn’t. • The normality of the distribution should be statistically tested. o Use the Anderson-Darling test of a Normality Test or Probability Plot. • A normality test or probability plot will plot the data on a logarithmic scale.  In Minitab, go to Stat > Basic Statistics > Normality Test or go to Graph > Probability Plot • Normal data will appear like a straight line; Minitab will try to fit a line along the data points.  A “fat pencil” test is if a fat pencil can lay across and cover all the data points, then it’s probably a normal distribution. • A better test is to examine the p-value of the Anderson-Darling (AD) test.  If p-value > 0.05, then it’s normal.  If p-value < 0.05, then it’s not normal. – In these examples, 0.05 is referring to the alpha risk which is the % chance of being right when concluding the data is normal. 5% is a commonly acceptable threshold. Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 3 This p-value is > 0.05 which means it’s a normal distribution
  • 4. Practical Application o Open the “Minitab Sample Data.MPJ” file and try to do the following: • Run a normality test on each continuous metric. • Which metrics are normally distributed? How can you prove it? o Next, pull some historical data for at least 2 continuous metrics used by your organization and try following the same steps described above. Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 4