SlideShare a Scribd company logo
1 of 6
Section & Lesson #:
Pre-Requisite Lessons:
Complex Tools + Clear Teaching = Powerful Results
Hypothesis Testing: Statistical Laws
and Confidence Intervals
Six Sigma-Analyze – Lesson 11
An extension on a series about hypothesis testing, this lesson introduces
some statistical concepts that are fundamental to most hypothesis testing.
Six Sigma-Analyze #10 – Hypothesis Testing – Formal and
Informal Sub-Processes
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means
(electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
Some Statistical Laws
o There are 3 general laws commonly associated with statistics:
• The Law of Averages, the Law of Large Numbers (LLN), and the Central Limit Theorem (CLT).
o Law of Averages
• This is actually not a law but a lay term generally applied by people unfamiliar with statistics.
• It is a belief that the outcome of random events will eventually “even out”.
 Example 1: Belief an event is “due” to happen.
– If you flip a coin 10 times and get heads all 10 times, this belief presumes that you’re “due”
to get tails next. However, the probability of getting tails doesn’t change; it’s still 50/50.
– However, since getting heads 10 straight times is unexpected, then it’s more likely that the
coin is unevenly weighted (bias). If so, there’s a greater probability of getting heads again.
 Example 2: Belief a sample’s average must equal its expected value.
– If you flip a coin 100 times, there’s only an 8% chance there will be exactly 50 heads.
 Example 3: Belief that a rare occurrence will happen given enough time.
– "If I send my résumé to enough places, someone will eventually hire me."
– This may actually be true assuming nonzero probabilities and that the number of trials is
really large enough; the law of averages is then simply the Law of Large Numbers (LLN).
o Law of Large Numbers (LLN)
• This is a legitimate theorem that the average of the results obtained from a large number of
samples will be close to the expected value and will get closer as you obtain more samples.
 Example: The expected average value of a six-sided die is (1 + 2 + 3 + 4 + 5 + 6) ÷ 6 = 3.5.
– Roll the die 10 times, the average of those 10 rolls will be close to 3.5.
– The more trials or sets of 10 rolls you record, then the closer the cumulative average will be to 3.5.
• This law is the premise for the Central Limit Theorem (CLT) applied in statistical testing.
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
2
Central Limit Theorem (CLT)
o The statistical tests used in hypothesis testing are founded on the CLT.
• Understanding this theorem can help you understand the basis for most statistical tests.
o What is the Central Limit Theorem (CLT)?
• Technically, the means of random samples from any distribution (normal or non-normal) with a
mean of μ and a variance of σ2 will have the following:
 An approximately normal distribution.
 A mean equal to μ.
 A variance equal to σ2/n.
o Example: Rolling Dice
• Suppose you roll a pair of dice 10 times and write down the average of all the rolls. If you
continue rolling the dice and writing down the average value after each set of 10 rolls, then:
 The averages of each of those sets of rolls will be about the same.
 The more you roll, the variance between the values will become more & more narrow (less variation).
• What if the dice are weighted (bias)?
 The theorem results are the same, except the average will shift.
o What can we conclude from the CLT?
• The more samples you collect, the more confident
you can be about what the mean is for all
samples of the population.
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
3
N = 5 sets of dice rolls
N = 10 sets of dice rolls
N = 20 sets of dice rolls
Confidence Intervals (CI)
o What is a confidence interval (CI)?
• It represents the range (lower & upper bounds) in which the population mean should reside
based on the data in the sample.
• Let’s look at the impact of the CI using the prior dice example:
• The spread of the range is dependent on the size of the sample (more samples will reduce the
interval) and the desired confidence (95% confidence will be more narrow than 99%).
 Remember the Standard Error of the Mean (SE Mean)? It is the descriptive statistic used to calculate the
confidence interval. To get 95% CI, calculate SE Mean by 2, or by 3 to get 99%.
• Remember, statistics are intended to help you make inferences about a population.
 A mean of 57 with a 95% confidence interval of 55 and 59 implies that although the sample mean is only
57, you can be 95% confident that the population mean will fall somewhere between 55 and 59.
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
4
Based on 5 sets of dice rolls, the population
mean should fall within this interval.
Based on 10 sets of dice rolls, the population
man should fall within this more narrow interval.
Based on 20 sets of dice rolls, the population
man should fall within this more narrow interval.
Using CIs in Statistical Tests
o How do statistical tests use the confidence intervals (CI)?
• Statistical tests generally compare the different CIs between factors to see if a difference exists.
o Example: Compare efficiency between two systems
• Scenario: 50 similar transactions were each run through two different systems. System A’s time
was 180 seconds and System B’s time was 160 seconds (assume standard deviation of about 1/3).
 Is System B statistically faster? What is the best/worst case difference in efficiency for System B?
o Bottom Line: Increase your sample size to decrease (narrow) your confidence interval.
5
Employees
Contractors
0 5 10 15 20
Large overlap indicates the population
mean may be the same for both
groups; therefore there is no difference
between them (high P-value).
N = 50
Employees
Contractors
0 5 10 15 20
Small overlap indicates a smaller chance
the population mean may be the same for
both groups; therefore there may be no
difference between them (small P-value).
N = 100
Employees
Contractors
0 5 10 15 20
No overlap indicates the population
mean is different for both groups;
therefore there is a difference between
them (low or no P-value).
N = 200
200195190185180175170165160155150145
System B
This overlap suggests both may have the same speed.
System A
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
Practical Application
o Scenario:
• You have two sets of continuous data that you’re comparing; each set has 100 values. The first
dataset has an average value of 127 with a confidence interval of 117 to 137, and the second
dataset has an average value of 132 with a confidence interval of 123 and 140.
o Try to answer the following questions based on the above scenario:
• Would you consider the confidence intervals to be wide or narrow?
• Would you expect the P-value to be high, medium, or low?
• Is there a statistical difference between these two datasets?
 How confident (roughly) would you be in your conclusion?
• How would the analysis change if you were only able to collect 50 samples per dataset?
• How would the analysis change if you collect 300 samples per dataset?
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
6

More Related Content

What's hot

Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:Standard)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:Standard)Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:Standard)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:Standard)Matt Hansen
 
Hypothesis Testing: Proportions (Compare 1:Standard)
Hypothesis Testing: Proportions (Compare 1:Standard)Hypothesis Testing: Proportions (Compare 1:Standard)
Hypothesis Testing: Proportions (Compare 1:Standard)Matt Hansen
 
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)Matt Hansen
 
Hypothesis Testing: Spread (Compare 1:Standard)
Hypothesis Testing: Spread (Compare 1:Standard)Hypothesis Testing: Spread (Compare 1:Standard)
Hypothesis Testing: Spread (Compare 1:Standard)Matt Hansen
 
Hypothesis Testing: Finding the Right Statistical Test
Hypothesis Testing: Finding the Right Statistical TestHypothesis Testing: Finding the Right Statistical Test
Hypothesis Testing: Finding the Right Statistical TestMatt Hansen
 
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:1)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:1)Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:1)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:1)Matt Hansen
 
Hypothesis Testing: Overview
Hypothesis Testing: OverviewHypothesis Testing: Overview
Hypothesis Testing: OverviewMatt Hansen
 
Hypothesis Testing: Spread (Compare 2+ Factors)
Hypothesis Testing: Spread (Compare 2+ Factors)Hypothesis Testing: Spread (Compare 2+ Factors)
Hypothesis Testing: Spread (Compare 2+ Factors)Matt Hansen
 
Hypothesis Testing: Proportions (Compare 2+ Factors)
Hypothesis Testing: Proportions (Compare 2+ Factors)Hypothesis Testing: Proportions (Compare 2+ Factors)
Hypothesis Testing: Proportions (Compare 2+ Factors)Matt Hansen
 
Hypothesis Testing: Formal and Informal Sub-Processes
Hypothesis Testing: Formal and Informal Sub-ProcessesHypothesis Testing: Formal and Informal Sub-Processes
Hypothesis Testing: Formal and Informal Sub-ProcessesMatt Hansen
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive StatisticsMatt Hansen
 
Hypothesis Testing: Relationships (Compare 2+ Factors)
Hypothesis Testing: Relationships (Compare 2+ Factors)Hypothesis Testing: Relationships (Compare 2+ Factors)
Hypothesis Testing: Relationships (Compare 2+ Factors)Matt Hansen
 
Testing for Multicollinearity
Testing for MulticollinearityTesting for Multicollinearity
Testing for MulticollinearityMatt Hansen
 
2016 Symposium Poster - statistics - Final
2016 Symposium Poster - statistics - Final2016 Symposium Poster - statistics - Final
2016 Symposium Poster - statistics - FinalBrian Lin
 
Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceAmit Sharma
 
MAT80 - White paper july 2017 - Prof. P. Irwing
MAT80 - White paper july 2017 - Prof. P. IrwingMAT80 - White paper july 2017 - Prof. P. Irwing
MAT80 - White paper july 2017 - Prof. P. IrwingPaul Irwing
 
Practical Statistical Testing
Practical Statistical TestingPractical Statistical Testing
Practical Statistical TestingAdrian Cuyugan
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningBill Liu
 
Basic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingBasic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingPenn State University
 
8. testing of hypothesis for variable & attribute data
8. testing of hypothesis for variable & attribute  data8. testing of hypothesis for variable & attribute  data
8. testing of hypothesis for variable & attribute dataHakeem-Ur- Rehman
 

What's hot (20)

Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:Standard)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:Standard)Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:Standard)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:Standard)
 
Hypothesis Testing: Proportions (Compare 1:Standard)
Hypothesis Testing: Proportions (Compare 1:Standard)Hypothesis Testing: Proportions (Compare 1:Standard)
Hypothesis Testing: Proportions (Compare 1:Standard)
 
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)
 
Hypothesis Testing: Spread (Compare 1:Standard)
Hypothesis Testing: Spread (Compare 1:Standard)Hypothesis Testing: Spread (Compare 1:Standard)
Hypothesis Testing: Spread (Compare 1:Standard)
 
Hypothesis Testing: Finding the Right Statistical Test
Hypothesis Testing: Finding the Right Statistical TestHypothesis Testing: Finding the Right Statistical Test
Hypothesis Testing: Finding the Right Statistical Test
 
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:1)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:1)Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:1)
Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:1)
 
Hypothesis Testing: Overview
Hypothesis Testing: OverviewHypothesis Testing: Overview
Hypothesis Testing: Overview
 
Hypothesis Testing: Spread (Compare 2+ Factors)
Hypothesis Testing: Spread (Compare 2+ Factors)Hypothesis Testing: Spread (Compare 2+ Factors)
Hypothesis Testing: Spread (Compare 2+ Factors)
 
Hypothesis Testing: Proportions (Compare 2+ Factors)
Hypothesis Testing: Proportions (Compare 2+ Factors)Hypothesis Testing: Proportions (Compare 2+ Factors)
Hypothesis Testing: Proportions (Compare 2+ Factors)
 
Hypothesis Testing: Formal and Informal Sub-Processes
Hypothesis Testing: Formal and Informal Sub-ProcessesHypothesis Testing: Formal and Informal Sub-Processes
Hypothesis Testing: Formal and Informal Sub-Processes
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Hypothesis Testing: Relationships (Compare 2+ Factors)
Hypothesis Testing: Relationships (Compare 2+ Factors)Hypothesis Testing: Relationships (Compare 2+ Factors)
Hypothesis Testing: Relationships (Compare 2+ Factors)
 
Testing for Multicollinearity
Testing for MulticollinearityTesting for Multicollinearity
Testing for Multicollinearity
 
2016 Symposium Poster - statistics - Final
2016 Symposium Poster - statistics - Final2016 Symposium Poster - statistics - Final
2016 Symposium Poster - statistics - Final
 
Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inference
 
MAT80 - White paper july 2017 - Prof. P. Irwing
MAT80 - White paper july 2017 - Prof. P. IrwingMAT80 - White paper july 2017 - Prof. P. Irwing
MAT80 - White paper july 2017 - Prof. P. Irwing
 
Practical Statistical Testing
Practical Statistical TestingPractical Statistical Testing
Practical Statistical Testing
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine Learning
 
Basic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingBasic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-Making
 
8. testing of hypothesis for variable & attribute data
8. testing of hypothesis for variable & attribute  data8. testing of hypothesis for variable & attribute  data
8. testing of hypothesis for variable & attribute data
 

Similar to Hypothesis Testing: Statistical Laws and Confidence Intervals

Overview of Statistical Terms and Concepts with Matt Hansen at StatStuff
Overview of Statistical Terms and Concepts with Matt Hansen at StatStuffOverview of Statistical Terms and Concepts with Matt Hansen at StatStuff
Overview of Statistical Terms and Concepts with Matt Hansen at StatStuffMatt Hansen
 
law of large number and central limit theorem
 law of large number and central limit theorem law of large number and central limit theorem
law of large number and central limit theoremlovemucheca
 
Normal and standard normal distribution
Normal and standard normal distributionNormal and standard normal distribution
Normal and standard normal distributionAvjinder (Avi) Kaler
 
Module-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data scienceModule-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data sciencepujashri1975
 
Hypothesis testing: A single sample test
Hypothesis testing: A single sample testHypothesis testing: A single sample test
Hypothesis testing: A single sample testUmme Salma Tuli
 
03-Data-Analysis-Final.pdf
03-Data-Analysis-Final.pdf03-Data-Analysis-Final.pdf
03-Data-Analysis-Final.pdfSugumarSarDurai
 
Frequentist inference only seems easy By John Mount
Frequentist inference only seems easy By John MountFrequentist inference only seems easy By John Mount
Frequentist inference only seems easy By John MountChester Chen
 
The%20 Minimum%20 Daily%20 Adult%20 %20 Ca Cmg
The%20 Minimum%20 Daily%20 Adult%20 %20 Ca CmgThe%20 Minimum%20 Daily%20 Adult%20 %20 Ca Cmg
The%20 Minimum%20 Daily%20 Adult%20 %20 Ca Cmgdahirf
 
Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...
Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...
Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...cambridgeWD
 
Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...
Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...
Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...cambridgeWD
 
Confidence intervals
Confidence intervalsConfidence intervals
Confidence intervalsTanay Tandon
 
Theory of estimation
Theory of estimationTheory of estimation
Theory of estimationTech_MX
 
Introduction to Statistics - Part 2
Introduction to Statistics - Part 2Introduction to Statistics - Part 2
Introduction to Statistics - Part 2Damian T. Gordon
 
1 statistical analysis notes
1 statistical analysis notes1 statistical analysis notes
1 statistical analysis notescartlidge
 
1. complete stats notes
1. complete stats notes1. complete stats notes
1. complete stats notesBob Smullen
 

Similar to Hypothesis Testing: Statistical Laws and Confidence Intervals (20)

Overview of Statistical Terms and Concepts with Matt Hansen at StatStuff
Overview of Statistical Terms and Concepts with Matt Hansen at StatStuffOverview of Statistical Terms and Concepts with Matt Hansen at StatStuff
Overview of Statistical Terms and Concepts with Matt Hansen at StatStuff
 
law of large number and central limit theorem
 law of large number and central limit theorem law of large number and central limit theorem
law of large number and central limit theorem
 
Normal and standard normal distribution
Normal and standard normal distributionNormal and standard normal distribution
Normal and standard normal distribution
 
Module-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data scienceModule-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data science
 
Statistics.pdf
Statistics.pdfStatistics.pdf
Statistics.pdf
 
Sampling
SamplingSampling
Sampling
 
Hypothesis testing: A single sample test
Hypothesis testing: A single sample testHypothesis testing: A single sample test
Hypothesis testing: A single sample test
 
03-Data-Analysis-Final.pdf
03-Data-Analysis-Final.pdf03-Data-Analysis-Final.pdf
03-Data-Analysis-Final.pdf
 
Frequentist inference only seems easy By John Mount
Frequentist inference only seems easy By John MountFrequentist inference only seems easy By John Mount
Frequentist inference only seems easy By John Mount
 
The%20 Minimum%20 Daily%20 Adult%20 %20 Ca Cmg
The%20 Minimum%20 Daily%20 Adult%20 %20 Ca CmgThe%20 Minimum%20 Daily%20 Adult%20 %20 Ca Cmg
The%20 Minimum%20 Daily%20 Adult%20 %20 Ca Cmg
 
Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...
Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...
Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...
 
Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...
Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...
Clinical Trials Versus Health Outcomes Research: SAS/STAT Versus SAS Enterpri...
 
Confidence intervals
Confidence intervalsConfidence intervals
Confidence intervals
 
Theory of estimation
Theory of estimationTheory of estimation
Theory of estimation
 
Cognitive Bias in Risk-Reward Analysis
Cognitive Bias in Risk-Reward AnalysisCognitive Bias in Risk-Reward Analysis
Cognitive Bias in Risk-Reward Analysis
 
Introduction to Statistics - Part 2
Introduction to Statistics - Part 2Introduction to Statistics - Part 2
Introduction to Statistics - Part 2
 
1 statistical analysis notes
1 statistical analysis notes1 statistical analysis notes
1 statistical analysis notes
 
1. complete stats notes
1. complete stats notes1. complete stats notes
1. complete stats notes
 
Sampling Size
Sampling SizeSampling Size
Sampling Size
 
226 lec9 jda
226 lec9 jda226 lec9 jda
226 lec9 jda
 

More from Matt Hansen

Getting Feedback with a Plus/Delta Tool
Getting Feedback with a Plus/Delta ToolGetting Feedback with a Plus/Delta Tool
Getting Feedback with a Plus/Delta ToolMatt Hansen
 
Closing a Project
Closing a ProjectClosing a Project
Closing a ProjectMatt Hansen
 
Documenting a New Process with SOPs
Documenting a New Process with SOPsDocumenting a New Process with SOPs
Documenting a New Process with SOPsMatt Hansen
 
Building a Control Plan
Building a Control PlanBuilding a Control Plan
Building a Control PlanMatt Hansen
 
Control Charts: Recalculating Control Limits
Control Charts: Recalculating Control LimitsControl Charts: Recalculating Control Limits
Control Charts: Recalculating Control LimitsMatt Hansen
 
Control Charts: U Chart
Control Charts: U ChartControl Charts: U Chart
Control Charts: U ChartMatt Hansen
 
Control Charts: P Chart
Control Charts: P ChartControl Charts: P Chart
Control Charts: P ChartMatt Hansen
 
Control Charts: Xbar-S Chart
Control Charts: Xbar-S ChartControl Charts: Xbar-S Chart
Control Charts: Xbar-S ChartMatt Hansen
 
Control Charts: I-MR Chart
Control Charts: I-MR ChartControl Charts: I-MR Chart
Control Charts: I-MR ChartMatt Hansen
 
Control Charts: Finding the Right Control Chart
Control Charts: Finding the Right Control ChartControl Charts: Finding the Right Control Chart
Control Charts: Finding the Right Control ChartMatt Hansen
 
Building a Scorecard
Building a ScorecardBuilding a Scorecard
Building a ScorecardMatt Hansen
 
Control Phase Roadmap (Level 3)
Control Phase Roadmap (Level 3)Control Phase Roadmap (Level 3)
Control Phase Roadmap (Level 3)Matt Hansen
 
Piloting Solutions: Build the Pilot Plan
Piloting Solutions: Build the Pilot PlanPiloting Solutions: Build the Pilot Plan
Piloting Solutions: Build the Pilot PlanMatt Hansen
 
Piloting Solutions: The Process
Piloting Solutions: The ProcessPiloting Solutions: The Process
Piloting Solutions: The ProcessMatt Hansen
 
Risk Assessment with a FMEA Tool
Risk Assessment with a FMEA ToolRisk Assessment with a FMEA Tool
Risk Assessment with a FMEA ToolMatt Hansen
 
Prioritize Solutions with an Impact Matrix
Prioritize Solutions with an Impact MatrixPrioritize Solutions with an Impact Matrix
Prioritize Solutions with an Impact MatrixMatt Hansen
 
Brainstorm Solutions with an Affinity Diagram
Brainstorm Solutions with an Affinity DiagramBrainstorm Solutions with an Affinity Diagram
Brainstorm Solutions with an Affinity DiagramMatt Hansen
 
Brainstorm & Prioritize Solutions with a Workout
Brainstorm & Prioritize Solutions with a WorkoutBrainstorm & Prioritize Solutions with a Workout
Brainstorm & Prioritize Solutions with a WorkoutMatt Hansen
 
Compiling Analysis Results
Compiling Analysis ResultsCompiling Analysis Results
Compiling Analysis ResultsMatt Hansen
 
Improve Phase Roadmap (Level 3)
Improve Phase Roadmap (Level 3)Improve Phase Roadmap (Level 3)
Improve Phase Roadmap (Level 3)Matt Hansen
 

More from Matt Hansen (20)

Getting Feedback with a Plus/Delta Tool
Getting Feedback with a Plus/Delta ToolGetting Feedback with a Plus/Delta Tool
Getting Feedback with a Plus/Delta Tool
 
Closing a Project
Closing a ProjectClosing a Project
Closing a Project
 
Documenting a New Process with SOPs
Documenting a New Process with SOPsDocumenting a New Process with SOPs
Documenting a New Process with SOPs
 
Building a Control Plan
Building a Control PlanBuilding a Control Plan
Building a Control Plan
 
Control Charts: Recalculating Control Limits
Control Charts: Recalculating Control LimitsControl Charts: Recalculating Control Limits
Control Charts: Recalculating Control Limits
 
Control Charts: U Chart
Control Charts: U ChartControl Charts: U Chart
Control Charts: U Chart
 
Control Charts: P Chart
Control Charts: P ChartControl Charts: P Chart
Control Charts: P Chart
 
Control Charts: Xbar-S Chart
Control Charts: Xbar-S ChartControl Charts: Xbar-S Chart
Control Charts: Xbar-S Chart
 
Control Charts: I-MR Chart
Control Charts: I-MR ChartControl Charts: I-MR Chart
Control Charts: I-MR Chart
 
Control Charts: Finding the Right Control Chart
Control Charts: Finding the Right Control ChartControl Charts: Finding the Right Control Chart
Control Charts: Finding the Right Control Chart
 
Building a Scorecard
Building a ScorecardBuilding a Scorecard
Building a Scorecard
 
Control Phase Roadmap (Level 3)
Control Phase Roadmap (Level 3)Control Phase Roadmap (Level 3)
Control Phase Roadmap (Level 3)
 
Piloting Solutions: Build the Pilot Plan
Piloting Solutions: Build the Pilot PlanPiloting Solutions: Build the Pilot Plan
Piloting Solutions: Build the Pilot Plan
 
Piloting Solutions: The Process
Piloting Solutions: The ProcessPiloting Solutions: The Process
Piloting Solutions: The Process
 
Risk Assessment with a FMEA Tool
Risk Assessment with a FMEA ToolRisk Assessment with a FMEA Tool
Risk Assessment with a FMEA Tool
 
Prioritize Solutions with an Impact Matrix
Prioritize Solutions with an Impact MatrixPrioritize Solutions with an Impact Matrix
Prioritize Solutions with an Impact Matrix
 
Brainstorm Solutions with an Affinity Diagram
Brainstorm Solutions with an Affinity DiagramBrainstorm Solutions with an Affinity Diagram
Brainstorm Solutions with an Affinity Diagram
 
Brainstorm & Prioritize Solutions with a Workout
Brainstorm & Prioritize Solutions with a WorkoutBrainstorm & Prioritize Solutions with a Workout
Brainstorm & Prioritize Solutions with a Workout
 
Compiling Analysis Results
Compiling Analysis ResultsCompiling Analysis Results
Compiling Analysis Results
 
Improve Phase Roadmap (Level 3)
Improve Phase Roadmap (Level 3)Improve Phase Roadmap (Level 3)
Improve Phase Roadmap (Level 3)
 

Recently uploaded

Call Girls Miyapur 7001305949 all area service COD available Any Time
Call Girls Miyapur 7001305949 all area service COD available Any TimeCall Girls Miyapur 7001305949 all area service COD available Any Time
Call Girls Miyapur 7001305949 all area service COD available Any Timedelhimodelshub1
 
Investment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy CheruiyotInvestment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy Cheruiyotictsugar
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Riya Pathan
 
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdfNewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdfKhaled Al Awadi
 
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,noida100girls
 
Future Of Sample Report 2024 | Redacted Version
Future Of Sample Report 2024 | Redacted VersionFuture Of Sample Report 2024 | Redacted Version
Future Of Sample Report 2024 | Redacted VersionMintel Group
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCRashishs7044
 
Market Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMarket Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMintel Group
 
2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis UsageNeil Kimberley
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...lizamodels9
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation SlidesKeppelCorporation
 
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...ictsugar
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Kirill Klimov
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...lizamodels9
 
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort ServiceCall US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Servicecallgirls2057
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africaictsugar
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckHajeJanKamps
 
Kenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith PereraKenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith Pereraictsugar
 
Lean: From Theory to Practice — One City’s (and Library’s) Lean Story… Abridged
Lean: From Theory to Practice — One City’s (and Library’s) Lean Story… AbridgedLean: From Theory to Practice — One City’s (and Library’s) Lean Story… Abridged
Lean: From Theory to Practice — One City’s (and Library’s) Lean Story… AbridgedKaiNexus
 
MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?Olivia Kresic
 

Recently uploaded (20)

Call Girls Miyapur 7001305949 all area service COD available Any Time
Call Girls Miyapur 7001305949 all area service COD available Any TimeCall Girls Miyapur 7001305949 all area service COD available Any Time
Call Girls Miyapur 7001305949 all area service COD available Any Time
 
Investment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy CheruiyotInvestment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy Cheruiyot
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737
 
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdfNewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
 
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
 
Future Of Sample Report 2024 | Redacted Version
Future Of Sample Report 2024 | Redacted VersionFuture Of Sample Report 2024 | Redacted Version
Future Of Sample Report 2024 | Redacted Version
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR
 
Market Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMarket Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 Edition
 
2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
 
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
 
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort ServiceCall US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africa
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
 
Kenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith PereraKenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith Perera
 
Lean: From Theory to Practice — One City’s (and Library’s) Lean Story… Abridged
Lean: From Theory to Practice — One City’s (and Library’s) Lean Story… AbridgedLean: From Theory to Practice — One City’s (and Library’s) Lean Story… Abridged
Lean: From Theory to Practice — One City’s (and Library’s) Lean Story… Abridged
 
MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?
 

Hypothesis Testing: Statistical Laws and Confidence Intervals

  • 1. Section & Lesson #: Pre-Requisite Lessons: Complex Tools + Clear Teaching = Powerful Results Hypothesis Testing: Statistical Laws and Confidence Intervals Six Sigma-Analyze – Lesson 11 An extension on a series about hypothesis testing, this lesson introduces some statistical concepts that are fundamental to most hypothesis testing. Six Sigma-Analyze #10 – Hypothesis Testing – Formal and Informal Sub-Processes Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
  • 2. Some Statistical Laws o There are 3 general laws commonly associated with statistics: • The Law of Averages, the Law of Large Numbers (LLN), and the Central Limit Theorem (CLT). o Law of Averages • This is actually not a law but a lay term generally applied by people unfamiliar with statistics. • It is a belief that the outcome of random events will eventually “even out”.  Example 1: Belief an event is “due” to happen. – If you flip a coin 10 times and get heads all 10 times, this belief presumes that you’re “due” to get tails next. However, the probability of getting tails doesn’t change; it’s still 50/50. – However, since getting heads 10 straight times is unexpected, then it’s more likely that the coin is unevenly weighted (bias). If so, there’s a greater probability of getting heads again.  Example 2: Belief a sample’s average must equal its expected value. – If you flip a coin 100 times, there’s only an 8% chance there will be exactly 50 heads.  Example 3: Belief that a rare occurrence will happen given enough time. – "If I send my résumé to enough places, someone will eventually hire me." – This may actually be true assuming nonzero probabilities and that the number of trials is really large enough; the law of averages is then simply the Law of Large Numbers (LLN). o Law of Large Numbers (LLN) • This is a legitimate theorem that the average of the results obtained from a large number of samples will be close to the expected value and will get closer as you obtain more samples.  Example: The expected average value of a six-sided die is (1 + 2 + 3 + 4 + 5 + 6) ÷ 6 = 3.5. – Roll the die 10 times, the average of those 10 rolls will be close to 3.5. – The more trials or sets of 10 rolls you record, then the closer the cumulative average will be to 3.5. • This law is the premise for the Central Limit Theorem (CLT) applied in statistical testing. Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 2
  • 3. Central Limit Theorem (CLT) o The statistical tests used in hypothesis testing are founded on the CLT. • Understanding this theorem can help you understand the basis for most statistical tests. o What is the Central Limit Theorem (CLT)? • Technically, the means of random samples from any distribution (normal or non-normal) with a mean of μ and a variance of σ2 will have the following:  An approximately normal distribution.  A mean equal to μ.  A variance equal to σ2/n. o Example: Rolling Dice • Suppose you roll a pair of dice 10 times and write down the average of all the rolls. If you continue rolling the dice and writing down the average value after each set of 10 rolls, then:  The averages of each of those sets of rolls will be about the same.  The more you roll, the variance between the values will become more & more narrow (less variation). • What if the dice are weighted (bias)?  The theorem results are the same, except the average will shift. o What can we conclude from the CLT? • The more samples you collect, the more confident you can be about what the mean is for all samples of the population. Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 3 N = 5 sets of dice rolls N = 10 sets of dice rolls N = 20 sets of dice rolls
  • 4. Confidence Intervals (CI) o What is a confidence interval (CI)? • It represents the range (lower & upper bounds) in which the population mean should reside based on the data in the sample. • Let’s look at the impact of the CI using the prior dice example: • The spread of the range is dependent on the size of the sample (more samples will reduce the interval) and the desired confidence (95% confidence will be more narrow than 99%).  Remember the Standard Error of the Mean (SE Mean)? It is the descriptive statistic used to calculate the confidence interval. To get 95% CI, calculate SE Mean by 2, or by 3 to get 99%. • Remember, statistics are intended to help you make inferences about a population.  A mean of 57 with a 95% confidence interval of 55 and 59 implies that although the sample mean is only 57, you can be 95% confident that the population mean will fall somewhere between 55 and 59. Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 4 Based on 5 sets of dice rolls, the population mean should fall within this interval. Based on 10 sets of dice rolls, the population man should fall within this more narrow interval. Based on 20 sets of dice rolls, the population man should fall within this more narrow interval.
  • 5. Using CIs in Statistical Tests o How do statistical tests use the confidence intervals (CI)? • Statistical tests generally compare the different CIs between factors to see if a difference exists. o Example: Compare efficiency between two systems • Scenario: 50 similar transactions were each run through two different systems. System A’s time was 180 seconds and System B’s time was 160 seconds (assume standard deviation of about 1/3).  Is System B statistically faster? What is the best/worst case difference in efficiency for System B? o Bottom Line: Increase your sample size to decrease (narrow) your confidence interval. 5 Employees Contractors 0 5 10 15 20 Large overlap indicates the population mean may be the same for both groups; therefore there is no difference between them (high P-value). N = 50 Employees Contractors 0 5 10 15 20 Small overlap indicates a smaller chance the population mean may be the same for both groups; therefore there may be no difference between them (small P-value). N = 100 Employees Contractors 0 5 10 15 20 No overlap indicates the population mean is different for both groups; therefore there is a difference between them (low or no P-value). N = 200 200195190185180175170165160155150145 System B This overlap suggests both may have the same speed. System A Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
  • 6. Practical Application o Scenario: • You have two sets of continuous data that you’re comparing; each set has 100 values. The first dataset has an average value of 127 with a confidence interval of 117 to 137, and the second dataset has an average value of 132 with a confidence interval of 123 and 140. o Try to answer the following questions based on the above scenario: • Would you consider the confidence intervals to be wide or narrow? • Would you expect the P-value to be high, medium, or low? • Is there a statistical difference between these two datasets?  How confident (roughly) would you be in your conclusion? • How would the analysis change if you were only able to collect 50 samples per dataset? • How would the analysis change if you collect 300 samples per dataset? Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 6