SlideShare a Scribd company logo
1 of 11
Section & Lesson #:
Pre-Requisite Lessons:
Complex Tools + Clear Teaching = Powerful Results
Hypothesis Testing: Central Tendency –
Non-Normal (Compare 2+ Factors)
Six Sigma-Analyze – Lesson 22
An extension on hypothesis testing, this lesson reviews the Mood’s Median &
Kruskal-Wallis tests as central tendency measurements for non-normal dist.
Six Sigma-Analyze #21 – Hypothesis Testing: Central Tendency –
Non-Normal (Compare 1:1)
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means
(electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
Why do we need hypothesis testing?
o Remember, our project goal is to resolve a problem by first building a transfer function.
• We don’t want to just alleviate symptoms, we want to resolve the root cause.
 Remember Hannah? We don’t want to alleviate the arthritis pain in her leg, but heal the strep throat.
• If we don’t know what the root cause is, then we need to build a transfer function.
 By building a transfer function, we can know what changes (improvements) should fix the root cause.
o Remember, the Transfer Function is defined as Y = f(X).
• This is described as “output response Y
is a function of one or more input X’s”.
• It’s part of the IPO flow model where we
described the IPO flow model as one or
more inputs feeding into a process that
transforms it to create a new output.
o How does a transfer function fit with hypothesis testing?
• Hypothesis testing tells us which X’s (inputs) are independently influencing the Y (output).
 When we reject a null hypothesis, we’re building evidence proving which X’s are “guilty” of driving the Y.
 We’ll compile all the evidence in the Improve phase of DMAIC and begin to fix those root causes.
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
Y = f(X)
Input (X) > Process > Output (Y)
Review Hypothesis Testing: 4 Step Process
o Remember, the 4 high-level steps for hypothesis testing begin/end with being practical:
o As the heart of hypothesis testing, steps 2 & 3 can be drilled to the following 6 steps:
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
3
Practical Statistical
Problem Problem
Solution Solution
 

Practical Problem
State the problem as a
practical Yes/No question.
 Statistical Problem
Convert the problem to an
analytical question identifying
the statistical tool/method.

Practical Solution
Interpret the analytical
answer in a practical way.
Statistical Solution
Interpret the results of the
hypothesis test with an
analytical answer.

1. Define the objective.
2. State the Null Hypothesis (H0) and Alternative Hypothesis (Ha).
3. Define the confidence (1-α) and power (beta or 1-β).
4. Collect the sample data.
5. Calculate the P-value.
6. Interpret the results: accept or reject the null hypothesis (H0).
Is the data type for both values discrete?
What are you measuring?
Is the data normal?
SpreadCentral Tendency
Compare 1:Standard
1 Proportion Test
Compare 1:1
2 Proportion Test
Compare 2+ Factors
Chi2 Test
Compare 1:Standard
1 Sample T Test
Compare 1:1
2 Sample T Test
or Paired T Test
Compare 2+ Factors
One-way ANOVA Test
Compare 1:Standard
1 Sample Wilcoxon
or 1 Sample Sign
Compare 1:1
Mann-Whitney Test
Compare 2+ Factors
Mood’s Median Test
or Kruskal-Wallis Test
Compare 1:Standard
1 Variance Test
Compare 1:1
2 Variance Test
Compare 2+ Factors
Test for Equal Variances
Yes No
Yes No
Proportions
Compare 1:1
Pearson Correlation
or Fitted Line Plot
Compare 2+ Factors
Multiple Regression or
General Linear Model
Is the data type for both values continuous?
Relationships
No Yes
Review Finding the Right Statistical Test
o What statistical test do I use for my hypothesis testing?
• The type of statistical test depends on the data to be tested, as described in the chart below:
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
4
Lesson
Current
Confidence Intervals (CI) Redefined
o Remember, confidence intervals (CI) are an estimated upper/lower range of the mean.
• The CI narrows when you add more samples:
• Remember, statistics are intended to help you make inferences about a population.
 A mean of 57 with a 95% confidence interval of 55 and 59 implies that although the sample mean is only
57, you can be 95% confident that the population mean will fall somewhere between 55 and 59.
• The first step in hypothesis testing is to define the objective by asking “is there a difference…”
 Statistical tests generally compare the different CIs between factors to see if a difference exists.
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
5
With fewer samples, the population
mean falls within a wide interval.
Add more samples and the population
mean falls in a more narrow interval.
With more samples, the more confident
(narrow) the mean interval becomes.
Factor A
Factor B
2.5 2.75 3.0 3.25 3.5
Large overlap indicates the population
mean may be the same for both
groups; therefore there is no difference
between them (high P-value).
N = 20
Factor A
Factor B
2.5 2.75 3.0 3.25 3.5
Small overlap indicates a smaller chance
the population mean may be the same for
both groups; therefore there is may be no
difference between them (small P-value).
N = 50
Factor A
Factor B
2.5 2.75 3.0 3.25 3.5
No overlap indicates the population
mean is different for both groups;
therefore there is a difference between
them (low or no P-value).
N = 100
Compare 2+ Factors Tests: Introduction
o When should I use it?
• To compare median values for multiple factors across different populations.
o There are two types of tests to compare medians for multiple factors:
• Mood’s Median is a test when the population contains many outliers.
• Kruskal-Wallis Test is a stronger test, except when there are many outliers.
o How do you know if there are many outliers?
• Use a boxplot to identify possible outliers (at right).
 To create a boxplot, go to Graph > Boxplot…
o What are the inputs for the test?
• The inputs for both tests are identical:
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
6
Now
350000
300000
250000
200000
150000
100000
50000
0
MetricC
Boxplot of MetricC
These outliers may be influencing the
distribution. Therefore, you’re better
off using the Mood’s Median test.
The columns containing the
continuous data to test.
The factor to be tested
against the response.
Mood’s Median: Interpreting Results
o What do these results from the Mood’s Median test mean?
o How do I interpret these results?
• If the P-value is less than the α risk, there’s a statistical difference between the factor’s values.
• The IQR is a measurement of the spread for each factor, like a variance. A large difference in IQR
between the values should be validated with a test for equal variances.
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
7
Now
The amount of overlap
in these CIs are what
affect the p-value.
Factor’s
unique
values
Count below
& above the
median
Median per
unique value
Interquartile
Range (IQR) per
unique value
Illustration of
Confidence Intervals
per unique value
P-Value
Mood’s Median: MetricB Example
o Example: MetricB & CategoryA sample values
• Background:
 Use the arbitrary values in the “MetricB” and “CategoryA” columns of the
Minitab Sample Data file.
• Practical Problem:
 Is there a difference in medians of MetricB for each CategoryA value?
• Statistical Problem:
 State the null (H0) and alternative (Ha) hypotheses:
– H0: ηCategoryA1 = ηCategoryA2 = ηCategoryA3 etc., and Ha: ηCategoryA1 ≠ ηCategoryA2 ≠ ηCategoryA3 etc.
 Define the confidence (1-α) and power (1-β):
– For confidence, we’ll accept the default of 95% (which means α = 5%) and power of 90% (which means β = 10%).
 Type the statistical problem into Minitab:
– In Minitab, go to Stat > Nonparametrics > Mood’s Median Test…
– In the “Response” box, select MetricB from the list of columns.
– In the “Factor” box, select CategoryA from the list of columns.
• Statistical Solution:
 Refer to the session window results.
– Since P-value is > 0.05 (α), then we fail to reject H0.
• Practical Solution:
 The sample is insufficient to prove that the
medians of MetricB are different between the
CatetoryA values.
 If we assumed power=90% (β=10%), then how do
we apply that to this test? What does it mean?
 Which CategoryA values appear to drive what
difference there is? How do you know it?
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
8
Now
Kruskal-Wallis: Interpreting Results
o What do these results from the Kruskal-Wallis test mean?
o How do I interpret these results?
• If the P-value is less than the α risk, there’s a statistical difference between the factor’s values.
• The factor having a Z-score furthest from zero (positive or negative) has the most difference (in
mean rank) among all observations.
 That is, if there is a statistical difference from this test, it’s most likely affected by the factor(s) having a Z-
value furthest from zero.
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
9
Now
Factor’s
unique
values
Count
per value
Median per
unique value
Mean rank & Z-
score per value
P-Value
Kruskal-Wallis: MetricB Example
o Example: MetricB & CategoryA sample values
• Background:
 Use the arbitrary values in the “MetricB” and “CategoryA” columns of the
Minitab Sample Data file.
• Practical Problem:
 Is there a difference in medians of MetricB for each CategoryA value?
• Statistical Problem:
 State the null (H0) and alternative (Ha) hypotheses:
– H0: ηCategoryA1 = ηCategoryA2 = ηCategoryA3 etc., and Ha: ηCategoryA1 ≠ ηCategoryA2 ≠ ηCategoryA3 etc.
 Define the confidence (1-α) and power (1-β):
– For confidence, we’ll accept the default of 95% (which means α = 5%) and power of 90% (which means β = 10%).
 Type the statistical problem into Minitab:
– In Minitab, go to Stat > Nonparametrics > Kruskal-Wallis…
– In the “Response” box, select MetricB from the list of columns.
– In the “Factor” box, select CategoryA from the list of columns.
• Statistical Solution:
 Refer to the session window results.
– Since P-value is > 0.05 (α), then we fail to reject H0.
• Practical Solution:
 The sample is insufficient to prove that the medians of
MetricB are different between the CatetoryA values.
 If we assumed power=90% (β=10%), then how do we
apply that to this test? What does it mean?
 Which CategoryA values appear to drive what difference
there is? How do you know it?
 Why is this so different from the Mood’s Median test?
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
10
Now
Practical Application
o Refer to the critical metric (output Y) and at least 5 factors (input X’s) you identified in
a previous lesson for applying to this hypothesis testing.
• For any factor that is a continuous value, try applying both the Mood’s Median and Kruskal-Wallis
tests.
 To do this, you’ll need a discrete factor that has 2 or more sets of values (e.g., across multiple periods of
time, or different locations, or different groups, etc.).
 Non-parametric tests like these are ideal for non-normal distributions, but you can still run them even if
your continuous value has a normal distribution.
 Other factors in your organization can be used for this exercise.
• Before running either test, do the medians of the factors appear to be different?
• After running either test, are the medians of the factors statistically different?
• If the answers to the above 2 questions are different, then how does that affect how you’d
typically measure and communicate that factor in the organization?
 For example, does the difference between the compared factors affect financial decisions (e.g., how people
are compensated), or process changes (e.g., how the process may be modified), or other critical actions?
 If so, then how should the results from either test be used to influence your organization?
– Should they change how the factors are compared (e.g., across different times, locations, groups, etc.)?
– Should they change how the factor is measured?
– Should they change how they react when they compare the metric this way?
Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic,
photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
11

More Related Content

What's hot

Hypothesis Testing: Proportions (Compare 2+ Factors)
Hypothesis Testing: Proportions (Compare 2+ Factors)Hypothesis Testing: Proportions (Compare 2+ Factors)
Hypothesis Testing: Proportions (Compare 2+ Factors)Matt Hansen
 
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)Matt Hansen
 
Hypothesis Testing: Spread (Compare 2+ Factors)
Hypothesis Testing: Spread (Compare 2+ Factors)Hypothesis Testing: Spread (Compare 2+ Factors)
Hypothesis Testing: Spread (Compare 2+ Factors)Matt Hansen
 
Hypothesis Testing: Central Tendency – Normal (Compare 1:Standard)
Hypothesis Testing: Central Tendency – Normal (Compare 1:Standard)Hypothesis Testing: Central Tendency – Normal (Compare 1:Standard)
Hypothesis Testing: Central Tendency – Normal (Compare 1:Standard)Matt Hansen
 
Hypothesis Testing: Statistical Laws and Confidence Intervals
Hypothesis Testing: Statistical Laws and Confidence IntervalsHypothesis Testing: Statistical Laws and Confidence Intervals
Hypothesis Testing: Statistical Laws and Confidence IntervalsMatt Hansen
 
Hypothesis Testing: Spread (Compare 1:1)
Hypothesis Testing: Spread (Compare 1:1)Hypothesis Testing: Spread (Compare 1:1)
Hypothesis Testing: Spread (Compare 1:1)Matt Hansen
 
Hypothesis Testing: Finding the Right Statistical Test
Hypothesis Testing: Finding the Right Statistical TestHypothesis Testing: Finding the Right Statistical Test
Hypothesis Testing: Finding the Right Statistical TestMatt Hansen
 
Hypothesis Testing: Relationships (Overview)
Hypothesis Testing: Relationships (Overview)Hypothesis Testing: Relationships (Overview)
Hypothesis Testing: Relationships (Overview)Matt Hansen
 
Hypothesis Testing: Overview
Hypothesis Testing: OverviewHypothesis Testing: Overview
Hypothesis Testing: OverviewMatt Hansen
 
Hypothesis Testing: Formal and Informal Sub-Processes
Hypothesis Testing: Formal and Informal Sub-ProcessesHypothesis Testing: Formal and Informal Sub-Processes
Hypothesis Testing: Formal and Informal Sub-ProcessesMatt Hansen
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive StatisticsMatt Hansen
 
Hypothesis Testing: Relationships (Compare 2+ Factors)
Hypothesis Testing: Relationships (Compare 2+ Factors)Hypothesis Testing: Relationships (Compare 2+ Factors)
Hypothesis Testing: Relationships (Compare 2+ Factors)Matt Hansen
 
2016 Symposium Poster - statistics - Final
2016 Symposium Poster - statistics - Final2016 Symposium Poster - statistics - Final
2016 Symposium Poster - statistics - FinalBrian Lin
 
Testing for Multicollinearity
Testing for MulticollinearityTesting for Multicollinearity
Testing for MulticollinearityMatt Hansen
 
Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceAmit Sharma
 
MAT80 - White paper july 2017 - Prof. P. Irwing
MAT80 - White paper july 2017 - Prof. P. IrwingMAT80 - White paper july 2017 - Prof. P. Irwing
MAT80 - White paper july 2017 - Prof. P. IrwingPaul Irwing
 
8. testing of hypothesis for variable & attribute data
8. testing of hypothesis for variable & attribute  data8. testing of hypothesis for variable & attribute  data
8. testing of hypothesis for variable & attribute dataHakeem-Ur- Rehman
 
Basic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingBasic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingPenn State University
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningBill Liu
 
Lecture 7
Lecture 7Lecture 7
Lecture 7butest
 

What's hot (20)

Hypothesis Testing: Proportions (Compare 2+ Factors)
Hypothesis Testing: Proportions (Compare 2+ Factors)Hypothesis Testing: Proportions (Compare 2+ Factors)
Hypothesis Testing: Proportions (Compare 2+ Factors)
 
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)
Hypothesis Testing: Central Tendency – Non-Normal (Nonparametric Overview)
 
Hypothesis Testing: Spread (Compare 2+ Factors)
Hypothesis Testing: Spread (Compare 2+ Factors)Hypothesis Testing: Spread (Compare 2+ Factors)
Hypothesis Testing: Spread (Compare 2+ Factors)
 
Hypothesis Testing: Central Tendency – Normal (Compare 1:Standard)
Hypothesis Testing: Central Tendency – Normal (Compare 1:Standard)Hypothesis Testing: Central Tendency – Normal (Compare 1:Standard)
Hypothesis Testing: Central Tendency – Normal (Compare 1:Standard)
 
Hypothesis Testing: Statistical Laws and Confidence Intervals
Hypothesis Testing: Statistical Laws and Confidence IntervalsHypothesis Testing: Statistical Laws and Confidence Intervals
Hypothesis Testing: Statistical Laws and Confidence Intervals
 
Hypothesis Testing: Spread (Compare 1:1)
Hypothesis Testing: Spread (Compare 1:1)Hypothesis Testing: Spread (Compare 1:1)
Hypothesis Testing: Spread (Compare 1:1)
 
Hypothesis Testing: Finding the Right Statistical Test
Hypothesis Testing: Finding the Right Statistical TestHypothesis Testing: Finding the Right Statistical Test
Hypothesis Testing: Finding the Right Statistical Test
 
Hypothesis Testing: Relationships (Overview)
Hypothesis Testing: Relationships (Overview)Hypothesis Testing: Relationships (Overview)
Hypothesis Testing: Relationships (Overview)
 
Hypothesis Testing: Overview
Hypothesis Testing: OverviewHypothesis Testing: Overview
Hypothesis Testing: Overview
 
Hypothesis Testing: Formal and Informal Sub-Processes
Hypothesis Testing: Formal and Informal Sub-ProcessesHypothesis Testing: Formal and Informal Sub-Processes
Hypothesis Testing: Formal and Informal Sub-Processes
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Hypothesis Testing: Relationships (Compare 2+ Factors)
Hypothesis Testing: Relationships (Compare 2+ Factors)Hypothesis Testing: Relationships (Compare 2+ Factors)
Hypothesis Testing: Relationships (Compare 2+ Factors)
 
2016 Symposium Poster - statistics - Final
2016 Symposium Poster - statistics - Final2016 Symposium Poster - statistics - Final
2016 Symposium Poster - statistics - Final
 
Testing for Multicollinearity
Testing for MulticollinearityTesting for Multicollinearity
Testing for Multicollinearity
 
Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inference
 
MAT80 - White paper july 2017 - Prof. P. Irwing
MAT80 - White paper july 2017 - Prof. P. IrwingMAT80 - White paper july 2017 - Prof. P. Irwing
MAT80 - White paper july 2017 - Prof. P. Irwing
 
8. testing of hypothesis for variable & attribute data
8. testing of hypothesis for variable & attribute  data8. testing of hypothesis for variable & attribute  data
8. testing of hypothesis for variable & attribute data
 
Basic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingBasic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-Making
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine Learning
 
Lecture 7
Lecture 7Lecture 7
Lecture 7
 

Similar to Hypothesis Testing: Central Tendency – Non-Normal (Compare 2+ Factors)

What So Funny About Proportion Testv3
What So Funny About Proportion Testv3What So Funny About Proportion Testv3
What So Funny About Proportion Testv3ChrisConnors
 
Quantitative_analysis.ppt
Quantitative_analysis.pptQuantitative_analysis.ppt
Quantitative_analysis.pptmousaderhem1
 
CORE: May the “Power” (Statistical) - Be with You!
CORE: May the “Power” (Statistical) - Be with You!CORE: May the “Power” (Statistical) - Be with You!
CORE: May the “Power” (Statistical) - Be with You!Trident University
 
Assignment 2 Tests of SignificanceThroughout this assignment yo.docx
Assignment 2 Tests of SignificanceThroughout this assignment yo.docxAssignment 2 Tests of SignificanceThroughout this assignment yo.docx
Assignment 2 Tests of SignificanceThroughout this assignment yo.docxrock73
 
Review Z Test Ci 1
Review Z Test Ci 1Review Z Test Ci 1
Review Z Test Ci 1shoffma5
 
Hypothesis Testing Definitions A statistical hypothesi.docx
Hypothesis Testing  Definitions A statistical hypothesi.docxHypothesis Testing  Definitions A statistical hypothesi.docx
Hypothesis Testing Definitions A statistical hypothesi.docxwilcockiris
 
STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...
STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...
STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...Musfera Nara Vadia
 
Quantitative analysis
Quantitative analysisQuantitative analysis
Quantitative analysisRajesh Mishra
 
Error Control and Severity
Error Control and SeverityError Control and Severity
Error Control and Severityjemille6
 
Novelties in social science statistics
Novelties in social science statisticsNovelties in social science statistics
Novelties in social science statisticsJiri Haviger
 
Statistics pres 3.31.2014
Statistics pres 3.31.2014Statistics pres 3.31.2014
Statistics pres 3.31.2014tjcarter
 

Similar to Hypothesis Testing: Central Tendency – Non-Normal (Compare 2+ Factors) (16)

What So Funny About Proportion Testv3
What So Funny About Proportion Testv3What So Funny About Proportion Testv3
What So Funny About Proportion Testv3
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Quantitative_analysis.ppt
Quantitative_analysis.pptQuantitative_analysis.ppt
Quantitative_analysis.ppt
 
CORE: May the “Power” (Statistical) - Be with You!
CORE: May the “Power” (Statistical) - Be with You!CORE: May the “Power” (Statistical) - Be with You!
CORE: May the “Power” (Statistical) - Be with You!
 
Assignment 2 Tests of SignificanceThroughout this assignment yo.docx
Assignment 2 Tests of SignificanceThroughout this assignment yo.docxAssignment 2 Tests of SignificanceThroughout this assignment yo.docx
Assignment 2 Tests of SignificanceThroughout this assignment yo.docx
 
Review Z Test Ci 1
Review Z Test Ci 1Review Z Test Ci 1
Review Z Test Ci 1
 
T test
T test T test
T test
 
Hypothesis Testing Definitions A statistical hypothesi.docx
Hypothesis Testing  Definitions A statistical hypothesi.docxHypothesis Testing  Definitions A statistical hypothesi.docx
Hypothesis Testing Definitions A statistical hypothesi.docx
 
STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...
STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...
STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...
 
Causal Inference in Marketing
Causal Inference in MarketingCausal Inference in Marketing
Causal Inference in Marketing
 
Spss session 1 and 2
Spss session 1 and 2Spss session 1 and 2
Spss session 1 and 2
 
Quantitative analysis
Quantitative analysisQuantitative analysis
Quantitative analysis
 
Error Control and Severity
Error Control and SeverityError Control and Severity
Error Control and Severity
 
Meta analysis with R
Meta analysis with RMeta analysis with R
Meta analysis with R
 
Novelties in social science statistics
Novelties in social science statisticsNovelties in social science statistics
Novelties in social science statistics
 
Statistics pres 3.31.2014
Statistics pres 3.31.2014Statistics pres 3.31.2014
Statistics pres 3.31.2014
 

More from Matt Hansen

Getting Feedback with a Plus/Delta Tool
Getting Feedback with a Plus/Delta ToolGetting Feedback with a Plus/Delta Tool
Getting Feedback with a Plus/Delta ToolMatt Hansen
 
Closing a Project
Closing a ProjectClosing a Project
Closing a ProjectMatt Hansen
 
Documenting a New Process with SOPs
Documenting a New Process with SOPsDocumenting a New Process with SOPs
Documenting a New Process with SOPsMatt Hansen
 
Building a Control Plan
Building a Control PlanBuilding a Control Plan
Building a Control PlanMatt Hansen
 
Control Charts: Recalculating Control Limits
Control Charts: Recalculating Control LimitsControl Charts: Recalculating Control Limits
Control Charts: Recalculating Control LimitsMatt Hansen
 
Control Charts: U Chart
Control Charts: U ChartControl Charts: U Chart
Control Charts: U ChartMatt Hansen
 
Control Charts: P Chart
Control Charts: P ChartControl Charts: P Chart
Control Charts: P ChartMatt Hansen
 
Control Charts: Xbar-S Chart
Control Charts: Xbar-S ChartControl Charts: Xbar-S Chart
Control Charts: Xbar-S ChartMatt Hansen
 
Control Charts: I-MR Chart
Control Charts: I-MR ChartControl Charts: I-MR Chart
Control Charts: I-MR ChartMatt Hansen
 
Control Charts: Finding the Right Control Chart
Control Charts: Finding the Right Control ChartControl Charts: Finding the Right Control Chart
Control Charts: Finding the Right Control ChartMatt Hansen
 
Building a Scorecard
Building a ScorecardBuilding a Scorecard
Building a ScorecardMatt Hansen
 
Control Phase Roadmap (Level 3)
Control Phase Roadmap (Level 3)Control Phase Roadmap (Level 3)
Control Phase Roadmap (Level 3)Matt Hansen
 
Piloting Solutions: Build the Pilot Plan
Piloting Solutions: Build the Pilot PlanPiloting Solutions: Build the Pilot Plan
Piloting Solutions: Build the Pilot PlanMatt Hansen
 
Piloting Solutions: The Process
Piloting Solutions: The ProcessPiloting Solutions: The Process
Piloting Solutions: The ProcessMatt Hansen
 
Risk Assessment with a FMEA Tool
Risk Assessment with a FMEA ToolRisk Assessment with a FMEA Tool
Risk Assessment with a FMEA ToolMatt Hansen
 
Prioritize Solutions with an Impact Matrix
Prioritize Solutions with an Impact MatrixPrioritize Solutions with an Impact Matrix
Prioritize Solutions with an Impact MatrixMatt Hansen
 
Brainstorm Solutions with an Affinity Diagram
Brainstorm Solutions with an Affinity DiagramBrainstorm Solutions with an Affinity Diagram
Brainstorm Solutions with an Affinity DiagramMatt Hansen
 
Brainstorm & Prioritize Solutions with a Workout
Brainstorm & Prioritize Solutions with a WorkoutBrainstorm & Prioritize Solutions with a Workout
Brainstorm & Prioritize Solutions with a WorkoutMatt Hansen
 
Compiling Analysis Results
Compiling Analysis ResultsCompiling Analysis Results
Compiling Analysis ResultsMatt Hansen
 
Improve Phase Roadmap (Level 3)
Improve Phase Roadmap (Level 3)Improve Phase Roadmap (Level 3)
Improve Phase Roadmap (Level 3)Matt Hansen
 

More from Matt Hansen (20)

Getting Feedback with a Plus/Delta Tool
Getting Feedback with a Plus/Delta ToolGetting Feedback with a Plus/Delta Tool
Getting Feedback with a Plus/Delta Tool
 
Closing a Project
Closing a ProjectClosing a Project
Closing a Project
 
Documenting a New Process with SOPs
Documenting a New Process with SOPsDocumenting a New Process with SOPs
Documenting a New Process with SOPs
 
Building a Control Plan
Building a Control PlanBuilding a Control Plan
Building a Control Plan
 
Control Charts: Recalculating Control Limits
Control Charts: Recalculating Control LimitsControl Charts: Recalculating Control Limits
Control Charts: Recalculating Control Limits
 
Control Charts: U Chart
Control Charts: U ChartControl Charts: U Chart
Control Charts: U Chart
 
Control Charts: P Chart
Control Charts: P ChartControl Charts: P Chart
Control Charts: P Chart
 
Control Charts: Xbar-S Chart
Control Charts: Xbar-S ChartControl Charts: Xbar-S Chart
Control Charts: Xbar-S Chart
 
Control Charts: I-MR Chart
Control Charts: I-MR ChartControl Charts: I-MR Chart
Control Charts: I-MR Chart
 
Control Charts: Finding the Right Control Chart
Control Charts: Finding the Right Control ChartControl Charts: Finding the Right Control Chart
Control Charts: Finding the Right Control Chart
 
Building a Scorecard
Building a ScorecardBuilding a Scorecard
Building a Scorecard
 
Control Phase Roadmap (Level 3)
Control Phase Roadmap (Level 3)Control Phase Roadmap (Level 3)
Control Phase Roadmap (Level 3)
 
Piloting Solutions: Build the Pilot Plan
Piloting Solutions: Build the Pilot PlanPiloting Solutions: Build the Pilot Plan
Piloting Solutions: Build the Pilot Plan
 
Piloting Solutions: The Process
Piloting Solutions: The ProcessPiloting Solutions: The Process
Piloting Solutions: The Process
 
Risk Assessment with a FMEA Tool
Risk Assessment with a FMEA ToolRisk Assessment with a FMEA Tool
Risk Assessment with a FMEA Tool
 
Prioritize Solutions with an Impact Matrix
Prioritize Solutions with an Impact MatrixPrioritize Solutions with an Impact Matrix
Prioritize Solutions with an Impact Matrix
 
Brainstorm Solutions with an Affinity Diagram
Brainstorm Solutions with an Affinity DiagramBrainstorm Solutions with an Affinity Diagram
Brainstorm Solutions with an Affinity Diagram
 
Brainstorm & Prioritize Solutions with a Workout
Brainstorm & Prioritize Solutions with a WorkoutBrainstorm & Prioritize Solutions with a Workout
Brainstorm & Prioritize Solutions with a Workout
 
Compiling Analysis Results
Compiling Analysis ResultsCompiling Analysis Results
Compiling Analysis Results
 
Improve Phase Roadmap (Level 3)
Improve Phase Roadmap (Level 3)Improve Phase Roadmap (Level 3)
Improve Phase Roadmap (Level 3)
 

Recently uploaded

Catalogue ONG NUOC PPR DE NHAT .pdf
Catalogue ONG NUOC PPR DE NHAT      .pdfCatalogue ONG NUOC PPR DE NHAT      .pdf
Catalogue ONG NUOC PPR DE NHAT .pdfOrient Homes
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Dave Litwiller
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Dipal Arora
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMRavindra Nath Shukla
 
/:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc...
/:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc.../:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc...
/:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc...lizamodels9
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Servicediscovermytutordmt
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...lizamodels9
 
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts ServiceVip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Serviceankitnayak356677
 
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service DewasVip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewasmakika9823
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communicationskarancommunications
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Serviceritikaroy0888
 
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,noida100girls
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.Aaiza Hassan
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsApsara Of India
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfPaul Menig
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Neil Kimberley
 
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Tina Ji
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdfRenandantas16
 

Recently uploaded (20)

Forklift Operations: Safety through Cartoons
Forklift Operations: Safety through CartoonsForklift Operations: Safety through Cartoons
Forklift Operations: Safety through Cartoons
 
Catalogue ONG NUOC PPR DE NHAT .pdf
Catalogue ONG NUOC PPR DE NHAT      .pdfCatalogue ONG NUOC PPR DE NHAT      .pdf
Catalogue ONG NUOC PPR DE NHAT .pdf
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
 
/:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc...
/:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc.../:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc...
/:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc...
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Service
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
 
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts ServiceVip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
 
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service DewasVip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Service
 
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdf
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023
 
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
 

Hypothesis Testing: Central Tendency – Non-Normal (Compare 2+ Factors)

  • 1. Section & Lesson #: Pre-Requisite Lessons: Complex Tools + Clear Teaching = Powerful Results Hypothesis Testing: Central Tendency – Non-Normal (Compare 2+ Factors) Six Sigma-Analyze – Lesson 22 An extension on hypothesis testing, this lesson reviews the Mood’s Median & Kruskal-Wallis tests as central tendency measurements for non-normal dist. Six Sigma-Analyze #21 – Hypothesis Testing: Central Tendency – Non-Normal (Compare 1:1) Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher.
  • 2. Why do we need hypothesis testing? o Remember, our project goal is to resolve a problem by first building a transfer function. • We don’t want to just alleviate symptoms, we want to resolve the root cause.  Remember Hannah? We don’t want to alleviate the arthritis pain in her leg, but heal the strep throat. • If we don’t know what the root cause is, then we need to build a transfer function.  By building a transfer function, we can know what changes (improvements) should fix the root cause. o Remember, the Transfer Function is defined as Y = f(X). • This is described as “output response Y is a function of one or more input X’s”. • It’s part of the IPO flow model where we described the IPO flow model as one or more inputs feeding into a process that transforms it to create a new output. o How does a transfer function fit with hypothesis testing? • Hypothesis testing tells us which X’s (inputs) are independently influencing the Y (output).  When we reject a null hypothesis, we’re building evidence proving which X’s are “guilty” of driving the Y.  We’ll compile all the evidence in the Improve phase of DMAIC and begin to fix those root causes. Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. Y = f(X) Input (X) > Process > Output (Y)
  • 3. Review Hypothesis Testing: 4 Step Process o Remember, the 4 high-level steps for hypothesis testing begin/end with being practical: o As the heart of hypothesis testing, steps 2 & 3 can be drilled to the following 6 steps: Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 3 Practical Statistical Problem Problem Solution Solution    Practical Problem State the problem as a practical Yes/No question.  Statistical Problem Convert the problem to an analytical question identifying the statistical tool/method.  Practical Solution Interpret the analytical answer in a practical way. Statistical Solution Interpret the results of the hypothesis test with an analytical answer.  1. Define the objective. 2. State the Null Hypothesis (H0) and Alternative Hypothesis (Ha). 3. Define the confidence (1-α) and power (beta or 1-β). 4. Collect the sample data. 5. Calculate the P-value. 6. Interpret the results: accept or reject the null hypothesis (H0).
  • 4. Is the data type for both values discrete? What are you measuring? Is the data normal? SpreadCentral Tendency Compare 1:Standard 1 Proportion Test Compare 1:1 2 Proportion Test Compare 2+ Factors Chi2 Test Compare 1:Standard 1 Sample T Test Compare 1:1 2 Sample T Test or Paired T Test Compare 2+ Factors One-way ANOVA Test Compare 1:Standard 1 Sample Wilcoxon or 1 Sample Sign Compare 1:1 Mann-Whitney Test Compare 2+ Factors Mood’s Median Test or Kruskal-Wallis Test Compare 1:Standard 1 Variance Test Compare 1:1 2 Variance Test Compare 2+ Factors Test for Equal Variances Yes No Yes No Proportions Compare 1:1 Pearson Correlation or Fitted Line Plot Compare 2+ Factors Multiple Regression or General Linear Model Is the data type for both values continuous? Relationships No Yes Review Finding the Right Statistical Test o What statistical test do I use for my hypothesis testing? • The type of statistical test depends on the data to be tested, as described in the chart below: Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 4 Lesson Current
  • 5. Confidence Intervals (CI) Redefined o Remember, confidence intervals (CI) are an estimated upper/lower range of the mean. • The CI narrows when you add more samples: • Remember, statistics are intended to help you make inferences about a population.  A mean of 57 with a 95% confidence interval of 55 and 59 implies that although the sample mean is only 57, you can be 95% confident that the population mean will fall somewhere between 55 and 59. • The first step in hypothesis testing is to define the objective by asking “is there a difference…”  Statistical tests generally compare the different CIs between factors to see if a difference exists. Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 5 With fewer samples, the population mean falls within a wide interval. Add more samples and the population mean falls in a more narrow interval. With more samples, the more confident (narrow) the mean interval becomes. Factor A Factor B 2.5 2.75 3.0 3.25 3.5 Large overlap indicates the population mean may be the same for both groups; therefore there is no difference between them (high P-value). N = 20 Factor A Factor B 2.5 2.75 3.0 3.25 3.5 Small overlap indicates a smaller chance the population mean may be the same for both groups; therefore there is may be no difference between them (small P-value). N = 50 Factor A Factor B 2.5 2.75 3.0 3.25 3.5 No overlap indicates the population mean is different for both groups; therefore there is a difference between them (low or no P-value). N = 100
  • 6. Compare 2+ Factors Tests: Introduction o When should I use it? • To compare median values for multiple factors across different populations. o There are two types of tests to compare medians for multiple factors: • Mood’s Median is a test when the population contains many outliers. • Kruskal-Wallis Test is a stronger test, except when there are many outliers. o How do you know if there are many outliers? • Use a boxplot to identify possible outliers (at right).  To create a boxplot, go to Graph > Boxplot… o What are the inputs for the test? • The inputs for both tests are identical: Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 6 Now 350000 300000 250000 200000 150000 100000 50000 0 MetricC Boxplot of MetricC These outliers may be influencing the distribution. Therefore, you’re better off using the Mood’s Median test. The columns containing the continuous data to test. The factor to be tested against the response.
  • 7. Mood’s Median: Interpreting Results o What do these results from the Mood’s Median test mean? o How do I interpret these results? • If the P-value is less than the α risk, there’s a statistical difference between the factor’s values. • The IQR is a measurement of the spread for each factor, like a variance. A large difference in IQR between the values should be validated with a test for equal variances. Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 7 Now The amount of overlap in these CIs are what affect the p-value. Factor’s unique values Count below & above the median Median per unique value Interquartile Range (IQR) per unique value Illustration of Confidence Intervals per unique value P-Value
  • 8. Mood’s Median: MetricB Example o Example: MetricB & CategoryA sample values • Background:  Use the arbitrary values in the “MetricB” and “CategoryA” columns of the Minitab Sample Data file. • Practical Problem:  Is there a difference in medians of MetricB for each CategoryA value? • Statistical Problem:  State the null (H0) and alternative (Ha) hypotheses: – H0: ηCategoryA1 = ηCategoryA2 = ηCategoryA3 etc., and Ha: ηCategoryA1 ≠ ηCategoryA2 ≠ ηCategoryA3 etc.  Define the confidence (1-α) and power (1-β): – For confidence, we’ll accept the default of 95% (which means α = 5%) and power of 90% (which means β = 10%).  Type the statistical problem into Minitab: – In Minitab, go to Stat > Nonparametrics > Mood’s Median Test… – In the “Response” box, select MetricB from the list of columns. – In the “Factor” box, select CategoryA from the list of columns. • Statistical Solution:  Refer to the session window results. – Since P-value is > 0.05 (α), then we fail to reject H0. • Practical Solution:  The sample is insufficient to prove that the medians of MetricB are different between the CatetoryA values.  If we assumed power=90% (β=10%), then how do we apply that to this test? What does it mean?  Which CategoryA values appear to drive what difference there is? How do you know it? Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 8 Now
  • 9. Kruskal-Wallis: Interpreting Results o What do these results from the Kruskal-Wallis test mean? o How do I interpret these results? • If the P-value is less than the α risk, there’s a statistical difference between the factor’s values. • The factor having a Z-score furthest from zero (positive or negative) has the most difference (in mean rank) among all observations.  That is, if there is a statistical difference from this test, it’s most likely affected by the factor(s) having a Z- value furthest from zero. Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 9 Now Factor’s unique values Count per value Median per unique value Mean rank & Z- score per value P-Value
  • 10. Kruskal-Wallis: MetricB Example o Example: MetricB & CategoryA sample values • Background:  Use the arbitrary values in the “MetricB” and “CategoryA” columns of the Minitab Sample Data file. • Practical Problem:  Is there a difference in medians of MetricB for each CategoryA value? • Statistical Problem:  State the null (H0) and alternative (Ha) hypotheses: – H0: ηCategoryA1 = ηCategoryA2 = ηCategoryA3 etc., and Ha: ηCategoryA1 ≠ ηCategoryA2 ≠ ηCategoryA3 etc.  Define the confidence (1-α) and power (1-β): – For confidence, we’ll accept the default of 95% (which means α = 5%) and power of 90% (which means β = 10%).  Type the statistical problem into Minitab: – In Minitab, go to Stat > Nonparametrics > Kruskal-Wallis… – In the “Response” box, select MetricB from the list of columns. – In the “Factor” box, select CategoryA from the list of columns. • Statistical Solution:  Refer to the session window results. – Since P-value is > 0.05 (α), then we fail to reject H0. • Practical Solution:  The sample is insufficient to prove that the medians of MetricB are different between the CatetoryA values.  If we assumed power=90% (β=10%), then how do we apply that to this test? What does it mean?  Which CategoryA values appear to drive what difference there is? How do you know it?  Why is this so different from the Mood’s Median test? Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 10 Now
  • 11. Practical Application o Refer to the critical metric (output Y) and at least 5 factors (input X’s) you identified in a previous lesson for applying to this hypothesis testing. • For any factor that is a continuous value, try applying both the Mood’s Median and Kruskal-Wallis tests.  To do this, you’ll need a discrete factor that has 2 or more sets of values (e.g., across multiple periods of time, or different locations, or different groups, etc.).  Non-parametric tests like these are ideal for non-normal distributions, but you can still run them even if your continuous value has a normal distribution.  Other factors in your organization can be used for this exercise. • Before running either test, do the medians of the factors appear to be different? • After running either test, are the medians of the factors statistically different? • If the answers to the above 2 questions are different, then how does that affect how you’d typically measure and communicate that factor in the organization?  For example, does the difference between the compared factors affect financial decisions (e.g., how people are compensated), or process changes (e.g., how the process may be modified), or other critical actions?  If so, then how should the results from either test be used to influence your organization? – Should they change how the factors are compared (e.g., across different times, locations, groups, etc.)? – Should they change how the factor is measured? – Should they change how they react when they compare the metric this way? Copyright © 2011-2019 by Matthew J. Hansen. All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted by any means (electronic, mechanical, photographic, photocopying, recording or otherwise) without prior permission in writing by the author and/or publisher. 11