SlideShare a Scribd company logo
1 of 24
Download to read offline
HYPOTHESIS TESTING
HYPOTHESIS TESTING: THE BASICS
• Why hypothesis testing?
• To investigate whether a statement about the population average (based on a sample
derived from that population) is valid.
• Assume that someone believes that the average annual income for data analysts (μ) is
BDT600K based on past evidence or an educated opinion.
• Assume that we derive a sample from the population
• If the sample average ( ҧ
𝑥) is BDT550K, does this mean that the population mean is
different from BDT600K and therefore the original statement is wrong ?
• Could it be that the statement is correct and that we simply derived a sample out of the
many possible quite different from the population mean?
• Or can we use it to tell us whether the population average cannot be equal to the
proposed average value?
HYPOTHESIS TESTING: THE REASONING
• The reasoning is based on the central limit theorem.
• We are testing a suggestion about the population mean.
• If it was to be true then it would have been at the centre of the normal distribution of all
the sample averages and every sample average would have been distributed around it.
We assume that this is the case.
• We will therefore estimate the z value by using the hypothesised population mean.
• By stating our level of significance (e.g. a=5%/2 = 2.5 %) and its corresponding critical
values (1.96, -1.96) we define a threshold for the ‘rejection’ or ‘not rejection’ of the null
hypothesis.
• That means if we estimate a z value greater than 1.96 or smaller than -1.96 (the critical
values) then we will reject the Null Hypothesis.
HYPOTHESIS TESTING: THE REASONING
• Why? Because assuming that the suggested population average is correct and it is at the
centre of our distribution the probability of selecting a sample that is more than 1.96
standard deviations away from the suggested population average is less than 2.5 %.
• Therefore we cannot believe that we were so unlucky that we selected a sample that
gives an average value that is less that 2.5 % likely of being observed, ASSUMING
that the population average that has been suggested is the real one.
• So instead we believe that we actually picked up a sample that has more than 2.5 %
chance of being selected, and the only way that this can be true is if the population mean
has a different value than the one being suggested.
• A different population mean (that had a value that is closer to the one derived from our
sample) would have shown that the probability of observing our sample average would
have been more than 2.5 %.
HYPOTHESIS TESTING: THE STEPS
1. Set up the Null Hypothesis (H0)
2. Set up the Alternative Hypothesis (Ha)
3. Specify the Significance Level (α) and identify its corresponding critical values
4. Look at your sample data
5. Calculate the statistics for the test (test statistic)
6. Technical conclusion (Reject or not reject)
7. Draw your conclusion in non-technical terms referring to the original hypothesis
NULL & ALTERNATIVE HYPOTHESIS
• When we conduct hypothesis testing, we identify two mutually exclusive
hypotheses:
• Null Hypothesis (H0)
• Alternative Hypothesis (Ha)
• In hypothesis testing, we always assume that H0 is true until proven
otherwise
• We can develop hypotheses using a single population parameter or more
than one population parameter
REJECTION REGION
Two-tail test: When our null hypothesis is of the form (e.g. μ = BDT600K) and the alternative hypothesis of
the form (e.g. μ  BDT600K) then we have a two-tail test.
One-tail test: When our null hypothesis is of the form (e.g. μ ≥ BDT600K or μ ≤ BDT600K ) and the alternative
hypothesis of the form (e.g. μ < BDT600K or μ > BDT600K respectively) then we have a one-tail test.
Reject Ho Reject Ho
2.5% 2.5%
Accept Ho
+ z/t value
- z/t value
5%
Reject
Ho
- z/t value
Accept
Ho
5%
Reject
Ho
Accept
Ho
+ z/t value
✓ Type 2 Error
β
✓
Type 1 Error
α
Accept Null Hypothesis
Reject Null Hypothesis
Null Hypothesis
True
Null Hypothesis
False
Decision
The guilty go
free
The innocent
are punished
Free
Punish
Innocent Guilty
TYPE 1 & 2 ERRORS
PARAMETRIC HYPOTHESIS TESTS
• Hypothesis tests that we want to carry out could be based on sample statistics
(sample average, proportion) and could be used to provide estimates about
population parameters (population average, proportion).
• They are therefore called parametric tests.
• However they cannot be applied on all types of data.
• They can be applied on interval or ratio data scales but not on ordinal or
nominal. Why?
• Because although in the former group of scales, statistical measures such as the mean
or standard deviation can be estimated, in the latter group they cannot.
ONE-SAMPLE HYPOTHESIS TESTS
Three types of one sample tests:
1. H0: parameter ≤ constant
Ha: parameter > constant
2. H0: parameter ≥ constant
Ha: parameter < constant
3. H0: parameter = constant
Ha: parameter ≠ constant
It is not correct to formulate a null hypothesis using >, <, or ≠.
EXAMPLE - 1
Assume that the average time it takes to answer a call on a switchboard is
taken for a random sample of 81 calls. The resulting sample mean is found
to be 28 seconds with a standard deviation of 9 seconds. Previously it has
been estimated that the average time to answer a call was 25 seconds.
Test this argument at the 5% significance level.
EXAMPLE 1 - SOLVED
1. Set up the null hypothesis (Ho): The average time it takes for a phone call to be
answered is 25. H0: μ = 25
2. Set up the alternative hypothesis (Ha): The average time that it takes for a phone
call to be answered is not 25. Ha: μ  25
3. Specify the significance level (α) and identify the theoretical value: 5%
significance level with a critical value of za/2 = 1.96
4. Look at your sample data: The sample data suggests that the average time it takes
to answer a call is more than 25 seconds
5. Calculate the statistics for the test: 𝑧 =
𝑥
−
−𝜇
Τ
𝜎 𝑛
=
28−25
Τ
9 81
= 𝟑
6. Technical conclusion: The calculated value (3) is greater than the critical value
(1.96) and therefore we reject the null hypothesis
7. Conclusion in non-technical terms: There isn’t sufficient evidence to suggest that
the mean time to answer a call is 25 seconds
EXAMPLE - 2
The average time it takes to answer a call on the switchboard is taken for a
random sample of 25 calls. The resulting sample mean is found to be 28
seconds with a standard deviation of 9 seconds. Previously it has been
estimated that the average time to answer the call was 25. Test at the 5%
significance level the hypothesis that the average time for all phone calls to
be answered is still μ = 25.
EXAMPLE 2 - SOLVED
1. Set up the null hypothesis (Ho): The average time it takes for a phone call to be
answered is 25. H0: μ = 25
2. Set up the alternative hypothesis (Ha): The average time that it takes for a phone
call to be answered is not 25. Ha: μ  25
3. Specify the significance level (α) and identify the theoretical value: 5%
significance level with a critical value of t(0.025,24) = 2.064
4. Look at your sample data: The sample data suggests that the average time it takes
to answer a call is more than 25 seconds
5. Calculate the statistics for the test: 𝑡 =
𝑥
−
−𝜇
Τ
𝑠 𝑛
=
28−25
Τ
9 25
= 𝟏. 𝟔𝟔𝟔
6. Technical conclusion: The calculated value is lower than the critical value (|1.666|<
|2.064|) and therefore we cannot reject the null hypothesis.
7. Conclusion in non-technical terms: We cannot reject the claim that the average
time it takes to answer a phone call is 25 seconds
EXERCISE - CADSOFT
• CadSoft, a producer of computer-aided design software for the aerospace
industry receives numerous calls for technical support. In the past, the
average response time has been at least 25 minutes. The company has
upgraded its information systems and believes that this will help reduce
response time to less than 25 minutes. The company collected a sample
of 44 response times in the Excel worksheet “CadSoft Response Times”.
Check whether the company is right in their belief.
EXERCISE - VACATION SURVEY
• The Excel worksheet “Vacation Survey” shows a portion of data collected
in a survey of 34 respondents by a travel agency. Suppose that the travel
agency wanted to target individuals who were approximately 35 years
old. Test whether the average age of respondents is equal to 35.
ONE-SAMPLE HYPOTHESIS TESTS
Many important business measures, such as market share or the fraction
of deliveries received on time, are expressed as proportions. We may
conduct a test of hypothesis about a population proportion in a similar
fashion as we did for means. The test statistic for a one-sample test for
proportions is:
EXERCISE - CADSOFT
• CadSoft also sampled 44 customers and asked them to rate the overall quality
of the company’s software product using a scale of 0-very poor, 1-poor, 2-good,
3-very good, 4-excellent. The company collected a sample of 44 response
times in the Excel worksheet “CadSoft Product Satisfaction”. The firm tracks
customer satisfaction of quality by measuring the proportion of responses in the
top two categories. Over the past, this proportion has averaged about 75%. Is
there sufficient evidence to conclude that this satisfaction measure has
significantly exceeded 75% using a significance level of 0.05?
TWO-SAMPLE HYPOTHESIS TESTS
We can also have two-sample tests in which we compare means of two samples
H0: μ1 – μ2 = 0 (can also be ≤ or ≥)
Ha: μ1 – μ2 ≠ 0 (can also be < or >)
The samples may be matched (a.k.a. paired) or may be independent
PAIRED TWO-SAMPLE T-TEST FOR MEANS
A matched-pairs t-test are samples that are paired or related in some
fashion.
For example:
• If you wished to compare prices of groceries across grocery stores. Would need
to compare the same products at two (or more) different stores
• If you wished to measure the effectiveness of a new diet you would weigh the
dieters at the start and at the finish of the program.
EXERCISE - PILE FOUNDATION
• The Excel worksheet “Pile Foundation” contains the estimates used in a bid
and actual auger-cast pile lengths that engineers ultimately had to use for a
foundation engineering project. The contractor’s past experience suggested
that the bid information was generally accurate, so the average of the paired
differences between the actual pile lengths and estimated lengths should be
close to zero. After this project was completed, the contractor found that the
average difference between the actual lengths and the estimated lengths
was 6.38. Could the contractor conclude that the bid information was poor?
INDEPENDENT SAMPLES: TWO-SAMPLE T-TEST
• The prior example assumed we had matched or dependent data (i.e., the data
from one population corresponds to data from another population)
• Sometimes we have data taken independently from two populations
• Independence means that the observations from one population have no influence on
the observations from the other population
• We use different hypothesis tests based on whether the variances between the
samples are statistically the same
• Test for equality of variances between two samples using the F-test.
EXERCISE - COMPARING SUPPLIER PERFORMANCE
• The last two columns in the “Purchase Orders” worksheet provide the order
date and arrival date of all orders placed with each supplier. The time
between placement of an order and its arrival is commonly called the lead
time (computed by subtracting the dates from each other). Purchasing
managers have noted that they order many of the same types of items from
Alum Sheeting and Durrable Products and are considering dropping Alum
Sheeting from its supplier base if its lead time is significantly longer than that
of Durrable Products. Should they?
Hypothesis Testing on bus. analytics 101

More Related Content

Similar to Hypothesis Testing on bus. analytics 101

Statistical inference concept, procedure of hypothesis testing
Statistical inference   concept, procedure of hypothesis testingStatistical inference   concept, procedure of hypothesis testing
Statistical inference concept, procedure of hypothesis testingAmitaChaudhary19
 
Topic 8 Hypothesis Testing One Sample Test (Z ).pptx
Topic  8  Hypothesis Testing One Sample Test (Z ).pptxTopic  8  Hypothesis Testing One Sample Test (Z ).pptx
Topic 8 Hypothesis Testing One Sample Test (Z ).pptxCallplanetsDeveloper
 
TOPIC- HYPOTHESIS TESTING RMS.pptx
TOPIC- HYPOTHESIS TESTING  RMS.pptxTOPIC- HYPOTHESIS TESTING  RMS.pptx
TOPIC- HYPOTHESIS TESTING RMS.pptxanamikamishra29
 
Statistics pres 3.31.2014
Statistics pres 3.31.2014Statistics pres 3.31.2014
Statistics pres 3.31.2014tjcarter
 
Hypothesis Testing Lesson 1
Hypothesis Testing Lesson 1Hypothesis Testing Lesson 1
Hypothesis Testing Lesson 1yhchung
 
Lecture 7 Hypothesis testing.pptx
Lecture 7 Hypothesis testing.pptxLecture 7 Hypothesis testing.pptx
Lecture 7 Hypothesis testing.pptxshakirRahman10
 
Hypothesis Testing techniques in social research.ppt
Hypothesis Testing techniques in social research.pptHypothesis Testing techniques in social research.ppt
Hypothesis Testing techniques in social research.pptSolomonkiplimo
 
Lecture no 04 mspt 1st semester research methods in rehabilitation by abdul g...
Lecture no 04 mspt 1st semester research methods in rehabilitation by abdul g...Lecture no 04 mspt 1st semester research methods in rehabilitation by abdul g...
Lecture no 04 mspt 1st semester research methods in rehabilitation by abdul g...riphah college of rehabilitation sciences
 
Statr session 17 and 18
Statr session 17 and 18Statr session 17 and 18
Statr session 17 and 18Ruru Chowdhury
 
Statr session 17 and 18 (ASTR)
Statr session 17 and 18 (ASTR)Statr session 17 and 18 (ASTR)
Statr session 17 and 18 (ASTR)Ruru Chowdhury
 
Sample Size Calculation One Sample Case.pdf
Sample Size Calculation One Sample Case.pdfSample Size Calculation One Sample Case.pdf
Sample Size Calculation One Sample Case.pdfRajJirel
 
Basic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingBasic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingPenn State University
 
Statr session 15 and 16
Statr session 15 and 16Statr session 15 and 16
Statr session 15 and 16Ruru Chowdhury
 
Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)Harve Abella
 
HYPOTHESIS TESTING
HYPOTHESIS TESTINGHYPOTHESIS TESTING
HYPOTHESIS TESTINGAmna Sheikh
 

Similar to Hypothesis Testing on bus. analytics 101 (20)

RESEARCH METHODS LESSON 3
RESEARCH METHODS LESSON 3RESEARCH METHODS LESSON 3
RESEARCH METHODS LESSON 3
 
Statistical inference concept, procedure of hypothesis testing
Statistical inference   concept, procedure of hypothesis testingStatistical inference   concept, procedure of hypothesis testing
Statistical inference concept, procedure of hypothesis testing
 
Topic 8 Hypothesis Testing One Sample Test (Z ).pptx
Topic  8  Hypothesis Testing One Sample Test (Z ).pptxTopic  8  Hypothesis Testing One Sample Test (Z ).pptx
Topic 8 Hypothesis Testing One Sample Test (Z ).pptx
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
 
TOPIC- HYPOTHESIS TESTING RMS.pptx
TOPIC- HYPOTHESIS TESTING  RMS.pptxTOPIC- HYPOTHESIS TESTING  RMS.pptx
TOPIC- HYPOTHESIS TESTING RMS.pptx
 
Statistics pres 3.31.2014
Statistics pres 3.31.2014Statistics pres 3.31.2014
Statistics pres 3.31.2014
 
ANSWERS
ANSWERSANSWERS
ANSWERS
 
Hypothesis Testing Lesson 1
Hypothesis Testing Lesson 1Hypothesis Testing Lesson 1
Hypothesis Testing Lesson 1
 
Lecture 7 Hypothesis testing.pptx
Lecture 7 Hypothesis testing.pptxLecture 7 Hypothesis testing.pptx
Lecture 7 Hypothesis testing.pptx
 
Hypothesis Testing techniques in social research.ppt
Hypothesis Testing techniques in social research.pptHypothesis Testing techniques in social research.ppt
Hypothesis Testing techniques in social research.ppt
 
Lecture no 04 mspt 1st semester research methods in rehabilitation by abdul g...
Lecture no 04 mspt 1st semester research methods in rehabilitation by abdul g...Lecture no 04 mspt 1st semester research methods in rehabilitation by abdul g...
Lecture no 04 mspt 1st semester research methods in rehabilitation by abdul g...
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Statr session 17 and 18
Statr session 17 and 18Statr session 17 and 18
Statr session 17 and 18
 
Statr session 17 and 18 (ASTR)
Statr session 17 and 18 (ASTR)Statr session 17 and 18 (ASTR)
Statr session 17 and 18 (ASTR)
 
Sample Size Calculation One Sample Case.pdf
Sample Size Calculation One Sample Case.pdfSample Size Calculation One Sample Case.pdf
Sample Size Calculation One Sample Case.pdf
 
Basic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingBasic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-Making
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Statr session 15 and 16
Statr session 15 and 16Statr session 15 and 16
Statr session 15 and 16
 
Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)
 
HYPOTHESIS TESTING
HYPOTHESIS TESTINGHYPOTHESIS TESTING
HYPOTHESIS TESTING
 

More from Abdullah Amin

Chapter 1 - Introduction to Management and Organizations
Chapter 1 - Introduction to Management and OrganizationsChapter 1 - Introduction to Management and Organizations
Chapter 1 - Introduction to Management and OrganizationsAbdullah Amin
 
Spotify - international marketing culture
Spotify - international marketing cultureSpotify - international marketing culture
Spotify - international marketing cultureAbdullah Amin
 
International market n cultural dynamics
International market n cultural dynamicsInternational market n cultural dynamics
International market n cultural dynamicsAbdullah Amin
 
Linear Programming.ppt
Linear Programming.pptLinear Programming.ppt
Linear Programming.pptAbdullah Amin
 
Location Planning.pptx
Location Planning.pptxLocation Planning.pptx
Location Planning.pptxAbdullah Amin
 

More from Abdullah Amin (7)

Chapter 1 - Introduction to Management and Organizations
Chapter 1 - Introduction to Management and OrganizationsChapter 1 - Introduction to Management and Organizations
Chapter 1 - Introduction to Management and Organizations
 
Spotify - international marketing culture
Spotify - international marketing cultureSpotify - international marketing culture
Spotify - international marketing culture
 
International market n cultural dynamics
International market n cultural dynamicsInternational market n cultural dynamics
International market n cultural dynamics
 
AXE_draft.pptx
AXE_draft.pptxAXE_draft.pptx
AXE_draft.pptx
 
Linear Programming.ppt
Linear Programming.pptLinear Programming.ppt
Linear Programming.ppt
 
Location Planning.pptx
Location Planning.pptxLocation Planning.pptx
Location Planning.pptx
 
Presentation 3.pptx
Presentation 3.pptxPresentation 3.pptx
Presentation 3.pptx
 

Recently uploaded

UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfUGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfNirmal Dwivedi
 
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lessonQUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lessonhttgc7rh9c
 
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSAnaAcapella
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
Economic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food AdditivesEconomic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food AdditivesSHIVANANDaRV
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxCeline George
 
dusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningdusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningMarc Dusseiller Dusjagr
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxDr. Ravikiran H M Gowda
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111GangaMaiya1
 
What is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptxWhat is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptxCeline George
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxPooja Bhuva
 

Recently uploaded (20)

UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfUGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
 
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lessonQUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
 
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
 
VAMOS CUIDAR DO NOSSO PLANETA! .
VAMOS CUIDAR DO NOSSO PLANETA!                    .VAMOS CUIDAR DO NOSSO PLANETA!                    .
VAMOS CUIDAR DO NOSSO PLANETA! .
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Economic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food AdditivesEconomic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food Additives
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
dusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningdusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learning
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Our Environment Class 10 Science Notes pdf
Our Environment Class 10 Science Notes pdfOur Environment Class 10 Science Notes pdf
Our Environment Class 10 Science Notes pdf
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111
 
What is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptxWhat is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 

Hypothesis Testing on bus. analytics 101

  • 2. HYPOTHESIS TESTING: THE BASICS • Why hypothesis testing? • To investigate whether a statement about the population average (based on a sample derived from that population) is valid. • Assume that someone believes that the average annual income for data analysts (μ) is BDT600K based on past evidence or an educated opinion. • Assume that we derive a sample from the population • If the sample average ( ҧ 𝑥) is BDT550K, does this mean that the population mean is different from BDT600K and therefore the original statement is wrong ? • Could it be that the statement is correct and that we simply derived a sample out of the many possible quite different from the population mean? • Or can we use it to tell us whether the population average cannot be equal to the proposed average value?
  • 3. HYPOTHESIS TESTING: THE REASONING • The reasoning is based on the central limit theorem. • We are testing a suggestion about the population mean. • If it was to be true then it would have been at the centre of the normal distribution of all the sample averages and every sample average would have been distributed around it. We assume that this is the case. • We will therefore estimate the z value by using the hypothesised population mean. • By stating our level of significance (e.g. a=5%/2 = 2.5 %) and its corresponding critical values (1.96, -1.96) we define a threshold for the ‘rejection’ or ‘not rejection’ of the null hypothesis. • That means if we estimate a z value greater than 1.96 or smaller than -1.96 (the critical values) then we will reject the Null Hypothesis.
  • 4. HYPOTHESIS TESTING: THE REASONING • Why? Because assuming that the suggested population average is correct and it is at the centre of our distribution the probability of selecting a sample that is more than 1.96 standard deviations away from the suggested population average is less than 2.5 %. • Therefore we cannot believe that we were so unlucky that we selected a sample that gives an average value that is less that 2.5 % likely of being observed, ASSUMING that the population average that has been suggested is the real one. • So instead we believe that we actually picked up a sample that has more than 2.5 % chance of being selected, and the only way that this can be true is if the population mean has a different value than the one being suggested. • A different population mean (that had a value that is closer to the one derived from our sample) would have shown that the probability of observing our sample average would have been more than 2.5 %.
  • 5. HYPOTHESIS TESTING: THE STEPS 1. Set up the Null Hypothesis (H0) 2. Set up the Alternative Hypothesis (Ha) 3. Specify the Significance Level (α) and identify its corresponding critical values 4. Look at your sample data 5. Calculate the statistics for the test (test statistic) 6. Technical conclusion (Reject or not reject) 7. Draw your conclusion in non-technical terms referring to the original hypothesis
  • 6. NULL & ALTERNATIVE HYPOTHESIS • When we conduct hypothesis testing, we identify two mutually exclusive hypotheses: • Null Hypothesis (H0) • Alternative Hypothesis (Ha) • In hypothesis testing, we always assume that H0 is true until proven otherwise • We can develop hypotheses using a single population parameter or more than one population parameter
  • 7. REJECTION REGION Two-tail test: When our null hypothesis is of the form (e.g. μ = BDT600K) and the alternative hypothesis of the form (e.g. μ  BDT600K) then we have a two-tail test. One-tail test: When our null hypothesis is of the form (e.g. μ ≥ BDT600K or μ ≤ BDT600K ) and the alternative hypothesis of the form (e.g. μ < BDT600K or μ > BDT600K respectively) then we have a one-tail test. Reject Ho Reject Ho 2.5% 2.5% Accept Ho + z/t value - z/t value 5% Reject Ho - z/t value Accept Ho 5% Reject Ho Accept Ho + z/t value
  • 8. ✓ Type 2 Error β ✓ Type 1 Error α Accept Null Hypothesis Reject Null Hypothesis Null Hypothesis True Null Hypothesis False Decision The guilty go free The innocent are punished Free Punish Innocent Guilty TYPE 1 & 2 ERRORS
  • 9. PARAMETRIC HYPOTHESIS TESTS • Hypothesis tests that we want to carry out could be based on sample statistics (sample average, proportion) and could be used to provide estimates about population parameters (population average, proportion). • They are therefore called parametric tests. • However they cannot be applied on all types of data. • They can be applied on interval or ratio data scales but not on ordinal or nominal. Why? • Because although in the former group of scales, statistical measures such as the mean or standard deviation can be estimated, in the latter group they cannot.
  • 10. ONE-SAMPLE HYPOTHESIS TESTS Three types of one sample tests: 1. H0: parameter ≤ constant Ha: parameter > constant 2. H0: parameter ≥ constant Ha: parameter < constant 3. H0: parameter = constant Ha: parameter ≠ constant It is not correct to formulate a null hypothesis using >, <, or ≠.
  • 11. EXAMPLE - 1 Assume that the average time it takes to answer a call on a switchboard is taken for a random sample of 81 calls. The resulting sample mean is found to be 28 seconds with a standard deviation of 9 seconds. Previously it has been estimated that the average time to answer a call was 25 seconds. Test this argument at the 5% significance level.
  • 12. EXAMPLE 1 - SOLVED 1. Set up the null hypothesis (Ho): The average time it takes for a phone call to be answered is 25. H0: μ = 25 2. Set up the alternative hypothesis (Ha): The average time that it takes for a phone call to be answered is not 25. Ha: μ  25 3. Specify the significance level (α) and identify the theoretical value: 5% significance level with a critical value of za/2 = 1.96 4. Look at your sample data: The sample data suggests that the average time it takes to answer a call is more than 25 seconds 5. Calculate the statistics for the test: 𝑧 = 𝑥 − −𝜇 Τ 𝜎 𝑛 = 28−25 Τ 9 81 = 𝟑 6. Technical conclusion: The calculated value (3) is greater than the critical value (1.96) and therefore we reject the null hypothesis 7. Conclusion in non-technical terms: There isn’t sufficient evidence to suggest that the mean time to answer a call is 25 seconds
  • 13. EXAMPLE - 2 The average time it takes to answer a call on the switchboard is taken for a random sample of 25 calls. The resulting sample mean is found to be 28 seconds with a standard deviation of 9 seconds. Previously it has been estimated that the average time to answer the call was 25. Test at the 5% significance level the hypothesis that the average time for all phone calls to be answered is still μ = 25.
  • 14. EXAMPLE 2 - SOLVED 1. Set up the null hypothesis (Ho): The average time it takes for a phone call to be answered is 25. H0: μ = 25 2. Set up the alternative hypothesis (Ha): The average time that it takes for a phone call to be answered is not 25. Ha: μ  25 3. Specify the significance level (α) and identify the theoretical value: 5% significance level with a critical value of t(0.025,24) = 2.064 4. Look at your sample data: The sample data suggests that the average time it takes to answer a call is more than 25 seconds 5. Calculate the statistics for the test: 𝑡 = 𝑥 − −𝜇 Τ 𝑠 𝑛 = 28−25 Τ 9 25 = 𝟏. 𝟔𝟔𝟔 6. Technical conclusion: The calculated value is lower than the critical value (|1.666|< |2.064|) and therefore we cannot reject the null hypothesis. 7. Conclusion in non-technical terms: We cannot reject the claim that the average time it takes to answer a phone call is 25 seconds
  • 15. EXERCISE - CADSOFT • CadSoft, a producer of computer-aided design software for the aerospace industry receives numerous calls for technical support. In the past, the average response time has been at least 25 minutes. The company has upgraded its information systems and believes that this will help reduce response time to less than 25 minutes. The company collected a sample of 44 response times in the Excel worksheet “CadSoft Response Times”. Check whether the company is right in their belief.
  • 16. EXERCISE - VACATION SURVEY • The Excel worksheet “Vacation Survey” shows a portion of data collected in a survey of 34 respondents by a travel agency. Suppose that the travel agency wanted to target individuals who were approximately 35 years old. Test whether the average age of respondents is equal to 35.
  • 17. ONE-SAMPLE HYPOTHESIS TESTS Many important business measures, such as market share or the fraction of deliveries received on time, are expressed as proportions. We may conduct a test of hypothesis about a population proportion in a similar fashion as we did for means. The test statistic for a one-sample test for proportions is:
  • 18. EXERCISE - CADSOFT • CadSoft also sampled 44 customers and asked them to rate the overall quality of the company’s software product using a scale of 0-very poor, 1-poor, 2-good, 3-very good, 4-excellent. The company collected a sample of 44 response times in the Excel worksheet “CadSoft Product Satisfaction”. The firm tracks customer satisfaction of quality by measuring the proportion of responses in the top two categories. Over the past, this proportion has averaged about 75%. Is there sufficient evidence to conclude that this satisfaction measure has significantly exceeded 75% using a significance level of 0.05?
  • 19. TWO-SAMPLE HYPOTHESIS TESTS We can also have two-sample tests in which we compare means of two samples H0: μ1 – μ2 = 0 (can also be ≤ or ≥) Ha: μ1 – μ2 ≠ 0 (can also be < or >) The samples may be matched (a.k.a. paired) or may be independent
  • 20. PAIRED TWO-SAMPLE T-TEST FOR MEANS A matched-pairs t-test are samples that are paired or related in some fashion. For example: • If you wished to compare prices of groceries across grocery stores. Would need to compare the same products at two (or more) different stores • If you wished to measure the effectiveness of a new diet you would weigh the dieters at the start and at the finish of the program.
  • 21. EXERCISE - PILE FOUNDATION • The Excel worksheet “Pile Foundation” contains the estimates used in a bid and actual auger-cast pile lengths that engineers ultimately had to use for a foundation engineering project. The contractor’s past experience suggested that the bid information was generally accurate, so the average of the paired differences between the actual pile lengths and estimated lengths should be close to zero. After this project was completed, the contractor found that the average difference between the actual lengths and the estimated lengths was 6.38. Could the contractor conclude that the bid information was poor?
  • 22. INDEPENDENT SAMPLES: TWO-SAMPLE T-TEST • The prior example assumed we had matched or dependent data (i.e., the data from one population corresponds to data from another population) • Sometimes we have data taken independently from two populations • Independence means that the observations from one population have no influence on the observations from the other population • We use different hypothesis tests based on whether the variances between the samples are statistically the same • Test for equality of variances between two samples using the F-test.
  • 23. EXERCISE - COMPARING SUPPLIER PERFORMANCE • The last two columns in the “Purchase Orders” worksheet provide the order date and arrival date of all orders placed with each supplier. The time between placement of an order and its arrival is commonly called the lead time (computed by subtracting the dates from each other). Purchasing managers have noted that they order many of the same types of items from Alum Sheeting and Durrable Products and are considering dropping Alum Sheeting from its supplier base if its lead time is significantly longer than that of Durrable Products. Should they?