SlideShare a Scribd company logo
1 of 49
ANOVA
Samprit Chakrabarti
Topics for Discussion
• ANOVA
Example
• Gulfstream Aerospace Company produced three different
prototypes as candidates for mass production as the company’s
newest large-cabin business jet, the Gulfstream IV. Each of the
three prototypes has slightly different features, which may bring
about differences in performance. Therefore, as part of the
decision-making process concerning which model to produce,
company engineers are interested in determining whether the
three proposed models have about the same average flight range.
Each of the models is assigned a random choice of 10 flight routes
and departure times, and the flight range on a full standard fuel
tank is measured (the planes carry additional fuel on the test
flights, to allow them to land safely at certain destination points).
Range data for the three prototypes, in nautical miles (measured to
the nearest 10 miles), are as follows.
Data
Prototype A Prototype B Prototype C
4,420 4,230 4,110
4,540 4,220 4,090
4,380 4,100 4,070
4,550 4,300 4,160
4,210 4,420 4,230
4,330 4,110 4,120
4,400 4,230 4,000
4,340 4,280 4,200
4,390 4,090 4,150
4,510 4,320 4,220
Do all three prototypes have the same average range?
Introduction
• ANOVA is a technique of testing hypotheses
about the significance in several population
means
• Was developed by R A Fisher
• Total variation in a data set accounted for two
components: between and within
ANOVA – What does it tell us?
• ANOVA = Analysis of Variance
• ANOVA will tell us whether we have sufficient
evidence to say that measurements from at
least one sample differ significantly from at
least one other.
– It will not tell us which ones differ, or how many
differ
Relationship Amongst Test, Analysis of Variance, Analysis of
Covariance, & Regression
One Independent One or More
Metric Dependent Variable
t Test
Binary
Variable
One-Way Analysis
of Variance
One Factor
N-Way Analysis
of Variance
More than
One Factor
Analysis of
Variance
Categorical:
Factorial
Analysis of
Covariance
Categorical
and Interval
Regression
Interval
Independent Variables
One-Way Analysis of Variance
Marketing researchers are often interested in
examining the differences in the mean values of the
dependent variable for several categories of a single
independent variable or factor. For example:
• Do the various segments differ in terms of their
volume of product consumption?
• Do the brand evaluations of groups exposed to
different commercials vary?
• What is the effect of consumers' familiarity with the
store (measured as high, medium, and low) on
preference for the store?
ANOVA vs. t-test
• ANOVA is like a t-test among multiple data
sets simultaneously
– t-tests can only be done between two data
sets, or between one set and a “true” value
• ANOVA uses the F distribution instead of
the t-distribution
• ANOVA assumes that all of the data sets
have equal variances
Conducting One-Way ANOVA
Interpret the Results
Identify the Dependent and Independent Variables
Decompose the Total Variation
Measure the Effects
Test the Significance
Completely randomized design
Population 1 Population 2….. Population k
Mean = 1 Mean = 2 …. Mean = k
Variance=1
2 Variance=2
2 … Variance = k
2
We want to know something about how the
populations compare. Do they have the same mean?
We can collect random samples from each
population, which gives us the following data.
Completely randomized design
Mean = M1 Mean = M2 ..… Mean = Mk
Variance=s1
2 Variance=s2
2 …. Variance = sk
2
N1 cases N2 cases …. Nk cases
Suppose we want to compare 3 college majors in a
business school by the average annual income
people make after post graduation. We collect the
following data (in Rs 1000s) based on random
surveys.
Completely randomized design
Accounting Marketing Finance
27 23 48
22 36 35
33 27 46
25 44 36
38 39 28
29 32 29
Completely randomized design
Can the dean conclude that there are
differences among the major’s incomes?
Ho: 1 = 2 = 3
HA: 1  2  3
In this problem we must take into account:
1) The variance between samples, or the actual
differences by major. This is called the sum of
squares for treatment (SST) (columns)
Completely randomized design
2) The variance within samples, or the variance
of incomes within a single major. This is called
the sum of squares for error (SSE).
Recall that when we sample, there will always
be a chance of getting something different
than the population. We account for this
through #2, or the SSE.
F-Statistic
For this test, we will calculate a F statistic, which
is used to compare variances.
F = SST/(k-1)
SSE/(n-k)
SST=sum of squares for treatment (columns)
SSE=sum of squares for error
k = the number of populations
n = total sample size
F-statistic
Intuitively, the F statistic is:
F = explained variance
unexplained variance
Explained variance is the difference between
majors
Unexplained variance is the difference based on
random sampling for each group
Decision Based on ANOVA
• F (calculated) > F (critical or theoretical)
• Reject H0
• P < 0.05 (chosen significance level)
• Reject H0
Calculating SST
SST = ni(Mi - )2
 = grand mean or =  Mi/k or the sum of all
values for all groups divided by total sample
size
Mi = mean for each sample
k= the number of populations
Calculating SST
By major
Accounting M1=29, n1=6
Marketing M2=33.5, n2=6
Finance M3=37, n3=6
 = (29+33.5+37)/3 = 33.17
SST = (6)(29-33.17)2 + (6)(33.5-33.17)2 + (6)(37-
33.17)2 = 193
Calculating SST
Note that when M1 = M2 = M3, then SST=0 which
would support the null hypothesis.
In this example, the samples are of equal size,
but we can also run this analysis with samples
of varying size also.
Calculating SSE
SSE = (Xit – Mi)2
In other words, it is just the variance for each sample
added together.
SSE = (X1t – M1)2 + (X2t – M2)2 +
(X3t – M3)2
SSE = [(27-29)2 + (22-29)2 +…+ (29-29)2]
+ [(23-33.5)2 + (36-33.5)2 +…]
+ [(48-37)2 + (35-37)2 +…+ (29-37)2]
SSE = 819.5
Statistical Output
When you estimate this information in a computer
program, it will typically be presented in a table as
follows:
Source of df Sum of Mean F-ratio
Variation squares squares
Treatment k-1 SST MST=SST/(k-1) F=MST
Error n-k SSE MSE=SSE/(n-k) MSE
Total n-1 SS=SST+SSE
Calculating F for our example
F = 193/2
819.5/15
F = 1.77
Our calculated F is compared to the critical value
using the F-distribution with
F, k-1, n-k degrees of freedom
k-1 (numerator df)
n-k (denominator df)
The Results
For 95% confidence (=.05), our critical F is 3.68
In this case, 1.77 < 3.68 so we must accept the
null hypothesis.
The dean is puzzled by these results because
just by eyeballing the data, it looks like finance
majors make more money.
New Data
Many other factors may determine the salary
level, such as GPA. The dean decides to
collect new data selecting one student
randomly from each major with the following
average grades.
New data
Average Accounting Marketing Finance M(b)
A+ 41 45 51 M(b1)=45.67
A 36 38 45 M(b2)=39.67
B+ 27 33 31 M(b3)=30.83
B 32 29 35 M(b4)=32
C+ 26 31 32 M(b5)=29.67
C 23 25 27 M(b6)=25
M(t)1=30.83 M(t)2=33.5 M(t)3=36.83
 = 33.72
Randomized Block Design
Now the data in the 3 samples are not
independent, they are matched by GPA levels.
Just like before, matched samples are superior
to unmatched samples because they provide
more information. In this case, we have
added a factor that may account for some of
the SSE.
Two way ANOVA
Now SS(total) = SST + SSB + SSE
Where SSB = the variability among blocks,
where a block is a matched group of
observations from each of the populations
We can calculate a two-way ANOVA to test our
null hypothesis.
Example: Solving with Excel
• Vishal Foods Ltd is a leading manufacturer of biscuits. The
company has launched a new brand in the four metros;
Delhi, Mumbai, Kolkata and Chennai. After one month, the
company realizes that there is a difference in the retail
price per pack of biscuits across cities. Before the launch,
the company had promised its employees and newly-
appointed retailers that the biscuits would be sold at a
uniform price in the country. The difference in price can
tarnish the image of the company. In order to make a quick
inference, the company collected data about the price from
six randomly selected stores across the four cities. Based on
the sample information, the price per pack of the biscuits,
in rupees, is given in the table
Data
Delhi Mumbai Kolkata Chennai
22 19 18 21
22.5 19.5 17 20
21.5 19 18.5 21.5
22 20 17 20
22.5 19 18.5 21
21.5 21 17 20
Hypotheses
• Null hypothesis is all the means are all equal
• Alternative hypothesis is all the means are
unequal
Excel Path
• Select Data Analysis
• From Data Analysis dialogue box, select Anova:
Single Factor
• Click OK
• In the Anova: Single Factor dialogue box, enter
the location of the samples in the variable Input
Range box. Select Grouped by Columns.
• Place the value of α
• Click OK
Minitab Path
• Select Stat from the menu bar
• A pull down menu will appear- select ANOVA
• Another pull down menu will appear- select One
Way Unstacked
• One Way dialogue box will appear
• By using select, place samples in the Responses
(in separate columns) box and place the
confidence level
• Click OK
• F and p values will appear in the output box
Output Sheet: Excel
df for SST is (4-1) = 3, df for SSE is (24-4) = 20, df for total is (24-1) =
23. F theoretical is 3.098 where as F calculated is 43.03. As F
calculated greater than F theoretical, F falls in the rejection region.
Null hypothesis is rejected.
Anova: Single Factor
SUMMARY
Groups Count Sum Average Variance
Column 1 6 132 22 0.2
Column 2 6 117.5 19.58333 0.641667
Column 3 6 106 17.66667 0.566667
Column 4 6 123.5 20.58333 0.441667
ANOVA
Source of Variation SS df MS F P-value F crit
Between Groups 59.70833 3 19.90278 43.03303 6.54E-09 3.098391
Within Groups 9.25 20 0.4625
Total 68.95833 23
Conclusion
• There is enough evidence to believe that there
is a significant difference in the prices across
four cities
Two Way ANOVA
• A company which produces stationery items wants to
diversify into the photocopy paper manufacturing business.
The company has decided to first test market the product in
three areas termed as the north area, central area, and the
south area. The company takes a random sample of five
salesmen S1, S2, S3, S4 and S5 for this purpose. The sales
volume generated by these five salesmen, in thousand
rupees, and total sales in different regions are given in the
table.
• Use a randomized block design analysis to examine:
– Whether the salesmen significantly differ in performance?
– Whether there is a significant difference in terms of sales
capacity between the regions?
– Take 95% confidence level
Data
Region Salesmen
S1 S2 S3 S4 S5 Region’s
Total
North 24 30 26 23 32 135
Central 22 32 27 25 31 137
South 23 28 25 22 32 130
Salesmen’s
Total
69 90 78 70 95 402
Hypotheses
• Divided into two parts
• For treatments (columns)
• For blocks (rows)
• For treatments: Null hypothesis: all the treatment
means are equal; Alternative hypothesis: all the
treatment means are unequal
• For blocks: Null hypothesis: all the block means
are equal; Alternative hypothesis: all the block
means are unequal
Excel Path
• Select Data Analysis
• From Data Analysis dialogue box, select Anova:
Two Factor Without Replication
• Click OK
• In the Anova: Two Factor dialogue box, enter the
location of the samples in the variable Input
Range box
• Place the value of α
• Click OK
Output Sheet: Excel
The F calculated for columns is 30.17, which is greater than F theoretical
of 3.84, null hypothesis is rejected.
The F calculated for rows is 1.71, which is less than F theoretical of 4.46,
null hypothesis is accepted.
Anova: Two-Factor Without Replication
SUMMARY Count Sum Average Variance
Row 1 5 135 27 15
Row 2 5 137 27.4 17.3
Row 3 5 130 26 16.5
Column 1 3 69 23 1
Column 2 3 90 30 4
Column 3 3 78 26 1
Column 4 3 70 23.33333 2.333333
Column 5 3 95 31.66667 0.333333
ANOVA
Source of Variation SS df MS F P-value F crit
Rows 5.2 2 2.6 1.714286 0.2401 4.45897
Columns 183.0667 4 45.76667 30.17582 7.09E-05 3.837853
Error 12.13333 8 1.516667
Total 200.4 14
Conclusion
• There is enough evidence to believe that there
is a significant difference in the performance
of five salesmen in terms of generation of
results. On the other hand, there is no
significant difference in the capacity of
generating sales for the three regions
Illustrative Applications of One-Way
Analysis of Variance
We illustrate the concepts discussed in this chapter
using the data presented in the next table
The department store is attempting to determine the
effect of in-store promotion (X) on sales (Y)
The null hypothesis is that the category means are
equal:
H0: µ1 = µ2 = µ3.
16-44
Effect of Promotion and Clientele on Sales
Store Number Coupon Level In-Store Promotion Sales Clientel Rating
1 1.00 1.00 10.00 9.00
2 1.00 1.00 9.00 10.00
3 1.00 1.00 10.00 8.00
4 1.00 1.00 8.00 4.00
5 1.00 1.00 9.00 6.00
6 1.00 2.00 8.00 8.00
7 1.00 2.00 8.00 4.00
8 1.00 2.00 7.00 10.00
9 1.00 2.00 9.00 6.00
10 1.00 2.00 6.00 9.00
11 1.00 3.00 5.00 8.00
12 1.00 3.00 7.00 9.00
13 1.00 3.00 6.00 6.00
14 1.00 3.00 4.00 10.00
15 1.00 3.00 5.00 4.00
16 2.00 1.00 8.00 10.00
17 2.00 1.00 9.00 6.00
18 2.00 1.00 7.00 8.00
19 2.00 1.00 7.00 4.00
20 2.00 1.00 6.00 9.00
21 2.00 2.00 4.00 6.00
22 2.00 2.00 5.00 8.00
23 2.00 2.00 5.00 10.00
24 2.00 2.00 6.00 4.00
25 2.00 2.00 4.00 9.00
26 2.00 3.00 2.00 4.00
27 2.00 3.00 3.00 6.00
28 2.00 3.00 2.00 10.00
29 2.00 3.00 1.00 9.00
30 2.00 3.00 2.00 8.00
16-45
One-Way ANOVA: Effect of In-store Promotion on
Store Sales
Cell means
Level of Count Mean
Promotion
High (1) 10 8.300
Medium (2) 10 6.200
Low (3) 10 3.700
TOTAL 30 6.067
Source of Sum of df Mean F ratio F
prob.
Variation squares square
Between groups 106.067 2 53.033 17.944 3.35
(Promotion)
Within groups 79.800 27 2.956
(Error)
TOTAL 185.867 29 6.409
• From table we see that for 2 and 27 degrees of freedom,
the critical value of F is 3.35 for 95% level. Because the
calculated value of F is greater than the critical value, we
reject the null hypothesis.
Illustrative Applications of One-Way
Analysis of Variance
SPSS Windows
One-way ANOVA can be efficiently performed using the
program COMPARE MEANS and then One-way ANOVA.
To select this procedure using SPSS for Windows click:
Analyze>Compare Means>One-Way ANOVA …
N-way analysis of variance and analysis of covariance
can be performed using GENERAL LINEAR MODEL. To
select this procedure using SPSS for Windows click:
Analyze>General Linear Model>Univariate …
SPSS Windows: One-Way ANOVA
1. Select ANALYZE from the SPSS menu bar.
2. Click COMPARE MEANS and then ONE-WAY ANOVA.
3. Move “Sales [sales]” in to the DEPENDENT LIST box.
4. Move “In-Store Promotion[promotion]” to the FACTOR
box.
5. Click OPTIONS
6. Click Descriptive.
7. Click CONTINUE.
8. Click OK.
• Thanks

More Related Content

Similar to ANOVA.pptx

Analysis of Variance
Analysis of Variance Analysis of Variance
Analysis of Variance jyothimonc
 
Application of ANOVA
Application of ANOVAApplication of ANOVA
Application of ANOVARohit Patidar
 
Mb0040 statistics for management
Mb0040   statistics for managementMb0040   statistics for management
Mb0040 statistics for managementsmumbahelp
 
data analysis in research.pptx
data analysis in research.pptxdata analysis in research.pptx
data analysis in research.pptxAbdulRashidAdams
 
Penggambaran Data dengan Grafik
Penggambaran Data dengan GrafikPenggambaran Data dengan Grafik
Penggambaran Data dengan Grafikanom0164
 
test_using_one-way_analysis_of_varianceANOVA_063847.pptx
test_using_one-way_analysis_of_varianceANOVA_063847.pptxtest_using_one-way_analysis_of_varianceANOVA_063847.pptx
test_using_one-way_analysis_of_varianceANOVA_063847.pptxRaquelMaacap
 
Analysis of variance
Analysis of varianceAnalysis of variance
Analysis of varianceRavi Rohilla
 
A study on the ANOVA ANALYSIS OF VARIANCE.pptx
A study on the ANOVA ANALYSIS OF VARIANCE.pptxA study on the ANOVA ANALYSIS OF VARIANCE.pptx
A study on the ANOVA ANALYSIS OF VARIANCE.pptxjibinjohn140
 
Tugasan kumpulan anova
Tugasan kumpulan anovaTugasan kumpulan anova
Tugasan kumpulan anovapraba karan
 
Analysis of variance
Analysis of varianceAnalysis of variance
Analysis of varianceHemant Sharma
 
Analysis of data in research
Analysis of data in researchAnalysis of data in research
Analysis of data in researchAbhijeet Birari
 
3.3 Measures of Variation
3.3 Measures of Variation3.3 Measures of Variation
3.3 Measures of Variationmlong24
 
Analysis of variance(menong)
Analysis of variance(menong)Analysis of variance(menong)
Analysis of variance(menong)Maria Theresa
 
Statr session 19 and 20
Statr session 19 and 20Statr session 19 and 20
Statr session 19 and 20Ruru Chowdhury
 

Similar to ANOVA.pptx (20)

Analysis of Variance
Analysis of Variance Analysis of Variance
Analysis of Variance
 
Application of ANOVA
Application of ANOVAApplication of ANOVA
Application of ANOVA
 
Friedman's test
Friedman's testFriedman's test
Friedman's test
 
Mb0040 statistics for management
Mb0040   statistics for managementMb0040   statistics for management
Mb0040 statistics for management
 
Anova.ppt
Anova.pptAnova.ppt
Anova.ppt
 
data analysis in research.pptx
data analysis in research.pptxdata analysis in research.pptx
data analysis in research.pptx
 
Penggambaran Data dengan Grafik
Penggambaran Data dengan GrafikPenggambaran Data dengan Grafik
Penggambaran Data dengan Grafik
 
test_using_one-way_analysis_of_varianceANOVA_063847.pptx
test_using_one-way_analysis_of_varianceANOVA_063847.pptxtest_using_one-way_analysis_of_varianceANOVA_063847.pptx
test_using_one-way_analysis_of_varianceANOVA_063847.pptx
 
QM Unit II.pptx
QM Unit II.pptxQM Unit II.pptx
QM Unit II.pptx
 
Data analysis
Data analysisData analysis
Data analysis
 
Analysis of variance
Analysis of varianceAnalysis of variance
Analysis of variance
 
A study on the ANOVA ANALYSIS OF VARIANCE.pptx
A study on the ANOVA ANALYSIS OF VARIANCE.pptxA study on the ANOVA ANALYSIS OF VARIANCE.pptx
A study on the ANOVA ANALYSIS OF VARIANCE.pptx
 
Tugasan kumpulan anova
Tugasan kumpulan anovaTugasan kumpulan anova
Tugasan kumpulan anova
 
Analysis of variance
Analysis of varianceAnalysis of variance
Analysis of variance
 
Factorial Experiments
Factorial ExperimentsFactorial Experiments
Factorial Experiments
 
Analysis of data in research
Analysis of data in researchAnalysis of data in research
Analysis of data in research
 
3.3 Measures of Variation
3.3 Measures of Variation3.3 Measures of Variation
3.3 Measures of Variation
 
Analysis of variance(menong)
Analysis of variance(menong)Analysis of variance(menong)
Analysis of variance(menong)
 
Statr session 19 and 20
Statr session 19 and 20Statr session 19 and 20
Statr session 19 and 20
 
Ch11
Ch11Ch11
Ch11
 

Recently uploaded

Design Portfolio - 2024 - William Vickery
Design Portfolio - 2024 - William VickeryDesign Portfolio - 2024 - William Vickery
Design Portfolio - 2024 - William VickeryWilliamVickery6
 
Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Fashion trends before and after covid.pptx
Fashion trends before and after covid.pptxFashion trends before and after covid.pptx
Fashion trends before and after covid.pptxVanshNarang19
 
Captivating Charm: Exploring Marseille's Hillside Villas with Our 3D Architec...
Captivating Charm: Exploring Marseille's Hillside Villas with Our 3D Architec...Captivating Charm: Exploring Marseille's Hillside Villas with Our 3D Architec...
Captivating Charm: Exploring Marseille's Hillside Villas with Our 3D Architec...Yantram Animation Studio Corporation
 
Abu Dhabi Call Girls O58993O4O2 Call Girls in Abu Dhabi`
Abu Dhabi Call Girls O58993O4O2 Call Girls in Abu Dhabi`Abu Dhabi Call Girls O58993O4O2 Call Girls in Abu Dhabi`
Abu Dhabi Call Girls O58993O4O2 Call Girls in Abu Dhabi`dajasot375
 
NO1 Famous Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi Add...
NO1 Famous Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi Add...NO1 Famous Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi Add...
NO1 Famous Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi Add...Amil baba
 
SCRIP Lua HTTP PROGRACMACION PLC WECON CA
SCRIP Lua HTTP PROGRACMACION PLC  WECON CASCRIP Lua HTTP PROGRACMACION PLC  WECON CA
SCRIP Lua HTTP PROGRACMACION PLC WECON CANestorGamez6
 
3D Printing And Designing Final Report.pdf
3D Printing And Designing Final Report.pdf3D Printing And Designing Final Report.pdf
3D Printing And Designing Final Report.pdfSwaraliBorhade
 
Cheap Rate Call girls Malviya Nagar 9205541914 shot 1500 night
Cheap Rate Call girls Malviya Nagar 9205541914 shot 1500 nightCheap Rate Call girls Malviya Nagar 9205541914 shot 1500 night
Cheap Rate Call girls Malviya Nagar 9205541914 shot 1500 nightDelhi Call girls
 
The_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdf
The_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdfThe_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdf
The_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdfAmirYakdi
 
Call Us ✡️97111⇛47426⇛Call In girls Vasant Vihar༒(Delhi)
Call Us ✡️97111⇛47426⇛Call In girls Vasant Vihar༒(Delhi)Call Us ✡️97111⇛47426⇛Call In girls Vasant Vihar༒(Delhi)
Call Us ✡️97111⇛47426⇛Call In girls Vasant Vihar༒(Delhi)jennyeacort
 
PORTAFOLIO 2024_ ANASTASIYA KUDINOVA
PORTAFOLIO   2024_  ANASTASIYA  KUDINOVAPORTAFOLIO   2024_  ANASTASIYA  KUDINOVA
PORTAFOLIO 2024_ ANASTASIYA KUDINOVAAnastasiya Kudinova
 
Kindergarten Assessment Questions Via LessonUp
Kindergarten Assessment Questions Via LessonUpKindergarten Assessment Questions Via LessonUp
Kindergarten Assessment Questions Via LessonUpmainac1
 
VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130
VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130
VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130Suhani Kapoor
 
WAEC Carpentry and Joinery Past Questions
WAEC Carpentry and Joinery Past QuestionsWAEC Carpentry and Joinery Past Questions
WAEC Carpentry and Joinery Past QuestionsCharles Obaleagbon
 
NATA 2024 SYLLABUS, full syllabus explained in detail
NATA 2024 SYLLABUS, full syllabus explained in detailNATA 2024 SYLLABUS, full syllabus explained in detail
NATA 2024 SYLLABUS, full syllabus explained in detailDesigntroIntroducing
 
VIP Call Girl Amravati Aashi 8250192130 Independent Escort Service Amravati
VIP Call Girl Amravati Aashi 8250192130 Independent Escort Service AmravatiVIP Call Girl Amravati Aashi 8250192130 Independent Escort Service Amravati
VIP Call Girl Amravati Aashi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Cheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk Gurgaon
Cheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk GurgaonCheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk Gurgaon
Cheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk GurgaonDelhi Call girls
 
A level Digipak development Presentation
A level Digipak development PresentationA level Digipak development Presentation
A level Digipak development Presentationamedia6
 
ARt app | UX Case Study
ARt app | UX Case StudyARt app | UX Case Study
ARt app | UX Case StudySophia Viganò
 

Recently uploaded (20)

Design Portfolio - 2024 - William Vickery
Design Portfolio - 2024 - William VickeryDesign Portfolio - 2024 - William Vickery
Design Portfolio - 2024 - William Vickery
 
Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝
 
Fashion trends before and after covid.pptx
Fashion trends before and after covid.pptxFashion trends before and after covid.pptx
Fashion trends before and after covid.pptx
 
Captivating Charm: Exploring Marseille's Hillside Villas with Our 3D Architec...
Captivating Charm: Exploring Marseille's Hillside Villas with Our 3D Architec...Captivating Charm: Exploring Marseille's Hillside Villas with Our 3D Architec...
Captivating Charm: Exploring Marseille's Hillside Villas with Our 3D Architec...
 
Abu Dhabi Call Girls O58993O4O2 Call Girls in Abu Dhabi`
Abu Dhabi Call Girls O58993O4O2 Call Girls in Abu Dhabi`Abu Dhabi Call Girls O58993O4O2 Call Girls in Abu Dhabi`
Abu Dhabi Call Girls O58993O4O2 Call Girls in Abu Dhabi`
 
NO1 Famous Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi Add...
NO1 Famous Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi Add...NO1 Famous Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi Add...
NO1 Famous Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi Add...
 
SCRIP Lua HTTP PROGRACMACION PLC WECON CA
SCRIP Lua HTTP PROGRACMACION PLC  WECON CASCRIP Lua HTTP PROGRACMACION PLC  WECON CA
SCRIP Lua HTTP PROGRACMACION PLC WECON CA
 
3D Printing And Designing Final Report.pdf
3D Printing And Designing Final Report.pdf3D Printing And Designing Final Report.pdf
3D Printing And Designing Final Report.pdf
 
Cheap Rate Call girls Malviya Nagar 9205541914 shot 1500 night
Cheap Rate Call girls Malviya Nagar 9205541914 shot 1500 nightCheap Rate Call girls Malviya Nagar 9205541914 shot 1500 night
Cheap Rate Call girls Malviya Nagar 9205541914 shot 1500 night
 
The_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdf
The_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdfThe_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdf
The_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdf
 
Call Us ✡️97111⇛47426⇛Call In girls Vasant Vihar༒(Delhi)
Call Us ✡️97111⇛47426⇛Call In girls Vasant Vihar༒(Delhi)Call Us ✡️97111⇛47426⇛Call In girls Vasant Vihar༒(Delhi)
Call Us ✡️97111⇛47426⇛Call In girls Vasant Vihar༒(Delhi)
 
PORTAFOLIO 2024_ ANASTASIYA KUDINOVA
PORTAFOLIO   2024_  ANASTASIYA  KUDINOVAPORTAFOLIO   2024_  ANASTASIYA  KUDINOVA
PORTAFOLIO 2024_ ANASTASIYA KUDINOVA
 
Kindergarten Assessment Questions Via LessonUp
Kindergarten Assessment Questions Via LessonUpKindergarten Assessment Questions Via LessonUp
Kindergarten Assessment Questions Via LessonUp
 
VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130
VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130
VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130
 
WAEC Carpentry and Joinery Past Questions
WAEC Carpentry and Joinery Past QuestionsWAEC Carpentry and Joinery Past Questions
WAEC Carpentry and Joinery Past Questions
 
NATA 2024 SYLLABUS, full syllabus explained in detail
NATA 2024 SYLLABUS, full syllabus explained in detailNATA 2024 SYLLABUS, full syllabus explained in detail
NATA 2024 SYLLABUS, full syllabus explained in detail
 
VIP Call Girl Amravati Aashi 8250192130 Independent Escort Service Amravati
VIP Call Girl Amravati Aashi 8250192130 Independent Escort Service AmravatiVIP Call Girl Amravati Aashi 8250192130 Independent Escort Service Amravati
VIP Call Girl Amravati Aashi 8250192130 Independent Escort Service Amravati
 
Cheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk Gurgaon
Cheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk GurgaonCheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk Gurgaon
Cheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk Gurgaon
 
A level Digipak development Presentation
A level Digipak development PresentationA level Digipak development Presentation
A level Digipak development Presentation
 
ARt app | UX Case Study
ARt app | UX Case StudyARt app | UX Case Study
ARt app | UX Case Study
 

ANOVA.pptx

  • 3. Example • Gulfstream Aerospace Company produced three different prototypes as candidates for mass production as the company’s newest large-cabin business jet, the Gulfstream IV. Each of the three prototypes has slightly different features, which may bring about differences in performance. Therefore, as part of the decision-making process concerning which model to produce, company engineers are interested in determining whether the three proposed models have about the same average flight range. Each of the models is assigned a random choice of 10 flight routes and departure times, and the flight range on a full standard fuel tank is measured (the planes carry additional fuel on the test flights, to allow them to land safely at certain destination points). Range data for the three prototypes, in nautical miles (measured to the nearest 10 miles), are as follows.
  • 4. Data Prototype A Prototype B Prototype C 4,420 4,230 4,110 4,540 4,220 4,090 4,380 4,100 4,070 4,550 4,300 4,160 4,210 4,420 4,230 4,330 4,110 4,120 4,400 4,230 4,000 4,340 4,280 4,200 4,390 4,090 4,150 4,510 4,320 4,220 Do all three prototypes have the same average range?
  • 5. Introduction • ANOVA is a technique of testing hypotheses about the significance in several population means • Was developed by R A Fisher • Total variation in a data set accounted for two components: between and within
  • 6. ANOVA – What does it tell us? • ANOVA = Analysis of Variance • ANOVA will tell us whether we have sufficient evidence to say that measurements from at least one sample differ significantly from at least one other. – It will not tell us which ones differ, or how many differ
  • 7. Relationship Amongst Test, Analysis of Variance, Analysis of Covariance, & Regression One Independent One or More Metric Dependent Variable t Test Binary Variable One-Way Analysis of Variance One Factor N-Way Analysis of Variance More than One Factor Analysis of Variance Categorical: Factorial Analysis of Covariance Categorical and Interval Regression Interval Independent Variables
  • 8. One-Way Analysis of Variance Marketing researchers are often interested in examining the differences in the mean values of the dependent variable for several categories of a single independent variable or factor. For example: • Do the various segments differ in terms of their volume of product consumption? • Do the brand evaluations of groups exposed to different commercials vary? • What is the effect of consumers' familiarity with the store (measured as high, medium, and low) on preference for the store?
  • 9. ANOVA vs. t-test • ANOVA is like a t-test among multiple data sets simultaneously – t-tests can only be done between two data sets, or between one set and a “true” value • ANOVA uses the F distribution instead of the t-distribution • ANOVA assumes that all of the data sets have equal variances
  • 10. Conducting One-Way ANOVA Interpret the Results Identify the Dependent and Independent Variables Decompose the Total Variation Measure the Effects Test the Significance
  • 11. Completely randomized design Population 1 Population 2….. Population k Mean = 1 Mean = 2 …. Mean = k Variance=1 2 Variance=2 2 … Variance = k 2 We want to know something about how the populations compare. Do they have the same mean? We can collect random samples from each population, which gives us the following data.
  • 12. Completely randomized design Mean = M1 Mean = M2 ..… Mean = Mk Variance=s1 2 Variance=s2 2 …. Variance = sk 2 N1 cases N2 cases …. Nk cases Suppose we want to compare 3 college majors in a business school by the average annual income people make after post graduation. We collect the following data (in Rs 1000s) based on random surveys.
  • 13. Completely randomized design Accounting Marketing Finance 27 23 48 22 36 35 33 27 46 25 44 36 38 39 28 29 32 29
  • 14. Completely randomized design Can the dean conclude that there are differences among the major’s incomes? Ho: 1 = 2 = 3 HA: 1  2  3 In this problem we must take into account: 1) The variance between samples, or the actual differences by major. This is called the sum of squares for treatment (SST) (columns)
  • 15. Completely randomized design 2) The variance within samples, or the variance of incomes within a single major. This is called the sum of squares for error (SSE). Recall that when we sample, there will always be a chance of getting something different than the population. We account for this through #2, or the SSE.
  • 16. F-Statistic For this test, we will calculate a F statistic, which is used to compare variances. F = SST/(k-1) SSE/(n-k) SST=sum of squares for treatment (columns) SSE=sum of squares for error k = the number of populations n = total sample size
  • 17. F-statistic Intuitively, the F statistic is: F = explained variance unexplained variance Explained variance is the difference between majors Unexplained variance is the difference based on random sampling for each group
  • 18. Decision Based on ANOVA • F (calculated) > F (critical or theoretical) • Reject H0 • P < 0.05 (chosen significance level) • Reject H0
  • 19. Calculating SST SST = ni(Mi - )2  = grand mean or =  Mi/k or the sum of all values for all groups divided by total sample size Mi = mean for each sample k= the number of populations
  • 20. Calculating SST By major Accounting M1=29, n1=6 Marketing M2=33.5, n2=6 Finance M3=37, n3=6  = (29+33.5+37)/3 = 33.17 SST = (6)(29-33.17)2 + (6)(33.5-33.17)2 + (6)(37- 33.17)2 = 193
  • 21. Calculating SST Note that when M1 = M2 = M3, then SST=0 which would support the null hypothesis. In this example, the samples are of equal size, but we can also run this analysis with samples of varying size also.
  • 22. Calculating SSE SSE = (Xit – Mi)2 In other words, it is just the variance for each sample added together. SSE = (X1t – M1)2 + (X2t – M2)2 + (X3t – M3)2 SSE = [(27-29)2 + (22-29)2 +…+ (29-29)2] + [(23-33.5)2 + (36-33.5)2 +…] + [(48-37)2 + (35-37)2 +…+ (29-37)2] SSE = 819.5
  • 23. Statistical Output When you estimate this information in a computer program, it will typically be presented in a table as follows: Source of df Sum of Mean F-ratio Variation squares squares Treatment k-1 SST MST=SST/(k-1) F=MST Error n-k SSE MSE=SSE/(n-k) MSE Total n-1 SS=SST+SSE
  • 24. Calculating F for our example F = 193/2 819.5/15 F = 1.77 Our calculated F is compared to the critical value using the F-distribution with F, k-1, n-k degrees of freedom k-1 (numerator df) n-k (denominator df)
  • 25. The Results For 95% confidence (=.05), our critical F is 3.68 In this case, 1.77 < 3.68 so we must accept the null hypothesis. The dean is puzzled by these results because just by eyeballing the data, it looks like finance majors make more money.
  • 26. New Data Many other factors may determine the salary level, such as GPA. The dean decides to collect new data selecting one student randomly from each major with the following average grades.
  • 27. New data Average Accounting Marketing Finance M(b) A+ 41 45 51 M(b1)=45.67 A 36 38 45 M(b2)=39.67 B+ 27 33 31 M(b3)=30.83 B 32 29 35 M(b4)=32 C+ 26 31 32 M(b5)=29.67 C 23 25 27 M(b6)=25 M(t)1=30.83 M(t)2=33.5 M(t)3=36.83  = 33.72
  • 28. Randomized Block Design Now the data in the 3 samples are not independent, they are matched by GPA levels. Just like before, matched samples are superior to unmatched samples because they provide more information. In this case, we have added a factor that may account for some of the SSE.
  • 29. Two way ANOVA Now SS(total) = SST + SSB + SSE Where SSB = the variability among blocks, where a block is a matched group of observations from each of the populations We can calculate a two-way ANOVA to test our null hypothesis.
  • 30. Example: Solving with Excel • Vishal Foods Ltd is a leading manufacturer of biscuits. The company has launched a new brand in the four metros; Delhi, Mumbai, Kolkata and Chennai. After one month, the company realizes that there is a difference in the retail price per pack of biscuits across cities. Before the launch, the company had promised its employees and newly- appointed retailers that the biscuits would be sold at a uniform price in the country. The difference in price can tarnish the image of the company. In order to make a quick inference, the company collected data about the price from six randomly selected stores across the four cities. Based on the sample information, the price per pack of the biscuits, in rupees, is given in the table
  • 31. Data Delhi Mumbai Kolkata Chennai 22 19 18 21 22.5 19.5 17 20 21.5 19 18.5 21.5 22 20 17 20 22.5 19 18.5 21 21.5 21 17 20
  • 32. Hypotheses • Null hypothesis is all the means are all equal • Alternative hypothesis is all the means are unequal
  • 33. Excel Path • Select Data Analysis • From Data Analysis dialogue box, select Anova: Single Factor • Click OK • In the Anova: Single Factor dialogue box, enter the location of the samples in the variable Input Range box. Select Grouped by Columns. • Place the value of α • Click OK
  • 34. Minitab Path • Select Stat from the menu bar • A pull down menu will appear- select ANOVA • Another pull down menu will appear- select One Way Unstacked • One Way dialogue box will appear • By using select, place samples in the Responses (in separate columns) box and place the confidence level • Click OK • F and p values will appear in the output box
  • 35. Output Sheet: Excel df for SST is (4-1) = 3, df for SSE is (24-4) = 20, df for total is (24-1) = 23. F theoretical is 3.098 where as F calculated is 43.03. As F calculated greater than F theoretical, F falls in the rejection region. Null hypothesis is rejected. Anova: Single Factor SUMMARY Groups Count Sum Average Variance Column 1 6 132 22 0.2 Column 2 6 117.5 19.58333 0.641667 Column 3 6 106 17.66667 0.566667 Column 4 6 123.5 20.58333 0.441667 ANOVA Source of Variation SS df MS F P-value F crit Between Groups 59.70833 3 19.90278 43.03303 6.54E-09 3.098391 Within Groups 9.25 20 0.4625 Total 68.95833 23
  • 36. Conclusion • There is enough evidence to believe that there is a significant difference in the prices across four cities
  • 37. Two Way ANOVA • A company which produces stationery items wants to diversify into the photocopy paper manufacturing business. The company has decided to first test market the product in three areas termed as the north area, central area, and the south area. The company takes a random sample of five salesmen S1, S2, S3, S4 and S5 for this purpose. The sales volume generated by these five salesmen, in thousand rupees, and total sales in different regions are given in the table. • Use a randomized block design analysis to examine: – Whether the salesmen significantly differ in performance? – Whether there is a significant difference in terms of sales capacity between the regions? – Take 95% confidence level
  • 38. Data Region Salesmen S1 S2 S3 S4 S5 Region’s Total North 24 30 26 23 32 135 Central 22 32 27 25 31 137 South 23 28 25 22 32 130 Salesmen’s Total 69 90 78 70 95 402
  • 39. Hypotheses • Divided into two parts • For treatments (columns) • For blocks (rows) • For treatments: Null hypothesis: all the treatment means are equal; Alternative hypothesis: all the treatment means are unequal • For blocks: Null hypothesis: all the block means are equal; Alternative hypothesis: all the block means are unequal
  • 40. Excel Path • Select Data Analysis • From Data Analysis dialogue box, select Anova: Two Factor Without Replication • Click OK • In the Anova: Two Factor dialogue box, enter the location of the samples in the variable Input Range box • Place the value of α • Click OK
  • 41. Output Sheet: Excel The F calculated for columns is 30.17, which is greater than F theoretical of 3.84, null hypothesis is rejected. The F calculated for rows is 1.71, which is less than F theoretical of 4.46, null hypothesis is accepted. Anova: Two-Factor Without Replication SUMMARY Count Sum Average Variance Row 1 5 135 27 15 Row 2 5 137 27.4 17.3 Row 3 5 130 26 16.5 Column 1 3 69 23 1 Column 2 3 90 30 4 Column 3 3 78 26 1 Column 4 3 70 23.33333 2.333333 Column 5 3 95 31.66667 0.333333 ANOVA Source of Variation SS df MS F P-value F crit Rows 5.2 2 2.6 1.714286 0.2401 4.45897 Columns 183.0667 4 45.76667 30.17582 7.09E-05 3.837853 Error 12.13333 8 1.516667 Total 200.4 14
  • 42. Conclusion • There is enough evidence to believe that there is a significant difference in the performance of five salesmen in terms of generation of results. On the other hand, there is no significant difference in the capacity of generating sales for the three regions
  • 43. Illustrative Applications of One-Way Analysis of Variance We illustrate the concepts discussed in this chapter using the data presented in the next table The department store is attempting to determine the effect of in-store promotion (X) on sales (Y) The null hypothesis is that the category means are equal: H0: µ1 = µ2 = µ3.
  • 44. 16-44 Effect of Promotion and Clientele on Sales Store Number Coupon Level In-Store Promotion Sales Clientel Rating 1 1.00 1.00 10.00 9.00 2 1.00 1.00 9.00 10.00 3 1.00 1.00 10.00 8.00 4 1.00 1.00 8.00 4.00 5 1.00 1.00 9.00 6.00 6 1.00 2.00 8.00 8.00 7 1.00 2.00 8.00 4.00 8 1.00 2.00 7.00 10.00 9 1.00 2.00 9.00 6.00 10 1.00 2.00 6.00 9.00 11 1.00 3.00 5.00 8.00 12 1.00 3.00 7.00 9.00 13 1.00 3.00 6.00 6.00 14 1.00 3.00 4.00 10.00 15 1.00 3.00 5.00 4.00 16 2.00 1.00 8.00 10.00 17 2.00 1.00 9.00 6.00 18 2.00 1.00 7.00 8.00 19 2.00 1.00 7.00 4.00 20 2.00 1.00 6.00 9.00 21 2.00 2.00 4.00 6.00 22 2.00 2.00 5.00 8.00 23 2.00 2.00 5.00 10.00 24 2.00 2.00 6.00 4.00 25 2.00 2.00 4.00 9.00 26 2.00 3.00 2.00 4.00 27 2.00 3.00 3.00 6.00 28 2.00 3.00 2.00 10.00 29 2.00 3.00 1.00 9.00 30 2.00 3.00 2.00 8.00
  • 45. 16-45 One-Way ANOVA: Effect of In-store Promotion on Store Sales Cell means Level of Count Mean Promotion High (1) 10 8.300 Medium (2) 10 6.200 Low (3) 10 3.700 TOTAL 30 6.067 Source of Sum of df Mean F ratio F prob. Variation squares square Between groups 106.067 2 53.033 17.944 3.35 (Promotion) Within groups 79.800 27 2.956 (Error) TOTAL 185.867 29 6.409
  • 46. • From table we see that for 2 and 27 degrees of freedom, the critical value of F is 3.35 for 95% level. Because the calculated value of F is greater than the critical value, we reject the null hypothesis. Illustrative Applications of One-Way Analysis of Variance
  • 47. SPSS Windows One-way ANOVA can be efficiently performed using the program COMPARE MEANS and then One-way ANOVA. To select this procedure using SPSS for Windows click: Analyze>Compare Means>One-Way ANOVA … N-way analysis of variance and analysis of covariance can be performed using GENERAL LINEAR MODEL. To select this procedure using SPSS for Windows click: Analyze>General Linear Model>Univariate …
  • 48. SPSS Windows: One-Way ANOVA 1. Select ANALYZE from the SPSS menu bar. 2. Click COMPARE MEANS and then ONE-WAY ANOVA. 3. Move “Sales [sales]” in to the DEPENDENT LIST box. 4. Move “In-Store Promotion[promotion]” to the FACTOR box. 5. Click OPTIONS 6. Click Descriptive. 7. Click CONTINUE. 8. Click OK.