SlideShare a Scribd company logo
1 of 33
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 1
Correlation and Simple
Linear Regression
Analysis
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 2
Learning Objectives
Upon completion of this chapter, you will be able to:
Ø Use the simple linear regression equation
Ø Compute the coefficient of correlation and understand its
interpretation.
Ø Understand the concept of measures of variation, coefficient
of
determination, and standard error of the estimate
Ø Understand and use residual analysis for testing the
assumptions of regression
Ø Measure autocorrelation by using the Durbin–Watson statistic
Ø Understand statistical inference about slope, correlation
coefficient of the regression model, and testing the overall
model
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 3
Measures of Association
Ø Measures of association are statistics for measuring the
strength
of relationship between two variables.
Ø Correlation measures the degree of association between two
variables.
Ø Karl Pearson’s coefficient of correlation is a quantitative
measure
of the degree of relationship between two variables. Suppose
these variables are x and y, then Karl Pearson’s coefficient of
correlation is defined as
Ø The coefficient of correlation lies in between +1 and –1.
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 4
Figure 15.1: Interpretation of correlation coefficient
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 5
Table 15.2 shows the sales revenue and advertisement expenses
of a
company for the past 10 months. Find the coefficient of
correlation
between sales and advertisement.
Example 15.1
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 6
Table 15.3 : Calculation of correlation coefficient between sales
and advertisement
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 7
Figure 15.9: Five examples of correlation coefficient
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 8
Using MS Excel, Minitab and SPSS for
Computing Correlation Coefficient
Ø Ch 15 Solved ExamplesExcelEx 15.1.xls
Ø Ch 15 Solved ExamplesMinitabEx 15.1.MPJ
Ø Ch 15 Solved ExamplesSPSSEx 15.1.sav
Ø Ch 15 Solved ExamplesSPSSOutput Ex 15.1.spv
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 9
Introduction to Simple Linear Regression
Ø Regression analysis is the process of developing a statistical
model, which is used to predict the value of a dependent
variable
by at least one independent variable.
Ø In simple linear regression analysis, there are two types of
variables. The variable whose value is influenced or to be
predicted is called dependent variable and the variable which
influences the value or is used for prediction is called
independent variable.
Ø In regression analysis, independent variable is also known as
regressor or predictor, or explanatory while the dependent
variable is also known as regressed or explained variable. In a
simple linear regression analysis, only a straight line
relationship
between two variables is examined.
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 10
A Deterministic and Probabilistic Model
ε is the error of the regression line in fitting the points of the
regression equation. If a point is on the regression line, the
corresponding value of ε is equal to zero. If the point is not on
the
regression line, the value of ε measures the error.
It can be noticed that in the deterministic model, all the points
are
assumed to be on the regression line and hence, in all the cases
random error ε is equal to zero. Probabilistic model includes an
error
term which allows the value of y to vary for any given value of
x.
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 11
Figure 15.10: Error in simple regression
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 12
Figure 15.11: Summary of the estimation process for simple
linear regression.
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 13
A cable wire company has spent heavily on advertisements. The
sales
and advertisement expenses (in thousand rupees) for the 12
randomly selected months are given in Table 14.2. Develop a
regression model to predict the impact of advertisement on
sales.
Example 15.2
C
o
p
yr
ig
h
t©
D
o
rl
in
g
K
in
d
er
sl
ey
I
n
d
ia
P
vt
.
Lt
d
Correlation and Simple Linear Regression Analysis 14
Using Ms Excel, Minitab, and Spss for
Simple Linear Regression
Ø Ch 15 Solved ExamplesExcelEx 15.2.xls
Ø Ch 15 Solved ExamplesMinitabEX 15.2.MPJ
Ø Ch 15 Solved ExamplesSPSSEx 15.2.sav
Ø Ch 15 Solved ExamplesSPSSOutput Ex 15.2.spv
Lecture Outline
Ø Understand the concept of ANOVA.
Ø Compute and interpret the result of one- way ANOVA.
Analysis of Variance
(ANOVA)
Analysis of Variance
Ø Analysis of variance or ANOVA is a technique of testing
hypotheses about the significant difference in several
population means.
Ø In analysis of variance, the total variation in the sample data
can be on account of two components, namely, variance
between the samples and variance within the samples.
Ø Variance between the samples is attributed to the difference
among the sample means.
Ø Variance within the samples is the difference due to chance
or experimental errors.
Figure : Partitioning the total sum of squares of the variation
for
completely randomized design (one-way ANOVA)
SST (total sum of squares) = SSC (sum of squares between
columns) + SSE (sum of squares within samples)
Completely Randomized Design
(One-way Anova)
Completely randomized design contains only one independent
variable, with two or more treatment levels or classifications.
Applying the F -Test Statistic
Ø In case of ANOVA, F value is obtained by dividing the
treatment
variance (MSC) by the error variance (MSE).
Ø F test statistic in one-way ANOVA
Ø The F test statistic follows F distribution with k – 1 degrees
of freedom
corresponding to MSC in the numerator and n – k degrees of
freedom
corresponding to MSE in the denominator.
The ANOVA Summary Table
Figure : Rejection and non-rejection region (acceptance region)
when using
ANOVA to test null hypothesis
Vishal Foods Ltd is a leading manufacturer of biscuits. The
company has launched a new brand in the four metros; Delhi,
Mumbai, Kolkata, and Chennai. After one month, the company
realizes that there is a difference in the retail price per pack of
biscuits across cities. Before the launch, the company had
promised its employees and newly-appointed retailers that the
biscuits would be sold at a uniform price in the country. The
difference in price can tarnish the image of the company. In
order to make a quick inference, the company collected data
about the price from three randomly selected stores across the
four cities. Based on the sample information, the price per pack
of the biscuits (in rupees) is given.
Example
Use one-way ANOVA to analyse the significant difference in
the prices. Take 95% as the
confidence level.
Example : Continued
Table: ANOVA table
Volkswagon wants to examine the safety of compact
cars, midsize cars, and full-size cars. It collects a sample
of three for each of the treatments (cars types). Using the
data provided below, test whether the mean pressure
applied to the driver’s head during a crash test is equal
for each types of car. Use α = 5%.
Example
Compact
Cars
Midsize
Cars
Full-Size
Cars
15 25 10
25 25 5
20 35 15
Students were given different drug treatments before revising
for
their exams. Some were given a memory drug, some a placebo
drug
and some no treatment. Test whether the performance is
difference
across the three groups or not. The exam scores (%) are shown
below for the three different groups:
Example
Memory Drug Placebo No Treatment
70 37 3
77 43 10
83 50 17
90 57 23
97 63 30
Mean 83.40 50 16.60
Grand Mean 50
THANK YOU!
Statistical Inference:
Hypothesis Testing
Lecture Outline
Ø Understand hypothesis-testing procedure using one-tailed
and two- tailed tests
Ø Understand the concepts of Type I and Type II errors in
hypothesis testing
Ø Understand the procedure of hypothesis testing
The Concept of
Normal Distribution
Introduction to Hypothesis Testing
Ø A statistical hypothesis is an assumption about an unknown
population parameter.
Ø Hypothesis testing is a well defined procedure which helps us
to decide objectively whether to accept or reject the hypothesis
based on the information available from the sample.
Ø In statistical analysis, we use the concept of probability to
specify a probability level at which a researcher concludes that
the observed difference between the sample statistic and the
population parameter is not due to chance.
Hypothesis Testing Procedure
Seven steps of hypothesis testing
Step 1: Set Null and Alternative
Hypotheses
Ø The null hypothesis generally referred by H0 (H sub-zero), is
the
hypothesis which is tested for possible rejection under the
assumption that is
true. Theoretically, a null hypothesis is set as no difference or
status quo and
considered true, until and unless it is proved wrong by the
collected sample
data.
Ø Symbolically, a null hypothesis is represented as:
Ø The alternative hypothesis, generally referred by H1 (H sub-
one), is a logical
opposite of the null hypothesis.
Ø Symbolically, alternative hypothesis is represented as:
Step 2: Determine the Appropriate
Statistical Test
Ø Type, number, and the level of data may provide a platform
for deciding the statistical test.
Step 3: Set the Level of Significance
Ø The level of significance generally denoted by α is the
probability, which is attached to a null hypothesis, which
may be rejected even when it is true.
Ø The level of significance is also known as the size of the
rejection region or the size of the critical region.
Ø The levels of significance which are generally applied by
researchers are: 0.01; 0.05; 0.10.
Type I and Type II Errors
When a researcher tests statistical hypotheses, there can be four
possible outcomes as follows:
Step 4: Set the Decision Rule
Critical region is the area under the normal curve, divided into
two mutually
exclusive regions. These regions are termed as acceptance
region (when the
null hypothesis is accepted) and the rejection region or critical
region (when
the null hypothesis is rejected).
Acceptance and rejection regions of null hypothesis (two-tailed
test)
Two-Tailed Test of Hypothesis
Ø Let us consider the null and alternative hypotheses as
below:
Ø Two-tailed tests contain the rejection region on both the tails
of
the sampling distribution of a test statistic. This means a
researcher will reject the null hypothesis if the computed
sample
statistic is significantly higher than or lower than the
hypothesized population parameter (considering both the tails,
right as well as left).
Acceptance and rejection regions (alpha = 0.05)
One-Tailed Test of Hypothesis
Let us consider a null and alternative hypotheses as below:
One-tailed test contains the rejection region on one tail of the
sampling distribution of a test statistic. In case of a left-tailed
test, a
researcher rejects the null hypothesis if the computed sample
statistic is significantly lower than the hypothesized population
parameter.
In case of a right-tailed test, a researcher rejects the null
hypothesis
if the computed sample statistic is significantly higher than the
hypothesized population parameter.
Acceptance and rejection regions for one-tailed (left)
test (alpha = 0.05)
Acceptance and rejection regions for one-tailed (right)
test (alpha = 0.05)
Step 5: Collect the Sample Data
Ø In this stage of sampling, data are collected and the
appropriate
sample statistics are computed.
Ø The first four steps should be completed before collecting the
data
for the study.
Ø It is not advisable to collect the data first and then decide on
the
stages of hypothesis testing.
Step 6: Analyse the data
Ø In this step, the researcher has to compute the test statistic.
This
involves selection of an appropriate probability distribution for
a
particular test.
Ø Some of the commonly used testing procedures are z, t, F,
and χ2.
Step 7: Arrive at a Statistical
Conclusion and Business Implication
Ø In this step, the researchers draw a statistical conclusion. A
statistical conclusion is a decision to accept or reject a null
hypothesis.
Ø Statisticians present the information obtained using
hypothesis-
testing procedure to the decision makers. Decisions are made on
the basis of this information. Ultimately, a decision maker
decides
that a statistically significant result is a substantive result and
needs
to be implemented for meeting the organization’s goals.
Hypothesis Testing for a Single Population
Mean Using the Z Statistic
Ø When sample size is greater than (equals to) 30.
Ø Population has a normal distribution.
Hypothesis Testing for a Single Population
Mean Using the Z Statistic
A marketing research firm conducted a survey 10 years ago and
found that the average household income of a particular
geographic region is Rs 10,000. Mr.Ahmad, who has recently
joined
the firm as a vice president has expressed doubts about the
accuracy of the data. For verifying the data, the firm has
decided to
take a random sample of 200 households that yield a sample
mean
(for household income) of Rs 11,000. Assume that the
population
standard deviation of the household income is Rs 1200.
Verify Mr. Ahmad’s doubts using the seven steps of hypothesis
testing. Let α = 0.05 (5%).
Example (
Solution
)
Hypothesis Testing for a Single Population Mean
Using the T Statistic (Case of a Small Random
Sample When N < 30)
When a researcher draw a small random sample (n < 30) to
estimate
the population mean μ and when the population standard
deviation is
unknown and population is normally distributed, t-test can be
applied.
Royal Tyres has launched a new brand of tyres for tractors and
claims
that under normal circumstances the average life of the tyres is
40,000
km. A retailer wants to test this claim and has taken a random
sample
of 8 tyres and the mean was found as 39,750 (S.D = 2618.61).
He tests
the life of the tyres under normal circumstance. The results
obtained
are presented in Table 10.4.
Example
Example (

More Related Content

Similar to Copyright© Dorling Kinde.docx

Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
Elkana Rorio
 
In the t test for independent groups, ____.we estimate µ1 µ2.docx
In the t test for independent groups, ____.we estimate µ1 µ2.docxIn the t test for independent groups, ____.we estimate µ1 µ2.docx
In the t test for independent groups, ____.we estimate µ1 µ2.docx
bradburgess22840
 

Similar to Copyright© Dorling Kinde.docx (20)

Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
 
Simple Linear Regression.pptx
Simple Linear Regression.pptxSimple Linear Regression.pptx
Simple Linear Regression.pptx
 
Simple egression.pptx
Simple egression.pptxSimple egression.pptx
Simple egression.pptx
 
Simple Regression Analysis ch12.pptx
Simple Regression Analysis ch12.pptxSimple Regression Analysis ch12.pptx
Simple Regression Analysis ch12.pptx
 
Data-Analysis.pptx
Data-Analysis.pptxData-Analysis.pptx
Data-Analysis.pptx
 
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...Predicting an Applicant Status Using Principal Component, Discriminant and Lo...
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...
 
logistic regression.pdf
logistic regression.pdflogistic regression.pdf
logistic regression.pdf
 
In the t test for independent groups, ____.we estimate µ1 µ2.docx
In the t test for independent groups, ____.we estimate µ1 µ2.docxIn the t test for independent groups, ____.we estimate µ1 µ2.docx
In the t test for independent groups, ____.we estimate µ1 µ2.docx
 
Unit 03 - Consolidated.pptx
Unit 03 - Consolidated.pptxUnit 03 - Consolidated.pptx
Unit 03 - Consolidated.pptx
 
linear regression PDF.pdf
linear regression PDF.pdflinear regression PDF.pdf
linear regression PDF.pdf
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
spss
spss spss
spss
 
Correlation
Correlation  Correlation
Correlation
 
Chap013.ppt
Chap013.pptChap013.ppt
Chap013.ppt
 
Inference about means and mean differences
Inference about means and mean differencesInference about means and mean differences
Inference about means and mean differences
 
Asset Relationship - CH 8 - Regression | CMT Level 3 | Chartered Market Techn...
Asset Relationship - CH 8 - Regression | CMT Level 3 | Chartered Market Techn...Asset Relationship - CH 8 - Regression | CMT Level 3 | Chartered Market Techn...
Asset Relationship - CH 8 - Regression | CMT Level 3 | Chartered Market Techn...
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 

More from melvinjrobinson2199

Construction Management Jump StartChapter 5Project Sta.docx
Construction Management Jump StartChapter 5Project Sta.docxConstruction Management Jump StartChapter 5Project Sta.docx
Construction Management Jump StartChapter 5Project Sta.docx
melvinjrobinson2199
 
Create a 1-2-page resource that will describe databases that are.docx
Create a 1-2-page resource that will describe databases that are.docxCreate a 1-2-page resource that will describe databases that are.docx
Create a 1-2-page resource that will describe databases that are.docx
melvinjrobinson2199
 
Create a 1-2 page single-spaced Analysis of Research abstract pu.docx
Create a 1-2 page single-spaced Analysis of Research abstract pu.docxCreate a 1-2 page single-spaced Analysis of Research abstract pu.docx
Create a 1-2 page single-spaced Analysis of Research abstract pu.docx
melvinjrobinson2199
 
Coventry University 385ACC (Part-time) Advanced Study fo.docx
Coventry University 385ACC (Part-time) Advanced Study fo.docxCoventry University 385ACC (Part-time) Advanced Study fo.docx
Coventry University 385ACC (Part-time) Advanced Study fo.docx
melvinjrobinson2199
 
COV-19 -Corona Virus -- What a past week in our country and globally.docx
COV-19 -Corona Virus -- What a past week in our country and globally.docxCOV-19 -Corona Virus -- What a past week in our country and globally.docx
COV-19 -Corona Virus -- What a past week in our country and globally.docx
melvinjrobinson2199
 
Course ScenarioReynolds Tool & DieReynolds Mission Stateme.docx
Course ScenarioReynolds Tool & DieReynolds Mission Stateme.docxCourse ScenarioReynolds Tool & DieReynolds Mission Stateme.docx
Course ScenarioReynolds Tool & DieReynolds Mission Stateme.docx
melvinjrobinson2199
 
COURSE REFLECTIONJune 11, 2020How has this course helped you.docx
COURSE REFLECTIONJune 11, 2020How has this course helped you.docxCOURSE REFLECTIONJune 11, 2020How has this course helped you.docx
COURSE REFLECTIONJune 11, 2020How has this course helped you.docx
melvinjrobinson2199
 
Course Reflection GuidelinesPurposeThe purpose of this assignmen.docx
Course Reflection GuidelinesPurposeThe purpose of this assignmen.docxCourse Reflection GuidelinesPurposeThe purpose of this assignmen.docx
Course Reflection GuidelinesPurposeThe purpose of this assignmen.docx
melvinjrobinson2199
 
Course Project Layers of Me” My Humanitarian Professional Pro.docx
Course Project Layers of Me” My Humanitarian Professional Pro.docxCourse Project Layers of Me” My Humanitarian Professional Pro.docx
Course Project Layers of Me” My Humanitarian Professional Pro.docx
melvinjrobinson2199
 

More from melvinjrobinson2199 (20)

Constructivism and Self-Directed Learning in Adult learners An.docx
Constructivism and Self-Directed Learning in Adult learners An.docxConstructivism and Self-Directed Learning in Adult learners An.docx
Constructivism and Self-Directed Learning in Adult learners An.docx
 
Construction Management Jump StartChapter 5Project Sta.docx
Construction Management Jump StartChapter 5Project Sta.docxConstruction Management Jump StartChapter 5Project Sta.docx
Construction Management Jump StartChapter 5Project Sta.docx
 
Create a 10- to 12-slide presentation in which you Compare .docx
Create a 10- to 12-slide presentation in which you Compare .docxCreate a 10- to 12-slide presentation in which you Compare .docx
Create a 10- to 12-slide presentation in which you Compare .docx
 
Create a 10-12-slide presentation about the role of scientist-pr.docx
Create a 10-12-slide presentation about the role of scientist-pr.docxCreate a 10-12-slide presentation about the role of scientist-pr.docx
Create a 10-12-slide presentation about the role of scientist-pr.docx
 
Create a 1-page (front and back) information fact sheet on postpartu.docx
Create a 1-page (front and back) information fact sheet on postpartu.docxCreate a 1-page (front and back) information fact sheet on postpartu.docx
Create a 1-page (front and back) information fact sheet on postpartu.docx
 
Create a 1-2-page resource that will describe databases that are.docx
Create a 1-2-page resource that will describe databases that are.docxCreate a 1-2-page resource that will describe databases that are.docx
Create a 1-2-page resource that will describe databases that are.docx
 
Create a 1-2 page single-spaced Analysis of Research abstract pu.docx
Create a 1-2 page single-spaced Analysis of Research abstract pu.docxCreate a 1-2 page single-spaced Analysis of Research abstract pu.docx
Create a 1-2 page single-spaced Analysis of Research abstract pu.docx
 
CoverTitle PageAbstractBody of Paper10-12 pages wor.docx
CoverTitle PageAbstractBody of Paper10-12 pages wor.docxCoverTitle PageAbstractBody of Paper10-12 pages wor.docx
CoverTitle PageAbstractBody of Paper10-12 pages wor.docx
 
Cover LetterA significant part of a registered nurse’s job i.docx
Cover LetterA significant part of a registered nurse’s job i.docxCover LetterA significant part of a registered nurse’s job i.docx
Cover LetterA significant part of a registered nurse’s job i.docx
 
Coventry University 385ACC (Part-time) Advanced Study fo.docx
Coventry University 385ACC (Part-time) Advanced Study fo.docxCoventry University 385ACC (Part-time) Advanced Study fo.docx
Coventry University 385ACC (Part-time) Advanced Study fo.docx
 
COV-19 -Corona Virus -- What a past week in our country and globally.docx
COV-19 -Corona Virus -- What a past week in our country and globally.docxCOV-19 -Corona Virus -- What a past week in our country and globally.docx
COV-19 -Corona Virus -- What a past week in our country and globally.docx
 
Course ScenarioReynolds Tool & DieReynolds Mission Stateme.docx
Course ScenarioReynolds Tool & DieReynolds Mission Stateme.docxCourse ScenarioReynolds Tool & DieReynolds Mission Stateme.docx
Course ScenarioReynolds Tool & DieReynolds Mission Stateme.docx
 
COURSE REFLECTIONJune 11, 2020How has this course helped you.docx
COURSE REFLECTIONJune 11, 2020How has this course helped you.docxCOURSE REFLECTIONJune 11, 2020How has this course helped you.docx
COURSE REFLECTIONJune 11, 2020How has this course helped you.docx
 
Course Reflection GuidelinesPurposeThe purpose of this assignmen.docx
Course Reflection GuidelinesPurposeThe purpose of this assignmen.docxCourse Reflection GuidelinesPurposeThe purpose of this assignmen.docx
Course Reflection GuidelinesPurposeThe purpose of this assignmen.docx
 
Course ProjectExamine the statement of cash flows for the compan.docx
Course ProjectExamine the statement of cash flows for the compan.docxCourse ProjectExamine the statement of cash flows for the compan.docx
Course ProjectExamine the statement of cash flows for the compan.docx
 
Course PHYSICAL SECURITYDiscussion Question – Primary post du.docx
Course PHYSICAL SECURITYDiscussion Question – Primary post du.docxCourse PHYSICAL SECURITYDiscussion Question – Primary post du.docx
Course PHYSICAL SECURITYDiscussion Question – Primary post du.docx
 
Course Project Layers of Me” My Humanitarian Professional Pro.docx
Course Project Layers of Me” My Humanitarian Professional Pro.docxCourse Project Layers of Me” My Humanitarian Professional Pro.docx
Course Project Layers of Me” My Humanitarian Professional Pro.docx
 
Course ObjectivesCLO #1 Assess elements of contemporary le.docx
Course ObjectivesCLO #1 Assess elements of contemporary le.docxCourse ObjectivesCLO #1 Assess elements of contemporary le.docx
Course ObjectivesCLO #1 Assess elements of contemporary le.docx
 
Course Name Intro to big data.Assignment Big data and CO.docx
Course Name Intro to big data.Assignment Big data and CO.docxCourse Name Intro to big data.Assignment Big data and CO.docx
Course Name Intro to big data.Assignment Big data and CO.docx
 
COURSE MGT211Using the Internet, and credible electronic se.docx
COURSE MGT211Using the Internet, and credible electronic se.docxCOURSE MGT211Using the Internet, and credible electronic se.docx
COURSE MGT211Using the Internet, and credible electronic se.docx
 

Recently uploaded

Recently uploaded (20)

Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 

Copyright© Dorling Kinde.docx

  • 2. . Lt d Correlation and Simple Linear Regression Analysis 1 Correlation and Simple Linear Regression Analysis C o p yr ig h t© D o rl in g K in d er sl
  • 3. ey I n d ia P vt . Lt d Correlation and Simple Linear Regression Analysis 2 Learning Objectives Upon completion of this chapter, you will be able to: Ø Use the simple linear regression equation Ø Compute the coefficient of correlation and understand its interpretation. Ø Understand the concept of measures of variation, coefficient of determination, and standard error of the estimate Ø Understand and use residual analysis for testing the assumptions of regression Ø Measure autocorrelation by using the Durbin–Watson statistic Ø Understand statistical inference about slope, correlation coefficient of the regression model, and testing the overall
  • 5. . Lt d Correlation and Simple Linear Regression Analysis 3 Measures of Association Ø Measures of association are statistics for measuring the strength of relationship between two variables. Ø Correlation measures the degree of association between two variables. Ø Karl Pearson’s coefficient of correlation is a quantitative measure of the degree of relationship between two variables. Suppose these variables are x and y, then Karl Pearson’s coefficient of correlation is defined as Ø The coefficient of correlation lies in between +1 and –1. C o p yr ig h t© D
  • 6. o rl in g K in d er sl ey I n d ia P vt . Lt d Correlation and Simple Linear Regression Analysis 4 Figure 15.1: Interpretation of correlation coefficient C o p
  • 8. Table 15.2 shows the sales revenue and advertisement expenses of a company for the past 10 months. Find the coefficient of correlation between sales and advertisement. Example 15.1 C o p yr ig h t© D o rl in g K in d er sl ey I n
  • 9. d ia P vt . Lt d Correlation and Simple Linear Regression Analysis 6 Table 15.3 : Calculation of correlation coefficient between sales and advertisement C o p yr ig h t© D o rl in g K in
  • 10. d er sl ey I n d ia P vt . Lt d Correlation and Simple Linear Regression Analysis 7 Figure 15.9: Five examples of correlation coefficient C o p yr ig h t© D o
  • 11. rl in g K in d er sl ey I n d ia P vt . Lt d Correlation and Simple Linear Regression Analysis 8 Using MS Excel, Minitab and SPSS for Computing Correlation Coefficient Ø Ch 15 Solved ExamplesExcelEx 15.1.xls Ø Ch 15 Solved ExamplesMinitabEx 15.1.MPJ Ø Ch 15 Solved ExamplesSPSSEx 15.1.sav Ø Ch 15 Solved ExamplesSPSSOutput Ex 15.1.spv
  • 13. d Correlation and Simple Linear Regression Analysis 9 Introduction to Simple Linear Regression Ø Regression analysis is the process of developing a statistical model, which is used to predict the value of a dependent variable by at least one independent variable. Ø In simple linear regression analysis, there are two types of variables. The variable whose value is influenced or to be predicted is called dependent variable and the variable which influences the value or is used for prediction is called independent variable. Ø In regression analysis, independent variable is also known as regressor or predictor, or explanatory while the dependent variable is also known as regressed or explained variable. In a simple linear regression analysis, only a straight line relationship between two variables is examined. C o p yr ig h t© D
  • 14. o rl in g K in d er sl ey I n d ia P vt . Lt d Correlation and Simple Linear Regression Analysis 10 A Deterministic and Probabilistic Model ε is the error of the regression line in fitting the points of the regression equation. If a point is on the regression line, the corresponding value of ε is equal to zero. If the point is not on the regression line, the value of ε measures the error.
  • 15. It can be noticed that in the deterministic model, all the points are assumed to be on the regression line and hence, in all the cases random error ε is equal to zero. Probabilistic model includes an error term which allows the value of y to vary for any given value of x. C o p yr ig h t© D o rl in g K in d er sl ey I n
  • 16. d ia P vt . Lt d Correlation and Simple Linear Regression Analysis 11 Figure 15.10: Error in simple regression C o p yr ig h t© D o rl in g K in d
  • 17. er sl ey I n d ia P vt . Lt d Correlation and Simple Linear Regression Analysis 12 Figure 15.11: Summary of the estimation process for simple linear regression. C o p yr ig h t© D o
  • 18. rl in g K in d er sl ey I n d ia P vt . Lt d Correlation and Simple Linear Regression Analysis 13 A cable wire company has spent heavily on advertisements. The sales and advertisement expenses (in thousand rupees) for the 12 randomly selected months are given in Table 14.2. Develop a regression model to predict the impact of advertisement on sales. Example 15.2
  • 20. Lt d Correlation and Simple Linear Regression Analysis 14 Using Ms Excel, Minitab, and Spss for Simple Linear Regression Ø Ch 15 Solved ExamplesExcelEx 15.2.xls Ø Ch 15 Solved ExamplesMinitabEX 15.2.MPJ Ø Ch 15 Solved ExamplesSPSSEx 15.2.sav Ø Ch 15 Solved ExamplesSPSSOutput Ex 15.2.spv Lecture Outline Ø Understand the concept of ANOVA. Ø Compute and interpret the result of one- way ANOVA. Analysis of Variance (ANOVA) Analysis of Variance Ø Analysis of variance or ANOVA is a technique of testing hypotheses about the significant difference in several population means.
  • 21. Ø In analysis of variance, the total variation in the sample data can be on account of two components, namely, variance between the samples and variance within the samples. Ø Variance between the samples is attributed to the difference among the sample means. Ø Variance within the samples is the difference due to chance or experimental errors. Figure : Partitioning the total sum of squares of the variation for completely randomized design (one-way ANOVA) SST (total sum of squares) = SSC (sum of squares between columns) + SSE (sum of squares within samples) Completely Randomized Design (One-way Anova) Completely randomized design contains only one independent variable, with two or more treatment levels or classifications. Applying the F -Test Statistic Ø In case of ANOVA, F value is obtained by dividing the treatment variance (MSC) by the error variance (MSE). Ø F test statistic in one-way ANOVA
  • 22. Ø The F test statistic follows F distribution with k – 1 degrees of freedom corresponding to MSC in the numerator and n – k degrees of freedom corresponding to MSE in the denominator. The ANOVA Summary Table Figure : Rejection and non-rejection region (acceptance region) when using ANOVA to test null hypothesis Vishal Foods Ltd is a leading manufacturer of biscuits. The company has launched a new brand in the four metros; Delhi, Mumbai, Kolkata, and Chennai. After one month, the company realizes that there is a difference in the retail price per pack of biscuits across cities. Before the launch, the company had promised its employees and newly-appointed retailers that the biscuits would be sold at a uniform price in the country. The difference in price can tarnish the image of the company. In order to make a quick inference, the company collected data about the price from three randomly selected stores across the four cities. Based on the sample information, the price per pack of the biscuits (in rupees) is given. Example
  • 23. Use one-way ANOVA to analyse the significant difference in the prices. Take 95% as the confidence level. Example : Continued Table: ANOVA table Volkswagon wants to examine the safety of compact cars, midsize cars, and full-size cars. It collects a sample of three for each of the treatments (cars types). Using the data provided below, test whether the mean pressure applied to the driver’s head during a crash test is equal for each types of car. Use α = 5%. Example Compact Cars Midsize Cars Full-Size Cars 15 25 10 25 25 5
  • 24. 20 35 15 Students were given different drug treatments before revising for their exams. Some were given a memory drug, some a placebo drug and some no treatment. Test whether the performance is difference across the three groups or not. The exam scores (%) are shown below for the three different groups: Example Memory Drug Placebo No Treatment 70 37 3 77 43 10 83 50 17 90 57 23 97 63 30 Mean 83.40 50 16.60 Grand Mean 50 THANK YOU! Statistical Inference: Hypothesis Testing
  • 25. Lecture Outline Ø Understand hypothesis-testing procedure using one-tailed and two- tailed tests Ø Understand the concepts of Type I and Type II errors in hypothesis testing Ø Understand the procedure of hypothesis testing The Concept of Normal Distribution Introduction to Hypothesis Testing Ø A statistical hypothesis is an assumption about an unknown population parameter. Ø Hypothesis testing is a well defined procedure which helps us to decide objectively whether to accept or reject the hypothesis based on the information available from the sample.
  • 26. Ø In statistical analysis, we use the concept of probability to specify a probability level at which a researcher concludes that the observed difference between the sample statistic and the population parameter is not due to chance. Hypothesis Testing Procedure Seven steps of hypothesis testing Step 1: Set Null and Alternative Hypotheses Ø The null hypothesis generally referred by H0 (H sub-zero), is the hypothesis which is tested for possible rejection under the assumption that is true. Theoretically, a null hypothesis is set as no difference or status quo and considered true, until and unless it is proved wrong by the collected sample data. Ø Symbolically, a null hypothesis is represented as: Ø The alternative hypothesis, generally referred by H1 (H sub- one), is a logical opposite of the null hypothesis. Ø Symbolically, alternative hypothesis is represented as: Step 2: Determine the Appropriate
  • 27. Statistical Test Ø Type, number, and the level of data may provide a platform for deciding the statistical test. Step 3: Set the Level of Significance Ø The level of significance generally denoted by α is the probability, which is attached to a null hypothesis, which may be rejected even when it is true. Ø The level of significance is also known as the size of the rejection region or the size of the critical region. Ø The levels of significance which are generally applied by researchers are: 0.01; 0.05; 0.10. Type I and Type II Errors When a researcher tests statistical hypotheses, there can be four possible outcomes as follows: Step 4: Set the Decision Rule Critical region is the area under the normal curve, divided into two mutually exclusive regions. These regions are termed as acceptance region (when the null hypothesis is accepted) and the rejection region or critical region (when
  • 28. the null hypothesis is rejected). Acceptance and rejection regions of null hypothesis (two-tailed test) Two-Tailed Test of Hypothesis Ø Let us consider the null and alternative hypotheses as below: Ø Two-tailed tests contain the rejection region on both the tails of the sampling distribution of a test statistic. This means a researcher will reject the null hypothesis if the computed sample statistic is significantly higher than or lower than the hypothesized population parameter (considering both the tails, right as well as left). Acceptance and rejection regions (alpha = 0.05) One-Tailed Test of Hypothesis Let us consider a null and alternative hypotheses as below: One-tailed test contains the rejection region on one tail of the sampling distribution of a test statistic. In case of a left-tailed test, a researcher rejects the null hypothesis if the computed sample statistic is significantly lower than the hypothesized population
  • 29. parameter. In case of a right-tailed test, a researcher rejects the null hypothesis if the computed sample statistic is significantly higher than the hypothesized population parameter. Acceptance and rejection regions for one-tailed (left) test (alpha = 0.05) Acceptance and rejection regions for one-tailed (right) test (alpha = 0.05) Step 5: Collect the Sample Data Ø In this stage of sampling, data are collected and the appropriate sample statistics are computed. Ø The first four steps should be completed before collecting the data for the study. Ø It is not advisable to collect the data first and then decide on the stages of hypothesis testing. Step 6: Analyse the data
  • 30. Ø In this step, the researcher has to compute the test statistic. This involves selection of an appropriate probability distribution for a particular test. Ø Some of the commonly used testing procedures are z, t, F, and χ2. Step 7: Arrive at a Statistical Conclusion and Business Implication Ø In this step, the researchers draw a statistical conclusion. A statistical conclusion is a decision to accept or reject a null hypothesis. Ø Statisticians present the information obtained using hypothesis- testing procedure to the decision makers. Decisions are made on the basis of this information. Ultimately, a decision maker decides that a statistically significant result is a substantive result and needs to be implemented for meeting the organization’s goals. Hypothesis Testing for a Single Population Mean Using the Z Statistic Ø When sample size is greater than (equals to) 30. Ø Population has a normal distribution.
  • 31. Hypothesis Testing for a Single Population Mean Using the Z Statistic A marketing research firm conducted a survey 10 years ago and found that the average household income of a particular geographic region is Rs 10,000. Mr.Ahmad, who has recently joined the firm as a vice president has expressed doubts about the accuracy of the data. For verifying the data, the firm has decided to take a random sample of 200 households that yield a sample mean (for household income) of Rs 11,000. Assume that the population standard deviation of the household income is Rs 1200. Verify Mr. Ahmad’s doubts using the seven steps of hypothesis testing. Let α = 0.05 (5%). Example ( Solution )
  • 32. Hypothesis Testing for a Single Population Mean Using the T Statistic (Case of a Small Random Sample When N < 30) When a researcher draw a small random sample (n < 30) to estimate the population mean μ and when the population standard deviation is unknown and population is normally distributed, t-test can be applied. Royal Tyres has launched a new brand of tyres for tractors and claims that under normal circumstances the average life of the tyres is 40,000 km. A retailer wants to test this claim and has taken a random sample of 8 tyres and the mean was found as 39,750 (S.D = 2618.61). He tests the life of the tyres under normal circumstance. The results obtained
  • 33. are presented in Table 10.4. Example Example (