This document contains excerpts from Chapter 3 and Chapter 12 of the 6th edition of the textbook "Business Statistics" by Ken Black. Chapter 3 discusses measures of shape such as skewness and the coefficient of skewness. Chapter 12 introduces regression analysis and correlation, covering topics like the Pearson correlation coefficient, least squares regression, and residual analysis. Examples are provided to demonstrate calculating the correlation coefficient and estimating the regression equation to predict costs from number of passengers for an airline.
Aris Spanos: A Note on Correlation Analysis: statistically spurious vs. non-spurious results
This note uses an empirical example to illustrate (i) how one can inadvertently derive spurious correlation results and (ii) how such results can be transformed into statistically reliable ones.
Aris Spanos: A Note on Correlation Analysis: statistically spurious vs. non-spurious results
This note uses an empirical example to illustrate (i) how one can inadvertently derive spurious correlation results and (ii) how such results can be transformed into statistically reliable ones.
If you are looking for business statistics homework help, Statisticshelpdesk is your rightest destination. Our experts are capable of solving all grades of business statistics homework with best 100% accuracy and originality. We charge reasonable.
Simple Regression Years with Midwest and Shelf Space Winter .docxbudabrooks46239
Simple Regression Years with Midwest and Shelf Space Winter 2016 Page 1
Lecture Notes for Simple Linear Regression
Problem Definition: Midwest Insurance wants to develop a model able to predict sales
according to time with the company.
Results for: MIDWEST.MTW
Data Display
Row Sales Years with Midwest xy y2 x2
1 487 3 1461 237169 9
2 445 5 2225 198025 25
3 272 2 544 73984 4
4 641 8 5128 410881 64
5 187 2 374 34969 4
6 440 6 2640 193600 36
7 346 7 2422 119716 49
8 238 1 238 56644 1
9 312 4 1248 97344 16
10 269 2 538 72361 4
11 655 9 5895 429025 81
12 563 6 3378 316969 36
y=4855 x=55 xy=26,091 y
2
=2,240,687 x
2
=329
(x)
2
= 3025
(y)
2
= 23571025
Scatterplot of Midwest Data
Graphs>Scatterplot
Years with Midwest
S
a
le
s
9876543210
700
600
500
400
300
200
Scatterplot of Sales vs Years with Midwest
Evaluate the bivariate graph to determine whether a linear relationship exists and the
nature of the relationship. What happens to y as x increases? What type of relationship do
you see?
Simple Regression Years with Midwest and Shelf Space Winter 2016 Page 2
Dialog box for developing correlation coefficient
Explore Linearity of Relationship for significance using t distribution
Pearson Product Moment
Correlation Coefficient
Stat>Basic Stat>Correlation
Correlations: Sales, Years with Midwest – Minitab readout
Pearson correlation of Sales and Years with Midwest = 0.833
P-Value = 0.001
Formula for computing correlation coefficient
2222
yynxxn
yxxyn
r
Hypothesis for t test for significant correlation
H0: =0
H1: ≠0
Decision Rule: Pvalue and critical ratio/critical value technique
Critical Ratio of t
t=
r
r
n
1
2
2
Conclusion:
Interpretation:
Simple Regression Years with Midwest and Shelf Space Winter 2016 Page 3
Simple linear regression assumes that the relationship between the dependent, y
and independent variable, x can be approximated by a straight line.
Population or Deterministic Model – For each x there is an exact value for y.
y = 0 + 1(x) +
y - value of independent variable
(x) - value of independent variable
0 - Value of population y intercept
1 - Slope of population regression line
- Epsilon represents the difference between y and y’. Epsilon also accounts for the independent
variables that affect y but are not in the model. (The .
Biostats coorelation vs rREGRESSION.DIFFERENCE BETWEEN CORRELATION AND REGRES...Payaamvohra1
CORRELATION
REGRESSION
BIOSTATISTICS
SEMESTER 8
M PHARMACY
CORRELATION VS REGRESSION
REGRESSION ANALYSIS
LINEAR AND MULTIPLE REGREISSIO
CORRELATION COEFFICIENT
If you are looking for business statistics homework help, Statisticshelpdesk is your rightest destination. Our experts are capable of solving all grades of business statistics homework with best 100% accuracy and originality. We charge reasonable.
Simple Regression Years with Midwest and Shelf Space Winter .docxbudabrooks46239
Simple Regression Years with Midwest and Shelf Space Winter 2016 Page 1
Lecture Notes for Simple Linear Regression
Problem Definition: Midwest Insurance wants to develop a model able to predict sales
according to time with the company.
Results for: MIDWEST.MTW
Data Display
Row Sales Years with Midwest xy y2 x2
1 487 3 1461 237169 9
2 445 5 2225 198025 25
3 272 2 544 73984 4
4 641 8 5128 410881 64
5 187 2 374 34969 4
6 440 6 2640 193600 36
7 346 7 2422 119716 49
8 238 1 238 56644 1
9 312 4 1248 97344 16
10 269 2 538 72361 4
11 655 9 5895 429025 81
12 563 6 3378 316969 36
y=4855 x=55 xy=26,091 y
2
=2,240,687 x
2
=329
(x)
2
= 3025
(y)
2
= 23571025
Scatterplot of Midwest Data
Graphs>Scatterplot
Years with Midwest
S
a
le
s
9876543210
700
600
500
400
300
200
Scatterplot of Sales vs Years with Midwest
Evaluate the bivariate graph to determine whether a linear relationship exists and the
nature of the relationship. What happens to y as x increases? What type of relationship do
you see?
Simple Regression Years with Midwest and Shelf Space Winter 2016 Page 2
Dialog box for developing correlation coefficient
Explore Linearity of Relationship for significance using t distribution
Pearson Product Moment
Correlation Coefficient
Stat>Basic Stat>Correlation
Correlations: Sales, Years with Midwest – Minitab readout
Pearson correlation of Sales and Years with Midwest = 0.833
P-Value = 0.001
Formula for computing correlation coefficient
2222
yynxxn
yxxyn
r
Hypothesis for t test for significant correlation
H0: =0
H1: ≠0
Decision Rule: Pvalue and critical ratio/critical value technique
Critical Ratio of t
t=
r
r
n
1
2
2
Conclusion:
Interpretation:
Simple Regression Years with Midwest and Shelf Space Winter 2016 Page 3
Simple linear regression assumes that the relationship between the dependent, y
and independent variable, x can be approximated by a straight line.
Population or Deterministic Model – For each x there is an exact value for y.
y = 0 + 1(x) +
y - value of independent variable
(x) - value of independent variable
0 - Value of population y intercept
1 - Slope of population regression line
- Epsilon represents the difference between y and y’. Epsilon also accounts for the independent
variables that affect y but are not in the model. (The .
Biostats coorelation vs rREGRESSION.DIFFERENCE BETWEEN CORRELATION AND REGRES...Payaamvohra1
CORRELATION
REGRESSION
BIOSTATISTICS
SEMESTER 8
M PHARMACY
CORRELATION VS REGRESSION
REGRESSION ANALYSIS
LINEAR AND MULTIPLE REGREISSIO
CORRELATION COEFFICIENT
Please Subscribe to this Channel for more solutions and lectures
http://www.youtube.com/onlineteaching
Chapter 10: Correlation and Regression
10.2: Regression
Complete presentation On Regression Analysis.
Proved By Three methods, Least Square Method, Deviation method by assumed mean, Deviation method By Arithmetic mean.
Finding the relationship between two quantitative variables without being able to infer causal relationships
Correlation is a statistical technique used to determine the degree to which two variables are related
Similar to Applied Business Statistics ,ken black , ch 3 part 2 (20)
The Indian economy is classified into different sectors to simplify the analysis and understanding of economic activities. For Class 10, it's essential to grasp the sectors of the Indian economy, understand their characteristics, and recognize their importance. This guide will provide detailed notes on the Sectors of the Indian Economy Class 10, using specific long-tail keywords to enhance comprehension.
For more information, visit-www.vavaclasses.com
This is a presentation by Dada Robert in a Your Skill Boost masterclass organised by the Excellence Foundation for South Sudan (EFSS) on Saturday, the 25th and Sunday, the 26th of May 2024.
He discussed the concept of quality improvement, emphasizing its applicability to various aspects of life, including personal, project, and program improvements. He defined quality as doing the right thing at the right time in the right way to achieve the best possible results and discussed the concept of the "gap" between what we know and what we do, and how this gap represents the areas we need to improve. He explained the scientific approach to quality improvement, which involves systematic performance analysis, testing and learning, and implementing change ideas. He also highlighted the importance of client focus and a team approach to quality improvement.
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
Palestine last event orientationfvgnh .pptxRaedMohamed3
An EFL lesson about the current events in Palestine. It is intended to be for intermediate students who wish to increase their listening skills through a short lesson in power point.
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
Applied Business Statistics ,ken black , ch 3 part 2
1. Copyright 2010 John Wiley & Sons, Inc. 1
Copyright 2010 John Wiley & Sons, Inc.
Business Statistics, 6th ed.
by Ken Black
Chapter 3
Describing Data
Through Statistics
2. Copyright 2010 John Wiley & Sons, Inc. 2
Measures of Shape
Symmetrical – the right half is a mirror image of the
left half
Skewness – shows that the distribution lacks
symmetry; used to denote the data is sparse at one
end, and piled at the other end
Absence of symmetry
Extreme values in one side of a distribution
3. Copyright 2010 John Wiley & Sons, Inc. 3
Coefficient of Skewness
( )
dM
Sk
−
=
3
Coefficient of Skewness (Sk) - compares the mean
and median in light of the magnitude to the standard
deviation; Md is the median; Sk is coefficient of
skewness; σ is the Std Dev
4. Copyright 2010 John Wiley & Sons, Inc. 4
Coefficient of Skewness
Summary measure for skewness
If Sk < 0, the distribution is negatively skewed
(skewed to the left).
If Sk = 0, the distribution is symmetric (not skewed).
If Sk > 0, the distribution is positively skewed (skewed
to the right).
( )
d
k
M
S
−
=
3
5. Copyright 2010 John Wiley & Sons, Inc. 5
Copyright 2010 John Wiley & Sons, Inc.
Business Statistics, 6th ed.
by Ken Black
Chapter 12
Introduction to
Regression Analysis
and Correlation
6. Copyright 2010 John Wiley & Sons, Inc. 6
Learning Objectives
Compute the equation of a simple regression line from a
sample of data, and interpret the slope and intercept of
the equation.
Understand the usefulness of residual analysis in testing
the assumptions underlying regression analysis and in
examining the fit of the regression line to the data.
Compute a standard error of the estimate and interpret
its meaning.
Compute a coefficient of determination and interpret it.
Test hypotheses about the slope of the regression model
and interpret the results.
Estimate values of Y using the regression model.
7. Copyright 2010 John Wiley & Sons, Inc. 7
Regression and Correlation
Regression analysis is the process of constructing a
mathematical model or function that can be used to
predict or determine one variable by another variable.
Correlation is a measure of the degree of relatedness
of two variables.
8. Copyright 2010 John Wiley & Sons, Inc. 8
( )( )
( )( )
( ) ( )
( )( )
( ) ( )
r
SSXY
SSX SSY
X X Y Y
XY
X Y
n
n n
X X Y Y
X
X
Y
Y
=
=
− −
=
−
−
−
− −
2 2
2
2
2
2
− 1 1r
Pearson Product-Moment
Correlation Coefficient
9. Copyright 2010 John Wiley & Sons, Inc. 9
Degrees of Correlation
Correlation is a measure of the degree of relatedness
of variables
Coefficient of Correlation (r) - applicable only if both
variables being analyzed have at least an interval
level of data
10. Copyright 2010 John Wiley & Sons, Inc. 10
Degrees of Correlation
The term (r) is a measure of the linear correlation
of two variables
The number ranges from -1 to 0 to +1
Closer to +1, the higher the correlation between the
dependent and the independent variables
See the formula for Pearson Product Moment correlation
coefficient –
See slide 3-82 for the formula
11. Copyright 2010 John Wiley & Sons, Inc. 11
r < 0 r > 0
r = 0
Three Degrees of Correlation
12. Copyright 2010 John Wiley & Sons, Inc. 12
Day
Interest
X
Futures
Index
Y
1 7.43 221 55.205 48,841 1,642.03
2 7.48 222 55.950 49,284 1,660.56
3 8.00 226 64.000 51,076 1,808.00
4 7.75 225 60.063 50,625 1,743.75
5 7.60 224 57.760 50,176 1,702.40
6 7.63 223 58.217 49,729 1,701.49
7 7.68 223 58.982 49,729 1,712.64
8 7.67 226 58.829 51,076 1,733.42
9 7.59 226 57.608 51,076 1,715.34
10 8.07 235 65.125 55,225 1,896.45
11 8.03 233 64.481 54,289 1,870.99
12 8.00 241 64.000 58,081 1,928.00
Summations 92.93 2,725 720.220 619,207 21,115.07
X2 Y2 XY
Computation of r for
the Economics Example (Part 1)
13. Copyright 2010 John Wiley & Sons, Inc. 13
( )( )
( ) ( )
( )
( )( )
( )
( ) ( )
( )
r
X
X
Y
Y
XY
X Y
n
n n
=
−
−
−
=
−
−
−
=
2
2
2
2
2 2
21115 07
92 93 2725
12
720 22
12
619 207
12
92 93 2725
815
, .
.
. ,
.
.
Computation of r
Economics Example (Part 2)
14. Copyright 2010 John Wiley & Sons, Inc. 14
Computation of r
Economics Example (Part 2)
Means that 81.5% of the dependent variables
are explained by the independent variables.
Is 81.5% high or low?
15. Copyright 2010 John Wiley & Sons, Inc. 15
Bivariate (two variables) linear regression -- the most
elementary regression model
dependent variable, the variable to be predicted, usually
called Y
independent variable, the predictor or explanatory variable,
usually called X
Nonlinear relationships and regression models with
more than one independent variable can be explored
by using multiple regression models
Simple Regression Analysis
16. Copyright 2010 John Wiley & Sons, Inc. 16
Deterministic Regression Model
Y = 0 + 1X
Probabilistic Regression Model
Y = 0 + 1X +
0 and 1 are population parameters
0 and 1 are estimated by sample statistics b0 and b1
Regression Models
17. Copyright 2010 John Wiley & Sons, Inc. 17
YY
where
XY
b
b
bb
ofvaluepredictedthe=ˆ
slopesamplethe=
interceptsamplethe=:
ˆ
1
0
10
+=
Equation of the Simple Regression Line
18. Copyright 2010 John Wiley & Sons, Inc. 18
Least Squares Analysis
Least squares analysis is a process whereby a
regression model is developed by producing the
minimum sum of the squared error values
The vertical distance from each point to the line is
the error of the prediction.
The least squares regression line is the regression line
that results in the smallest sum of errors squared.
19. Copyright 2010 John Wiley & Sons, Inc. 19
( )( )
( )
( )( )
1 2 2 2
2
2b
X X X X X X
X X Y Y XY nXY
n
XY
X Y
n
n
=
− −
=
−
−
=
−
−
−
0 1 1b b bY X
Y
n
X
n
= − = −
Least Squares Analysis
20. Copyright 2010 John Wiley & Sons, Inc. 20
( )( )
( )( )
( )
SS X X Y Y XY
X Y
n
SS
n
SS
SS
XY
XX
XY
XX
X X X X
b
= − − = −
= = −
=
−
2 2
2
1
0 1 1b b bY X
Y
n
X
n
= − = −
Least Squares Analysis
21. Copyright 2010 John Wiley & Sons, Inc. 21
Number of
Passengers Cost ($1,000)
X Y X2
XY
61 4.28 3,721 261.08
63 4.08 3,969 257.04
67 4.42 4,489 296.14
69 4.17 4,761 287.73
70 4.48 4,900 313.60
74 4.30 5,476 318.20
76 4.82 5,776 366.32
81 4.70 6,561 380.70
86 5.11 7,396 439.46
91 5.13 8,281 466.83
95 5.64 9,025 535.80
97 5.56 9,409 539.32
X = 930 Y = 56.69 2
X = 73,764 XY = 4,462.22
Solving for b1 and b0 of the Regression
Line: Airline Cost Example (Part 1)
22. Copyright 2010 John Wiley & Sons, Inc. 22
Solving for b1 and b0 of the Regression
Line: Airline Cost Example (Part 2)
745.68
12
)69.56)(930(
22.462,4 =−=−=
n
YX
XYSSXY
1689
12
)930(
764,73
)( 22
2
=−=−=
n
X
XSSXX
0407.
1689
745.68
1 ===
XX
XY
SS
SS
b
57.1
12
930
)0407(.
12
69.56
10 =−=−=
n
X
b
n
Y
b
XY 0407.57.1ˆ +=
23. Copyright 2010 John Wiley & Sons, Inc. 23
SUMMARY OUTPUT
Regression Statistics
Multiple R 0.94820033
R Square 0.89908386
Adjusted R Square 0.88899225
Standard Error 0.17721746
Observations 12
ANOVA
df SS MS F Significance F
Regression 1 2.79803 2.79803 89.092179 2.7E-06
Residual 10 0.31406 0.03141
Total 11 3.11209
Coefficients Standard Error t Stat P-value
Intercept 1.56979278 0.33808 4.64322 0.0009175
Number of Passengers 0.0407016 0.00431 9.43887 2.692E-06
Airline Cost: Excel Summary Output
24. Copyright 2010 John Wiley & Sons, Inc. 24
Airline Cost: MINITAB Summary Output
25. Copyright 2010 John Wiley & Sons, Inc. 25
Residual Analysis: Airline Cost Example
Number of Predicted
Passengers Cost ($1,000) Value Residual
X Y Yˆ YY ˆ−
61 4.28 4.053 .227
63 4.08 4.134 -.054
67 4.42 4.297 .123
69 4.17 4.378 -.208
70 4.48 4.419 .061
74 4.30 4.582 -.282
76 4.82 4.663 .157
81 4.70 4.867 -.167
86 5.11 5.070 .040
91 5.13 5.274 -.144
95 5.64 5.436 .204
97 5.56 5.518 .042
−=− 001.)ˆ( YY
26. Copyright 2010 John Wiley & Sons, Inc. 26
Compute the residuals for Demonstration Problem
12.1 in which a regression model was developed to
predict the number of full-time equivalent workers
(FTEs) by the number of beds in a hospital. Analyze
the residuals by using MINITAB graphic diagnostics.
Demonstration Problem 14.2
27. Copyright 2010 John Wiley & Sons, Inc. 27
Demonstration Problem 14.2 – MINITAB
Computations for Residuals
28. Copyright 2010 John Wiley & Sons, Inc. 28
Spearman’s Rank Correlation - Analyze the degree
of association of two variables
Applicable to ordinal level data (ranks)
2
2
6
1
( 1)
: = number of pairs being correlated
= the difference in the ranks of each pair
s
n n
where n
d
d
r = −
−
Spearman’s Rank Correlation
29. Copyright 2010 John Wiley & Sons, Inc. 29
Listed below are the average prices in dollars per 100
pounds for choice spring lambs and choice heifers
over a 10-year period. The data were published by
the National Agricultural Statistics Service of the U.S.
Department of Agriculture.
Suppose the researcher want to determine the
strength of association of the prices between these
two commodities by using Spearman’s rank
correlation.
Spearman’s Rank Correlation
30. Copyright 2010 John Wiley & Sons, Inc. 30
Spearman’s Rank Correlation for
Heifer and Lamb Prices
31. Copyright 2010 John Wiley & Sons, Inc. 31
345.0
)110(10
)108(6
1
)1(
6
1 22
2
=
−
−=
−
−=
nn
d
sr
Spearman’s Rank Correlation for
Heifer and Lamb Prices
32. Copyright 2010 John Wiley & Sons, Inc. 32
The lamb prices are ranked and the heifer prices are
ranked.
The difference in ranks is computed for each year.
The differences are squared and summed, producing
∑d2 = 108.
The number of pairs, n, is 10.
The value of rs = 0.345 indicates that there is a very
modest if not poor positive correlation between lamb
and heifer prices.
Spearman’s Rank Correlation for
Heifer and Lamb Prices