SlideShare a Scribd company logo
Table of Content
Correlation
Correlation
Definition
Types of Correlation
Measures of Correlation
Applications
Regression
Definition
Types of Regression
Methods of Regression
Applications
References
• Definition: The extent (degree) and nature of the relationship between two variables is
called correlation.
• Correlation analysis is a statistical tool, that measures the closeness or strength of the
relationship between the variables.
• In correlation, two variables are inter-dependent or co-vary and we can not make distinction
between the independent and dependent variables. E.g birth weight and maternal height,
drug intake and number of days taken to cure etc.
• Correlation analysis is not only establishing relationship but also quantify it. Correlation is
unable to indicate the cause and effect relationship between two variables.
Types of Correlation
On the basis of the nature of relationship between the variables, correlation can be
categorized as
1.Positive and negative correlation.
2.Simple, partial and multiple correlation
3.Linear and non-linear
• In this, increase in one variable causes the
proportionate decrease in the other variable.
• Here the two variables move in the
opposite direction.
• E.g. demand and price of commodity. If the
price of the commodity is more, demand fall
and if price of the commodity goes down,
then the demand goes up. Here there is
negative relationship between demand and
price.
Negative Correlation
A) Depending on direction of relationship
• This correlation is also called, direct
correlation.
• In this, an increase or decrease in the value
of one variable is associated with the increase
or decrease in the value of the other.
• In this, both variables move in the same
direction.
• E.g. Predict how much a man who is 125
cm tall might weigh.
We know the man is 125 cm tall, so we draw a
line up from 125 cm to the line of best fit. We
then draw across to the weight axis. We can
predict that he weighs about 75 - 76 kg.
Positive Correlation
Simple Partial
B) Depending on the number of variables
Multiple
• In this simple correlation only
two variables are involved, and
these two variables are taken
into consideration at a time.
• E.g. yield of wheat and the
amount (dose) of fertilizers.
• Relationship between three or
more variables is studied.
• In this type only two variables
are taken into consideration while
effect of other variables are held
constant.
• E.g. the yield of maize and the
amount of fertilizers applied to it
are taken into consideration and
the effect of the other variables
such as effect of pesticides, type
of soil, availability of water etc.
are not taken into consideration.
• In multiple correlations three or
more variables are studied
simultaneously. However it consist
of measurements of relationships
between a dependable variable
and two or more independent
variable.
• Partial and multiple correlation
are mainly associated with
multivariate analysis.
•E.g. relationship between
agricultural production, rainfall and
quantity of fertilizers
Linear correlation
• Difference between these two is based
on the ratio of change between the
variables under study.
• Linear correlation: values have
constant ratio.
• E.g. X= 30, 60, 90. • Y= 10, 20, 30
Non-linear correlation
• The amount of change in one variable
doesn’t have a constant ratio to the
change in other related variable.
• E.g. If the use of fertilizer is doubled,
yield of maize crop would not be exactly
doubled.
C) Depending on the ratio of change
Measures of correlation
Scatter diagram
Graph method
Correlation
Coefficient
1
2
3
A. Scatter diagram
• This is the simplest method for confirming whether there
is any relationship between two variables by plotting
values on chart or graph.
• It is nothing but a visual representation of two variables
by points (dots) on a graph.
• In a scatter diagram one variable is taken on the X-axis
and other on the Y-axis and the data is represented in the
form of points.
• It is called as a scatter diagram because it indicates
scatter of various points (variables)
• Scatter diagram gives a general idea about existence of
correlation between two variables and type of correlation,
but it does not give correct numerical value of the
correlation.
Depending on the extent of relationship between two variables,
scatter diagrams shows
Perfect correlation Perfect negative correlation
No correlation
High negative correlation
Degree
of
Relationship
High negative correlation
Degree of Relationship Between Variables
Perfect
Correlation
Perfect Negative
Correlation
High Negative
Correlation
High Positive
Correlation
No
Correlation
• All the points lie on
a straight line.
• As the variable value
increases on X-axis
the value on Y-axis
also increases or vice
a versa.
• E.g. height and
biomass.
• In this all the points
lie on a straight line.
• As the value on X-
axis increases, the
value on Y-axis
decreases
proportionately
•e.g. Water
temperature and
amount of dissolved
oxygen.
• In this the line can
not be drawn which
is passing through
most of the plotted
points and the
points are totally
scattered. Hence
there is no
correlation between
variables of X and
Y-axis.
• In this most of the
plotted points lie
on the line and
others near to this
line
• In this, It slopes
downward
• In this most of the
plotted points lie
on the line and
others near to this
line
• In this, It slopes
upward
Scatter Diagram Representation
Note: If the plotted points are very close to each other, it indicates high degree of correlation. If the plotted points
are away from each other, it indicates low degree of correlation.
• Merits of Scatter diagram:
1. It is the simple method to find out nature of correlation between two variables.
2. It is not influenced by extreme limits
3. It is easy to understand.
• Demerits of Scatter diagram:
1. It is unable to give exact degree of correlation between two variables.
2. It is a subjective method.
3. It cannot be applied to qualitative data.
4. Scatter is the only first step in finding out the strength of correlation-ship.
Graphical Method
Numerical
and solution
This method, also known as Correlogram is very simple. The data pertaining to
two series are plotted on a graph sheet. We can find out the correlation by
examining the direction and closeness of two curves. If both the curves drawn on
the graph are moving in the same direction, it is a case of positive correlation. On
the other hand, if both the curves are moving in opposite direction, correlation is
said to be negative. If the graph does not show any definite pattern on account of
erratic fluctuations in the curves, then it shows an absence of correlation.
Find out graphically, if there is any correlation between price yield per plot
(qtls); denoted by Y and quantity of fertilizer used (kg); denote by X.
Plot No.: 1 2 3 4 5 6 7 8 9 10
Y: 3.5 4.3 5.2 5.8 6.4 7.3 7.2 7.5 7.8 8.3
X: 6 8 9 12 10 15 17 20 18 24
The two curves move in the same direction
and, moreover, they are very close to each
other, suggesting a close relationship
between price yield per plot (qtls) and
quantity of fertilizer used (kg)
B. Correlation coefficient
• Scattered diagram and graphic method only gives a rough idea about the relationship between two
variables but does not give numerical measure of correlation.The degree of relationship can be
established by calculating Karl Pearson’s coefficient, which is denoted by ‘r’
• Definition: The coefficient of correlation ‘r’ can be defined as a measure of strength of the linear
relationship between the two variables X and Y.
𝑟 =
Ʃ( X –X’) (Y−Y’ )
√Ʃ ( X –X′)
2
√Ʃ (Y−Y′ )
2
where X = Independent variable , Y= dependent variable
• X -`X = deviation from mean
• Y-`Y = deviation from the mean
Type equation here.
Characteristics of correlation coefficient
• If r>0 correlation is positive , r<0 correlation is negative and r
=0 variables are not related
• Larger the numerical value of ‘r’ more close relationship
between variables.
• The value of r ranges between (-1) and (+1).
• If r = 1, we can say that there is perfect positive relationship
• If r = -1 there is perfect negative relationship.
• If r = 0 there is no relationship at all between the two
variables.
• Relationship is perfect, which means that all the points on the
scatter diagram fall on the straight line, the value of r is +1 or –1,
depending on the direction of line. Other values of r show an
intermediate degree of relationship between the two variables.
• If the value of Y increases as the value of X increases the sign
and slope will be positive whereas if the value Y decreases as
the value of X increases, then the slope will be negative a so
there will be –ve coefficient of correlation.
• Merits of Correlation Coefficient:
1.It is the numerical measure of correlation.
2. It determines a single value which summarizes extent of linear relationship.
3. It also indicates the type of correlation
4. It depends on all the observations so give true picture
• Demerits of Correlation Coefficient :
1.It can not be computed for qualitative data such as flower, colour, honesty, beauty, intelligence etc.
2.It measures only linear relationship, but it fails to measure non-linear relationship among variables.
3.It is difficult to calculate and cannot determine cause-and-effect relationship.
4.It can concludes a positive or negative relationship even though the two variables are actually
unrelated. e.g, the age of students and their score in the examination have no relation with each other.
The two variables may show similar movements but there does not seem to be a common link between
them.
Applications of correlation
In agriculture, genetics, physiology, medicine etc. correlation is used as a tool of the analysis.
1) Agriculture: Correlation is widely used as a tool of analysis in agriculture sciences. E.g. to
estimate the role of various variables (factors) such as fertilizers, irrigation, fertility of soil etc.
on crop yield.
2) Physiology: Using regression and correlation analysis relationship between germination
time and temperature of soil, alkalinity of river water and growth of fungi, etc. can be
estimated.
3) Genetics: Correlation analysis finds a lot of application in genetics. • For instance, when
‘r’=0 (correlation coefficient) then it indicates that the concern genes are located at distance
on same chromosomes. • When r=1, it indicates that genes are linked. Thus, correlation
analysis is very important in gene mapping
REGRESSION
•This term was first used by British
Biometrician Sir Francis Galton in 1877 to
describe the laws of human inheritance.
•Regression describes the liner relationship in
quantitative terms•It is used to make predictions
about one variable based on our knowledge of
the other. The regression analysis is a statistical
tool for measuring the average relationship
between any two, or more closely related
(positively, or negatively) variables in terms of
the original units of their data.
.
A) Simple and Multiple Regression Analysis
A simple regression Analysis is concerning with two variables say, X and Y while multiple regression is
concerning with more than two variables.
Y=f(x), Y=f(x,z)
Types of Regression
Analysis
Simple and Multiple
Regression Analysis
Linear and Non linear
Regression Analysis
Total and Partial
Regression Analysis
B) Total and Partial Regression Analysis
A total regression analysis is one which is made to study the effect of all the important variables on one
another. For example, when the effect of advertising expenditure, income of the people, and price of the
goods on the volume of sales are measured it is a case of total regression analysis. S = f(A , I, P)
While A partial regression analysis, on the other hand, is one which is made to study the effect of one, or
two relevant variables (excluding the irrelevant one) on another variable.
Y = f (X but not of Z and P ); S = f (advertisement but not of price and income of the people)
C) Linear and Non linear Regression Analysis
A linear regression is one in which some change in dependent variable (Y) can be expected for the
change in independent variable (X, irrespective of the values of Y). It give rise to straight line• In studying
the way in which the yield of wheat vary in relation to change the amount of fertilizer applied, yield is
dependent variable (Y) and fertilizer level is independent variable (X) .i.e.Y=a+bx
While Non linear Regression Analysis is one which gives rise to a curved line when the data relating to
two variables are plotted on a graph paper .
METHODS OF SIMPLE REGRESSION ANALYSIS
There are two different methods of studying simple (i.e. linear and partial) regression between two
related variables.
They are : I. Graphic method, 2. Scatter diagram method and 3. Algebraic method.
1. GRAPHIC METHOD
Under this method, one or two regression lines are drawn on a graph paper to estimate the values of
one variable say, X on the basis of the given values of another variable say, Y. The regression line of Y on
X will help us in estimating the value of Y for any value of X, and the regression line of X on Y will help us
in estimating the value of X for any value of Y
(i) When there is perfect positive correlation, i.e. r = 1 (ii) When there is perfect negative correlation, i.e. r = -1
(iv) When there is a high degree of correlation.
(iii) When there is no correlation (i.e. r = 0)
(v) When there is a low degree of correlation
B) SCATTER DIAGRAM METHOD
Under this method a graph paper is taken on which the independent variable say, X is represented
along the horizontal axis, and the dependent variable say, Y is represented along the vertical axis.
The points are then plotted on the graph paper representing the various pair of values of both the
variables X and Y which give the picture of a scatter diagram with several points scattered around.
After this, two free-hand straight lines are drawn across the scattered points in such a manner that
sum of the deviations of the points on one side of a line is equal to sum of the deviations of the
points on its other side. . However, the drawl of the regression lines in such a free hand manner
involves a great deal of difficulties
B) ALGEBRIC METHOD
Under this method we are to draw the lines of best fit as the lines of regression. These, lines of
regression are called the lines of the best fit because, with reference to these lines we can get the best
estimates of the values of one variable for the specified values of the other variable. Under this
method the sum of the squares of the deviations between the given values of a variable and its
estimated values given by the concerned line of regression is the least or minimum possible.
 The line of the best fit for Y on X (Le. the regression lines of Y on X) is obtained by finding the value
of Y for any two (preferably the extreme ones) values of X through the following linear equation :
Y=a+bx
𝑦 = 𝑛𝑎 + 𝑏 𝑥
𝑥𝑦 = 𝑎 𝑥 +b 𝑥2
 the line of the best fit for X on Y (i.e. the regression line of X on Y) is obtained by finding the values
of X for any two (preferably the extreme ones) values of Y through the following linear equation:
X=a+bY
𝑥 = 𝑛𝑎 + 𝑏 𝑦
𝑥𝑦 = 𝑎 𝑦 +b 𝑦2
Methods involved under regression analysis
1
2
3
4
5
6
METHODS
Method of deviation
from the actual
Means
Method of deviation
from the assumed
means
Standard error of
estimate
Method of least
square
Normal equation
method
Galton's graphs and
their interpretation
ADVANTAGES OF REGRESSION ANALYSIS
It provides a functional relationship between two or more related variables with the
help of which we can easily estimate or predict the unknown values of one variable
from the known values of another variable.
It provides a formidable tool of statistical analysis in the field of business and
commerce where people are interested in predicting the future events viz. :
consumption, production, investment, prices, sales, profits, etc. and success of
businessmen depends very much on the degree of accuracy in their various
estimates.
It provides a valuable tool for measuring and estimating the cause and effect
relationship among the economic variables that constitute the essence of economic
theory and economic life.
 This technique is highly used in our day-to-day life and sociological studies as well
to estimate the various factors viz. birth rate, death rate, tax rate, yield rate, etc.
 Last but not the least, the regression analysis technique gives us an idea about the
relative variation of a series
Limitations of linear regression
Despite the above utilities and usefulness, the technique of regression analysis suffers
from the following serious limitations :
 It is assumed that the cause and effect relationship between the variables remains
unchanged. This assumption may not always hold good and hence estimation of the
values of a variable made on the basis of the regression equation may lead to
erroneous and misleading results.
 The functional relationship that is established between any two or more variables
on the basis of some limited data may not hold good if more and more data are
taken into consideration.
 It involves very lengthy and complicated procedure of calculations and analysis.
 It can not be used in case of qualitative phenomenon viz., honesty, crime etc.
.
REFRENCES
KALYANI PUBLISHERS
BUSINESS STATISTIC II:
AN INTRODUCTION TO BUSINESS STATISTIC: AUTHOR SURINDER KUNDU ; VETTER: DR. B. S. BODLA
BUSINESS RESEARCH METHODOLOGY : KALYANI PUBLISHERS
Thank you!

More Related Content

What's hot

One Way ANOVA and Two Way ANOVA using R
One Way ANOVA and Two Way ANOVA using ROne Way ANOVA and Two Way ANOVA using R
One Way ANOVA and Two Way ANOVA using R
Sean Stovall
 
Regression ppt
Regression pptRegression ppt
Regression ppt
Shraddha Tiwari
 
Correlation
CorrelationCorrelation
Correlation
harshika5
 
Correlation - Biostatistics
Correlation - BiostatisticsCorrelation - Biostatistics
Correlation - Biostatistics
Fahmida Swati
 
Two-Way ANOVA
Two-Way ANOVATwo-Way ANOVA
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
Avjinder (Avi) Kaler
 
Spearman Rank Correlation - Thiyagu
Spearman Rank Correlation - ThiyaguSpearman Rank Correlation - Thiyagu
Spearman Rank Correlation - Thiyagu
Thiyagu K
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
Abdelaziz Tayoun
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and RegressionShubham Mehta
 
Correlation and Regression
Correlation and Regression Correlation and Regression
Correlation and Regression
Dr. Tushar J Bhatt
 
Regression
RegressionRegression
correlation and regression
correlation and regressioncorrelation and regression
correlation and regression
Unsa Shakir
 
Probability distribution
Probability distributionProbability distribution
Probability distribution
Rohit kumar
 
poisson distribution
poisson distributionpoisson distribution
poisson distribution
sangeeta saini
 
T test
T testT test
Multiple Correlation - Thiyagu
Multiple Correlation - ThiyaguMultiple Correlation - Thiyagu
Multiple Correlation - Thiyagu
Thiyagu K
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
bijuhari
 
correlation.ppt
correlation.pptcorrelation.ppt
correlation.ppt
NayanPatil59
 
Probability.ppt
Probability.pptProbability.ppt
Probability.ppt
Pudhuvai Baveesh
 

What's hot (20)

One Way ANOVA and Two Way ANOVA using R
One Way ANOVA and Two Way ANOVA using ROne Way ANOVA and Two Way ANOVA using R
One Way ANOVA and Two Way ANOVA using R
 
Regression ppt
Regression pptRegression ppt
Regression ppt
 
Correlation
CorrelationCorrelation
Correlation
 
Correlation - Biostatistics
Correlation - BiostatisticsCorrelation - Biostatistics
Correlation - Biostatistics
 
Two-Way ANOVA
Two-Way ANOVATwo-Way ANOVA
Two-Way ANOVA
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
 
Spearman Rank Correlation - Thiyagu
Spearman Rank Correlation - ThiyaguSpearman Rank Correlation - Thiyagu
Spearman Rank Correlation - Thiyagu
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
Correlation and Regression
Correlation and Regression Correlation and Regression
Correlation and Regression
 
Regression
RegressionRegression
Regression
 
correlation and regression
correlation and regressioncorrelation and regression
correlation and regression
 
Probability distribution
Probability distributionProbability distribution
Probability distribution
 
poisson distribution
poisson distributionpoisson distribution
poisson distribution
 
T test
T testT test
T test
 
Multiple Correlation - Thiyagu
Multiple Correlation - ThiyaguMultiple Correlation - Thiyagu
Multiple Correlation - Thiyagu
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
F test
F testF test
F test
 
correlation.ppt
correlation.pptcorrelation.ppt
correlation.ppt
 
Probability.ppt
Probability.pptProbability.ppt
Probability.ppt
 

Similar to correlation and regression.pptx

correlationandregression1-200905162711.pdf
correlationandregression1-200905162711.pdfcorrelationandregression1-200905162711.pdf
correlationandregression1-200905162711.pdf
MuhammadAftab89
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
Sir Parashurambhau College, Pune
 
Biostatistics - Correlation explanation.pptx
Biostatistics - Correlation explanation.pptxBiostatistics - Correlation explanation.pptx
Biostatistics - Correlation explanation.pptx
UVAS
 
Correlation.pptx
Correlation.pptxCorrelation.pptx
Correlation.pptx
Gauravchaudhary214677
 
Correlationanalysis
CorrelationanalysisCorrelationanalysis
Correlationanalysis
Libu Thomas
 
CORRELATION.ppt
CORRELATION.pptCORRELATION.ppt
CORRELATION.ppt
sadiakhan783184
 
Correlation analysis
Correlation analysis Correlation analysis
Correlation analysis
Misab P.T
 
P G STAT 531 Lecture 9 Correlation
P G STAT 531 Lecture 9 CorrelationP G STAT 531 Lecture 9 Correlation
P G STAT 531 Lecture 9 Correlation
Aashish Patel
 
Correlation and Regression.pptx
Correlation and Regression.pptxCorrelation and Regression.pptx
Correlation and Regression.pptx
Jayaprakash985685
 
Correlation (1)
Correlation (1)Correlation (1)
Correlation (1)
Sudhakar8086
 
13943056.ppt
13943056.ppt13943056.ppt
13943056.ppt
TinasheChatara1
 
BS_5Correlation.pptx
BS_5Correlation.pptxBS_5Correlation.pptx
BS_5Correlation.pptx
muthukrishnaveni anand
 
Correlation IN STATISTICS
Correlation IN STATISTICSCorrelation IN STATISTICS
Correlation IN STATISTICS
Kriace Ward
 
Statistics ppt
Statistics pptStatistics ppt
Statistics ppt
PMuruganBalaMurugan
 
Correlation analysis
Correlation analysis Correlation analysis
Correlation analysis
Anil Pokhrel
 
correlation-ppt [Autosaved].pptx statistics in BBA from parul University
correlation-ppt [Autosaved].pptx statistics in BBA from parul Universitycorrelation-ppt [Autosaved].pptx statistics in BBA from parul University
correlation-ppt [Autosaved].pptx statistics in BBA from parul University
PrafullRai4
 
Correlation Studies - Descriptive Studies
Correlation Studies - Descriptive StudiesCorrelation Studies - Descriptive Studies
Correlation Studies - Descriptive Studies
SalmaAsghar4
 
Correlation
CorrelationCorrelation
Correlation
Anjali Awasthi
 
MCA_UNIT-4_Computer Oriented Numerical Statistical Methods
MCA_UNIT-4_Computer Oriented Numerical Statistical MethodsMCA_UNIT-4_Computer Oriented Numerical Statistical Methods
MCA_UNIT-4_Computer Oriented Numerical Statistical Methods
Rai University
 

Similar to correlation and regression.pptx (20)

correlationandregression1-200905162711.pdf
correlationandregression1-200905162711.pdfcorrelationandregression1-200905162711.pdf
correlationandregression1-200905162711.pdf
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
Biostatistics - Correlation explanation.pptx
Biostatistics - Correlation explanation.pptxBiostatistics - Correlation explanation.pptx
Biostatistics - Correlation explanation.pptx
 
Correlation.pptx
Correlation.pptxCorrelation.pptx
Correlation.pptx
 
Correlationanalysis
CorrelationanalysisCorrelationanalysis
Correlationanalysis
 
CORRELATION.ppt
CORRELATION.pptCORRELATION.ppt
CORRELATION.ppt
 
Correlation analysis
Correlation analysis Correlation analysis
Correlation analysis
 
P G STAT 531 Lecture 9 Correlation
P G STAT 531 Lecture 9 CorrelationP G STAT 531 Lecture 9 Correlation
P G STAT 531 Lecture 9 Correlation
 
Correlation and Regression.pptx
Correlation and Regression.pptxCorrelation and Regression.pptx
Correlation and Regression.pptx
 
Correlation (1)
Correlation (1)Correlation (1)
Correlation (1)
 
Correlation
CorrelationCorrelation
Correlation
 
13943056.ppt
13943056.ppt13943056.ppt
13943056.ppt
 
BS_5Correlation.pptx
BS_5Correlation.pptxBS_5Correlation.pptx
BS_5Correlation.pptx
 
Correlation IN STATISTICS
Correlation IN STATISTICSCorrelation IN STATISTICS
Correlation IN STATISTICS
 
Statistics ppt
Statistics pptStatistics ppt
Statistics ppt
 
Correlation analysis
Correlation analysis Correlation analysis
Correlation analysis
 
correlation-ppt [Autosaved].pptx statistics in BBA from parul University
correlation-ppt [Autosaved].pptx statistics in BBA from parul Universitycorrelation-ppt [Autosaved].pptx statistics in BBA from parul University
correlation-ppt [Autosaved].pptx statistics in BBA from parul University
 
Correlation Studies - Descriptive Studies
Correlation Studies - Descriptive StudiesCorrelation Studies - Descriptive Studies
Correlation Studies - Descriptive Studies
 
Correlation
CorrelationCorrelation
Correlation
 
MCA_UNIT-4_Computer Oriented Numerical Statistical Methods
MCA_UNIT-4_Computer Oriented Numerical Statistical MethodsMCA_UNIT-4_Computer Oriented Numerical Statistical Methods
MCA_UNIT-4_Computer Oriented Numerical Statistical Methods
 

Recently uploaded

How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
timhan337
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
CarlosHernanMontoyab2
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 

Recently uploaded (20)

How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 

correlation and regression.pptx

  • 1. Table of Content Correlation Correlation Definition Types of Correlation Measures of Correlation Applications Regression Definition Types of Regression Methods of Regression Applications References
  • 2. • Definition: The extent (degree) and nature of the relationship between two variables is called correlation. • Correlation analysis is a statistical tool, that measures the closeness or strength of the relationship between the variables. • In correlation, two variables are inter-dependent or co-vary and we can not make distinction between the independent and dependent variables. E.g birth weight and maternal height, drug intake and number of days taken to cure etc. • Correlation analysis is not only establishing relationship but also quantify it. Correlation is unable to indicate the cause and effect relationship between two variables.
  • 3. Types of Correlation On the basis of the nature of relationship between the variables, correlation can be categorized as 1.Positive and negative correlation. 2.Simple, partial and multiple correlation 3.Linear and non-linear
  • 4. • In this, increase in one variable causes the proportionate decrease in the other variable. • Here the two variables move in the opposite direction. • E.g. demand and price of commodity. If the price of the commodity is more, demand fall and if price of the commodity goes down, then the demand goes up. Here there is negative relationship between demand and price. Negative Correlation A) Depending on direction of relationship
  • 5. • This correlation is also called, direct correlation. • In this, an increase or decrease in the value of one variable is associated with the increase or decrease in the value of the other. • In this, both variables move in the same direction. • E.g. Predict how much a man who is 125 cm tall might weigh. We know the man is 125 cm tall, so we draw a line up from 125 cm to the line of best fit. We then draw across to the weight axis. We can predict that he weighs about 75 - 76 kg. Positive Correlation
  • 6. Simple Partial B) Depending on the number of variables Multiple • In this simple correlation only two variables are involved, and these two variables are taken into consideration at a time. • E.g. yield of wheat and the amount (dose) of fertilizers. • Relationship between three or more variables is studied. • In this type only two variables are taken into consideration while effect of other variables are held constant. • E.g. the yield of maize and the amount of fertilizers applied to it are taken into consideration and the effect of the other variables such as effect of pesticides, type of soil, availability of water etc. are not taken into consideration. • In multiple correlations three or more variables are studied simultaneously. However it consist of measurements of relationships between a dependable variable and two or more independent variable. • Partial and multiple correlation are mainly associated with multivariate analysis. •E.g. relationship between agricultural production, rainfall and quantity of fertilizers
  • 7. Linear correlation • Difference between these two is based on the ratio of change between the variables under study. • Linear correlation: values have constant ratio. • E.g. X= 30, 60, 90. • Y= 10, 20, 30 Non-linear correlation • The amount of change in one variable doesn’t have a constant ratio to the change in other related variable. • E.g. If the use of fertilizer is doubled, yield of maize crop would not be exactly doubled. C) Depending on the ratio of change
  • 8. Measures of correlation Scatter diagram Graph method Correlation Coefficient 1 2 3 A. Scatter diagram • This is the simplest method for confirming whether there is any relationship between two variables by plotting values on chart or graph. • It is nothing but a visual representation of two variables by points (dots) on a graph. • In a scatter diagram one variable is taken on the X-axis and other on the Y-axis and the data is represented in the form of points. • It is called as a scatter diagram because it indicates scatter of various points (variables) • Scatter diagram gives a general idea about existence of correlation between two variables and type of correlation, but it does not give correct numerical value of the correlation.
  • 9. Depending on the extent of relationship between two variables, scatter diagrams shows Perfect correlation Perfect negative correlation No correlation High negative correlation Degree of Relationship High negative correlation
  • 10. Degree of Relationship Between Variables Perfect Correlation Perfect Negative Correlation High Negative Correlation High Positive Correlation No Correlation • All the points lie on a straight line. • As the variable value increases on X-axis the value on Y-axis also increases or vice a versa. • E.g. height and biomass. • In this all the points lie on a straight line. • As the value on X- axis increases, the value on Y-axis decreases proportionately •e.g. Water temperature and amount of dissolved oxygen. • In this the line can not be drawn which is passing through most of the plotted points and the points are totally scattered. Hence there is no correlation between variables of X and Y-axis. • In this most of the plotted points lie on the line and others near to this line • In this, It slopes downward • In this most of the plotted points lie on the line and others near to this line • In this, It slopes upward
  • 11. Scatter Diagram Representation Note: If the plotted points are very close to each other, it indicates high degree of correlation. If the plotted points are away from each other, it indicates low degree of correlation.
  • 12. • Merits of Scatter diagram: 1. It is the simple method to find out nature of correlation between two variables. 2. It is not influenced by extreme limits 3. It is easy to understand. • Demerits of Scatter diagram: 1. It is unable to give exact degree of correlation between two variables. 2. It is a subjective method. 3. It cannot be applied to qualitative data. 4. Scatter is the only first step in finding out the strength of correlation-ship.
  • 13. Graphical Method Numerical and solution This method, also known as Correlogram is very simple. The data pertaining to two series are plotted on a graph sheet. We can find out the correlation by examining the direction and closeness of two curves. If both the curves drawn on the graph are moving in the same direction, it is a case of positive correlation. On the other hand, if both the curves are moving in opposite direction, correlation is said to be negative. If the graph does not show any definite pattern on account of erratic fluctuations in the curves, then it shows an absence of correlation. Find out graphically, if there is any correlation between price yield per plot (qtls); denoted by Y and quantity of fertilizer used (kg); denote by X. Plot No.: 1 2 3 4 5 6 7 8 9 10 Y: 3.5 4.3 5.2 5.8 6.4 7.3 7.2 7.5 7.8 8.3 X: 6 8 9 12 10 15 17 20 18 24 The two curves move in the same direction and, moreover, they are very close to each other, suggesting a close relationship between price yield per plot (qtls) and quantity of fertilizer used (kg)
  • 14. B. Correlation coefficient • Scattered diagram and graphic method only gives a rough idea about the relationship between two variables but does not give numerical measure of correlation.The degree of relationship can be established by calculating Karl Pearson’s coefficient, which is denoted by ‘r’ • Definition: The coefficient of correlation ‘r’ can be defined as a measure of strength of the linear relationship between the two variables X and Y. 𝑟 = Ʃ( X –X’) (Y−Y’ ) √Ʃ ( X –X′) 2 √Ʃ (Y−Y′ ) 2 where X = Independent variable , Y= dependent variable • X -`X = deviation from mean • Y-`Y = deviation from the mean Type equation here.
  • 15. Characteristics of correlation coefficient • If r>0 correlation is positive , r<0 correlation is negative and r =0 variables are not related • Larger the numerical value of ‘r’ more close relationship between variables. • The value of r ranges between (-1) and (+1). • If r = 1, we can say that there is perfect positive relationship • If r = -1 there is perfect negative relationship. • If r = 0 there is no relationship at all between the two variables. • Relationship is perfect, which means that all the points on the scatter diagram fall on the straight line, the value of r is +1 or –1, depending on the direction of line. Other values of r show an intermediate degree of relationship between the two variables. • If the value of Y increases as the value of X increases the sign and slope will be positive whereas if the value Y decreases as the value of X increases, then the slope will be negative a so there will be –ve coefficient of correlation.
  • 16. • Merits of Correlation Coefficient: 1.It is the numerical measure of correlation. 2. It determines a single value which summarizes extent of linear relationship. 3. It also indicates the type of correlation 4. It depends on all the observations so give true picture • Demerits of Correlation Coefficient : 1.It can not be computed for qualitative data such as flower, colour, honesty, beauty, intelligence etc. 2.It measures only linear relationship, but it fails to measure non-linear relationship among variables. 3.It is difficult to calculate and cannot determine cause-and-effect relationship. 4.It can concludes a positive or negative relationship even though the two variables are actually unrelated. e.g, the age of students and their score in the examination have no relation with each other. The two variables may show similar movements but there does not seem to be a common link between them.
  • 17. Applications of correlation In agriculture, genetics, physiology, medicine etc. correlation is used as a tool of the analysis. 1) Agriculture: Correlation is widely used as a tool of analysis in agriculture sciences. E.g. to estimate the role of various variables (factors) such as fertilizers, irrigation, fertility of soil etc. on crop yield. 2) Physiology: Using regression and correlation analysis relationship between germination time and temperature of soil, alkalinity of river water and growth of fungi, etc. can be estimated. 3) Genetics: Correlation analysis finds a lot of application in genetics. • For instance, when ‘r’=0 (correlation coefficient) then it indicates that the concern genes are located at distance on same chromosomes. • When r=1, it indicates that genes are linked. Thus, correlation analysis is very important in gene mapping
  • 18. REGRESSION •This term was first used by British Biometrician Sir Francis Galton in 1877 to describe the laws of human inheritance. •Regression describes the liner relationship in quantitative terms•It is used to make predictions about one variable based on our knowledge of the other. The regression analysis is a statistical tool for measuring the average relationship between any two, or more closely related (positively, or negatively) variables in terms of the original units of their data.
  • 19. . A) Simple and Multiple Regression Analysis A simple regression Analysis is concerning with two variables say, X and Y while multiple regression is concerning with more than two variables. Y=f(x), Y=f(x,z) Types of Regression Analysis Simple and Multiple Regression Analysis Linear and Non linear Regression Analysis Total and Partial Regression Analysis
  • 20. B) Total and Partial Regression Analysis A total regression analysis is one which is made to study the effect of all the important variables on one another. For example, when the effect of advertising expenditure, income of the people, and price of the goods on the volume of sales are measured it is a case of total regression analysis. S = f(A , I, P) While A partial regression analysis, on the other hand, is one which is made to study the effect of one, or two relevant variables (excluding the irrelevant one) on another variable. Y = f (X but not of Z and P ); S = f (advertisement but not of price and income of the people) C) Linear and Non linear Regression Analysis A linear regression is one in which some change in dependent variable (Y) can be expected for the change in independent variable (X, irrespective of the values of Y). It give rise to straight line• In studying the way in which the yield of wheat vary in relation to change the amount of fertilizer applied, yield is dependent variable (Y) and fertilizer level is independent variable (X) .i.e.Y=a+bx While Non linear Regression Analysis is one which gives rise to a curved line when the data relating to two variables are plotted on a graph paper .
  • 21. METHODS OF SIMPLE REGRESSION ANALYSIS There are two different methods of studying simple (i.e. linear and partial) regression between two related variables. They are : I. Graphic method, 2. Scatter diagram method and 3. Algebraic method. 1. GRAPHIC METHOD Under this method, one or two regression lines are drawn on a graph paper to estimate the values of one variable say, X on the basis of the given values of another variable say, Y. The regression line of Y on X will help us in estimating the value of Y for any value of X, and the regression line of X on Y will help us in estimating the value of X for any value of Y (i) When there is perfect positive correlation, i.e. r = 1 (ii) When there is perfect negative correlation, i.e. r = -1
  • 22. (iv) When there is a high degree of correlation. (iii) When there is no correlation (i.e. r = 0) (v) When there is a low degree of correlation
  • 23. B) SCATTER DIAGRAM METHOD Under this method a graph paper is taken on which the independent variable say, X is represented along the horizontal axis, and the dependent variable say, Y is represented along the vertical axis. The points are then plotted on the graph paper representing the various pair of values of both the variables X and Y which give the picture of a scatter diagram with several points scattered around. After this, two free-hand straight lines are drawn across the scattered points in such a manner that sum of the deviations of the points on one side of a line is equal to sum of the deviations of the points on its other side. . However, the drawl of the regression lines in such a free hand manner involves a great deal of difficulties
  • 24. B) ALGEBRIC METHOD Under this method we are to draw the lines of best fit as the lines of regression. These, lines of regression are called the lines of the best fit because, with reference to these lines we can get the best estimates of the values of one variable for the specified values of the other variable. Under this method the sum of the squares of the deviations between the given values of a variable and its estimated values given by the concerned line of regression is the least or minimum possible.  The line of the best fit for Y on X (Le. the regression lines of Y on X) is obtained by finding the value of Y for any two (preferably the extreme ones) values of X through the following linear equation : Y=a+bx 𝑦 = 𝑛𝑎 + 𝑏 𝑥 𝑥𝑦 = 𝑎 𝑥 +b 𝑥2  the line of the best fit for X on Y (i.e. the regression line of X on Y) is obtained by finding the values of X for any two (preferably the extreme ones) values of Y through the following linear equation: X=a+bY 𝑥 = 𝑛𝑎 + 𝑏 𝑦 𝑥𝑦 = 𝑎 𝑦 +b 𝑦2
  • 25. Methods involved under regression analysis 1 2 3 4 5 6 METHODS Method of deviation from the actual Means Method of deviation from the assumed means Standard error of estimate Method of least square Normal equation method Galton's graphs and their interpretation
  • 26. ADVANTAGES OF REGRESSION ANALYSIS It provides a functional relationship between two or more related variables with the help of which we can easily estimate or predict the unknown values of one variable from the known values of another variable. It provides a formidable tool of statistical analysis in the field of business and commerce where people are interested in predicting the future events viz. : consumption, production, investment, prices, sales, profits, etc. and success of businessmen depends very much on the degree of accuracy in their various estimates. It provides a valuable tool for measuring and estimating the cause and effect relationship among the economic variables that constitute the essence of economic theory and economic life.  This technique is highly used in our day-to-day life and sociological studies as well to estimate the various factors viz. birth rate, death rate, tax rate, yield rate, etc.  Last but not the least, the regression analysis technique gives us an idea about the relative variation of a series
  • 27. Limitations of linear regression Despite the above utilities and usefulness, the technique of regression analysis suffers from the following serious limitations :  It is assumed that the cause and effect relationship between the variables remains unchanged. This assumption may not always hold good and hence estimation of the values of a variable made on the basis of the regression equation may lead to erroneous and misleading results.  The functional relationship that is established between any two or more variables on the basis of some limited data may not hold good if more and more data are taken into consideration.  It involves very lengthy and complicated procedure of calculations and analysis.  It can not be used in case of qualitative phenomenon viz., honesty, crime etc. .
  • 28. REFRENCES KALYANI PUBLISHERS BUSINESS STATISTIC II: AN INTRODUCTION TO BUSINESS STATISTIC: AUTHOR SURINDER KUNDU ; VETTER: DR. B. S. BODLA BUSINESS RESEARCH METHODOLOGY : KALYANI PUBLISHERS

Editor's Notes

  1. Photo by Markus Spiske on Unsplash