SlideShare a Scribd company logo
1 of 32
Download to read offline
Correlation and Regression
Dr. Anil V Dusane
Sir Parashurambhau College
Pune, India
anildusane@gmail.com
www.careerguru.co.com
1
Correlation
• Definition: The extent (degree) of the linear relationship between two
variables is called correlation.
• Correlation analysis is a statistical tool, that measures the closeness or
strength of the relationship between the variables.
• In correlation, two variables are inter-dependent or co-vary and we can not
make distinction between the independent and dependent variables. E.g birth
weight and maternal height, drug intake and number of days taken to cure
etc.
• Correlation analysis is not only establishing relationship but also quantify it.
Correlation is unable to indicate the cause and effect relationship between
two variables. 2
Types of Correlation
On the basis of the nature of relationship between the
variables, correlation can be categorized as
1.Positive and negative correlation.
2.Simple, partial and multiple correlation
3.Linear and non-linear
3
Positive correlation
• This correlation is also called,
direct correlation.
• In this, an increase or decrease
in the value of one variable is
associated with the increase or
decrease in the value of the
other.
• In this, both variables move in
the same direction.
• E.g. number of tillers and plant
yield in wheat, plant yield and
number of pods, number of
days and height of the plant,
etc. 4
0
10
20
30
40
50
60
70
1 2 3 4 5
No. of days
No. of days
Negative correlation
• In this, increase in one variable
causes the proportionate
decrease in the other variable.
• Here the two variables move in
the opposite direction.
• E.g. supply and price of
commodity. If the supply of the
commodity is more, price fall
and if there is scarcity of the
commodity, then the price goes
up. Here there is negative
relationship between supply
and price.
5
0
20
40
60
80
100
120
1 2 3 4 5
Supply (Tonnes)
Supply (Tonnes)
Types of Correlations
• Depending of the number of variables the correlation is classified into Simple, partial
and multiple correlations.
• 1. Simple:
• In this only two variables are involved, and these two variables are taken into
consideration at a time.
• E.g. yield of wheat and the amount (dose) of fertilizers.
• 2. Partial correlation:
• Relationship between three or more variables is studied.
• In this type only two variables are taken into consideration and other variables are
excluded.
• E.g. the yield of maize and the amount of fertilizers applied to it are taken into
consideration and the effect of the other variables such as effect of pesticides, type of soil,
availability of water etc. are not taken into consideration.
6
Multiple correlations
• In this three or more variables are studied simultaneously.
• However multiple correlations consist of measurements of relationships
between a dependable variable and two or more independent variable.
• Partial and multiple correlation are mainly associated with multivariate
analysis.
• E.g. relationship between agricultural production, rainfall and quantity of
fertilizers used.
7
Liner correlation
• Linear and non-linear
correlation:
• Difference between these two is
based on the ratio of change
between the variables under
study.
• Linear correlation: values
have constant ratio.
• E.g. X= 30, 60, 90.
• Y= 10, 20, 30
8
0
10
20
30
40
50
60
70
80
90
100
1 2 3
X
X
Non-linear correlation
The amount of change in one
variable doesn’t have a
constant ratio to the change
in other related variable.
• E.g. If the use of fertilizer
is doubled, yield of maize
crop would not be exactly
doubled.
9
0
10
20
30
40
50
60
70
1 2 3 4 5
No. of days
No. of days
Measures of correlation
• Measures of correlation: There are several measures of
correlation but following three are important measures.
1.Scatter diagram
2.Graph method
3.Correlation coefficient
10
Scatter diagram
• This is the simplest method for confirming whether there is any
relationship between two variables by plotting values on chart or
graph.
• It is nothing but a visual representation of two variables by points
(dots) on a graph.
• In a scatter diagram one variable is taken on the X-axis and other on
the Y-axis and the data is represented in the form of points.
• It is called as a scatter diagram because it indicates scatter of
various points (variables).
11
Scatter diagram
• Scatter diagram gives a general idea
about existence of correlation between
two variables and type of correlation,
but it does not give correct numerical
value of the correlation.
• Depending on the extent of
relationship between two variables,
scatter diagrams shows perfect
correlation, perfect negative correlation,
no correlation, high positive and high
negative correlation.
12
0
5
10
15
20
25
30
35
0 10 20 30 40 50 60 70 80 90 100
Y
Merits of Scatter diagram
• Merits of scatter diagram:
1. It is the simple method to find out nature of correlation between two
variables.
2. It is not influenced by extreme limits
3. It is easy to understand.
• Demerits:
1. It doesn’t give correct numerical value of correlation. It is unable to give
exact degree of correlation between two variables.
2. It is a subjective method.
3. It cannot be applied to qualitative data.
4. Scatter is the only first step in finding out the strength of correlation-ship. 13
Correlation coefficient
• Scattered diagram and graphic method only gives a rough idea
about the relationship between two variables but does not give
numerical measure of correlation.
• The degree of relationship can be established by calculating
Karl Pearson’s coefficient, which is denoted by ‘r’
• Definition: The coefficient of correlation ‘r’ can be defined as a
measure of strength of the linear relationship between the two
variables X and Y.
14
Correlation coefficient
• r= ( X -X) (Y-Y )/ ( X -X)(Y-Y)
• where X = Independent variable
• Y= dependent variable
• X -X = deviation from AM
• Y-Y = deviation from the mean
• If r0, correlation is positive and r0, correlation is negative.
• r =0 variables are not related.
15
Correlation coefficient
• Larger the numerical value of ‘r’ more close relationship between
variables.
• If r = 1, we can say that there is perfect positive relationship
• If r = -1 there is perfect negative relationship.
• In general, for r 0.8 we can say that there is high correlation
• If r is between 0.3-0.8 then there is considerable correlation exists
and
• If r  0.3 we can say that there is negligible correlation.
16
Characteristics of correlation coefficient
The value of r ranges between (-1) and (+1):
• If there is no relationship at all between the two variables, then the
value is zero.
• On the other hand if the relationship is perfect, which means that all
the points on the scatter diagram fall on the straight line, the value of
r is +1 or –1, depending on the direction of line.
• Other values of r show an intermediate degree of relationship
between the two variables.
17
Characteristics of correlation coefficient
Sign of the coefficient can be positive or negative:
• It is positive when the slope of the line is positive, and it is
negative when the slope of line is negative.
• If the value of Y increases as the value of X increases the
sign will be positive on the other hand if the value Y
decreases as the value of X increases, then the slope will
be negative a so there will be –ve coefficient of
correlation.
18
Merits of Correlation coefficient
1.It is the numerical measure of correlation.
2. It determines a single value which summarizes
extent of linear relationship.
3. It also indicates the type of correlation
4. It depends on all the observations so give true
picture.
19
Demerits of Correlation coefficient
1.It can not be computed for qualitative data such
as flower colour, honesty, beauty, intelligence
etc.
2.It measures only linear relationship, but it fails
to measure non-linear relationship.
3.It is difficult to calculate.
20
Applications of correlation
• In agriculture, genetics, physiology, medicine etc. correlation is used as
a tool of the analysis.
Agriculture:
• Correlation is widely used as a tool of analysis in agriculture sciences.
• E.g. to estimate the role of various variables (factors) such as
fertilizers, irrigation, fertility of soil etc. on crop yield.
• Physiology:
• Using regression and correlation analysis relationship between
germination time and temperature of soil, alkalinity of river water and
growth of fungi, etc. can be estimated.
21
Applications of Correlations
Genetics:
• Correlation analysis finds a lot of application in
genetics.
• For instance, when ‘r’=0 (correlation coefficient) then it
indicates that the concern genes are located at distance
on same chromosomes.
• When r=1, it indicates that genes are linked. Thus,
correlation analysis is very important in gene mapping.
22
Types of Correlations
• Depending on the extent of relationship between two variables, scatter
diagrams shows perfect correlation, perfect negative correlation, no
correlation, high positive and high negative correlation.
Perfect correlation:
• All the points lie on a straight line.
• As the variable value increases on X-axis the value on Y-axis also increases
or vice a versa.
• E.g. height and biomass.
23
Types of Correlations
Perfect negative correlation:
• In this all the points lie on a straight line.
• As the value on X-axis increases, the value on Y-axis decreases
proportionately
• e.g. Water temperature and amount of dissolved oxygen.
No-correlations:
• In this the line can not be drawn which is passing through most of the
plotted points and the points are totally scattered.
• Hence there is no correlation between variables of X and Y-axis.
24
Types of Correlations
High positive correlation:
In this most of the plotted points lie on the line and others
near to this line.
High negative correlation:
The diagram is showing high negative correlation as the
slope of the lines is more than 90o and most of the points
either lie on the straight line or in close vicinity.
25
Regression
• This term was first used by Sir Francis Galton to describe the laws of human
inheritance.
• Regression describes the liner relationship in quantitative terms.
• It is used to make predictions about one variable based on our knowledge of the
other.
• The regression is divided into two categories i.e. simple regression and multiple
regressions.
• The simple regression is concerning with two variables while multiple regression
is concerning with more than two variables.
• Simple regression is further classified into linear and non-linear type regression.
26
Regression
• A linear regression is one in which some change in dependent variable
(Y) can be expected for the change in independent variable (X,
irrespective of the values of Y).
• In studying the way in which the yield of wheat vary in relation to
change the amount of fertilizer applied, yield is dependent variable
(Y) and fertilizer level is independent variable (X).
• The starting point in regression is to illustrate the relationship between
the dependent variable (weight) and independent variable (age) by
scatter diagram.
27
Regression analysis
• Regression analysis is widely used for prediction and
forecasting.
• It is also used to understand which among the independent
variables are related to the dependent variable, and to
explore the forms of these relationships.
• In restricted circumstances, regression analysis can be
used to infer causal relationships between the independent
and dependent variables.
28
Linear regression
• In statistics linear regression includes any approach to modelling
the relationship between a scalar variable y and one or more
variables denoted X, such that the model depends linearly on the
unknown parameters to be estimated from the data.
• Such a model is called a “linear model”.
• Linear regression has many practical applications.
• This is because models that depend linearly on their unknown
parameters are easier to fit than models which are non-linearly
related to their parameters.
29
Applications of linear regression
• Linear regression is widely used in biological, behavioural and social sciences to
describe possible relationships between variables.
• It ranks as one of the most important tools used in these disciplines.
Prediction or forecasting:
• Linear regression can be used to fit a predictive model to an observed data set
of y and X values.
• After developing such a model, if an additional value of X is then given without
its accompanying value of y, the fitted model can be used to make a prediction
of the value of y.
30
Applications of linear regression
Epidemiology:
• Early evidence relating tobacco smoking to mortality and morbidity came
from observational studies employing regression analysis.
• In order to reduce spurious correlations when analyzing observational data,
researchers usually include several variables in their regression models in
addition to the variable of primary interest.
• For example, suppose we have a regression model in which cigarette smoking
is the independent variable of interest, and the dependent variable is lifespan
measured in years.
31
Applications of linear regression
Environmental science:
• Linear regression finds application in a wide range of
environmental science.
• In Canada, the Environmental Effects Monitoring
Program uses statistical analyses on fish and benthic
surveys to measure the effects of pulp mill or metal
mine effluent on the aquatic ecosystem.
32

More Related Content

What's hot

design of experiments.ppt
design of experiments.pptdesign of experiments.ppt
design of experiments.ppt9814857865
 
Correlation biostatistics
Correlation biostatisticsCorrelation biostatistics
Correlation biostatisticsLekhan Lodhi
 
multiple regression
multiple regressionmultiple regression
multiple regressionPriya Sharma
 
Confounding in Experimental Design
Confounding in Experimental DesignConfounding in Experimental Design
Confounding in Experimental DesignMdShakilSikder
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression AnalysisSalim Azad
 
Karl pearson's correlation
Karl pearson's correlationKarl pearson's correlation
Karl pearson's correlationfairoos1
 
Application of excel and spss programme in statistical
Application of excel and spss programme in statisticalApplication of excel and spss programme in statistical
Application of excel and spss programme in statisticalVeenaV29
 
Definition of dispersion
Definition of dispersionDefinition of dispersion
Definition of dispersionShah Alam Asim
 
Applications of sas and minitab in data analysis
Applications of sas and minitab in data analysisApplications of sas and minitab in data analysis
Applications of sas and minitab in data analysisVeenaV29
 
Non parametric test
Non parametric testNon parametric test
Non parametric testNeetathakur3
 
Simple & Multiple Regression Analysis
Simple & Multiple Regression AnalysisSimple & Multiple Regression Analysis
Simple & Multiple Regression AnalysisShailendra Tomar
 
DATA GRAPHICS 8th Sem.pdf
DATA GRAPHICS 8th Sem.pdfDATA GRAPHICS 8th Sem.pdf
DATA GRAPHICS 8th Sem.pdfRavinandan A P
 

What's hot (20)

Degree of freedom.pptx
Degree of freedom.pptxDegree of freedom.pptx
Degree of freedom.pptx
 
design of experiments.ppt
design of experiments.pptdesign of experiments.ppt
design of experiments.ppt
 
Correlation biostatistics
Correlation biostatisticsCorrelation biostatistics
Correlation biostatistics
 
Biostatistics Measures of dispersion
Biostatistics Measures of dispersionBiostatistics Measures of dispersion
Biostatistics Measures of dispersion
 
multiple regression
multiple regressionmultiple regression
multiple regression
 
Confounding in Experimental Design
Confounding in Experimental DesignConfounding in Experimental Design
Confounding in Experimental Design
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Karl pearson's correlation
Karl pearson's correlationKarl pearson's correlation
Karl pearson's correlation
 
Application of excel and spss programme in statistical
Application of excel and spss programme in statisticalApplication of excel and spss programme in statistical
Application of excel and spss programme in statistical
 
Definition of dispersion
Definition of dispersionDefinition of dispersion
Definition of dispersion
 
Correlation
CorrelationCorrelation
Correlation
 
Regression ppt
Regression pptRegression ppt
Regression ppt
 
Correlation
CorrelationCorrelation
Correlation
 
Measure of Dispersion in statistics
Measure of Dispersion in statisticsMeasure of Dispersion in statistics
Measure of Dispersion in statistics
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
Applications of sas and minitab in data analysis
Applications of sas and minitab in data analysisApplications of sas and minitab in data analysis
Applications of sas and minitab in data analysis
 
Non parametric test
Non parametric testNon parametric test
Non parametric test
 
Regression
RegressionRegression
Regression
 
Simple & Multiple Regression Analysis
Simple & Multiple Regression AnalysisSimple & Multiple Regression Analysis
Simple & Multiple Regression Analysis
 
DATA GRAPHICS 8th Sem.pdf
DATA GRAPHICS 8th Sem.pdfDATA GRAPHICS 8th Sem.pdf
DATA GRAPHICS 8th Sem.pdf
 

Similar to correlationandregression1-200905162711.pdf

Biostatistics - Correlation explanation.pptx
Biostatistics - Correlation explanation.pptxBiostatistics - Correlation explanation.pptx
Biostatistics - Correlation explanation.pptxUVAS
 
Correlation and Regression.pptx
Correlation and Regression.pptxCorrelation and Regression.pptx
Correlation and Regression.pptxJayaprakash985685
 
P G STAT 531 Lecture 9 Correlation
P G STAT 531 Lecture 9 CorrelationP G STAT 531 Lecture 9 Correlation
P G STAT 531 Lecture 9 CorrelationAashish Patel
 
Correlation analysis
Correlation analysis Correlation analysis
Correlation analysis Misab P.T
 
Correlation analysis
Correlation analysis Correlation analysis
Correlation analysis Anil Pokhrel
 
Correlationanalysis
CorrelationanalysisCorrelationanalysis
CorrelationanalysisLibu Thomas
 
correlation and regression
correlation and regressioncorrelation and regression
correlation and regressionKeyur Tejani
 
Correlation IN STATISTICS
Correlation IN STATISTICSCorrelation IN STATISTICS
Correlation IN STATISTICSKriace Ward
 
correlation-ppt [Autosaved].pptx statistics in BBA from parul University
correlation-ppt [Autosaved].pptx statistics in BBA from parul Universitycorrelation-ppt [Autosaved].pptx statistics in BBA from parul University
correlation-ppt [Autosaved].pptx statistics in BBA from parul UniversityPrafullRai4
 
Correlation Studies - Descriptive Studies
Correlation Studies - Descriptive StudiesCorrelation Studies - Descriptive Studies
Correlation Studies - Descriptive StudiesSalmaAsghar4
 
STATISTICAL REGRESSION MODELS
STATISTICAL REGRESSION MODELSSTATISTICAL REGRESSION MODELS
STATISTICAL REGRESSION MODELSAneesa K Ayoob
 
Correlation Analysis
Correlation AnalysisCorrelation Analysis
Correlation AnalysisSaqib Ali
 
Correlationanalysis
CorrelationanalysisCorrelationanalysis
CorrelationanalysisLibu Thomas
 

Similar to correlationandregression1-200905162711.pdf (20)

correlation and regression.pptx
correlation and regression.pptxcorrelation and regression.pptx
correlation and regression.pptx
 
Biostatistics - Correlation explanation.pptx
Biostatistics - Correlation explanation.pptxBiostatistics - Correlation explanation.pptx
Biostatistics - Correlation explanation.pptx
 
CORRELATION.ppt
CORRELATION.pptCORRELATION.ppt
CORRELATION.ppt
 
Correlation analysis
Correlation analysisCorrelation analysis
Correlation analysis
 
Correlation (1)
Correlation (1)Correlation (1)
Correlation (1)
 
Correlation and Regression.pptx
Correlation and Regression.pptxCorrelation and Regression.pptx
Correlation and Regression.pptx
 
Correlation
CorrelationCorrelation
Correlation
 
P G STAT 531 Lecture 9 Correlation
P G STAT 531 Lecture 9 CorrelationP G STAT 531 Lecture 9 Correlation
P G STAT 531 Lecture 9 Correlation
 
Correlation analysis
Correlation analysis Correlation analysis
Correlation analysis
 
Correlation analysis
Correlation analysis Correlation analysis
Correlation analysis
 
Correlationanalysis
CorrelationanalysisCorrelationanalysis
Correlationanalysis
 
13943056.ppt
13943056.ppt13943056.ppt
13943056.ppt
 
correlation and regression
correlation and regressioncorrelation and regression
correlation and regression
 
Correlation.pptx
Correlation.pptxCorrelation.pptx
Correlation.pptx
 
Correlation IN STATISTICS
Correlation IN STATISTICSCorrelation IN STATISTICS
Correlation IN STATISTICS
 
correlation-ppt [Autosaved].pptx statistics in BBA from parul University
correlation-ppt [Autosaved].pptx statistics in BBA from parul Universitycorrelation-ppt [Autosaved].pptx statistics in BBA from parul University
correlation-ppt [Autosaved].pptx statistics in BBA from parul University
 
Correlation Studies - Descriptive Studies
Correlation Studies - Descriptive StudiesCorrelation Studies - Descriptive Studies
Correlation Studies - Descriptive Studies
 
STATISTICAL REGRESSION MODELS
STATISTICAL REGRESSION MODELSSTATISTICAL REGRESSION MODELS
STATISTICAL REGRESSION MODELS
 
Correlation Analysis
Correlation AnalysisCorrelation Analysis
Correlation Analysis
 
Correlationanalysis
CorrelationanalysisCorrelationanalysis
Correlationanalysis
 

Recently uploaded

2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis UsageNeil Kimberley
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...anilsa9823
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
 
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,noida100girls
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...Paul Menig
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear RegressionRavindra Nath Shukla
 
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...lizamodels9
 
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...lizamodels9
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessAggregage
 
/:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc...
/:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc.../:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc...
/:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc...lizamodels9
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageMatteo Carbone
 
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfPaul Menig
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communicationskarancommunications
 
rishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfrishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfmuskan1121w
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in managementchhavia330
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Dave Litwiller
 
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,noida100girls
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsApsara Of India
 

Recently uploaded (20)

2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage
 
KestrelPro Flyer Japan IT Week 2024 (English)
KestrelPro Flyer Japan IT Week 2024 (English)KestrelPro Flyer Japan IT Week 2024 (English)
KestrelPro Flyer Japan IT Week 2024 (English)
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
 
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear Regression
 
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
 
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for Success
 
/:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc...
/:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc.../:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc...
/:Call Girls In Jaypee Siddharth - 5 Star Hotel New Delhi ➥9990211544 Top Esc...
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usage
 
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdf
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
rishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfrishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdf
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in management
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
 
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
 

correlationandregression1-200905162711.pdf

  • 1. Correlation and Regression Dr. Anil V Dusane Sir Parashurambhau College Pune, India anildusane@gmail.com www.careerguru.co.com 1
  • 2. Correlation • Definition: The extent (degree) of the linear relationship between two variables is called correlation. • Correlation analysis is a statistical tool, that measures the closeness or strength of the relationship between the variables. • In correlation, two variables are inter-dependent or co-vary and we can not make distinction between the independent and dependent variables. E.g birth weight and maternal height, drug intake and number of days taken to cure etc. • Correlation analysis is not only establishing relationship but also quantify it. Correlation is unable to indicate the cause and effect relationship between two variables. 2
  • 3. Types of Correlation On the basis of the nature of relationship between the variables, correlation can be categorized as 1.Positive and negative correlation. 2.Simple, partial and multiple correlation 3.Linear and non-linear 3
  • 4. Positive correlation • This correlation is also called, direct correlation. • In this, an increase or decrease in the value of one variable is associated with the increase or decrease in the value of the other. • In this, both variables move in the same direction. • E.g. number of tillers and plant yield in wheat, plant yield and number of pods, number of days and height of the plant, etc. 4 0 10 20 30 40 50 60 70 1 2 3 4 5 No. of days No. of days
  • 5. Negative correlation • In this, increase in one variable causes the proportionate decrease in the other variable. • Here the two variables move in the opposite direction. • E.g. supply and price of commodity. If the supply of the commodity is more, price fall and if there is scarcity of the commodity, then the price goes up. Here there is negative relationship between supply and price. 5 0 20 40 60 80 100 120 1 2 3 4 5 Supply (Tonnes) Supply (Tonnes)
  • 6. Types of Correlations • Depending of the number of variables the correlation is classified into Simple, partial and multiple correlations. • 1. Simple: • In this only two variables are involved, and these two variables are taken into consideration at a time. • E.g. yield of wheat and the amount (dose) of fertilizers. • 2. Partial correlation: • Relationship between three or more variables is studied. • In this type only two variables are taken into consideration and other variables are excluded. • E.g. the yield of maize and the amount of fertilizers applied to it are taken into consideration and the effect of the other variables such as effect of pesticides, type of soil, availability of water etc. are not taken into consideration. 6
  • 7. Multiple correlations • In this three or more variables are studied simultaneously. • However multiple correlations consist of measurements of relationships between a dependable variable and two or more independent variable. • Partial and multiple correlation are mainly associated with multivariate analysis. • E.g. relationship between agricultural production, rainfall and quantity of fertilizers used. 7
  • 8. Liner correlation • Linear and non-linear correlation: • Difference between these two is based on the ratio of change between the variables under study. • Linear correlation: values have constant ratio. • E.g. X= 30, 60, 90. • Y= 10, 20, 30 8 0 10 20 30 40 50 60 70 80 90 100 1 2 3 X X
  • 9. Non-linear correlation The amount of change in one variable doesn’t have a constant ratio to the change in other related variable. • E.g. If the use of fertilizer is doubled, yield of maize crop would not be exactly doubled. 9 0 10 20 30 40 50 60 70 1 2 3 4 5 No. of days No. of days
  • 10. Measures of correlation • Measures of correlation: There are several measures of correlation but following three are important measures. 1.Scatter diagram 2.Graph method 3.Correlation coefficient 10
  • 11. Scatter diagram • This is the simplest method for confirming whether there is any relationship between two variables by plotting values on chart or graph. • It is nothing but a visual representation of two variables by points (dots) on a graph. • In a scatter diagram one variable is taken on the X-axis and other on the Y-axis and the data is represented in the form of points. • It is called as a scatter diagram because it indicates scatter of various points (variables). 11
  • 12. Scatter diagram • Scatter diagram gives a general idea about existence of correlation between two variables and type of correlation, but it does not give correct numerical value of the correlation. • Depending on the extent of relationship between two variables, scatter diagrams shows perfect correlation, perfect negative correlation, no correlation, high positive and high negative correlation. 12 0 5 10 15 20 25 30 35 0 10 20 30 40 50 60 70 80 90 100 Y
  • 13. Merits of Scatter diagram • Merits of scatter diagram: 1. It is the simple method to find out nature of correlation between two variables. 2. It is not influenced by extreme limits 3. It is easy to understand. • Demerits: 1. It doesn’t give correct numerical value of correlation. It is unable to give exact degree of correlation between two variables. 2. It is a subjective method. 3. It cannot be applied to qualitative data. 4. Scatter is the only first step in finding out the strength of correlation-ship. 13
  • 14. Correlation coefficient • Scattered diagram and graphic method only gives a rough idea about the relationship between two variables but does not give numerical measure of correlation. • The degree of relationship can be established by calculating Karl Pearson’s coefficient, which is denoted by ‘r’ • Definition: The coefficient of correlation ‘r’ can be defined as a measure of strength of the linear relationship between the two variables X and Y. 14
  • 15. Correlation coefficient • r= ( X -X) (Y-Y )/ ( X -X)(Y-Y) • where X = Independent variable • Y= dependent variable • X -X = deviation from AM • Y-Y = deviation from the mean • If r0, correlation is positive and r0, correlation is negative. • r =0 variables are not related. 15
  • 16. Correlation coefficient • Larger the numerical value of ‘r’ more close relationship between variables. • If r = 1, we can say that there is perfect positive relationship • If r = -1 there is perfect negative relationship. • In general, for r 0.8 we can say that there is high correlation • If r is between 0.3-0.8 then there is considerable correlation exists and • If r  0.3 we can say that there is negligible correlation. 16
  • 17. Characteristics of correlation coefficient The value of r ranges between (-1) and (+1): • If there is no relationship at all between the two variables, then the value is zero. • On the other hand if the relationship is perfect, which means that all the points on the scatter diagram fall on the straight line, the value of r is +1 or –1, depending on the direction of line. • Other values of r show an intermediate degree of relationship between the two variables. 17
  • 18. Characteristics of correlation coefficient Sign of the coefficient can be positive or negative: • It is positive when the slope of the line is positive, and it is negative when the slope of line is negative. • If the value of Y increases as the value of X increases the sign will be positive on the other hand if the value Y decreases as the value of X increases, then the slope will be negative a so there will be –ve coefficient of correlation. 18
  • 19. Merits of Correlation coefficient 1.It is the numerical measure of correlation. 2. It determines a single value which summarizes extent of linear relationship. 3. It also indicates the type of correlation 4. It depends on all the observations so give true picture. 19
  • 20. Demerits of Correlation coefficient 1.It can not be computed for qualitative data such as flower colour, honesty, beauty, intelligence etc. 2.It measures only linear relationship, but it fails to measure non-linear relationship. 3.It is difficult to calculate. 20
  • 21. Applications of correlation • In agriculture, genetics, physiology, medicine etc. correlation is used as a tool of the analysis. Agriculture: • Correlation is widely used as a tool of analysis in agriculture sciences. • E.g. to estimate the role of various variables (factors) such as fertilizers, irrigation, fertility of soil etc. on crop yield. • Physiology: • Using regression and correlation analysis relationship between germination time and temperature of soil, alkalinity of river water and growth of fungi, etc. can be estimated. 21
  • 22. Applications of Correlations Genetics: • Correlation analysis finds a lot of application in genetics. • For instance, when ‘r’=0 (correlation coefficient) then it indicates that the concern genes are located at distance on same chromosomes. • When r=1, it indicates that genes are linked. Thus, correlation analysis is very important in gene mapping. 22
  • 23. Types of Correlations • Depending on the extent of relationship between two variables, scatter diagrams shows perfect correlation, perfect negative correlation, no correlation, high positive and high negative correlation. Perfect correlation: • All the points lie on a straight line. • As the variable value increases on X-axis the value on Y-axis also increases or vice a versa. • E.g. height and biomass. 23
  • 24. Types of Correlations Perfect negative correlation: • In this all the points lie on a straight line. • As the value on X-axis increases, the value on Y-axis decreases proportionately • e.g. Water temperature and amount of dissolved oxygen. No-correlations: • In this the line can not be drawn which is passing through most of the plotted points and the points are totally scattered. • Hence there is no correlation between variables of X and Y-axis. 24
  • 25. Types of Correlations High positive correlation: In this most of the plotted points lie on the line and others near to this line. High negative correlation: The diagram is showing high negative correlation as the slope of the lines is more than 90o and most of the points either lie on the straight line or in close vicinity. 25
  • 26. Regression • This term was first used by Sir Francis Galton to describe the laws of human inheritance. • Regression describes the liner relationship in quantitative terms. • It is used to make predictions about one variable based on our knowledge of the other. • The regression is divided into two categories i.e. simple regression and multiple regressions. • The simple regression is concerning with two variables while multiple regression is concerning with more than two variables. • Simple regression is further classified into linear and non-linear type regression. 26
  • 27. Regression • A linear regression is one in which some change in dependent variable (Y) can be expected for the change in independent variable (X, irrespective of the values of Y). • In studying the way in which the yield of wheat vary in relation to change the amount of fertilizer applied, yield is dependent variable (Y) and fertilizer level is independent variable (X). • The starting point in regression is to illustrate the relationship between the dependent variable (weight) and independent variable (age) by scatter diagram. 27
  • 28. Regression analysis • Regression analysis is widely used for prediction and forecasting. • It is also used to understand which among the independent variables are related to the dependent variable, and to explore the forms of these relationships. • In restricted circumstances, regression analysis can be used to infer causal relationships between the independent and dependent variables. 28
  • 29. Linear regression • In statistics linear regression includes any approach to modelling the relationship between a scalar variable y and one or more variables denoted X, such that the model depends linearly on the unknown parameters to be estimated from the data. • Such a model is called a “linear model”. • Linear regression has many practical applications. • This is because models that depend linearly on their unknown parameters are easier to fit than models which are non-linearly related to their parameters. 29
  • 30. Applications of linear regression • Linear regression is widely used in biological, behavioural and social sciences to describe possible relationships between variables. • It ranks as one of the most important tools used in these disciplines. Prediction or forecasting: • Linear regression can be used to fit a predictive model to an observed data set of y and X values. • After developing such a model, if an additional value of X is then given without its accompanying value of y, the fitted model can be used to make a prediction of the value of y. 30
  • 31. Applications of linear regression Epidemiology: • Early evidence relating tobacco smoking to mortality and morbidity came from observational studies employing regression analysis. • In order to reduce spurious correlations when analyzing observational data, researchers usually include several variables in their regression models in addition to the variable of primary interest. • For example, suppose we have a regression model in which cigarette smoking is the independent variable of interest, and the dependent variable is lifespan measured in years. 31
  • 32. Applications of linear regression Environmental science: • Linear regression finds application in a wide range of environmental science. • In Canada, the Environmental Effects Monitoring Program uses statistical analyses on fish and benthic surveys to measure the effects of pulp mill or metal mine effluent on the aquatic ecosystem. 32