SlideShare a Scribd company logo
1 of 32
Dr. Manoj Kumar Meher
Kalahandi University
meher.manoj@gmail.com
 The measures of association refer to a wide variety of
coefficients that measure the statistical strength of the
relationship on the variables of interest; these measures of
strength, or association, can be described in several ways,
depending on the analysis.
There are certain statistical distinctions that a researcher should know
in order to better understand the measures of statistical association.
1. The student should know that measures of association are not
the same as measures of statistical significance. The measures of
significance have a null hypothesis that states that there is no
significant difference between the strength of an observed
relationship and the strength of an expected relationship by
means of simple random sampling. Therefore, there is a
possibility of having a relationship that depicts strong measures
of association but is not statistically significant, and a
relationship that depicts weak measures of association but is
very significant.
2. The coefficient that measures statistical association, which can
vary depending on the analysis, that has a value of zero signifies
no relationship exists.
i. In correlation analyses, if the coefficient (r) has a value of one, it
signifies a perfect relationship on the variables of interest.
ii. In regression analyses, if the standardized beta weight (?) has a
value of one, it also signifies a perfect relationship on the
variables of interest.
iii. In regards to linear relationships, the measures of association are
those which deal with strictly monotonic, ordered monotonic,
predictive monotonic, and weak monotonic relationships.
iv. The researcher should note that if the relationships in measures
of association are perfect due to strict monotonicity, then it
should be perfect by other conditions as well.
v. However, in measures of association, one cannot have perfect
ordered and perfect predictive monotonicity at the same time.
The researcher should note that the linear definitions of perfect
relationships in measures of association are inappropriate for
curvilinear relationships or discontinuous relationships.
3. The measures of association define the strength of the linear
relationship in terms of the degree of monotonicity. This degree
of monotonicity used by the measures of association is on the
counting of various types of pairs in a relationship. There are
basically four types of pairs in the measures of association.
These are concordant pairs (i.e. the pairs that agree with each
other),discordant pairs (i.e. the pairs that do not agree with each
other), the tied pair on one variable, and the tied pair on the
other variable. The researcher should note that as the concordant
pair increases, all the linear definitions of perfect relationships in
measures of association increases the coefficient of association
towards +1.
There are certain assumptions that are made on the
measures of association:
 The measures of association assume categorical
(nominal or ordinal) and continuous types
 Statistics Solutions of level data. The measures of
association assume a symmetrical or asymmetrical
type of causal direction.
 The measures of association that define an ideal
relationship in terms of the strict monotonicity will
attain the value of one only if the two variables have
evolved from the same marginal distribution. The
measures of association also ignore those rows and
columns which have null values.
Product movement correlation
Rank correlation
Test of Significance
Coefficient of determination
Linear regression
The Pearson product-moment correlation
coefficient (or Pearson correlation coefficient, for
short) is a measure of the strength of a linear
association between two variables and is denoted
by r. Basically, a Pearson product-moment
correlation attempts to draw a line of best fit through
the data of two variables, and the Pearson correlation
coefficient, r, indicates how far away all these data
points are to this line of best fit (i.e., how well the
data points fit this new model/line of best fit).
Product Movement Correlation
r Strength of relationship
<0.2 Negligible
0.2 - 0.4 Low
0.4 – 0.7 Moderate
0.7 - 0.9 High
>0.9 Very High
Thumb rule
Formula
Pearson correlation coefficient ( r )
𝑟 =
∑(𝑥 − 𝑥)(𝑦 − 𝑦)
∑(𝑥 − 𝑥)2(𝑦 − 𝑦)2
Or
𝑟 =
∑(𝑥 − 𝑥)(𝑦 − 𝑦)
(σ𝑥 )(σ𝑦)
Production
Paddy in
Qnt./Hact.
(x)
Fertiliser
used
Kg/Hact.
(y)
x-x̄ y-ȳ (x-x̄)² (y-ȳ)² (x-x̄)*(y-ȳ)
40 90 -1.55 28.27 2.39 799.35 -43.69
42 65 0.45 3.27 0.21 10.71 1.49
37 56 -4.55 -5.73 20.66 32.80 26.03
28 47 -13.55 -14.73 183.48 216.89 199.49
75 59 33.45 -2.73 1119.21 7.44 -91.24
15 38 -26.55 -23.73 704.66 562.98 629.85
45 89 3.45 27.27 11.93 743.80 94.21
47 125 5.45 63.27 29.75 4003.44 345.12
49 25 7.45 -36.73 55.57 1348.89 -273.79
28 58 -13.55 -3.73 183.48 13.89 50.49
51 27 9.45 -34.73 89.39 1205.98 -328.33
x=41.55 y=61.73 ∑=2400.7 ∑=8946.18 ∑= 609.64
𝑟 =
∑(𝑥 − 𝑥)(𝑦 − 𝑦)
∑(𝑥 − 𝑥)2(𝑦 − 𝑦)2
(𝑥 − 𝑥)(𝑦 − 𝑦) = 609.64
(𝑥 − 𝑥)2 = 2400.73
(𝑦 − 𝑦)2 = 8946.18
=
609.64
2400.73 (8946.18)
= 0.132
r Strength of
relationship
<0.2 Negligible
0.2 - 0.4 Low
0.4 – 0.7 Moderate
0.7 - 0.9 High
>0.9 Very High
Solve the problem
X 5 7 8 6 10 25 15 10 7 3
Y 10 25 15 10 8 12 6 11 9 5
Sl No Urban Population Literacy (%)
1 60 73
2 35 29
3 15 36
4 22 14
5 18 20
6 38 48
7 47 45
8 5 12
9 12 13
10 9 10
Exercise
 The Spearman’s Rank Correlation Coefficient is the
non-parametric statistical measure used to study the
strength of association between the two ranked variables.
This method is applied to the ordinal set of numbers,
which can be arranged in order, i.e. one after the other so
that ranks can be given to each.
 In the rank correlation coefficient method, the ranks are
given to each individual on the basis of its quality or
quantity, such as ranking starts from position 1st and
goes till Nth position for the one ranked last in the
group.
Rank Correlation
R= 𝟏 −
𝟔∑𝑫𝟐
𝑵(𝑵𝟐−𝟏)
= 𝟏 −
𝟔∑𝑫𝟐
𝑵𝟑−𝑵
Where,
R = Rank coefficient of correlation
D = Difference of ranks
N = Number of Observations
Equal Ranks or Tie in Ranks: In case the same ranks are
assigned to two or more entities, then the ranks are assigned
on an average basis. Such as if two individuals are ranked
equal at third position, then the ranks shall be calculated as:
(4+5)/2 = 4.5
formula
Population Density No of District
100-120 1
120-140 3
140-160 4
160-180 6
180-200 8
200-220 14
220-240 12
240-260 11
260-280 15
280-300 7
Population
Density (X)
No of
District (Y)
Order (X) Order (Y) D= (X-Y) D²
100-120 1 1 10 -9 81
120-140 3 2 9 -7 49
140-160 4 3 8 -5 25
160-180 6 4 7 -4 16
180-200 8 5 5 0 0
200-220 14 6 2 4 16
220-240 12 7 3 3 9
240-260 11 8 4 4 16
260-280 15 9 1 8 64
280-300 7 10 6 4 16
∑=292
R= 𝟏 −
𝟔∑𝑫𝟐
𝑵𝟑−𝑵
1-
𝟔∗𝟐𝟗𝟐
𝟗𝟗𝟎
= 1 - 1.77 = 0 .77
Exercise
Sl No Urban Population (,000) Literacy (%)
1 60 73
2 35 29
3 15 36
4 22 14
5 18 20
6 38 48
7 47 45
8 5 36
9 12 13
10 22 36
Exercise
X Y
100 1025
120 3336
111 4258
200 150
250 589
99 7589
98 1587
135 987
189 687
60 1523
 The coefficient of determination, denoted R2 or r2 and pronounced "R
squared", is the proportion of the variance in the dependent variable that
is predictable from the independent variable(s).
 It is a statistic used in the context of statistical models whose main
purpose is either the prediction of future outcomes or the testing
of hypotheses, on the basis of related information. It provides a measure
of how well observed outcomes are replicated by the model, based on
the proportion of total variation of outcomes explained by the model.
 There are several definitions of R2 that are only sometimes equivalent.
One class of such cases includes that of simple linear
regression where r2 is used instead of R2. When an intercept is included,
then r2 is simply the square of the sample correlation coefficient (r)
between the observed outcomes and the observed predictor values. If
additional regressors are included, R2 is the square of the coefficient of
multiple correlation. In both such cases, the coefficient of determination
normally ranges from 0 to 1.
Coefficient of Determination
Steps to Find the Coefficient of Determination
 Find r, Correlation Coefficient
 Square ‘r’.
 Change r to percentage.
How to interpret the coefficient of determination?
The coefficient of determination, or the R-squared value, is a value
between 0.0 and 1.0 that expresses what proportion of the variance
in Y can be explained by X:
 If R2 = 1, then we have a perfect fit, which means that the values
of Y are fully determined (i.e., without any error) by the values
of X, and all data points lie precisely at the estimated best-fit line.
 If R2 = 0, then our model is no better at predicting the values
of Y than the model which always returns the average value
of Y as a prediction.
Multiplying R2 by 100%, you get the percentage of the variance
in Y which is explained with help of X. For instance:
 If R2 = 0.8, then 80% of the variance in Y is predicted by X
 If R2 = 0.5 then half of the variance in Y can be explained by X
The complementary percentage, i.e., (1 - R2) * 100%, quantifies the
unexplained variance:
 If R2 = 0.6, then 60% of the variance in Y has been explained with
help of X, while the remaining 40% remains unaccounted for.
Formula for the Coefficient of Determination, R2
Here are a few (equivalent) formulae:
R2 = SSR / SST
or
R2 = 1 - SSE / SST
or
R2 = SSR / (SSR + SSE)
TO BE DISCUSS AFTER REGRESSION
 The sum of squares of errors (SSE in short), also called
the residual sum of squares:
 SSE= ∑(yi - ŷi)² SSE quantifies the discrepancy between real
values of Y and those predicted by our model.
 The Regression Sum of Squares (shortened to SSR), which
is sometimes also called the explained sum of squares:
 SSR = ∑(ŷi - ȳ)² SSR measures the difference between the
values predicted by the regression model and those
predicted in the most basic way, namely by
ignoring X completely and using only the average value
of Y as a universal predictor.
 The Total Sum of Squares (SST), which quantifies the total
variability in Y:
 SST = ∑(yi - ȳ)² It turns out that those three sums of squares
satisfy:
 SST= SSR + SSE so you only need to calculate any two of
them, and the remaining one can be easily found!
Sum of
Squares of
Errors
Regression
Sum of
Squares
Total
Sum of
Squares
Origina
l Value
Original
Value
Predicted
Value
(Predicted-
Original)²
(Predicted-
Mean)²
(Y Value-
Mean)²
Yi Xi Y^* SSE SSR SST
3.5 16 3.45 0.0025 0.2025 0.25
3.2 14 3.15 0.0025 0.0225 0.04
3.0 12 2.85 0.0225 0.0225 0.00
2.6 11 2.70 0.0100 0.0900 0.16
2.9 12 2.85 0.0025 0.0225 0.01
3.3 15 3.30 0.0000 0.0900 0.09
2.7 13 3.00 0.0900 0.0000 0.09
2.8 11 2.70 0.0100 0.0900 0.04
SUM 0.1400 0.5400 0.68
Mean 3.0 3
Y^= 1.05+0.15X
R2= SSR/SST 0.7941
1-SSE/SST 0.7941 0.2593
SSR/(SSR+S
SE) 0.7941
Original
Value
Original
Value
Predicted
Value
(Predited-
Original)²
(Predicted-
Mean)²
(Y Value-
Mean)²
Yi Xi Y^* SSE SSR SST
25.00 12.00 30.89 34.69 36.24
35.00 18.00 36.05 1.10 0.74
58.00 22.00 39.49 342.62 6.66
37.00 15.00 33.47 12.46 11.83
27.00 25.00 42.07 227.10 26.63
45.00 17.00 35.19 96.24 2.96
62.00 22.00 39.49 506.70 6.66
32.00 32.00 48.09 258.89 124.99
12.00 8.00 27.45 238.70 89.49
36.91 1718.51 306.19
Y=a+bx
y = 0.8647x +
20.57 a= 20.57
R2= SSR/SST b= 0.8600
1-SSE/SST
SSR/(SSR+SSE) 0.1512
 In any distribution the line of best fit is known as
regression line. In a bivariate distribution there are two
regression line because there are two variable. If x on y are
two variable we get the regression x on y and y on x i.e, by
allotting a set of values to x a set of value for y where as a
set of value for x can be obtain respective to a set of values
y.
 The line can be means of least square methods i.e the
square of the deviation from the expected value are
minimum.
 The least-square method states that the curve that best fits a
given set of observations, is said to be a curve having
a minimum sum of the squared residuals (or deviations or
errors) from the given data points.
Linear regression
Formula
Y= a+bX
Now, here we need to find the value of the slope
of the line, b, plotted in scatter plot and the intercept, a
Where
N = no of observation
X= variable X
Y= Variable Y
X Y
1 1
2 4
3 5
4 6
5
y = 0.15x + 1.05
R² = 0.7941
2
2.5
3
3.5
4
10 12 14 16
Y
X
y
Linear (y)
Sl x y x² y² xy
1 16 3.5 256 12.25 56
2 14 3.2 196 10.24 44.8
3 12 3 144 9 36
4 11 2.6 121 6.76 28.6
5 12 2.9 144 8.41 34.8
6 15 3.3 225 10.89 49.5
7 13 2.7 169 7.29 35.1
8 11 2.8 121 7.84 30.8
∑ 104 24 1376 72.68 315.6
a = (24*1376) - (104*315.6)/8*1376 - (104*104)
=201.6/192
=1.05
b= (8*315.6) - (104*24)/8*1376 - (104*104)
= 28.8/192
=0.15
Find out X if Y= 3.0
Y= a+bx =
3=1.05+0.15x
a= 1.05
b= 0.15
3=1.05+0.15x
0.15x=3-1.05
0.15x=1.95
X=1.95/0.15
x=13
Find out Y if X= 20
Y= a+bX
Y=1.05+0.15X
x y
10 2.55
11 2.7
12 2.85
13 3
14 3.15
15 3.3
16 3.45
17 3.6
18 3.75
19 3.9
20 4.05
x y
16 3.5
14 3.2
12 3
11 2.6
12 2.9
15 3.3
13 2.7
11 2.8
X Y
12 25
18 35
22 58
15 37
25 27
17 45
22 62
32 32
8 12
Find Y if X = 25, 50 , 75 & 100
Exercise

More Related Content

What's hot

Sample Size Determination
Sample Size DeterminationSample Size Determination
Sample Size DeterminationTina Sepehrifar
 
descriptive and inferential statistics
descriptive and inferential statisticsdescriptive and inferential statistics
descriptive and inferential statisticsMona Sajid
 
Test of hypothesis
Test of hypothesisTest of hypothesis
Test of hypothesisvikramlawand
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statisticsewhite00
 
The Normal distribution
The Normal distributionThe Normal distribution
The Normal distributionSarfraz Ahmad
 
Confidence interval & probability statements
Confidence interval & probability statements Confidence interval & probability statements
Confidence interval & probability statements DrZahid Khan
 
Skewness and kurtosis ppt
Skewness and kurtosis pptSkewness and kurtosis ppt
Skewness and kurtosis pptDrishti Rajput
 
DIstinguish between Parametric vs nonparametric test
 DIstinguish between Parametric vs nonparametric test DIstinguish between Parametric vs nonparametric test
DIstinguish between Parametric vs nonparametric testsai prakash
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive StatisticsCIToolkit
 
Statistical inference
Statistical inferenceStatistical inference
Statistical inferenceJags Jagdish
 
Measures of association
Measures of associationMeasures of association
Measures of associationIAU Dent
 

What's hot (20)

Tests of significance
Tests of significanceTests of significance
Tests of significance
 
Sample Size Determination
Sample Size DeterminationSample Size Determination
Sample Size Determination
 
descriptive and inferential statistics
descriptive and inferential statisticsdescriptive and inferential statistics
descriptive and inferential statistics
 
Correlation
CorrelationCorrelation
Correlation
 
Normality tests
Normality testsNormality tests
Normality tests
 
Correlation analysis
Correlation analysisCorrelation analysis
Correlation analysis
 
Test of hypothesis
Test of hypothesisTest of hypothesis
Test of hypothesis
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
 
The Normal distribution
The Normal distributionThe Normal distribution
The Normal distribution
 
Confidence interval & probability statements
Confidence interval & probability statements Confidence interval & probability statements
Confidence interval & probability statements
 
Skewness and kurtosis ppt
Skewness and kurtosis pptSkewness and kurtosis ppt
Skewness and kurtosis ppt
 
Measure of central tendency
Measure of central tendencyMeasure of central tendency
Measure of central tendency
 
T test statistics
T test statisticsT test statistics
T test statistics
 
DIstinguish between Parametric vs nonparametric test
 DIstinguish between Parametric vs nonparametric test DIstinguish between Parametric vs nonparametric test
DIstinguish between Parametric vs nonparametric test
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Non parametric tests
Non parametric testsNon parametric tests
Non parametric tests
 
T test
T testT test
T test
 
Confidence Intervals
Confidence IntervalsConfidence Intervals
Confidence Intervals
 
Statistical inference
Statistical inferenceStatistical inference
Statistical inference
 
Measures of association
Measures of associationMeasures of association
Measures of association
 

Similar to Measure of Association

Correlation analysis notes
Correlation analysis notesCorrelation analysis notes
Correlation analysis notesJapheth Muthama
 
5 regressionand correlation
5 regressionand correlation5 regressionand correlation
5 regressionand correlationLama K Banna
 
Research Methodology Module-06
Research Methodology Module-06Research Methodology Module-06
Research Methodology Module-06Kishor Ade
 
Association between-variables
Association between-variablesAssociation between-variables
Association between-variablesBorhan Uddin
 
Association between-variables
Association between-variablesAssociation between-variables
Association between-variablesBorhan Uddin
 
Correlation Analysis for MSc in Development Finance .pdf
Correlation Analysis for MSc in Development Finance .pdfCorrelation Analysis for MSc in Development Finance .pdf
Correlation Analysis for MSc in Development Finance .pdfErnestNgehTingum
 
Factor Extraction method in factor analysis with example in R studio.pptx
Factor Extraction method in factor analysis with example in R studio.pptxFactor Extraction method in factor analysis with example in R studio.pptx
Factor Extraction method in factor analysis with example in R studio.pptxGauravRajole
 
Artificial Intelligence (Unit - 8).pdf
Artificial Intelligence   (Unit  -  8).pdfArtificial Intelligence   (Unit  -  8).pdf
Artificial Intelligence (Unit - 8).pdfSathyaNarayanan47813
 
Assessment 2 ContextIn many data analyses, it is desirable.docx
Assessment 2 ContextIn many data analyses, it is desirable.docxAssessment 2 ContextIn many data analyses, it is desirable.docx
Assessment 2 ContextIn many data analyses, it is desirable.docxfestockton
 
Assessment 2 ContextIn many data analyses, it is desirable.docx
Assessment 2 ContextIn many data analyses, it is desirable.docxAssessment 2 ContextIn many data analyses, it is desirable.docx
Assessment 2 ContextIn many data analyses, it is desirable.docxgalerussel59292
 

Similar to Measure of Association (20)

Correlation analysis notes
Correlation analysis notesCorrelation analysis notes
Correlation analysis notes
 
Simple Regression.pptx
Simple Regression.pptxSimple Regression.pptx
Simple Regression.pptx
 
Quantitative Data analysis
Quantitative Data analysisQuantitative Data analysis
Quantitative Data analysis
 
Linear Correlation
Linear Correlation Linear Correlation
Linear Correlation
 
9. parametric regression
9. parametric regression9. parametric regression
9. parametric regression
 
5 regressionand correlation
5 regressionand correlation5 regressionand correlation
5 regressionand correlation
 
Research Methodology Module-06
Research Methodology Module-06Research Methodology Module-06
Research Methodology Module-06
 
Association between-variables
Association between-variablesAssociation between-variables
Association between-variables
 
Association between-variables
Association between-variablesAssociation between-variables
Association between-variables
 
Correlation Analysis for MSc in Development Finance .pdf
Correlation Analysis for MSc in Development Finance .pdfCorrelation Analysis for MSc in Development Finance .pdf
Correlation Analysis for MSc in Development Finance .pdf
 
2-20-04.ppt
2-20-04.ppt2-20-04.ppt
2-20-04.ppt
 
Factor Extraction method in factor analysis with example in R studio.pptx
Factor Extraction method in factor analysis with example in R studio.pptxFactor Extraction method in factor analysis with example in R studio.pptx
Factor Extraction method in factor analysis with example in R studio.pptx
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Correlation and Regression
Correlation and Regression Correlation and Regression
Correlation and Regression
 
12943625.ppt
12943625.ppt12943625.ppt
12943625.ppt
 
Artificial Intelligence (Unit - 8).pdf
Artificial Intelligence   (Unit  -  8).pdfArtificial Intelligence   (Unit  -  8).pdf
Artificial Intelligence (Unit - 8).pdf
 
Assessment 2 ContextIn many data analyses, it is desirable.docx
Assessment 2 ContextIn many data analyses, it is desirable.docxAssessment 2 ContextIn many data analyses, it is desirable.docx
Assessment 2 ContextIn many data analyses, it is desirable.docx
 
Assessment 2 ContextIn many data analyses, it is desirable.docx
Assessment 2 ContextIn many data analyses, it is desirable.docxAssessment 2 ContextIn many data analyses, it is desirable.docx
Assessment 2 ContextIn many data analyses, it is desirable.docx
 
Correlation
CorrelationCorrelation
Correlation
 

Recently uploaded

Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 

Recently uploaded (20)

Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 

Measure of Association

  • 1. Dr. Manoj Kumar Meher Kalahandi University meher.manoj@gmail.com
  • 2.  The measures of association refer to a wide variety of coefficients that measure the statistical strength of the relationship on the variables of interest; these measures of strength, or association, can be described in several ways, depending on the analysis. There are certain statistical distinctions that a researcher should know in order to better understand the measures of statistical association. 1. The student should know that measures of association are not the same as measures of statistical significance. The measures of significance have a null hypothesis that states that there is no significant difference between the strength of an observed relationship and the strength of an expected relationship by means of simple random sampling. Therefore, there is a possibility of having a relationship that depicts strong measures of association but is not statistically significant, and a relationship that depicts weak measures of association but is very significant.
  • 3. 2. The coefficient that measures statistical association, which can vary depending on the analysis, that has a value of zero signifies no relationship exists. i. In correlation analyses, if the coefficient (r) has a value of one, it signifies a perfect relationship on the variables of interest. ii. In regression analyses, if the standardized beta weight (?) has a value of one, it also signifies a perfect relationship on the variables of interest. iii. In regards to linear relationships, the measures of association are those which deal with strictly monotonic, ordered monotonic, predictive monotonic, and weak monotonic relationships. iv. The researcher should note that if the relationships in measures of association are perfect due to strict monotonicity, then it should be perfect by other conditions as well. v. However, in measures of association, one cannot have perfect ordered and perfect predictive monotonicity at the same time. The researcher should note that the linear definitions of perfect relationships in measures of association are inappropriate for curvilinear relationships or discontinuous relationships.
  • 4. 3. The measures of association define the strength of the linear relationship in terms of the degree of monotonicity. This degree of monotonicity used by the measures of association is on the counting of various types of pairs in a relationship. There are basically four types of pairs in the measures of association. These are concordant pairs (i.e. the pairs that agree with each other),discordant pairs (i.e. the pairs that do not agree with each other), the tied pair on one variable, and the tied pair on the other variable. The researcher should note that as the concordant pair increases, all the linear definitions of perfect relationships in measures of association increases the coefficient of association towards +1.
  • 5. There are certain assumptions that are made on the measures of association:  The measures of association assume categorical (nominal or ordinal) and continuous types  Statistics Solutions of level data. The measures of association assume a symmetrical or asymmetrical type of causal direction.  The measures of association that define an ideal relationship in terms of the strict monotonicity will attain the value of one only if the two variables have evolved from the same marginal distribution. The measures of association also ignore those rows and columns which have null values.
  • 6. Product movement correlation Rank correlation Test of Significance Coefficient of determination Linear regression
  • 7. The Pearson product-moment correlation coefficient (or Pearson correlation coefficient, for short) is a measure of the strength of a linear association between two variables and is denoted by r. Basically, a Pearson product-moment correlation attempts to draw a line of best fit through the data of two variables, and the Pearson correlation coefficient, r, indicates how far away all these data points are to this line of best fit (i.e., how well the data points fit this new model/line of best fit). Product Movement Correlation
  • 8.
  • 9. r Strength of relationship <0.2 Negligible 0.2 - 0.4 Low 0.4 – 0.7 Moderate 0.7 - 0.9 High >0.9 Very High Thumb rule
  • 10. Formula Pearson correlation coefficient ( r ) 𝑟 = ∑(𝑥 − 𝑥)(𝑦 − 𝑦) ∑(𝑥 − 𝑥)2(𝑦 − 𝑦)2 Or 𝑟 = ∑(𝑥 − 𝑥)(𝑦 − 𝑦) (σ𝑥 )(σ𝑦)
  • 11. Production Paddy in Qnt./Hact. (x) Fertiliser used Kg/Hact. (y) x-x̄ y-ȳ (x-x̄)² (y-ȳ)² (x-x̄)*(y-ȳ) 40 90 -1.55 28.27 2.39 799.35 -43.69 42 65 0.45 3.27 0.21 10.71 1.49 37 56 -4.55 -5.73 20.66 32.80 26.03 28 47 -13.55 -14.73 183.48 216.89 199.49 75 59 33.45 -2.73 1119.21 7.44 -91.24 15 38 -26.55 -23.73 704.66 562.98 629.85 45 89 3.45 27.27 11.93 743.80 94.21 47 125 5.45 63.27 29.75 4003.44 345.12 49 25 7.45 -36.73 55.57 1348.89 -273.79 28 58 -13.55 -3.73 183.48 13.89 50.49 51 27 9.45 -34.73 89.39 1205.98 -328.33 x=41.55 y=61.73 ∑=2400.7 ∑=8946.18 ∑= 609.64
  • 12. 𝑟 = ∑(𝑥 − 𝑥)(𝑦 − 𝑦) ∑(𝑥 − 𝑥)2(𝑦 − 𝑦)2 (𝑥 − 𝑥)(𝑦 − 𝑦) = 609.64 (𝑥 − 𝑥)2 = 2400.73 (𝑦 − 𝑦)2 = 8946.18 = 609.64 2400.73 (8946.18) = 0.132 r Strength of relationship <0.2 Negligible 0.2 - 0.4 Low 0.4 – 0.7 Moderate 0.7 - 0.9 High >0.9 Very High Solve the problem X 5 7 8 6 10 25 15 10 7 3 Y 10 25 15 10 8 12 6 11 9 5
  • 13. Sl No Urban Population Literacy (%) 1 60 73 2 35 29 3 15 36 4 22 14 5 18 20 6 38 48 7 47 45 8 5 12 9 12 13 10 9 10 Exercise
  • 14.  The Spearman’s Rank Correlation Coefficient is the non-parametric statistical measure used to study the strength of association between the two ranked variables. This method is applied to the ordinal set of numbers, which can be arranged in order, i.e. one after the other so that ranks can be given to each.  In the rank correlation coefficient method, the ranks are given to each individual on the basis of its quality or quantity, such as ranking starts from position 1st and goes till Nth position for the one ranked last in the group. Rank Correlation
  • 15. R= 𝟏 − 𝟔∑𝑫𝟐 𝑵(𝑵𝟐−𝟏) = 𝟏 − 𝟔∑𝑫𝟐 𝑵𝟑−𝑵 Where, R = Rank coefficient of correlation D = Difference of ranks N = Number of Observations Equal Ranks or Tie in Ranks: In case the same ranks are assigned to two or more entities, then the ranks are assigned on an average basis. Such as if two individuals are ranked equal at third position, then the ranks shall be calculated as: (4+5)/2 = 4.5 formula
  • 16. Population Density No of District 100-120 1 120-140 3 140-160 4 160-180 6 180-200 8 200-220 14 220-240 12 240-260 11 260-280 15 280-300 7
  • 17. Population Density (X) No of District (Y) Order (X) Order (Y) D= (X-Y) D² 100-120 1 1 10 -9 81 120-140 3 2 9 -7 49 140-160 4 3 8 -5 25 160-180 6 4 7 -4 16 180-200 8 5 5 0 0 200-220 14 6 2 4 16 220-240 12 7 3 3 9 240-260 11 8 4 4 16 260-280 15 9 1 8 64 280-300 7 10 6 4 16 ∑=292 R= 𝟏 − 𝟔∑𝑫𝟐 𝑵𝟑−𝑵 1- 𝟔∗𝟐𝟗𝟐 𝟗𝟗𝟎 = 1 - 1.77 = 0 .77
  • 18. Exercise Sl No Urban Population (,000) Literacy (%) 1 60 73 2 35 29 3 15 36 4 22 14 5 18 20 6 38 48 7 47 45 8 5 36 9 12 13 10 22 36
  • 19. Exercise X Y 100 1025 120 3336 111 4258 200 150 250 589 99 7589 98 1587 135 987 189 687 60 1523
  • 20.  The coefficient of determination, denoted R2 or r2 and pronounced "R squared", is the proportion of the variance in the dependent variable that is predictable from the independent variable(s).  It is a statistic used in the context of statistical models whose main purpose is either the prediction of future outcomes or the testing of hypotheses, on the basis of related information. It provides a measure of how well observed outcomes are replicated by the model, based on the proportion of total variation of outcomes explained by the model.  There are several definitions of R2 that are only sometimes equivalent. One class of such cases includes that of simple linear regression where r2 is used instead of R2. When an intercept is included, then r2 is simply the square of the sample correlation coefficient (r) between the observed outcomes and the observed predictor values. If additional regressors are included, R2 is the square of the coefficient of multiple correlation. In both such cases, the coefficient of determination normally ranges from 0 to 1. Coefficient of Determination
  • 21. Steps to Find the Coefficient of Determination  Find r, Correlation Coefficient  Square ‘r’.  Change r to percentage.
  • 22. How to interpret the coefficient of determination? The coefficient of determination, or the R-squared value, is a value between 0.0 and 1.0 that expresses what proportion of the variance in Y can be explained by X:  If R2 = 1, then we have a perfect fit, which means that the values of Y are fully determined (i.e., without any error) by the values of X, and all data points lie precisely at the estimated best-fit line.  If R2 = 0, then our model is no better at predicting the values of Y than the model which always returns the average value of Y as a prediction. Multiplying R2 by 100%, you get the percentage of the variance in Y which is explained with help of X. For instance:  If R2 = 0.8, then 80% of the variance in Y is predicted by X  If R2 = 0.5 then half of the variance in Y can be explained by X The complementary percentage, i.e., (1 - R2) * 100%, quantifies the unexplained variance:  If R2 = 0.6, then 60% of the variance in Y has been explained with help of X, while the remaining 40% remains unaccounted for.
  • 23. Formula for the Coefficient of Determination, R2 Here are a few (equivalent) formulae: R2 = SSR / SST or R2 = 1 - SSE / SST or R2 = SSR / (SSR + SSE) TO BE DISCUSS AFTER REGRESSION
  • 24.  The sum of squares of errors (SSE in short), also called the residual sum of squares:  SSE= ∑(yi - ŷi)² SSE quantifies the discrepancy between real values of Y and those predicted by our model.  The Regression Sum of Squares (shortened to SSR), which is sometimes also called the explained sum of squares:  SSR = ∑(ŷi - ȳ)² SSR measures the difference between the values predicted by the regression model and those predicted in the most basic way, namely by ignoring X completely and using only the average value of Y as a universal predictor.  The Total Sum of Squares (SST), which quantifies the total variability in Y:  SST = ∑(yi - ȳ)² It turns out that those three sums of squares satisfy:  SST= SSR + SSE so you only need to calculate any two of them, and the remaining one can be easily found!
  • 25. Sum of Squares of Errors Regression Sum of Squares Total Sum of Squares Origina l Value Original Value Predicted Value (Predicted- Original)² (Predicted- Mean)² (Y Value- Mean)² Yi Xi Y^* SSE SSR SST 3.5 16 3.45 0.0025 0.2025 0.25 3.2 14 3.15 0.0025 0.0225 0.04 3.0 12 2.85 0.0225 0.0225 0.00 2.6 11 2.70 0.0100 0.0900 0.16 2.9 12 2.85 0.0025 0.0225 0.01 3.3 15 3.30 0.0000 0.0900 0.09 2.7 13 3.00 0.0900 0.0000 0.09 2.8 11 2.70 0.0100 0.0900 0.04 SUM 0.1400 0.5400 0.68 Mean 3.0 3 Y^= 1.05+0.15X R2= SSR/SST 0.7941 1-SSE/SST 0.7941 0.2593 SSR/(SSR+S SE) 0.7941
  • 26. Original Value Original Value Predicted Value (Predited- Original)² (Predicted- Mean)² (Y Value- Mean)² Yi Xi Y^* SSE SSR SST 25.00 12.00 30.89 34.69 36.24 35.00 18.00 36.05 1.10 0.74 58.00 22.00 39.49 342.62 6.66 37.00 15.00 33.47 12.46 11.83 27.00 25.00 42.07 227.10 26.63 45.00 17.00 35.19 96.24 2.96 62.00 22.00 39.49 506.70 6.66 32.00 32.00 48.09 258.89 124.99 12.00 8.00 27.45 238.70 89.49 36.91 1718.51 306.19 Y=a+bx y = 0.8647x + 20.57 a= 20.57 R2= SSR/SST b= 0.8600 1-SSE/SST SSR/(SSR+SSE) 0.1512
  • 27.  In any distribution the line of best fit is known as regression line. In a bivariate distribution there are two regression line because there are two variable. If x on y are two variable we get the regression x on y and y on x i.e, by allotting a set of values to x a set of value for y where as a set of value for x can be obtain respective to a set of values y.  The line can be means of least square methods i.e the square of the deviation from the expected value are minimum.  The least-square method states that the curve that best fits a given set of observations, is said to be a curve having a minimum sum of the squared residuals (or deviations or errors) from the given data points. Linear regression
  • 28. Formula Y= a+bX Now, here we need to find the value of the slope of the line, b, plotted in scatter plot and the intercept, a Where N = no of observation X= variable X Y= Variable Y X Y 1 1 2 4 3 5 4 6 5
  • 29. y = 0.15x + 1.05 R² = 0.7941 2 2.5 3 3.5 4 10 12 14 16 Y X y Linear (y) Sl x y x² y² xy 1 16 3.5 256 12.25 56 2 14 3.2 196 10.24 44.8 3 12 3 144 9 36 4 11 2.6 121 6.76 28.6 5 12 2.9 144 8.41 34.8 6 15 3.3 225 10.89 49.5 7 13 2.7 169 7.29 35.1 8 11 2.8 121 7.84 30.8 ∑ 104 24 1376 72.68 315.6 a = (24*1376) - (104*315.6)/8*1376 - (104*104) =201.6/192 =1.05 b= (8*315.6) - (104*24)/8*1376 - (104*104) = 28.8/192 =0.15
  • 30. Find out X if Y= 3.0 Y= a+bx = 3=1.05+0.15x a= 1.05 b= 0.15 3=1.05+0.15x 0.15x=3-1.05 0.15x=1.95 X=1.95/0.15 x=13 Find out Y if X= 20
  • 31. Y= a+bX Y=1.05+0.15X x y 10 2.55 11 2.7 12 2.85 13 3 14 3.15 15 3.3 16 3.45 17 3.6 18 3.75 19 3.9 20 4.05 x y 16 3.5 14 3.2 12 3 11 2.6 12 2.9 15 3.3 13 2.7 11 2.8
  • 32. X Y 12 25 18 35 22 58 15 37 25 27 17 45 22 62 32 32 8 12 Find Y if X = 25, 50 , 75 & 100 Exercise

Editor's Notes

  1. Monotonic: a sequence or function; consistently increasing and never decreasing or consistently decreasing and never increasing in value
  2. In the case of only two random variables, this is called a bivariate distribution, but the concept generalizes to any number of random variables, giving a multivariate distribution.