SlideShare a Scribd company logo
1 of 36
Descriptive
Statistics
Skewness
& Kurtosis
Correlation
Agenda
•Stages of Analytics
•Data Life Cycle
•Data Types
•First Business Moment Decision (Measure of CentralTendency)
•Second Business Moment Decision (Measure of Spread)
•Third Business Moment Decision (Skewness)
•Fourth Business Moment Decision(Kurtosis)
•Correlation
CRISP
DM
Business
Understandi
ng
Data
Acquisition
Data
Processing
Data
Exploration
Modelling
Model
Evaluation
Deployment
Data Life Cycle
1. Descriptive Analytics
2. Diagnostic Analytics
3. Predictive analytics
4. Prescriptive Analytics
Stages of Analytics
Data
Quantitative
Continuous
Interval Ratio
Discrete
Qualitative
Categorical
Nominal Ordinal
Balanced Vs Imbalanced Data
Skewness
degree of asymmetry observed in a probability distribution that deviates from the symmetrical normal distribution
(bell Curve)
1. Positive skewed or right-skewed
The extreme positive skewness is not
desirable for distribution, as a high level of
skewness can cause misleading results.
For positively skewed distributions, the
famous transformation is the log
transformation. The log transformation
proposes the calculations of the natural
logarithm for each value in the dataset.
2. Negative skewed or left-skewed
a large number of data-pushed on the left-
hand side
Median is the middle value, and mode is the
highest value, and due to unbalanced
distribution median will be higher than the
mean.
•If the skewness is between -0.5 & 0.5, the data are nearly symmetrical.
•If the skewness is between -1 & -0.5 (negative skewed) or between 0.5 & 1(positive skewed), the
data are slightly skewed.
•If the skewness is lower than -1 (negative skewed) or greater than 1 (positive skewed), the data are
extremely skewed.
Calculate the skewness coefficient of the sample
Kurtosis
Kurtosis refers to the degree of
presence of outliers in the
distribution.
Kurtosis is a statistical measure,
whether the data is heavy-tailed or
light-tailed in a normal distribution
Types of Kurtosis
1.Leptokurtic (kurtosis > 3)
2.platykurtic (kurtosis < 3)
3.Mesokurtic (kurtosis = 3)
Correlation
1.“Correlation is an analysis of the co-variation between two or
more variables”—(A.M Tuttle)
2. “Correlation analysis attempts to determine the degree of
relationship between variables”—(Ya Lun Chou)
3. “Correlation analysis deals with the association between two
or more variables”— (Simpson and Kafka)
Correlation and Causation
Correlation: It is a numerical measure of the direction
and magnitude of the mutual relationship between the
variables(X and Y).
Causation: X is the cause of change in Y i.e, the
change of Y is the effect of change in X.
NOTE:
If X and Y are correlated then X and Y may or may not have a casual relationship.
If X and Y have a causal relationship then X and Y must be correlated.
Reasons Behind Correlation
1. Mutual dependence Between the variables: Both the variables may be mutually
influencing each other so that neither can be designated as the cause and the other the
effect.
When two variables(X and Y) affect each other mutually, we cannot say X is the cause or Y
is the cause.
For Example, The price of a commodity is affected by demand and supply.
2. Due to pure chance: In a small sample, X and Y are highly correlated but in the
universe X and Y are not correlated.
For Example, Correlation between income and weight of a person. This may be due to:
– Sampling fluctuations
– Bias of investigator in selecting the sample.
Such a relation is called a non-sense or spurious relation.
3. Correlation due to any third common factor: Both the correlated variables may be
influenced by one or other variables.
– X and Y don’t have a direct correlation.
For Example, It is between the production of tea and rice per hectare. Here they are not
directly correlated instead the cause is the good rainfall well in time.
Types of Correlation
Key Takeaway
After this session you learnt:
1. Stages of Analytics
2. Data Life Cycle
3. Data Types
4. First Business Moment Decision (Measure of CentralTendency)
5. Second Business Moment Decision (Measure of Spread)
6. Third Business Moment Decision (Skewness)
7. Fourth Business Moment Decision(Kurtosis)
8. Correlation

More Related Content

Similar to Descriptive Statistics: Skewness, Kurtosis & Correlation

29510Nominal Data and the Chi-Square TestsJupiterimag.docx
29510Nominal Data and  the Chi-Square TestsJupiterimag.docx29510Nominal Data and  the Chi-Square TestsJupiterimag.docx
29510Nominal Data and the Chi-Square TestsJupiterimag.docxrhetttrevannion
 
Statistics and probability pp
Statistics and  probability ppStatistics and  probability pp
Statistics and probability ppRuby Vidal
 
Chapter 03 scatterplots and correlation
Chapter 03 scatterplots and correlationChapter 03 scatterplots and correlation
Chapter 03 scatterplots and correlationHamdy F. F. Mahmoud
 
Selection of appropriate data analysis technique
Selection of appropriate data analysis techniqueSelection of appropriate data analysis technique
Selection of appropriate data analysis techniqueRajaKrishnan M
 
SUITABILITY OF COINTEGRATION TESTS ON DATA STRUCTURE OF DIFFERENT ORDERS
SUITABILITY OF COINTEGRATION TESTS ON DATA STRUCTURE  OF DIFFERENT ORDERSSUITABILITY OF COINTEGRATION TESTS ON DATA STRUCTURE  OF DIFFERENT ORDERS
SUITABILITY OF COINTEGRATION TESTS ON DATA STRUCTURE OF DIFFERENT ORDERSBRNSS Publication Hub
 
Inferential Statistics - DAY 4 - B.Ed - AIOU
Inferential Statistics - DAY 4 - B.Ed - AIOUInferential Statistics - DAY 4 - B.Ed - AIOU
Inferential Statistics - DAY 4 - B.Ed - AIOUEqraBaig
 
What do youwant to doHow manyvariablesWhat level.docx
What do youwant to doHow manyvariablesWhat level.docxWhat do youwant to doHow manyvariablesWhat level.docx
What do youwant to doHow manyvariablesWhat level.docxphilipnelson29183
 
Statistics Chapter 01[1]
Statistics  Chapter 01[1]Statistics  Chapter 01[1]
Statistics Chapter 01[1]plisasm
 
2013jsm,Proceedings,DSweitzer,26sep
2013jsm,Proceedings,DSweitzer,26sep2013jsm,Proceedings,DSweitzer,26sep
2013jsm,Proceedings,DSweitzer,26sepDennis Sweitzer
 
Areas In Statistics
Areas In StatisticsAreas In Statistics
Areas In Statisticsguestc94d8c
 
Da gs for_commonbiases2
Da gs for_commonbiases2Da gs for_commonbiases2
Da gs for_commonbiases2mglymour
 
Overview of different statistical tests used in epidemiological
Overview of different  statistical tests used in epidemiologicalOverview of different  statistical tests used in epidemiological
Overview of different statistical tests used in epidemiologicalshefali jain
 
Common statistical tests and applications in epidemiological literature
Common statistical tests and applications in epidemiological literatureCommon statistical tests and applications in epidemiological literature
Common statistical tests and applications in epidemiological literatureKadium
 
Common statistical tests and applications in epidemiological literature
Common statistical tests and applications in epidemiological literatureCommon statistical tests and applications in epidemiological literature
Common statistical tests and applications in epidemiological literatureKadium
 

Similar to Descriptive Statistics: Skewness, Kurtosis & Correlation (20)

29510Nominal Data and the Chi-Square TestsJupiterimag.docx
29510Nominal Data and  the Chi-Square TestsJupiterimag.docx29510Nominal Data and  the Chi-Square TestsJupiterimag.docx
29510Nominal Data and the Chi-Square TestsJupiterimag.docx
 
Statistics and probability pp
Statistics and  probability ppStatistics and  probability pp
Statistics and probability pp
 
stats notes.doc
stats notes.docstats notes.doc
stats notes.doc
 
Important terminologies
Important terminologiesImportant terminologies
Important terminologies
 
Chapter 03 scatterplots and correlation
Chapter 03 scatterplots and correlationChapter 03 scatterplots and correlation
Chapter 03 scatterplots and correlation
 
Selection of appropriate data analysis technique
Selection of appropriate data analysis techniqueSelection of appropriate data analysis technique
Selection of appropriate data analysis technique
 
SUITABILITY OF COINTEGRATION TESTS ON DATA STRUCTURE OF DIFFERENT ORDERS
SUITABILITY OF COINTEGRATION TESTS ON DATA STRUCTURE  OF DIFFERENT ORDERSSUITABILITY OF COINTEGRATION TESTS ON DATA STRUCTURE  OF DIFFERENT ORDERS
SUITABILITY OF COINTEGRATION TESTS ON DATA STRUCTURE OF DIFFERENT ORDERS
 
Inferential Statistics - DAY 4 - B.Ed - AIOU
Inferential Statistics - DAY 4 - B.Ed - AIOUInferential Statistics - DAY 4 - B.Ed - AIOU
Inferential Statistics - DAY 4 - B.Ed - AIOU
 
Presentation1
Presentation1Presentation1
Presentation1
 
What do youwant to doHow manyvariablesWhat level.docx
What do youwant to doHow manyvariablesWhat level.docxWhat do youwant to doHow manyvariablesWhat level.docx
What do youwant to doHow manyvariablesWhat level.docx
 
Statistics Chapter 01[1]
Statistics  Chapter 01[1]Statistics  Chapter 01[1]
Statistics Chapter 01[1]
 
2013jsm,Proceedings,DSweitzer,26sep
2013jsm,Proceedings,DSweitzer,26sep2013jsm,Proceedings,DSweitzer,26sep
2013jsm,Proceedings,DSweitzer,26sep
 
Econometrics ch2
Econometrics ch2Econometrics ch2
Econometrics ch2
 
Areas In Statistics
Areas In StatisticsAreas In Statistics
Areas In Statistics
 
Da gs for_commonbiases2
Da gs for_commonbiases2Da gs for_commonbiases2
Da gs for_commonbiases2
 
Overview of different statistical tests used in epidemiological
Overview of different  statistical tests used in epidemiologicalOverview of different  statistical tests used in epidemiological
Overview of different statistical tests used in epidemiological
 
Data science
Data scienceData science
Data science
 
Common statistical tests and applications in epidemiological literature
Common statistical tests and applications in epidemiological literatureCommon statistical tests and applications in epidemiological literature
Common statistical tests and applications in epidemiological literature
 
Common statistical tests and applications in epidemiological literature
Common statistical tests and applications in epidemiological literatureCommon statistical tests and applications in epidemiological literature
Common statistical tests and applications in epidemiological literature
 
Correlation
CorrelationCorrelation
Correlation
 

Recently uploaded

VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 

Recently uploaded (20)

VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 

Descriptive Statistics: Skewness, Kurtosis & Correlation

  • 2. Agenda •Stages of Analytics •Data Life Cycle •Data Types •First Business Moment Decision (Measure of CentralTendency) •Second Business Moment Decision (Measure of Spread) •Third Business Moment Decision (Skewness) •Fourth Business Moment Decision(Kurtosis) •Correlation
  • 4. 1. Descriptive Analytics 2. Diagnostic Analytics 3. Predictive analytics 4. Prescriptive Analytics Stages of Analytics
  • 6.
  • 7.
  • 8.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26. Skewness degree of asymmetry observed in a probability distribution that deviates from the symmetrical normal distribution (bell Curve)
  • 27. 1. Positive skewed or right-skewed The extreme positive skewness is not desirable for distribution, as a high level of skewness can cause misleading results. For positively skewed distributions, the famous transformation is the log transformation. The log transformation proposes the calculations of the natural logarithm for each value in the dataset.
  • 28. 2. Negative skewed or left-skewed a large number of data-pushed on the left- hand side Median is the middle value, and mode is the highest value, and due to unbalanced distribution median will be higher than the mean.
  • 29. •If the skewness is between -0.5 & 0.5, the data are nearly symmetrical. •If the skewness is between -1 & -0.5 (negative skewed) or between 0.5 & 1(positive skewed), the data are slightly skewed. •If the skewness is lower than -1 (negative skewed) or greater than 1 (positive skewed), the data are extremely skewed. Calculate the skewness coefficient of the sample
  • 30. Kurtosis Kurtosis refers to the degree of presence of outliers in the distribution. Kurtosis is a statistical measure, whether the data is heavy-tailed or light-tailed in a normal distribution
  • 31. Types of Kurtosis 1.Leptokurtic (kurtosis > 3) 2.platykurtic (kurtosis < 3) 3.Mesokurtic (kurtosis = 3)
  • 32. Correlation 1.“Correlation is an analysis of the co-variation between two or more variables”—(A.M Tuttle) 2. “Correlation analysis attempts to determine the degree of relationship between variables”—(Ya Lun Chou) 3. “Correlation analysis deals with the association between two or more variables”— (Simpson and Kafka)
  • 33. Correlation and Causation Correlation: It is a numerical measure of the direction and magnitude of the mutual relationship between the variables(X and Y). Causation: X is the cause of change in Y i.e, the change of Y is the effect of change in X. NOTE: If X and Y are correlated then X and Y may or may not have a casual relationship. If X and Y have a causal relationship then X and Y must be correlated.
  • 34. Reasons Behind Correlation 1. Mutual dependence Between the variables: Both the variables may be mutually influencing each other so that neither can be designated as the cause and the other the effect. When two variables(X and Y) affect each other mutually, we cannot say X is the cause or Y is the cause. For Example, The price of a commodity is affected by demand and supply. 2. Due to pure chance: In a small sample, X and Y are highly correlated but in the universe X and Y are not correlated. For Example, Correlation between income and weight of a person. This may be due to: – Sampling fluctuations – Bias of investigator in selecting the sample. Such a relation is called a non-sense or spurious relation. 3. Correlation due to any third common factor: Both the correlated variables may be influenced by one or other variables. – X and Y don’t have a direct correlation. For Example, It is between the production of tea and rice per hectare. Here they are not directly correlated instead the cause is the good rainfall well in time.
  • 36. Key Takeaway After this session you learnt: 1. Stages of Analytics 2. Data Life Cycle 3. Data Types 4. First Business Moment Decision (Measure of CentralTendency) 5. Second Business Moment Decision (Measure of Spread) 6. Third Business Moment Decision (Skewness) 7. Fourth Business Moment Decision(Kurtosis) 8. Correlation