SlideShare a Scribd company logo
WHICH NUTRITIONAL VALUE AFFECTS THE AMOUNT OF
CALORIES IN CANDY THE MOST?
OR WHY YOU GET SO MANY CALORIES FROM A CANDY WITH
HIGH CONTENTS OF TOTAL FAT
Total Fat
Prepared by:
Alexander
Voronin
Bonnie Pang
Kayla Mina
Stephanie
HeintzmanImage source: Google Images
EXECUTIVE SUMMARY
Calories in candy products are highly impacted by
Total Fat (0,8 correlation)
High levels of cholesterol in candies aren’t connected
to the contents of saturated fat (0,47 correlation)
2
TABLE OF CONTENTS
1. Our Data….……………………………………………………………4-
5
2. Data
Assumptions……………………………………………………….6
 Correlation coefficient as method of
research…………………………..7-8
 Using SAS to get insights about the
data………………………………..9-12
3. Conclusions about the
data.………………………………………..........13
4. How you can reach
3
BACKGROUND
Our data set consists of 75 candies.
For each candy, the following information is available:
Servings, Weight, Calories, Total Fat, Saturated Fat,
Cholesterol, Sodium, Carbohydrates, Fiber, Sugars,
Protein, Vitamin A, Vitamin C, Calcium, and Iron.
4
HERE’S HOW OUR DATA
LOOKS IN SAS
5
WE HYPOTHESIZE THAT…
1. The more Saturated Fat there is,
the more Cholesterol there is.
2. Calories are impacted more by
Total Fat than by Sugar.
6
METHOD
We will use the correlation coefficient (r) to
indicate the strength of the relationships
This will be done using the analytical software,
SAS Enterprise Guide
7
HOW WE USE THE
CORRELATION COEFFICIENT?
It measures the strength and the direction of a linear
relationship between two variables
When the correlation is positive (r > 0), it means that
as the value of one variable increases, so does the
other.
If a correlation is negative (r < 0), it indicates that
when one variable increases, the other variable
decreases. This means there is an inverse relationship
between the two variables.
[Shen, David. "Computation of Correlation Coefficient and It's Confidence Interval in SAS." Sas.com. Web. ]
8
STEP 1
Open data set
Analyze > Multivariate > Correlations…
9
STEP 2
Window opens
Drag the indicated
variables under “Variables
to assign” to the Analysis
variable <variable
required> under “Task
roles”
10
STEP 3
Under the Results tab to
the left, check “Create a
scatter plot for each
correlation pair” and
uncheck “Show significance
probabilities associated
with correlations”.
Finally, Run the correlation
11
RESULTSEach cell in the following data output shows the strength of the relationship
between the variables listed in the corresponding rows and columns.
The higher the number the stronger the relationship is
Numbers in the orange box link to
hypothesis 1Numbers in the red box link to hypothesis
2
12
Highest number in the dataset
– strongest relationship
CONCLUSIONS
Hypothesis 1- The more Saturated
Fat there is, the more Cholesterol
there is.
0.47270 – SatFat & Cholesterol
The relationship between
Saturated Fat and Cholesterol is
47%. This is a weak correlation.
This means that having more
Cholesterol does not indicate
higher Saturated Fat levels, and
vice versa.
Therefore we reject this
hypothesis.
Hypothesis 2 – When a candy is
high in calories, there is more
likely to be higher levels of Total
Fat than Sugar.
0.80707 – Calories & Total Fat
0.41692 – Calories & Sugar
The relationship is about 2X
stronger between calories and Total
Fat (80%), than Calories and Sugar
(41.6%).
This means that the higher levels of
Calories can be more likely
determined by the levels of Total Fat
than levels of Sugar.
Therefore we accept this hypothesis. 13
HOW YOU CAN REACH US
Our email address: team3@dataresearch.com
Feel free to contact us for data research using other
analytical tools and approaches
Using data analysis we’re able to find other
correlations in your dataset
14

More Related Content

Similar to Correlation

Lecture 5 practical_guidelines_assignments
Lecture 5 practical_guidelines_assignmentsLecture 5 practical_guidelines_assignments
Lecture 5 practical_guidelines_assignmentsDaria Bogdanova
 
1. What is a codebook Who might create one, what would she or h.docx
1.  What is a codebook  Who might create one, what would she or h.docx1.  What is a codebook  Who might create one, what would she or h.docx
1. What is a codebook Who might create one, what would she or h.docx
SONU61709
 
Spss & minitab
Spss & minitabSpss & minitab
Spss & minitab
asifusman1998
 
Education Management Corp PSY2060__05 Week 5 Lecture 1We
Education Management Corp  PSY2060__05 Week 5 Lecture 1WeEducation Management Corp  PSY2060__05 Week 5 Lecture 1We
Education Management Corp PSY2060__05 Week 5 Lecture 1We
EvonCanales257
 
Affect of Metabolic Obesity and Body Mass Index in Coronary Artery Diseases
Affect of Metabolic Obesity and Body Mass Index in Coronary Artery DiseasesAffect of Metabolic Obesity and Body Mass Index in Coronary Artery Diseases
Affect of Metabolic Obesity and Body Mass Index in Coronary Artery Diseases
Nikhil Gupta
 
ECON104RoughDraft1
ECON104RoughDraft1ECON104RoughDraft1
ECON104RoughDraft1John Nguyen
 
Weka for clustering and regression itb vgsom
Weka for clustering and regression itb vgsomWeka for clustering and regression itb vgsom
Weka for clustering and regression itb vgsom
mithai1985
 
Opticon 2017 Experimenting with Stats Engine
Opticon 2017 Experimenting with Stats EngineOpticon 2017 Experimenting with Stats Engine
Opticon 2017 Experimenting with Stats Engine
Optimizely
 
Predicting breast cancer: Adrian Valles
Predicting breast cancer: Adrian VallesPredicting breast cancer: Adrian Valles
Predicting breast cancer: Adrian Valles
Adrián Vallés
 
Predicting diabetes using a machine learning approach linked in
Predicting diabetes using a machine learning approach   linked inPredicting diabetes using a machine learning approach   linked in
Predicting diabetes using a machine learning approach linked in
venkatvajradhar1
 
Feature Reduction Techniques
Feature Reduction TechniquesFeature Reduction Techniques
Feature Reduction Techniques
Vishal Patel
 
Chapter 3.pptx
Chapter 3.pptxChapter 3.pptx
Chapter 3.pptx
mahamoh6
 
Causality in Python PyCon 2021 ISRAEL
Causality in Python PyCon 2021 ISRAELCausality in Python PyCon 2021 ISRAEL
Causality in Python PyCon 2021 ISRAEL
Hanan Shteingart
 
You clearly understand the concepts of this assignment. You’ve don.docx
You clearly understand the concepts of this assignment. You’ve don.docxYou clearly understand the concepts of this assignment. You’ve don.docx
You clearly understand the concepts of this assignment. You’ve don.docx
jeffevans62972
 
Sample Data Preparation
Sample Data PreparationSample Data Preparation
Sample Data Preparation
Sai Chandan
 
Statistical Analysis
Statistical AnalysisStatistical Analysis
Statistical Analysis
Eiji Slideshare
 
Multivariate data analysis regression, cluster and factor analysis on spss
Multivariate data analysis   regression, cluster and factor analysis on spssMultivariate data analysis   regression, cluster and factor analysis on spss
Multivariate data analysis regression, cluster and factor analysis on spss
Aditya Banerjee
 
Regression and Classification Analysis
Regression and Classification AnalysisRegression and Classification Analysis
Regression and Classification Analysis
YashIyengar
 
Predicting deaths from COVID-19 using Machine Learning
Predicting deaths from COVID-19 using Machine LearningPredicting deaths from COVID-19 using Machine Learning
Predicting deaths from COVID-19 using Machine Learning
IdanGalShohet
 
6MODULE 2Module 2 Problem SetEXAMPLEGrand .docx
6MODULE 2Module 2 Problem SetEXAMPLEGrand .docx6MODULE 2Module 2 Problem SetEXAMPLEGrand .docx
6MODULE 2Module 2 Problem SetEXAMPLEGrand .docx
blondellchancy
 

Similar to Correlation (20)

Lecture 5 practical_guidelines_assignments
Lecture 5 practical_guidelines_assignmentsLecture 5 practical_guidelines_assignments
Lecture 5 practical_guidelines_assignments
 
1. What is a codebook Who might create one, what would she or h.docx
1.  What is a codebook  Who might create one, what would she or h.docx1.  What is a codebook  Who might create one, what would she or h.docx
1. What is a codebook Who might create one, what would she or h.docx
 
Spss & minitab
Spss & minitabSpss & minitab
Spss & minitab
 
Education Management Corp PSY2060__05 Week 5 Lecture 1We
Education Management Corp  PSY2060__05 Week 5 Lecture 1WeEducation Management Corp  PSY2060__05 Week 5 Lecture 1We
Education Management Corp PSY2060__05 Week 5 Lecture 1We
 
Affect of Metabolic Obesity and Body Mass Index in Coronary Artery Diseases
Affect of Metabolic Obesity and Body Mass Index in Coronary Artery DiseasesAffect of Metabolic Obesity and Body Mass Index in Coronary Artery Diseases
Affect of Metabolic Obesity and Body Mass Index in Coronary Artery Diseases
 
ECON104RoughDraft1
ECON104RoughDraft1ECON104RoughDraft1
ECON104RoughDraft1
 
Weka for clustering and regression itb vgsom
Weka for clustering and regression itb vgsomWeka for clustering and regression itb vgsom
Weka for clustering and regression itb vgsom
 
Opticon 2017 Experimenting with Stats Engine
Opticon 2017 Experimenting with Stats EngineOpticon 2017 Experimenting with Stats Engine
Opticon 2017 Experimenting with Stats Engine
 
Predicting breast cancer: Adrian Valles
Predicting breast cancer: Adrian VallesPredicting breast cancer: Adrian Valles
Predicting breast cancer: Adrian Valles
 
Predicting diabetes using a machine learning approach linked in
Predicting diabetes using a machine learning approach   linked inPredicting diabetes using a machine learning approach   linked in
Predicting diabetes using a machine learning approach linked in
 
Feature Reduction Techniques
Feature Reduction TechniquesFeature Reduction Techniques
Feature Reduction Techniques
 
Chapter 3.pptx
Chapter 3.pptxChapter 3.pptx
Chapter 3.pptx
 
Causality in Python PyCon 2021 ISRAEL
Causality in Python PyCon 2021 ISRAELCausality in Python PyCon 2021 ISRAEL
Causality in Python PyCon 2021 ISRAEL
 
You clearly understand the concepts of this assignment. You’ve don.docx
You clearly understand the concepts of this assignment. You’ve don.docxYou clearly understand the concepts of this assignment. You’ve don.docx
You clearly understand the concepts of this assignment. You’ve don.docx
 
Sample Data Preparation
Sample Data PreparationSample Data Preparation
Sample Data Preparation
 
Statistical Analysis
Statistical AnalysisStatistical Analysis
Statistical Analysis
 
Multivariate data analysis regression, cluster and factor analysis on spss
Multivariate data analysis   regression, cluster and factor analysis on spssMultivariate data analysis   regression, cluster and factor analysis on spss
Multivariate data analysis regression, cluster and factor analysis on spss
 
Regression and Classification Analysis
Regression and Classification AnalysisRegression and Classification Analysis
Regression and Classification Analysis
 
Predicting deaths from COVID-19 using Machine Learning
Predicting deaths from COVID-19 using Machine LearningPredicting deaths from COVID-19 using Machine Learning
Predicting deaths from COVID-19 using Machine Learning
 
6MODULE 2Module 2 Problem SetEXAMPLEGrand .docx
6MODULE 2Module 2 Problem SetEXAMPLEGrand .docx6MODULE 2Module 2 Problem SetEXAMPLEGrand .docx
6MODULE 2Module 2 Problem SetEXAMPLEGrand .docx
 

Recently uploaded

一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 

Recently uploaded (20)

一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 

Correlation

  • 1. WHICH NUTRITIONAL VALUE AFFECTS THE AMOUNT OF CALORIES IN CANDY THE MOST? OR WHY YOU GET SO MANY CALORIES FROM A CANDY WITH HIGH CONTENTS OF TOTAL FAT Total Fat Prepared by: Alexander Voronin Bonnie Pang Kayla Mina Stephanie HeintzmanImage source: Google Images
  • 2. EXECUTIVE SUMMARY Calories in candy products are highly impacted by Total Fat (0,8 correlation) High levels of cholesterol in candies aren’t connected to the contents of saturated fat (0,47 correlation) 2
  • 3. TABLE OF CONTENTS 1. Our Data….……………………………………………………………4- 5 2. Data Assumptions……………………………………………………….6  Correlation coefficient as method of research…………………………..7-8  Using SAS to get insights about the data………………………………..9-12 3. Conclusions about the data.………………………………………..........13 4. How you can reach 3
  • 4. BACKGROUND Our data set consists of 75 candies. For each candy, the following information is available: Servings, Weight, Calories, Total Fat, Saturated Fat, Cholesterol, Sodium, Carbohydrates, Fiber, Sugars, Protein, Vitamin A, Vitamin C, Calcium, and Iron. 4
  • 5. HERE’S HOW OUR DATA LOOKS IN SAS 5
  • 6. WE HYPOTHESIZE THAT… 1. The more Saturated Fat there is, the more Cholesterol there is. 2. Calories are impacted more by Total Fat than by Sugar. 6
  • 7. METHOD We will use the correlation coefficient (r) to indicate the strength of the relationships This will be done using the analytical software, SAS Enterprise Guide 7
  • 8. HOW WE USE THE CORRELATION COEFFICIENT? It measures the strength and the direction of a linear relationship between two variables When the correlation is positive (r > 0), it means that as the value of one variable increases, so does the other. If a correlation is negative (r < 0), it indicates that when one variable increases, the other variable decreases. This means there is an inverse relationship between the two variables. [Shen, David. "Computation of Correlation Coefficient and It's Confidence Interval in SAS." Sas.com. Web. ] 8
  • 9. STEP 1 Open data set Analyze > Multivariate > Correlations… 9
  • 10. STEP 2 Window opens Drag the indicated variables under “Variables to assign” to the Analysis variable <variable required> under “Task roles” 10
  • 11. STEP 3 Under the Results tab to the left, check “Create a scatter plot for each correlation pair” and uncheck “Show significance probabilities associated with correlations”. Finally, Run the correlation 11
  • 12. RESULTSEach cell in the following data output shows the strength of the relationship between the variables listed in the corresponding rows and columns. The higher the number the stronger the relationship is Numbers in the orange box link to hypothesis 1Numbers in the red box link to hypothesis 2 12 Highest number in the dataset – strongest relationship
  • 13. CONCLUSIONS Hypothesis 1- The more Saturated Fat there is, the more Cholesterol there is. 0.47270 – SatFat & Cholesterol The relationship between Saturated Fat and Cholesterol is 47%. This is a weak correlation. This means that having more Cholesterol does not indicate higher Saturated Fat levels, and vice versa. Therefore we reject this hypothesis. Hypothesis 2 – When a candy is high in calories, there is more likely to be higher levels of Total Fat than Sugar. 0.80707 – Calories & Total Fat 0.41692 – Calories & Sugar The relationship is about 2X stronger between calories and Total Fat (80%), than Calories and Sugar (41.6%). This means that the higher levels of Calories can be more likely determined by the levels of Total Fat than levels of Sugar. Therefore we accept this hypothesis. 13
  • 14. HOW YOU CAN REACH US Our email address: team3@dataresearch.com Feel free to contact us for data research using other analytical tools and approaches Using data analysis we’re able to find other correlations in your dataset 14

Editor's Notes

  1. JUST IN CASE
  2. http://www2.sas.com/proceedings/sugi31/170-31.pdf
  3. 0.80707 – Calories & Total Fat 0.41692 – Calories & Sugar 0.47270 – SatFat & Cholesterol