SlideShare a Scribd company logo
Identifying individuals, variables and categorical
variables in a data set
Millions of Americans rely on caffeine to get them up in the morning. Here’s
nutritional data on some popular drinks at Ben’s Beans Coffee Shop:
The individuals in the dataset are?
The dataset contains how many variables?
Drink Type Calories Sugars (g) Caffeine (mg)
Brewed Coffee Hot 4 0 260
Café Mocha Hot 100 14 95
Café latte Hot 170 15 75
Cappuccino Hot 60 27 75
Iced Brewed Coffee Cold 60 8 120
Chai latte Hot 120 25 60
The school counselor needs to meet with students who
have more than three absences. Here is some of the data
about the students.
• The individuals in this data set are ?
 Students
 School Counselor
 Homeroom Teachers
• This data set contains ?
 2 variables, 1 of which is quantitative
 2 variables, 2 of which is quantitative
 5 variables, 1 of which is quantitative
 5 variables, 3 of which is quantitative
Bar Graph
What was the median score for the
Final exam?
What is the midrange of the midterm
scores?
What was the average student score for
the final exam?
What was the mode for the final exam
score?
What is the range of the midterm
scores?
Median score for the final exam : 75 80 100 100 100
Midrange of the midterm scores: 80
90 90 60 100 80
Max. Mark = 100 Min. Mark = 60 (100+60)/2 80
Average student score for the final exam: 91
(75+80+100+100+100)/5
Mode for the final exam score: 100
75 80 100 100 100 -------- Common Score ---100
Range of the midterm scores: 40
Max. Mark = 100 Min. Mark = 60 100-60 40
For the past year, a travel agency has collected data
about the number of individual tickets that it sells for
signature product: a Mediterranean cruise.
The monthly data on the
ticket sales is shown below.
What are the best and worst?
Histogram
A histogram displays numerical data by grouping data into "bins" of equal width.
Each bin is plotted as a bar whose height corresponds to how many data points are
in that bin.
Bins are also sometimes called "intervals", "classes", or "buckets".
Histogram
Age: 1, 3, 27, 32, 5, 63, 26, 25, 18, 16, 4, 45, 29, 19, 22, 51, 58, 9, 42, 6
Bin No
0-9 6
10-19 3
20-29 5
30-39 1
40-49 2
50-59 2
60-69 1
Histogram
1, 4, 2, 1, 0, 2, 1, 0, 1, 2, 1, 0, 0, 2, 2, 3, 1, 1, 3, 6
1
1
1 2
0 1 2
0 1 2
0 1 2 3
0 1 2 3 4 5 6
------------------------------------------------------------------
4 7 5 2 1 0 1
Shape of the distribution
Clusters, Gaps, Peaks and Outliers
Clusters, Gaps, Peaks and Outliers
Select all that apply
• The distribution has an outlier
• The distribution has a cluster
from 0 to 39 guests
• None of the above
Clusters, Gaps, Peaks and Outliers

More Related Content

Similar to Unit 3 univariate Analysis Exploratory Data Analysis

Kumpulan soal statistik dalam Bahasa Inggris
Kumpulan soal statistik dalam Bahasa InggrisKumpulan soal statistik dalam Bahasa Inggris
Kumpulan soal statistik dalam Bahasa Inggris
Retna Rindayani
 
MEASURES OF CENTRAL TENDENCY AND VARIABILITY
MEASURES OF CENTRAL TENDENCY AND VARIABILITYMEASURES OF CENTRAL TENDENCY AND VARIABILITY
MEASURES OF CENTRAL TENDENCY AND VARIABILITY
Mariele Brutas
 
The analysis of the data has been done using excel statistical sof.docx
The analysis of the data has been done using excel statistical sof.docxThe analysis of the data has been done using excel statistical sof.docx
The analysis of the data has been done using excel statistical sof.docx
mattinsonjanel
 
CUDA_ICLR_2023.pdf
CUDA_ICLR_2023.pdfCUDA_ICLR_2023.pdf
CUDA_ICLR_2023.pdf
sumyeong ahn
 
Teoria y problemas de graficos circulares GC316 ccesa007
Teoria y problemas de graficos circulares  GC316  ccesa007Teoria y problemas de graficos circulares  GC316  ccesa007
Teoria y problemas de graficos circulares GC316 ccesa007
Demetrio Ccesa Rayme
 
Teoría y problemas de gráficos circulares GC316 ccesa007
Teoría y problemas de gráficos circulares  GC316  ccesa007Teoría y problemas de gráficos circulares  GC316  ccesa007
Teoría y problemas de gráficos circulares GC316 ccesa007
Demetrio Ccesa Rayme
 

Similar to Unit 3 univariate Analysis Exploratory Data Analysis (20)

Qnt 351 final exam new 2016
Qnt 351 final exam new 2016Qnt 351 final exam new 2016
Qnt 351 final exam new 2016
 
Qnt 351 final exam new 2016
Qnt 351 final exam new 2016Qnt 351 final exam new 2016
Qnt 351 final exam new 2016
 
Qnt 351 final exam new 2016
Qnt 351 final exam new 2016Qnt 351 final exam new 2016
Qnt 351 final exam new 2016
 
Kumpulan soal statistik dalam Bahasa Inggris
Kumpulan soal statistik dalam Bahasa InggrisKumpulan soal statistik dalam Bahasa Inggris
Kumpulan soal statistik dalam Bahasa Inggris
 
How Leaders See Data? (Level 1)
How Leaders See Data? (Level 1)How Leaders See Data? (Level 1)
How Leaders See Data? (Level 1)
 
Introduction to Six Sigma (Basic)
Introduction to Six Sigma (Basic)Introduction to Six Sigma (Basic)
Introduction to Six Sigma (Basic)
 
Descriptive Statistics, Numerical Description
Descriptive Statistics, Numerical DescriptionDescriptive Statistics, Numerical Description
Descriptive Statistics, Numerical Description
 
QNT 351 PAPER Education for Service--qnt351paper.com
  QNT 351 PAPER Education for Service--qnt351paper.com  QNT 351 PAPER Education for Service--qnt351paper.com
QNT 351 PAPER Education for Service--qnt351paper.com
 
Chapter 3
Chapter 3Chapter 3
Chapter 3
 
Chapter 3
Chapter 3Chapter 3
Chapter 3
 
MEASURES OF CENTRAL TENDENCY AND VARIABILITY
MEASURES OF CENTRAL TENDENCY AND VARIABILITYMEASURES OF CENTRAL TENDENCY AND VARIABILITY
MEASURES OF CENTRAL TENDENCY AND VARIABILITY
 
QNT 351 PAPER Become Exceptional--qnt351paper.com
QNT 351 PAPER Become Exceptional--qnt351paper.comQNT 351 PAPER Become Exceptional--qnt351paper.com
QNT 351 PAPER Become Exceptional--qnt351paper.com
 
QNT 351 PAPER Achievement Education--qnt351paper.com
QNT 351 PAPER Achievement Education--qnt351paper.comQNT 351 PAPER Achievement Education--qnt351paper.com
QNT 351 PAPER Achievement Education--qnt351paper.com
 
STRAND 5 DATA HANDLING AND PROBABILITY.pptx
STRAND 5  DATA HANDLING AND PROBABILITY.pptxSTRAND 5  DATA HANDLING AND PROBABILITY.pptx
STRAND 5 DATA HANDLING AND PROBABILITY.pptx
 
The analysis of the data has been done using excel statistical sof.docx
The analysis of the data has been done using excel statistical sof.docxThe analysis of the data has been done using excel statistical sof.docx
The analysis of the data has been done using excel statistical sof.docx
 
data handeling.pptx
data handeling.pptxdata handeling.pptx
data handeling.pptx
 
Jwan kareem.biostatic exercise
Jwan kareem.biostatic exerciseJwan kareem.biostatic exercise
Jwan kareem.biostatic exercise
 
CUDA_ICLR_2023.pdf
CUDA_ICLR_2023.pdfCUDA_ICLR_2023.pdf
CUDA_ICLR_2023.pdf
 
Teoria y problemas de graficos circulares GC316 ccesa007
Teoria y problemas de graficos circulares  GC316  ccesa007Teoria y problemas de graficos circulares  GC316  ccesa007
Teoria y problemas de graficos circulares GC316 ccesa007
 
Teoría y problemas de gráficos circulares GC316 ccesa007
Teoría y problemas de gráficos circulares  GC316  ccesa007Teoría y problemas de gráficos circulares  GC316  ccesa007
Teoría y problemas de gráficos circulares GC316 ccesa007
 

Recently uploaded

Recently uploaded (20)

How to Manage Notification Preferences in the Odoo 17
How to Manage Notification Preferences in the Odoo 17How to Manage Notification Preferences in the Odoo 17
How to Manage Notification Preferences in the Odoo 17
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
B.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdfB.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdf
 
Morse OER Some Benefits and Challenges.pptx
Morse OER Some Benefits and Challenges.pptxMorse OER Some Benefits and Challenges.pptx
Morse OER Some Benefits and Challenges.pptx
 
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdfINU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
 
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptxslides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
 
Application of Matrices in real life. Presentation on application of matrices
Application of Matrices in real life. Presentation on application of matricesApplication of Matrices in real life. Presentation on application of matrices
Application of Matrices in real life. Presentation on application of matrices
 
An Overview of the Odoo 17 Discuss App.pptx
An Overview of the Odoo 17 Discuss App.pptxAn Overview of the Odoo 17 Discuss App.pptx
An Overview of the Odoo 17 Discuss App.pptx
 
[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation
 
Advances in production technology of Grapes.pdf
Advances in production technology of Grapes.pdfAdvances in production technology of Grapes.pdf
Advances in production technology of Grapes.pdf
 
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 
NCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdfNCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdf
 
Salient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptxSalient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptx
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
size separation d pharm 1st year pharmaceutics
size separation d pharm 1st year pharmaceuticssize separation d pharm 1st year pharmaceutics
size separation d pharm 1st year pharmaceutics
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Pragya Champions Chalice 2024 Prelims & Finals Q/A set, General Quiz
Pragya Champions Chalice 2024 Prelims & Finals Q/A set, General QuizPragya Champions Chalice 2024 Prelims & Finals Q/A set, General Quiz
Pragya Champions Chalice 2024 Prelims & Finals Q/A set, General Quiz
 
Benefits and Challenges of Using Open Educational Resources
Benefits and Challenges of Using Open Educational ResourcesBenefits and Challenges of Using Open Educational Resources
Benefits and Challenges of Using Open Educational Resources
 

Unit 3 univariate Analysis Exploratory Data Analysis

  • 1.
  • 2. Identifying individuals, variables and categorical variables in a data set Millions of Americans rely on caffeine to get them up in the morning. Here’s nutritional data on some popular drinks at Ben’s Beans Coffee Shop: The individuals in the dataset are? The dataset contains how many variables? Drink Type Calories Sugars (g) Caffeine (mg) Brewed Coffee Hot 4 0 260 Café Mocha Hot 100 14 95 Café latte Hot 170 15 75 Cappuccino Hot 60 27 75 Iced Brewed Coffee Cold 60 8 120 Chai latte Hot 120 25 60
  • 3. The school counselor needs to meet with students who have more than three absences. Here is some of the data about the students. • The individuals in this data set are ?  Students  School Counselor  Homeroom Teachers • This data set contains ?  2 variables, 1 of which is quantitative  2 variables, 2 of which is quantitative  5 variables, 1 of which is quantitative  5 variables, 3 of which is quantitative
  • 4. Bar Graph What was the median score for the Final exam? What is the midrange of the midterm scores? What was the average student score for the final exam? What was the mode for the final exam score? What is the range of the midterm scores?
  • 5. Median score for the final exam : 75 80 100 100 100 Midrange of the midterm scores: 80 90 90 60 100 80 Max. Mark = 100 Min. Mark = 60 (100+60)/2 80 Average student score for the final exam: 91 (75+80+100+100+100)/5 Mode for the final exam score: 100 75 80 100 100 100 -------- Common Score ---100 Range of the midterm scores: 40 Max. Mark = 100 Min. Mark = 60 100-60 40
  • 6. For the past year, a travel agency has collected data about the number of individual tickets that it sells for signature product: a Mediterranean cruise. The monthly data on the ticket sales is shown below. What are the best and worst?
  • 7. Histogram A histogram displays numerical data by grouping data into "bins" of equal width. Each bin is plotted as a bar whose height corresponds to how many data points are in that bin. Bins are also sometimes called "intervals", "classes", or "buckets".
  • 8. Histogram Age: 1, 3, 27, 32, 5, 63, 26, 25, 18, 16, 4, 45, 29, 19, 22, 51, 58, 9, 42, 6 Bin No 0-9 6 10-19 3 20-29 5 30-39 1 40-49 2 50-59 2 60-69 1
  • 9. Histogram 1, 4, 2, 1, 0, 2, 1, 0, 1, 2, 1, 0, 0, 2, 2, 3, 1, 1, 3, 6 1 1 1 2 0 1 2 0 1 2 0 1 2 3 0 1 2 3 4 5 6 ------------------------------------------------------------------ 4 7 5 2 1 0 1
  • 10. Shape of the distribution
  • 11. Clusters, Gaps, Peaks and Outliers
  • 12. Clusters, Gaps, Peaks and Outliers
  • 13. Select all that apply • The distribution has an outlier • The distribution has a cluster from 0 to 39 guests • None of the above Clusters, Gaps, Peaks and Outliers