SlideShare a Scribd company logo
Data Visualization
April 3, 2015
• When you should graph
• What you should graph
• Given some data, how would you graph it
When should you graph your data?
2Data Visualization
Always
Don’t just make graphs for client reports -- graph your data for
yourself, so you understand it.
If you use a table in a report, see if you can make it into a graph.
Why graphs?
Because of the environment that humans evolved in, we are much
better at getting info from color, size, shape, and position than from
reading text.
3Data Visualization
Find the dangerous creatures!
Why graphs work
• Color
•Size
• Shape
• Position
4Data Visualization
Why else do people like graphs?
People like cool-looking stuff.
5Data Visualization
Not cool Cool
What are we currently doing?
• Making lots of tables
6Data Visualization
Group Mean 25% 50% 75%
Bananas 11.3 2.7 4.6 23.1
Kittens 4.0 0.9 3.6 7.5
Phones -3.1 -11.0 -2.9 2.2
Variable Parameter
Estimate
Cuteness 0.6***
Ability to Fly 1.4***
Deadliness 11.2***
Telepathy -9.8***
Big Ears -17.3***
What is wrong with tables?
Tables give only a partial picture – means only tell us so much.
Figuring out what’s bigger, and by how much, requires more work.
The information is not necessarily in any order, so we need to read
all the numbers.
7Data Visualization
What kinds of graphs should you make?
• The distribution, instead of
giving just mean, median, etc.
• The relationship between two
variables – the conditional
distribution
• Graph estimation results’ point
estimates and confidence
intervals
8Data Visualization
What to expect out of this presentation
1. Discussion of the type of graph (e.g. distributions)
2. How the type of graph applies to continuous vs. categorical data
3. Extensions (e.g. graphing more than one at a time)
What not to expect: how to do these in any particular software.
9Data Visualization
Distributions
10Data Visualization
Distributions – Continuous variables
Make density plots/histograms for continuous variables. These give
much more information than means, medians, etc.
Two distributions with the same mean, but which are dramatically different.
11Data Visualization
Density vs. histogram
A density plot is basically a smoothed histogram.
12Data Visualization
Distributions – Categorical variables
Make bar charts for categorical variables.
Tip: if your categories don’t have any inherent order, order them
from largest to smallest.
13Data Visualization
Compare distributions using color
Suppose we want to compare the distribution of income among
different occupations. Plot all the distributions, distinguished by
color, and use transparency to make them all visible simultaneously.
14Data Visualization
Highlighting important facts
Add vertical lines to highlight the means.
15Data Visualization
Relationships
16Data Visualization
Relationships between variables
If we’re asking, for example, what GDP growth looks like at different
levels of government spending, we can show this using a
scatterplot.
17Data Visualization
How to show trends
We can highlight the trend using scatterplot smoothing, which
adapts the shape of the trend line to the data.
18Data Visualization
How to show multiple groups
We can see if the relationship differs among groups by giving each
group a color.
19Data Visualization
Another use for colors
Suppose we want to come up with rules to identify people’s favorite
food based on population density and elevation (bear with me)
Can we see this on a graph?
20Data Visualization
Graphing relationships with categorical data
With categorical data, you typically can’t use scatterplots because
points fall right on top of each other (‘overplotting’).
However! We can use jittering to move the plotted points slightly.
21Data Visualization
Without jittering With jittering
Graphing relationships with categorical data
The next step beyond jittering is to use a boxplot, which shows
– The mean,
– 25th and 75th percentiles,
– 1.5 times the inter-quartile range (IQR)
– outliers (plotted as points)
22Data Visualization
mean
75th pctile
mean + 1.5 *IQR
outlier
Looping back
A boxplot isn’t, after all, all that different from the multi-colored
density plot we showed earlier. Which is better depends on what
you’re trying to show.
23Data Visualization
Use log scale if your data spans a wide range
Let’s say you have a large
range of values, but most of
your data is concentrated to
one part of the range.
It’s easier to see what’s
going when we use log
scale.
24Data Visualization
Estimation results
25Data Visualization
Graphing estimation results
We make a lot of regression tables, but we can make them easier to
understand by putting them into graphs.
26Data Visualization
ggplot(df, aes(population_density, elevation, color = favorite_food)) +
geom_point()
27Data Visualization
dataset x variable y variable
make scatterplot
color variable
All graphs made in R and ggplot2
Data Visualization Checklist
• Always graph
• Use color, size, shape, and position
• Three important types of graph:
– Distribution
– Relationship
– Estimation results
• Highlight important facts
• Make it cool-looking
28Data Visualization

More Related Content

What's hot

Top 8 Different Types Of Charts In Statistics And Their Uses
Top 8 Different Types Of Charts In Statistics And Their UsesTop 8 Different Types Of Charts In Statistics And Their Uses
Top 8 Different Types Of Charts In Statistics And Their Uses
Stat Analytica
 
Machine Learning Algorithm - KNN
Machine Learning Algorithm - KNNMachine Learning Algorithm - KNN
Machine Learning Algorithm - KNN
Kush Kulshrestha
 
Data Handling
Data HandlingData Handling
Data Handling
Hemant Karakoti
 
Statistics
StatisticsStatistics
Statisticsdiereck
 
Effective Business Presentations with Storyboarding and Data Visualization
Effective Business Presentations with Storyboarding and Data VisualizationEffective Business Presentations with Storyboarding and Data Visualization
Effective Business Presentations with Storyboarding and Data Visualization
Carmen Proctor
 
Communicating Effectively with Data Visualization
Communicating Effectively with Data VisualizationCommunicating Effectively with Data Visualization
Communicating Effectively with Data Visualization
Eamonn Maguire
 
Different types of charts
Different types of chartsDifferent types of charts
Different types of charts
Zakaria Salim
 
Advanced excel
Advanced excelAdvanced excel
Advanced excel
PremananthamM
 
Types of Charts
Types of ChartsTypes of Charts
Types of Charts
Loyd Calangan
 
Interpret data for use in charts and graphs
Interpret data for use in charts and graphsInterpret data for use in charts and graphs
Interpret data for use in charts and graphsCharles Flynt
 
Charts And Graphs
Charts And GraphsCharts And Graphs
Charts And Graphs
Himadri Shekhar
 
Making a Pie Chart
Making a Pie ChartMaking a Pie Chart
Making a Pie Chart
Daksha Bhat
 
Data visualization & Story Telling with Data
Data visualization & Story Telling with DataData visualization & Story Telling with Data
Data visualization & Story Telling with Data
Dr Nisha Arora
 
TID Chapter 5 Introduction To Charts And Graph
TID Chapter 5 Introduction To Charts And GraphTID Chapter 5 Introduction To Charts And Graph
TID Chapter 5 Introduction To Charts And GraphWanBK Leo
 
Basics of Educational Statistics (Graphs & its Types)
Basics of Educational Statistics (Graphs & its Types)Basics of Educational Statistics (Graphs & its Types)
Basics of Educational Statistics (Graphs & its Types)
HennaAnsari
 
Summary data visualization
Summary data visualizationSummary data visualization
Summary data visualization
Novita Sari
 
Types Of Charts
Types Of ChartsTypes Of Charts
Types Of Chartswmassie
 

What's hot (17)

Top 8 Different Types Of Charts In Statistics And Their Uses
Top 8 Different Types Of Charts In Statistics And Their UsesTop 8 Different Types Of Charts In Statistics And Their Uses
Top 8 Different Types Of Charts In Statistics And Their Uses
 
Machine Learning Algorithm - KNN
Machine Learning Algorithm - KNNMachine Learning Algorithm - KNN
Machine Learning Algorithm - KNN
 
Data Handling
Data HandlingData Handling
Data Handling
 
Statistics
StatisticsStatistics
Statistics
 
Effective Business Presentations with Storyboarding and Data Visualization
Effective Business Presentations with Storyboarding and Data VisualizationEffective Business Presentations with Storyboarding and Data Visualization
Effective Business Presentations with Storyboarding and Data Visualization
 
Communicating Effectively with Data Visualization
Communicating Effectively with Data VisualizationCommunicating Effectively with Data Visualization
Communicating Effectively with Data Visualization
 
Different types of charts
Different types of chartsDifferent types of charts
Different types of charts
 
Advanced excel
Advanced excelAdvanced excel
Advanced excel
 
Types of Charts
Types of ChartsTypes of Charts
Types of Charts
 
Interpret data for use in charts and graphs
Interpret data for use in charts and graphsInterpret data for use in charts and graphs
Interpret data for use in charts and graphs
 
Charts And Graphs
Charts And GraphsCharts And Graphs
Charts And Graphs
 
Making a Pie Chart
Making a Pie ChartMaking a Pie Chart
Making a Pie Chart
 
Data visualization & Story Telling with Data
Data visualization & Story Telling with DataData visualization & Story Telling with Data
Data visualization & Story Telling with Data
 
TID Chapter 5 Introduction To Charts And Graph
TID Chapter 5 Introduction To Charts And GraphTID Chapter 5 Introduction To Charts And Graph
TID Chapter 5 Introduction To Charts And Graph
 
Basics of Educational Statistics (Graphs & its Types)
Basics of Educational Statistics (Graphs & its Types)Basics of Educational Statistics (Graphs & its Types)
Basics of Educational Statistics (Graphs & its Types)
 
Summary data visualization
Summary data visualizationSummary data visualization
Summary data visualization
 
Types Of Charts
Types Of ChartsTypes Of Charts
Types Of Charts
 

Similar to Data Visualization by David Kretch

QQ Plot.pptx
QQ Plot.pptxQQ Plot.pptx
QQ Plot.pptx
Rahul Borate
 
DATA VISUALIZATION
DATA VISUALIZATIONDATA VISUALIZATION
DATA VISUALIZATION
Aabhika Samantaray
 
Visual Analytics in Big Data
Visual Analytics in Big DataVisual Analytics in Big Data
Visual Analytics in Big Data
Saurabh Shanbhag
 
Data Visualization Techniques
Data Visualization TechniquesData Visualization Techniques
Data Visualization TechniquesAllAnalytics
 
Datascape Introduction
Datascape IntroductionDatascape Introduction
Datascape Introduction
Daden Limited
 
Tableau Final Presentation
Tableau Final PresentationTableau Final Presentation
Tableau Final Presentation
Anvesh Rao
 
Tableau Presentation
Tableau PresentationTableau Presentation
Tableau Presentation
Andrea Bissoli
 
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-shareBigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
stelligence
 
Art and Science of Dashboard Design
Art and Science of Dashboard DesignArt and Science of Dashboard Design
Art and Science of Dashboard Design
SavvyData
 
Design for Delight
Design for DelightDesign for Delight
Design for Delight
Amanda Makulec
 
Data visualization
Data visualizationData visualization
Data visualization
Moushmi Dasgupta
 
Analyzing and Visualizing Data Chapter 6Data Represent.docx
Analyzing and Visualizing Data Chapter 6Data Represent.docxAnalyzing and Visualizing Data Chapter 6Data Represent.docx
Analyzing and Visualizing Data Chapter 6Data Represent.docx
durantheseldine
 
The Data Stroytelling Handbook
The Data Stroytelling HandbookThe Data Stroytelling Handbook
The Data Stroytelling Handbook
ssuserd075da
 
Unit III.pptx
Unit III.pptxUnit III.pptx
Unit III.pptx
KennyPratheepKumar
 
Diowane2003
Diowane2003Diowane2003
Diowane2003SFYC
 
A Tour through the Data Vizualization Zoo - Communications of the ACM
A Tour through the Data Vizualization Zoo - Communications of the ACMA Tour through the Data Vizualization Zoo - Communications of the ACM
A Tour through the Data Vizualization Zoo - Communications of the ACM
Reynolds Center for Business Journalism
 
How to visualize web analytics data and choose a graph?
How to visualize web analytics data and choose a graph?How to visualize web analytics data and choose a graph?
How to visualize web analytics data and choose a graph?
Janvi Arora
 
Design PatternsChristian Behrenshttpswww.behance.netgall.docx
Design PatternsChristian Behrenshttpswww.behance.netgall.docxDesign PatternsChristian Behrenshttpswww.behance.netgall.docx
Design PatternsChristian Behrenshttpswww.behance.netgall.docx
carolinef5
 
From data to diagrams: an introduction to basic graphs and charts
From data to diagrams: an introduction to basic graphs and chartsFrom data to diagrams: an introduction to basic graphs and charts
From data to diagrams: an introduction to basic graphs and charts
School of Data
 

Similar to Data Visualization by David Kretch (20)

QQ Plot.pptx
QQ Plot.pptxQQ Plot.pptx
QQ Plot.pptx
 
DATA VISUALIZATION
DATA VISUALIZATIONDATA VISUALIZATION
DATA VISUALIZATION
 
Visual Analytics in Big Data
Visual Analytics in Big DataVisual Analytics in Big Data
Visual Analytics in Big Data
 
Data Visualization Techniques
Data Visualization TechniquesData Visualization Techniques
Data Visualization Techniques
 
Datascape Introduction
Datascape IntroductionDatascape Introduction
Datascape Introduction
 
Tableau Final Presentation
Tableau Final PresentationTableau Final Presentation
Tableau Final Presentation
 
Tableau Presentation
Tableau PresentationTableau Presentation
Tableau Presentation
 
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-shareBigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
 
Art and Science of Dashboard Design
Art and Science of Dashboard DesignArt and Science of Dashboard Design
Art and Science of Dashboard Design
 
Design for Delight
Design for DelightDesign for Delight
Design for Delight
 
Data visualization
Data visualizationData visualization
Data visualization
 
Analyzing and Visualizing Data Chapter 6Data Represent.docx
Analyzing and Visualizing Data Chapter 6Data Represent.docxAnalyzing and Visualizing Data Chapter 6Data Represent.docx
Analyzing and Visualizing Data Chapter 6Data Represent.docx
 
The Data Stroytelling Handbook
The Data Stroytelling HandbookThe Data Stroytelling Handbook
The Data Stroytelling Handbook
 
Unit III.pptx
Unit III.pptxUnit III.pptx
Unit III.pptx
 
Diowane2003
Diowane2003Diowane2003
Diowane2003
 
A Tour through the Data Vizualization Zoo - Communications of the ACM
A Tour through the Data Vizualization Zoo - Communications of the ACMA Tour through the Data Vizualization Zoo - Communications of the ACM
A Tour through the Data Vizualization Zoo - Communications of the ACM
 
Data Visualization Techniques
Data Visualization TechniquesData Visualization Techniques
Data Visualization Techniques
 
How to visualize web analytics data and choose a graph?
How to visualize web analytics data and choose a graph?How to visualize web analytics data and choose a graph?
How to visualize web analytics data and choose a graph?
 
Design PatternsChristian Behrenshttpswww.behance.netgall.docx
Design PatternsChristian Behrenshttpswww.behance.netgall.docxDesign PatternsChristian Behrenshttpswww.behance.netgall.docx
Design PatternsChristian Behrenshttpswww.behance.netgall.docx
 
From data to diagrams: an introduction to basic graphs and charts
From data to diagrams: an introduction to basic graphs and chartsFrom data to diagrams: an introduction to basic graphs and charts
From data to diagrams: an introduction to basic graphs and charts
 

Recently uploaded

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
correoyaya
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
James Polillo
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
theahmadsaood
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 

Recently uploaded (20)

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 

Data Visualization by David Kretch

  • 1. Data Visualization April 3, 2015 • When you should graph • What you should graph • Given some data, how would you graph it
  • 2. When should you graph your data? 2Data Visualization Always Don’t just make graphs for client reports -- graph your data for yourself, so you understand it. If you use a table in a report, see if you can make it into a graph.
  • 3. Why graphs? Because of the environment that humans evolved in, we are much better at getting info from color, size, shape, and position than from reading text. 3Data Visualization Find the dangerous creatures!
  • 4. Why graphs work • Color •Size • Shape • Position 4Data Visualization
  • 5. Why else do people like graphs? People like cool-looking stuff. 5Data Visualization Not cool Cool
  • 6. What are we currently doing? • Making lots of tables 6Data Visualization Group Mean 25% 50% 75% Bananas 11.3 2.7 4.6 23.1 Kittens 4.0 0.9 3.6 7.5 Phones -3.1 -11.0 -2.9 2.2 Variable Parameter Estimate Cuteness 0.6*** Ability to Fly 1.4*** Deadliness 11.2*** Telepathy -9.8*** Big Ears -17.3***
  • 7. What is wrong with tables? Tables give only a partial picture – means only tell us so much. Figuring out what’s bigger, and by how much, requires more work. The information is not necessarily in any order, so we need to read all the numbers. 7Data Visualization
  • 8. What kinds of graphs should you make? • The distribution, instead of giving just mean, median, etc. • The relationship between two variables – the conditional distribution • Graph estimation results’ point estimates and confidence intervals 8Data Visualization
  • 9. What to expect out of this presentation 1. Discussion of the type of graph (e.g. distributions) 2. How the type of graph applies to continuous vs. categorical data 3. Extensions (e.g. graphing more than one at a time) What not to expect: how to do these in any particular software. 9Data Visualization
  • 11. Distributions – Continuous variables Make density plots/histograms for continuous variables. These give much more information than means, medians, etc. Two distributions with the same mean, but which are dramatically different. 11Data Visualization
  • 12. Density vs. histogram A density plot is basically a smoothed histogram. 12Data Visualization
  • 13. Distributions – Categorical variables Make bar charts for categorical variables. Tip: if your categories don’t have any inherent order, order them from largest to smallest. 13Data Visualization
  • 14. Compare distributions using color Suppose we want to compare the distribution of income among different occupations. Plot all the distributions, distinguished by color, and use transparency to make them all visible simultaneously. 14Data Visualization
  • 15. Highlighting important facts Add vertical lines to highlight the means. 15Data Visualization
  • 17. Relationships between variables If we’re asking, for example, what GDP growth looks like at different levels of government spending, we can show this using a scatterplot. 17Data Visualization
  • 18. How to show trends We can highlight the trend using scatterplot smoothing, which adapts the shape of the trend line to the data. 18Data Visualization
  • 19. How to show multiple groups We can see if the relationship differs among groups by giving each group a color. 19Data Visualization
  • 20. Another use for colors Suppose we want to come up with rules to identify people’s favorite food based on population density and elevation (bear with me) Can we see this on a graph? 20Data Visualization
  • 21. Graphing relationships with categorical data With categorical data, you typically can’t use scatterplots because points fall right on top of each other (‘overplotting’). However! We can use jittering to move the plotted points slightly. 21Data Visualization Without jittering With jittering
  • 22. Graphing relationships with categorical data The next step beyond jittering is to use a boxplot, which shows – The mean, – 25th and 75th percentiles, – 1.5 times the inter-quartile range (IQR) – outliers (plotted as points) 22Data Visualization mean 75th pctile mean + 1.5 *IQR outlier
  • 23. Looping back A boxplot isn’t, after all, all that different from the multi-colored density plot we showed earlier. Which is better depends on what you’re trying to show. 23Data Visualization
  • 24. Use log scale if your data spans a wide range Let’s say you have a large range of values, but most of your data is concentrated to one part of the range. It’s easier to see what’s going when we use log scale. 24Data Visualization
  • 26. Graphing estimation results We make a lot of regression tables, but we can make them easier to understand by putting them into graphs. 26Data Visualization
  • 27. ggplot(df, aes(population_density, elevation, color = favorite_food)) + geom_point() 27Data Visualization dataset x variable y variable make scatterplot color variable All graphs made in R and ggplot2
  • 28. Data Visualization Checklist • Always graph • Use color, size, shape, and position • Three important types of graph: – Distribution – Relationship – Estimation results • Highlight important facts • Make it cool-looking 28Data Visualization