SlideShare a Scribd company logo
1 of 72
Download to read offline
Telling Stories With Data
Class #1
March 20th, 2017
David Newbury — @workergnome 1
What We're Doing Today:
— Syllabus Review
— (Brief) History of Data Visualization
— (Tiny) Theory of Visualization
— (Nerdy) Overview of Concepts
— (Fake) Data Exploration
David Newbury — @workergnome 2
Course Website:
datastories.davidnewbury.com
David Newbury — @workergnome 3
Which is biggest?
15012, 8271, 30193, 1189, 9913, 16000, 92481, 49801,
100407, 2910, 3809, 8018, 61528, 18083, 38691, 1800
David Newbury — @workergnome 4
Which is biggest?
David Newbury — @workergnome 5
Which is biggest?
David Newbury — @workergnome 6
Why do we
visualize?
David Newbury — @workergnome 7
(Brief)
History of
Data Visualization
David Newbury — @workergnome 8
Tabula Peutingeriana, 5th century CE
David Newbury — @workergnome 9
David Newbury — @workergnome 10
Rene Descartes, 1600s
David Newbury — @workergnome 11
Joseph Priestly, New Chart of History (1769)
David Newbury — @workergnome 12
William Playfair, (1786 & 1801)
David Newbury — @workergnome 13
David Newbury — @workergnome 14
John Snow, London Cholera Map (1854)
David Newbury — @workergnome 15
Cholera Map
David Newbury — @workergnome 16
Florence Nightingale, War Deaths (1855)
David Newbury — @workergnome 17
Charles Minard, March on Moscow (1862)
David Newbury — @workergnome 18
More recent history.
David Newbury — @workergnome 19
David Newbury — @workergnome 20
New York Times
David Newbury — @workergnome 21
(tiny)
Theory
of Visualization
David Newbury — @workergnome 22
Dataviz is constructed reality.
You are telling a story, not (just) stating facts.
David Newbury — @workergnome 23
data art
as opposed to
data visualization
as opposed to
statistical graphics
David Newbury — @workergnome 24
Statistical
Graphics
How do I create Statistical Graphs
in SAS 9.1.3 without Proc Gplot.
UCLA: Statistical Consulting
Group.
http://www.ats.ucla.edu/stat/sas/
notes2/
David Newbury — @workergnome 25
Data Art
Dear Data
Giorgia Lupi & Stefanie Posavec.
http://www.dear-data.com
David Newbury — @workergnome 26
Two Uses1). help people grasp things outside their reach
David Newbury — @workergnome 27
Two Uses1). help people grasp things outside their reach
2.) tell stories
David Newbury — @workergnome 28
explanatory visualization work
as opposed to
exploratory visualizations
David Newbury — @workergnome 29
Dataviz is constructed reality.
Do you care how true your story is?
Do you care how accurate your story is?
Are you trying to teach, entertain, or convince?
David Newbury — @workergnome 30
(Nerdy)
Overview of Concepts
David Newbury — @workergnome 31
What can you visualise?
David Newbury — @workergnome 32
Potential Subjects.
subways, sheep, the solar system,
shoes, sleep, skyline,
snow, supermarket, sausages,
school,the sea, spiders,
staircases, syrup, soap,
sawmills, stereos...
David Newbury — @workergnome 33
Potential Subjects.
subways, sheep, the solar system,
shoes, sleep, skyline,
snow, supermarket, sausages,
school,the sea, spiders,
staircases, syrup, soap,
sawmills, stereos...
...and other things that begin with S.
David Newbury — @workergnome 34
What are you interested in?
I'm interested in subways.
David Newbury — @workergnome 35
Data Visualization starts with...
A Question.
David Newbury — @workergnome 36
What question about your subject are you interested in?
— Are subways more efficient than owning a car?
— How often do I ride the subway in a year?
— What's locations have the best access to subways?
— What's the average subway commute in Pittsburgh?
David Newbury — @workergnome 37
Dimension and Scope
are about choosing
what to focus on.
David Newbury — @workergnome 38
Dimension
Which bits of information about a subject
are you going to focus on?
David Newbury — @workergnome 39
Possible Dimensions
number of cars
duration of ride
date of a ride
different lines
number of stops
cost per ride
number of stops per day
time between stops
cleanliness
David Newbury — @workergnome 40
Scope
Out of the infinite ways to look at your subject,
how are you going to choose one?
David Newbury — @workergnome 41
Possible Scopes
All trains in a day
All the rides that I've been on this year
My train this morning
All of the stops in the city
Each line
Every train stop in the past 50 years
David Newbury — @workergnome 42
(Fake)
Data Exploration
David Newbury — @workergnome 43
Choose one.
subways, sheep, the solar system,
shoes, sleep, skyline,
snow, supermarket, sausages,
school,the sea, spiders,
staircases, syrup, soap,
sawmills, stereos...
...and other things that begin with S.
David Newbury — @workergnome 44
TRY IT.1. Write down your subject
2. Write down your question
3. Write down as many dimensions as you can
4. Write down possible scopes for your data
David Newbury — @workergnome 45
What does your
data look like?
David Newbury — @workergnome 46
Types of Data
Dates
Numbers
Geo Coordinate
Strings
Categories
David Newbury — @workergnome 47
Types of Data
number of cars - Numeric
duration of ride - Numeric
date of a ride - Date
different lines - Category
number of stops - Numeric
cost per ride - Category
number of stops per day - Numeric
time between stops - Numeric
cleanliness - String
David Newbury — @workergnome 48
Two (related ides):
Categories & measures
David Newbury — @workergnome 49
Categories are Discrete Things
Measures are for Counting
David Newbury — @workergnome 50
number of cars - Measure
duration of ride - Measure
date of a ride - Measure
different lines - Categories
number of stops - Measure
cost per ride - Categories
number of stops per day - Measure
time between stops - Measure
cleanliness - Categories
David Newbury — @workergnome 51
A hidden dimension:
David, Daniel, Dawn, Danique
David Newbury — @workergnome 52
A hidden dimension:
David (1), Daniel (2), Dawn (3), Danique (4)
Position of the item in the group.
David Newbury — @workergnome 53
TRY IT.1. Choose a scope for your data.
2. Identify which dimensions are relevant.
3. Is the dimension is a category or a measure?
David Newbury — @workergnome 54
Now
What?David Newbury — @workergnome 55
We need to map our data
from a domain
to a range.
David Newbury — @workergnome 56
Domain
number of cars - 1...8
duration of ride - 30 sec...2 hours
date of a ride - - 24ft...200ft
different lines - Red line, Blue line, Green line, Silver
Line, Yellow Line
number of stops - **2..20
cost per ride - "$2.50, $1.75, $3.00, $0.00"
number of stops per day - ??...???
time between stops - 30 sec..5 minutes
David Newbury — @workergnome 57
Range
Domain is the possible input values
Range is the possible output values
David Newbury — @workergnome 58
Data
3, 7, 10, 6, 2
Position of the item in the group.
Domain
[0-10]
[1-5]
Range
X: 400px
Y: 800px
Mapping
X: item position
Y: numeric value
David Newbury — @workergnome 59
Data
3, 7, 10, 6, 2
Position of the item in the group.
Area
David Newbury — @workergnome 60
Data
3, 7, 10, 6, 2
Position of the item in the group.
Color
David Newbury — @workergnome 61
Data
val1: 3, 7, 10, 6, 2
val2: 5, 8, 1, 8, 3
val3: Cat, Dog, Cat, Cat, Dog
Position of the item in the group.
Mapping
X: item position
Y: val1
Size: val2
Color: val3
David Newbury — @workergnome 62
Dimensions beyond X and Y.
Color
Size
Shape
Labels
Patterns
Icons
Anything Else You Can Imagine
David Newbury — @workergnome 63
TRY IT.1. Identify your domains
2. For each domain, choose a range
3. Draw it!
David Newbury — @workergnome 64
Finishing
Touches
David Newbury — @workergnome 65
Measures get Axis
Categories get Headers
David Newbury — @workergnome 66
Labels
David Newbury — @workergnome 67
Axis
Category Axis
Number Axis
Date Axis
Log axis
David Newbury — @workergnome 68
Legends
David Newbury — @workergnome 69
TRY IT.1. Add a title to your chart
2. Label your axis
3. Add legends and labels as needed
David Newbury — @workergnome 70
Review
Dimensions
Scope
Domain
Range
Categories
Measures
David Newbury — @workergnome 71
Thank You.
David Newbury — @workergnome 72

More Related Content

Similar to Telling Stories With Data: Class 1

Computational Social Science, Lecture 07: Counting Fast, Part I
Computational Social Science, Lecture 07: Counting Fast, Part IComputational Social Science, Lecture 07: Counting Fast, Part I
Computational Social Science, Lecture 07: Counting Fast, Part I
jakehofman
 
From the Big Bang to Ecommerce, a journey in making sense of Big Data
From the Big Bang to Ecommerce, a journey in making sense of Big DataFrom the Big Bang to Ecommerce, a journey in making sense of Big Data
From the Big Bang to Ecommerce, a journey in making sense of Big Data
Patrick Deglon
 

Similar to Telling Stories With Data: Class 1 (14)

NDSR Learning Enrichment: Data Models and Linked Data
NDSR Learning Enrichment: Data Models and Linked DataNDSR Learning Enrichment: Data Models and Linked Data
NDSR Learning Enrichment: Data Models and Linked Data
 
Sharing Data Across Memory Institutions
Sharing Data Across Memory InstitutionsSharing Data Across Memory Institutions
Sharing Data Across Memory Institutions
 
Big data
Big dataBig data
Big data
 
How to collect and organize data (v. ITA 2021)
How to collect and organize data (v. ITA 2021)How to collect and organize data (v. ITA 2021)
How to collect and organize data (v. ITA 2021)
 
Computational Social Science, Lecture 07: Counting Fast, Part I
Computational Social Science, Lecture 07: Counting Fast, Part IComputational Social Science, Lecture 07: Counting Fast, Part I
Computational Social Science, Lecture 07: Counting Fast, Part I
 
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slidesMining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
 
Handling Uncertainty in Geo-Spatial Data.
Handling Uncertainty in Geo-Spatial Data.Handling Uncertainty in Geo-Spatial Data.
Handling Uncertainty in Geo-Spatial Data.
 
Public Data and Data Mining Competitions - What are Lessons?
Public Data and Data Mining Competitions - What are Lessons?Public Data and Data Mining Competitions - What are Lessons?
Public Data and Data Mining Competitions - What are Lessons?
 
Skillwise Big data
Skillwise Big dataSkillwise Big data
Skillwise Big data
 
[TRECVID 2018] Instance Search
[TRECVID 2018] Instance Search[TRECVID 2018] Instance Search
[TRECVID 2018] Instance Search
 
From the Big Bang to Ecommerce, a journey in making sense of Big Data
From the Big Bang to Ecommerce, a journey in making sense of Big DataFrom the Big Bang to Ecommerce, a journey in making sense of Big Data
From the Big Bang to Ecommerce, a journey in making sense of Big Data
 
Denis Reznik Data driven future
Denis Reznik Data driven futureDenis Reznik Data driven future
Denis Reznik Data driven future
 
SE2016 BigData Denis Reznik "Data driven future"
SE2016 BigData Denis Reznik "Data driven future"SE2016 BigData Denis Reznik "Data driven future"
SE2016 BigData Denis Reznik "Data driven future"
 
Big Data and official statistics with examples of their use
Big Data and official statistics with examples of their useBig Data and official statistics with examples of their use
Big Data and official statistics with examples of their use
 

More from David Newbury

IIIF For Small Projects
IIIF  For Small ProjectsIIIF  For Small Projects
IIIF For Small Projects
David Newbury
 
Authority Cascades: A presentation strategy for Linked Open Data
Authority Cascades: A presentation strategy for Linked Open DataAuthority Cascades: A presentation strategy for Linked Open Data
Authority Cascades: A presentation strategy for Linked Open Data
David Newbury
 

More from David Newbury (20)

Linked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond OntologiesLinked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond Ontologies
 
The LOD Gateway: Open Source Infrastructure for Linked Data
The LOD Gateway: Open Source Infrastructure for Linked DataThe LOD Gateway: Open Source Infrastructure for Linked Data
The LOD Gateway: Open Source Infrastructure for Linked Data
 
Linked Data on a Budget
Linked Data on a BudgetLinked Data on a Budget
Linked Data on a Budget
 
USE ME: progressive integration of IIIF with new software services at the Getty
USE ME: progressive integration of IIIF with new software services at the GettyUSE ME: progressive integration of IIIF with new software services at the Getty
USE ME: progressive integration of IIIF with new software services at the Getty
 
IIIF Across Platforms | IIIF Community Call, January 2021
IIIF Across Platforms | IIIF Community Call, January 2021IIIF Across Platforms | IIIF Community Call, January 2021
IIIF Across Platforms | IIIF Community Call, January 2021
 
IIIF Canvases as First Class Citizens
IIIF Canvases as First Class CitizensIIIF Canvases as First Class Citizens
IIIF Canvases as First Class Citizens
 
IIIF and Linked Open Data: LODLAM 2020
IIIF and Linked Open Data: LODLAM 2020IIIF and Linked Open Data: LODLAM 2020
IIIF and Linked Open Data: LODLAM 2020
 
How to Fail Interdisciplinarily
How to Fail InterdisciplinarilyHow to Fail Interdisciplinarily
How to Fail Interdisciplinarily
 
Extending IIIF 3.0
Extending IIIF 3.0Extending IIIF 3.0
Extending IIIF 3.0
 
Fuzzy Dates & the Digital Humanities
Fuzzy Dates & the Digital HumanitiesFuzzy Dates & the Digital Humanities
Fuzzy Dates & the Digital Humanities
 
21st Century Provenance: Lessons Learned Building Art Tracks
21st Century Provenance:  Lessons Learned Building Art Tracks21st Century Provenance:  Lessons Learned Building Art Tracks
21st Century Provenance: Lessons Learned Building Art Tracks
 
Art Tracks: From Provenance to Structured Data
Art Tracks: From Provenance to Structured DataArt Tracks: From Provenance to Structured Data
Art Tracks: From Provenance to Structured Data
 
Linked Data: Worse is Better
Linked Data:  Worse is BetterLinked Data:  Worse is Better
Linked Data: Worse is Better
 
Understanding D3
Understanding D3Understanding D3
Understanding D3
 
Art Tracks: A technical deep dive.
Art Tracks:  A technical deep dive.Art Tracks:  A technical deep dive.
Art Tracks: A technical deep dive.
 
Using Linked Data: American Art Collaborative, Oct. 3, 2016
Using Linked Data:  American Art Collaborative, Oct. 3, 2016Using Linked Data:  American Art Collaborative, Oct. 3, 2016
Using Linked Data: American Art Collaborative, Oct. 3, 2016
 
IIIF For Small Projects
IIIF  For Small ProjectsIIIF  For Small Projects
IIIF For Small Projects
 
Authority Cascades: A presentation strategy for Linked Open Data
Authority Cascades: A presentation strategy for Linked Open DataAuthority Cascades: A presentation strategy for Linked Open Data
Authority Cascades: A presentation strategy for Linked Open Data
 
Art Tracks: Presentation to the imProvenance Group
Art Tracks: Presentation to the imProvenance GroupArt Tracks: Presentation to the imProvenance Group
Art Tracks: Presentation to the imProvenance Group
 
How To Internet: The Magic Words
How To Internet:  The Magic WordsHow To Internet:  The Magic Words
How To Internet: The Magic Words
 

Recently uploaded

Personalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes GuàrdiaPersonalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
EADTU
 
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MysoreMuleSoftMeetup
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
中 央社
 
SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project research
CaitlinCummins3
 

Recently uploaded (20)

Personalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes GuàrdiaPersonalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
 
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptx
 
OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...
 
Basic Civil Engineering notes on Transportation Engineering & Modes of Transport
Basic Civil Engineering notes on Transportation Engineering & Modes of TransportBasic Civil Engineering notes on Transportation Engineering & Modes of Transport
Basic Civil Engineering notes on Transportation Engineering & Modes of Transport
 
UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & Systems
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
 
Andreas Schleicher presents at the launch of What does child empowerment mean...
Andreas Schleicher presents at the launch of What does child empowerment mean...Andreas Schleicher presents at the launch of What does child empowerment mean...
Andreas Schleicher presents at the launch of What does child empowerment mean...
 
Including Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdfIncluding Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdf
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.ppt
 
ESSENTIAL of (CS/IT/IS) class 07 (Networks)
ESSENTIAL of (CS/IT/IS) class 07 (Networks)ESSENTIAL of (CS/IT/IS) class 07 (Networks)
ESSENTIAL of (CS/IT/IS) class 07 (Networks)
 
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
 
VAMOS CUIDAR DO NOSSO PLANETA! .
VAMOS CUIDAR DO NOSSO PLANETA!                    .VAMOS CUIDAR DO NOSSO PLANETA!                    .
VAMOS CUIDAR DO NOSSO PLANETA! .
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
 
The Liver & Gallbladder (Anatomy & Physiology).pptx
The Liver &  Gallbladder (Anatomy & Physiology).pptxThe Liver &  Gallbladder (Anatomy & Physiology).pptx
The Liver & Gallbladder (Anatomy & Physiology).pptx
 
Major project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategiesMajor project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategies
 
SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project research
 
Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"
 
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUMDEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
 

Telling Stories With Data: Class 1