SlideShare a Scribd company logo
1 of 44
Making sense of data visually: 
A modern look at data 
visualization 
VLADIMIR MILEV 
NEW VENTURE SOFTWARE
Author Bio 
Vladimir Milev 
MCPD Enterprise 
Speaker (Devreach, NTK Slovenia and others) 
DV Evangelist 
Founder at New Venture Software 
@vmilev 
www.linkedin.com/in/vladimirmilev/
http://www.newventuresoftware.com/
Agenda 
1. Big data and information overload 
2. What problems DataViz solves 
3. DataViz fundamental theory 
4. Basic visualizations 
5. Advanced visualizations
Information Overload 
Twitter: 500 million tweets per day 
Facebook: 55 million status updates per day 
Facebook: 900 million interactions per day (comments, likes etc.) 
Reddit:
Proliferation of smart devices 
 We are already living in a world dominated by 
smart devices 
 What is the meaning of this? 
 More connected, data is more accessible 
 Less space for tables and text 
 Must use visual communication
Making Sense of Data 
Increasing amount of data available 
Increasing number of data consumer devices 
Obtaining data no longer a problem 
We have an Information Overload issue 
Quick data analysis is the new problem 
But how quick?
A Picture is worth a 1000 words 
With about 1,000,000 
ganglion cells, the human 
retina would transmit data 
at roughly the rate of an 
Ethernet connection, or 10 
million bits per second.” 
-Vijay Balasubramanian, 
PhD, Professor of Physics at 
U Penn
OK – That’s a lot of 
bandwidth 
BUT ARE WE USING IT EFFICIENTLY?
Efficiency 
Best readers usually read up to about 300 words per minute. 
Average word length is 5.1 letters 
300 * 5.1 = 1530 characters per minute 
Or 1530 / 60 = 25.5 characters per second 
1 character is usually stored as 8 bits 
26 * 8 = 208 bits per second 
Reading bandwidth is ~0.025 KiB/s 
Or 0.00208% Efficiency
So reading clearly isn’t 
the way to go… 
BUT WHAT IS THE SOLUTION?
Using statistics 
For the most part of the 20th century 
Using arithmetic mean, average, standard deviation 
Variance, correlations, regressions 
Turns out this is not good enough
Anscombe’s Quartet 
I II III IV 
x y x y x y x y 
10 8.04 10 9.14 10 7.46 8 6.58 
8 6.95 8 8.14 8 6.77 8 5.76 
13 7.58 13 8.74 13 12.74 8 7.71 
9 8.81 9 8.77 9 7.11 8 8.84 
11 8.33 11 9.26 11 7.81 8 8.47 
14 9.96 14 8.1 14 8.84 8 7.04 
6 7.24 6 6.13 6 6.08 8 5.25 
4 4.26 4 3.1 4 5.39 19 12.5 
12 10.84 12 9.13 12 8.15 8 5.56 
7 4.82 7 7.26 7 6.42 8 7.91 
5 5.68 5 4.74 5 5.73 8 6.89 
• Statistical properties are identical: 
• Mean of X (9.0) and Y (7.5) values are constant 
• Nearly same variances, correlations and regressions 
• As far as statistics is concerned these sets are almost the same
Anscombe’s Quartet
So DataViz is very powerful 
But why does it work so well?
Gestalt Psychology 
Seeing with the brain 
The mind understands external stimuli as whole rather than the 
sum of their parts 
We tend to order our experience in a manner that is regular, 
orderly, symmetric, and simple 
Key principles of gestalt: reification, multistability, invariance 
Gestalt laws of grouping: proximity, similarity, closure, symmetry
Gestalt Principles - Reification 
Our minds tend to 
construct/generate 
information
Gestalt Principles -Multistability 
The tendency of our 
mind to jump back and 
forth between 
ambiguous alternative 
interpretations 
Spinning Girl Rubin Vase
Gestalt Principles - Invariance 
The tendency to perceive simple geometric 
objects independent of rotation, translation, 
and scale 
Also elastic deformations, different lighting, 
and different component features
Gestalt Laws of Grouping - Similarity 
We group objects based on visual similarity
Gestalt Laws of Grouping - Proximity 
We group items based on spatial proximity
Gestalt Laws of Grouping - Closure 
We perceive objects such as shapes, letters, pictures, etc., as 
being whole when they are not complete
Application in Data Visualization 
 Introducing the visual variables 
 Fundamental properties of objects which can encode information into a 
picture 
 Fundamental visual variables: 
◦ Position 
◦ Size 
◦ Color 
◦ Shape 
◦ Orientation 
Basis for all Data Visualization!
Basic/Common Visualizations 
Bar graphs 
Line graphs 
Area charts 
Pie charts
Bar Graphs 
• Using color correctly to encode 
gender 
• Using position (ordering) to 
create an orderly scale 
• Using size to encode the values 
• Using orientation to differentiate 
gender again
Bar Graphs continued 
• Labels are used 
• Color is neutral and does not encode 
information 
• Again, we have top-down ordering 
(position) 
• And again size encodes the relative 
numeric value
Bars and Normal Distribution 
Minimum passing grade 
• Distribution of test scores for 
Polish “Matura” exam 
• Normal Distribution is 
expected 
• Red line shows normal 
distribution 
• 30 is the minimum expected 
grade 
• Detecting behavioral changes 
• What happened?
Line Graphs 
Confirming what we already know – 
paper media is declining rapidly. 
• Shape encodes the value 
• Color is not significant 
• Design goal is to show a 
trend/change
Area Graphs 
Effect of school year on 
Team Fortress 2 players 
School starts 
• Similar to line graph 
• Design goal for area 
charts is emphasize 
on the 
value/quantity, not 
so much on the trend 
• You can see both 
• Color has no 
meaning
Area Graphs continued 
• This time color carries a 
meaning (legend) 
• The graph is also good for 
displaying ratio between series 
of data over time
Pie Charts
Pie Charts 
Golden Rules for Pie Charts 
• Ratio of one piece to the whole 
• Order the values 
• Less than 6 pieces 
• Avoid legends 
• Sum up to 100%
Abusing Pie Charts 
Don’t break the rules!
Maps 
Plot millions of journal entries from 18th and 19th century ship logs, and 
you reveal a picture of ocean trade you've never seen before 
• Visualization of 
routes 
• Color saturation 
indicates heavily 
used routes
Maps are good with animations too 
• Concentration of NO2 from 
2005 to 2011 
• Using both color and position 
to encode concentration 
• Using continuous color scale 
• Adding another dimension - 
time
Choropleth Maps 
Displaying the most popular 
name for a newborn in each 
state 
• Using discrete 
palette to encode 
information
Heat Maps 
• Excellent for plotting 
recurring values 
• Color 
saturation/brightness 
encodes the values 
• Position also encodes 
information 
• Easy to spot 
concentrations and 
find patterns
Heat Maps medicine/genetics
Tree Maps 
• Excellent for representing 
hierarchical data 
• Color carries a meaning 
• Size carries a meaning as well 
• Position is irrelevant 
• Suitable for annotations
Parallel Coordinates Plot 
• Interactive visualization 
• Good at displaying 
relationships between 
different dimensions of 
data 
• Position encodes 
dimension 
• Color encodes scale
Parallel Coordinates Plot – in action 
Selecting a subset 
of a dimension to 
display the 
relationships with 
the other 
dimensions
Chord Diagram 
• Similar to Parallel Coordinates 
plot 
• Color and Position used to 
encode data 
• Design is different 
• Filtering of dimensions is not a 
design goal 
• Focuses on selecting a whole 
dimension
Some resources 
http://www.reddit.com/r/dataisbeautiful/ 
http://blog.visual.ly/ 
http://flowingdata.com/ 
http://eagereyes.org/ 
http://www.perceptualedge.com/blog/
Thank You!

More Related Content

Viewers also liked

Information visualization: representation
Information visualization: representationInformation visualization: representation
Information visualization: representationKatrien Verbert
 
Intro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big DataIntro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big DataPaco Nathan
 
Myths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsMyths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsDavid Pittman
 
Titan: The Rise of Big Graph Data
Titan: The Rise of Big Graph DataTitan: The Rise of Big Graph Data
Titan: The Rise of Big Graph DataMarko Rodriguez
 
Data Analysis: Descriptive Statistics
Data Analysis: Descriptive StatisticsData Analysis: Descriptive Statistics
Data Analysis: Descriptive StatisticsMahmood Ahmad
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data ScientistDaniel Tunkelang
 
Titan: Big Graph Data with Cassandra
Titan: Big Graph Data with CassandraTitan: Big Graph Data with Cassandra
Titan: Big Graph Data with CassandraMatthias Broecheler
 
A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)Prof. Dr. Diego Kuonen
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Data Science London
 

Viewers also liked (10)

Information visualization: representation
Information visualization: representationInformation visualization: representation
Information visualization: representation
 
Intro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big DataIntro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big Data
 
Myths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsMyths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data Scientists
 
Titan: The Rise of Big Graph Data
Titan: The Rise of Big Graph DataTitan: The Rise of Big Graph Data
Titan: The Rise of Big Graph Data
 
Data Analysis: Descriptive Statistics
Data Analysis: Descriptive StatisticsData Analysis: Descriptive Statistics
Data Analysis: Descriptive Statistics
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data Scientist
 
Titan: Big Graph Data with Cassandra
Titan: Big Graph Data with CassandraTitan: Big Graph Data with Cassandra
Titan: Big Graph Data with Cassandra
 
A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)
 
Introduction to R for Data Mining
Introduction to R for Data MiningIntroduction to R for Data Mining
Introduction to R for Data Mining
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
 

Similar to Making sense of data visually: A modern look at datavisualization

AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1
AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1
AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1David Gotz
 
chi03-tutorial.ppt
chi03-tutorial.pptchi03-tutorial.ppt
chi03-tutorial.pptKumarVijay54
 
Exploratory Data Analysis week 4
Exploratory Data Analysis week 4Exploratory Data Analysis week 4
Exploratory Data Analysis week 4Manzur Ashraf
 
How Humans See Data - Google - November 2017
How Humans See Data  - Google - November 2017How Humans See Data  - Google - November 2017
How Humans See Data - Google - November 2017John Rauser
 
How Humans See Data - Amazon Cut
How Humans See Data - Amazon CutHow Humans See Data - Amazon Cut
How Humans See Data - Amazon CutJohn Rauser
 
Data Visualization dataviz superpower
Data Visualization dataviz superpowerData Visualization dataviz superpower
Data Visualization dataviz superpowerJen Stirrup
 
Design and Support Recommendations from Data Visualization Research
Design and Support Recommendations from Data Visualization ResearchDesign and Support Recommendations from Data Visualization Research
Design and Support Recommendations from Data Visualization ResearchAngela Zoss
 
How Humans See Data
How Humans See DataHow Humans See Data
How Humans See DataJohn Rauser
 
Best Practices for Killer Data Visualization
Best Practices for Killer Data VisualizationBest Practices for Killer Data Visualization
Best Practices for Killer Data VisualizationQualtrics
 
Working With Infographics
Working With InfographicsWorking With Infographics
Working With InfographicsUNCResearchHub
 
MIL-report-TEXT-AND-VISUAL-DIMENSION.pptx
MIL-report-TEXT-AND-VISUAL-DIMENSION.pptxMIL-report-TEXT-AND-VISUAL-DIMENSION.pptx
MIL-report-TEXT-AND-VISUAL-DIMENSION.pptxEdelmarBenosa3
 
Design in UI: Visuals and Aesthetics - Swapnil Acharya
Design in UI: Visuals and Aesthetics - Swapnil AcharyaDesign in UI: Visuals and Aesthetics - Swapnil Acharya
Design in UI: Visuals and Aesthetics - Swapnil AcharyaMobileNepal
 
SMART Seminar Series: "Data is the new water in the digital age"
SMART Seminar Series: "Data is the new water in the digital age"SMART Seminar Series: "Data is the new water in the digital age"
SMART Seminar Series: "Data is the new water in the digital age"SMART Infrastructure Facility
 
4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx
4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx
4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docxgilbertkpeters11344
 
Rules of Design Composition
Rules of Design CompositionRules of Design Composition
Rules of Design CompositionManamiIshimura
 
Best practices for visualising data and building dashboards
Best practices for visualising data and building dashboardsBest practices for visualising data and building dashboards
Best practices for visualising data and building dashboardsDaniel Tanner
 

Similar to Making sense of data visually: A modern look at datavisualization (20)

AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1
AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1
AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1
 
chi03-tutorial.ppt
chi03-tutorial.pptchi03-tutorial.ppt
chi03-tutorial.ppt
 
Exploratory Data Analysis week 4
Exploratory Data Analysis week 4Exploratory Data Analysis week 4
Exploratory Data Analysis week 4
 
How Humans See Data - Google - November 2017
How Humans See Data  - Google - November 2017How Humans See Data  - Google - November 2017
How Humans See Data - Google - November 2017
 
How Humans See Data - Amazon Cut
How Humans See Data - Amazon CutHow Humans See Data - Amazon Cut
How Humans See Data - Amazon Cut
 
Data Visualization dataviz superpower
Data Visualization dataviz superpowerData Visualization dataviz superpower
Data Visualization dataviz superpower
 
Lec 3.pptx
Lec 3.pptxLec 3.pptx
Lec 3.pptx
 
Design and Support Recommendations from Data Visualization Research
Design and Support Recommendations from Data Visualization ResearchDesign and Support Recommendations from Data Visualization Research
Design and Support Recommendations from Data Visualization Research
 
How Humans See Data
How Humans See DataHow Humans See Data
How Humans See Data
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Best Practices for Killer Data Visualization
Best Practices for Killer Data VisualizationBest Practices for Killer Data Visualization
Best Practices for Killer Data Visualization
 
Working With Infographics
Working With InfographicsWorking With Infographics
Working With Infographics
 
Data Visulalization
Data VisulalizationData Visulalization
Data Visulalization
 
MIL-report-TEXT-AND-VISUAL-DIMENSION.pptx
MIL-report-TEXT-AND-VISUAL-DIMENSION.pptxMIL-report-TEXT-AND-VISUAL-DIMENSION.pptx
MIL-report-TEXT-AND-VISUAL-DIMENSION.pptx
 
Design in UI: Visuals and Aesthetics - Swapnil Acharya
Design in UI: Visuals and Aesthetics - Swapnil AcharyaDesign in UI: Visuals and Aesthetics - Swapnil Acharya
Design in UI: Visuals and Aesthetics - Swapnil Acharya
 
SMART Seminar Series: "Data is the new water in the digital age"
SMART Seminar Series: "Data is the new water in the digital age"SMART Seminar Series: "Data is the new water in the digital age"
SMART Seminar Series: "Data is the new water in the digital age"
 
4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx
4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx
4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx
 
Rules of Design Composition
Rules of Design CompositionRules of Design Composition
Rules of Design Composition
 
Best practices for visualising data and building dashboards
Best practices for visualising data and building dashboardsBest practices for visualising data and building dashboards
Best practices for visualising data and building dashboards
 
Numeracy-for-all-learners.pptx
Numeracy-for-all-learners.pptxNumeracy-for-all-learners.pptx
Numeracy-for-all-learners.pptx
 

Recently uploaded

Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfLivetecs LLC
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....kzayra69
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceBrainSell Technologies
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 

Recently uploaded (20)

Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdf
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. Salesforce
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 

Making sense of data visually: A modern look at datavisualization

  • 1. Making sense of data visually: A modern look at data visualization VLADIMIR MILEV NEW VENTURE SOFTWARE
  • 2. Author Bio Vladimir Milev MCPD Enterprise Speaker (Devreach, NTK Slovenia and others) DV Evangelist Founder at New Venture Software @vmilev www.linkedin.com/in/vladimirmilev/
  • 4. Agenda 1. Big data and information overload 2. What problems DataViz solves 3. DataViz fundamental theory 4. Basic visualizations 5. Advanced visualizations
  • 5. Information Overload Twitter: 500 million tweets per day Facebook: 55 million status updates per day Facebook: 900 million interactions per day (comments, likes etc.) Reddit:
  • 6. Proliferation of smart devices  We are already living in a world dominated by smart devices  What is the meaning of this?  More connected, data is more accessible  Less space for tables and text  Must use visual communication
  • 7. Making Sense of Data Increasing amount of data available Increasing number of data consumer devices Obtaining data no longer a problem We have an Information Overload issue Quick data analysis is the new problem But how quick?
  • 8. A Picture is worth a 1000 words With about 1,000,000 ganglion cells, the human retina would transmit data at roughly the rate of an Ethernet connection, or 10 million bits per second.” -Vijay Balasubramanian, PhD, Professor of Physics at U Penn
  • 9. OK – That’s a lot of bandwidth BUT ARE WE USING IT EFFICIENTLY?
  • 10. Efficiency Best readers usually read up to about 300 words per minute. Average word length is 5.1 letters 300 * 5.1 = 1530 characters per minute Or 1530 / 60 = 25.5 characters per second 1 character is usually stored as 8 bits 26 * 8 = 208 bits per second Reading bandwidth is ~0.025 KiB/s Or 0.00208% Efficiency
  • 11. So reading clearly isn’t the way to go… BUT WHAT IS THE SOLUTION?
  • 12. Using statistics For the most part of the 20th century Using arithmetic mean, average, standard deviation Variance, correlations, regressions Turns out this is not good enough
  • 13. Anscombe’s Quartet I II III IV x y x y x y x y 10 8.04 10 9.14 10 7.46 8 6.58 8 6.95 8 8.14 8 6.77 8 5.76 13 7.58 13 8.74 13 12.74 8 7.71 9 8.81 9 8.77 9 7.11 8 8.84 11 8.33 11 9.26 11 7.81 8 8.47 14 9.96 14 8.1 14 8.84 8 7.04 6 7.24 6 6.13 6 6.08 8 5.25 4 4.26 4 3.1 4 5.39 19 12.5 12 10.84 12 9.13 12 8.15 8 5.56 7 4.82 7 7.26 7 6.42 8 7.91 5 5.68 5 4.74 5 5.73 8 6.89 • Statistical properties are identical: • Mean of X (9.0) and Y (7.5) values are constant • Nearly same variances, correlations and regressions • As far as statistics is concerned these sets are almost the same
  • 15. So DataViz is very powerful But why does it work so well?
  • 16. Gestalt Psychology Seeing with the brain The mind understands external stimuli as whole rather than the sum of their parts We tend to order our experience in a manner that is regular, orderly, symmetric, and simple Key principles of gestalt: reification, multistability, invariance Gestalt laws of grouping: proximity, similarity, closure, symmetry
  • 17. Gestalt Principles - Reification Our minds tend to construct/generate information
  • 18. Gestalt Principles -Multistability The tendency of our mind to jump back and forth between ambiguous alternative interpretations Spinning Girl Rubin Vase
  • 19. Gestalt Principles - Invariance The tendency to perceive simple geometric objects independent of rotation, translation, and scale Also elastic deformations, different lighting, and different component features
  • 20. Gestalt Laws of Grouping - Similarity We group objects based on visual similarity
  • 21. Gestalt Laws of Grouping - Proximity We group items based on spatial proximity
  • 22. Gestalt Laws of Grouping - Closure We perceive objects such as shapes, letters, pictures, etc., as being whole when they are not complete
  • 23. Application in Data Visualization  Introducing the visual variables  Fundamental properties of objects which can encode information into a picture  Fundamental visual variables: ◦ Position ◦ Size ◦ Color ◦ Shape ◦ Orientation Basis for all Data Visualization!
  • 24. Basic/Common Visualizations Bar graphs Line graphs Area charts Pie charts
  • 25. Bar Graphs • Using color correctly to encode gender • Using position (ordering) to create an orderly scale • Using size to encode the values • Using orientation to differentiate gender again
  • 26. Bar Graphs continued • Labels are used • Color is neutral and does not encode information • Again, we have top-down ordering (position) • And again size encodes the relative numeric value
  • 27. Bars and Normal Distribution Minimum passing grade • Distribution of test scores for Polish “Matura” exam • Normal Distribution is expected • Red line shows normal distribution • 30 is the minimum expected grade • Detecting behavioral changes • What happened?
  • 28. Line Graphs Confirming what we already know – paper media is declining rapidly. • Shape encodes the value • Color is not significant • Design goal is to show a trend/change
  • 29. Area Graphs Effect of school year on Team Fortress 2 players School starts • Similar to line graph • Design goal for area charts is emphasize on the value/quantity, not so much on the trend • You can see both • Color has no meaning
  • 30. Area Graphs continued • This time color carries a meaning (legend) • The graph is also good for displaying ratio between series of data over time
  • 32. Pie Charts Golden Rules for Pie Charts • Ratio of one piece to the whole • Order the values • Less than 6 pieces • Avoid legends • Sum up to 100%
  • 33. Abusing Pie Charts Don’t break the rules!
  • 34. Maps Plot millions of journal entries from 18th and 19th century ship logs, and you reveal a picture of ocean trade you've never seen before • Visualization of routes • Color saturation indicates heavily used routes
  • 35. Maps are good with animations too • Concentration of NO2 from 2005 to 2011 • Using both color and position to encode concentration • Using continuous color scale • Adding another dimension - time
  • 36. Choropleth Maps Displaying the most popular name for a newborn in each state • Using discrete palette to encode information
  • 37. Heat Maps • Excellent for plotting recurring values • Color saturation/brightness encodes the values • Position also encodes information • Easy to spot concentrations and find patterns
  • 39. Tree Maps • Excellent for representing hierarchical data • Color carries a meaning • Size carries a meaning as well • Position is irrelevant • Suitable for annotations
  • 40. Parallel Coordinates Plot • Interactive visualization • Good at displaying relationships between different dimensions of data • Position encodes dimension • Color encodes scale
  • 41. Parallel Coordinates Plot – in action Selecting a subset of a dimension to display the relationships with the other dimensions
  • 42. Chord Diagram • Similar to Parallel Coordinates plot • Color and Position used to encode data • Design is different • Filtering of dimensions is not a design goal • Focuses on selecting a whole dimension
  • 43. Some resources http://www.reddit.com/r/dataisbeautiful/ http://blog.visual.ly/ http://flowingdata.com/ http://eagereyes.org/ http://www.perceptualedge.com/blog/