SlideShare a Scribd company logo
1 of 20
DOS AND DON’TS OF DATAVIZ
ATALEOFPIES,DECEPTIONANDMINDTRICKS
IÑAKIPUIGDOLLERS SABIN
Data Scientist
DON’T RESCALE PROPORTIONS!
x1.75 times
bigger
Source: http://cadenaser.com/
15.1 + 70.7 + 15.2
= 101%
DO KEEP THE PROPORTIONSAS THEYARE
YET ANOTHER EXAMPLE …
?
Source: Twitter, @ppmadrid
NOW IN A PROPORTIONAL SCALE
PSOE
PARTIDO
POPULAR
NúmerodeParados
DON’T OMIT THE ORIGIN OF THE Y-AXIS
Where is
the
Axis??
94 is not 0
Source: http://blog.rtve.es/
http://mediamatters.org/
DO SHOW THE Y-AXIS FROM THE ORIGIN
MillionDollars
50.66% 49.07%
THIS ALSO HAPPENS IN SCIENTIFIC PAPERS
This is a big
difference, isn’t it?
According to the
paper,
this should be
1.82
The value of Y
(Rape Myth
Acceptance)
varies between
1 and 5
There are values
placed in the
wrong position
Source: Fox, Jesse; Bailenson, Jeremy N.; Tricase, Liz (2013). "The embodiment of
sexualized virtual selves: The Proteus effect and experiences of self-objectification via
avatars". Computers in Human Behavior 29 (3): 930–938
THE REALITY IS SOMETHING DIFFERENT
Face
It was not that
different in the
end…
Remember:
The value of Y
(Rape Myth
Acceptance)
varies between
1 and 5
DON’T USE INVENTED OR TAILOR-MADE SCALES
How can this be a line?
Source: http://mediamatters.org/
DO PLOT DATAAS IT IS
DON’T USE DIFFERENT SCALES FOR THE SAME
AXIS
Left Y-Axis
(representing the
non-smokers)
starts at 2
Right Y-Axis
(representing
the smokers)
starts at 3
Source: H. Wainer, Visual Revelations, Graphical Tales of Fate and
Deceptions from Napoleon Bonaparte to Ross Perot
Disclaimer! This Graph is
from a tobacco company
DO USE THE SAME SCALE TO MAKE DATA
COMPARABLE
DON’T SHOW MEANINGLESS NUMBERS
DON’T USE PIE CHARTS
193% ???
That’s a big pie!
Source: http://mediamatters.org/
DON’T USE 3D
Perspective makes
percentages look different
Source: http://imgarcade.com/1/misleading-circle-graphs/
SOME THINGS WE LEARNED AT SCHIBSTED
■Know your audience and adapt the visualization to them
■The title matters, it has to be attractive but not distracting
■Select the most suitable plot, there is no one-plot-fit-all
■Show only relevant information, crowded visualizations are
misleading
■Sometimes you can break the rules… 
DO CHOOSE A VISUALIZATION FITTING YOUR
AUDIENCE
Percentage of Sellers per segment
Slack channels sharing users
DON’T USE CROWDED PLOTS WITH MISLEADING
INFORMATION
■Too many elements
■The colours are
meaningless
■The axes are misleading
(not showing the origin)
DO SHOW ONLY WHAT IS IMPORTANT
■Axes starting at 0
■Only the necessary
elements
GOAL
Show the correlation of the
data points
… A DIFFERENTAPPROACH
■We don’t care about the
value  it’s OK to break the
axis rule!!
■The colours have a meaning
GOAL
Show the distribution and
density of the data points
WE ARE LOOKING FOR TALENT!
inaki.puigdollers@schibsted.com
Thanks, questions?
Data Scientist – Schibsted Product & Technology

More Related Content

Viewers also liked

201602 Technology Trends 2016 -spanish
201602 Technology Trends 2016  -spanish201602 Technology Trends 2016  -spanish
201602 Technology Trends 2016 -spanishFrancisco Calzado
 
#2 DataBeersBCN - "Why counting people at public transport" by Caterina Font
 #2 DataBeersBCN - "Why counting people at public transport" by Caterina Font #2 DataBeersBCN - "Why counting people at public transport" by Caterina Font
#2 DataBeersBCN - "Why counting people at public transport" by Caterina FontDataBeersBCN
 
мочевая система
мочевая системамочевая система
мочевая системаOksana Sulaieva
 
Вовлеченность персонала
Вовлеченность персоналаВовлеченность персонала
Вовлеченность персоналаOksana Rakityanskaya
 
قابلية الاستعمال
قابلية الاستعمالقابلية الاستعمال
قابلية الاستعمالWail Skanderi
 
How to win from Programming
How to win from ProgrammingHow to win from Programming
How to win from ProgrammingWail Skanderi
 

Viewers also liked (6)

201602 Technology Trends 2016 -spanish
201602 Technology Trends 2016  -spanish201602 Technology Trends 2016  -spanish
201602 Technology Trends 2016 -spanish
 
#2 DataBeersBCN - "Why counting people at public transport" by Caterina Font
 #2 DataBeersBCN - "Why counting people at public transport" by Caterina Font #2 DataBeersBCN - "Why counting people at public transport" by Caterina Font
#2 DataBeersBCN - "Why counting people at public transport" by Caterina Font
 
мочевая система
мочевая системамочевая система
мочевая система
 
Вовлеченность персонала
Вовлеченность персоналаВовлеченность персонала
Вовлеченность персонала
 
قابلية الاستعمال
قابلية الاستعمالقابلية الاستعمال
قابلية الاستعمال
 
How to win from Programming
How to win from ProgrammingHow to win from Programming
How to win from Programming
 

Similar to #5 DataBeersBCN -"Dos and Don'ts of Data Viz"

Вебинар «Интерактивная визуализация данных при помощи Infogram»
Вебинар «Интерактивная визуализация данных при помощи Infogram»Вебинар «Интерактивная визуализация данных при помощи Infogram»
Вебинар «Интерактивная визуализация данных при помощи Infogram»Newreporter.org Sukhacheva
 
Semiotic strategies: The things you are looking at have names
Semiotic strategies: The things you are looking at have namesSemiotic strategies: The things you are looking at have names
Semiotic strategies: The things you are looking at have namesMRS
 
TCS: Success Strategies Of The Fastest Growing Internet Retailers
TCS: Success Strategies Of The Fastest Growing Internet RetailersTCS: Success Strategies Of The Fastest Growing Internet Retailers
TCS: Success Strategies Of The Fastest Growing Internet RetailersRoland Frasier
 
Santahelena Truthtelling - Por marcas mais humanas, autênticas e verdadeiras
Santahelena Truthtelling - Por marcas mais humanas, autênticas e verdadeirasSantahelena Truthtelling - Por marcas mais humanas, autênticas e verdadeiras
Santahelena Truthtelling - Por marcas mais humanas, autênticas e verdadeirasRaul Santahelena
 
Mind The Gap - ConnectNow
Mind The Gap - ConnectNowMind The Gap - ConnectNow
Mind The Gap - ConnectNowTara Hunt
 
How to Visualize Data Like a Pro
How to Visualize Data Like a ProHow to Visualize Data Like a Pro
How to Visualize Data Like a Pro24Slides
 
[DEVit 360] Opti-pessimism: Design for the best case, build for the worst
[DEVit 360] Opti-pessimism: Design for the best case, build for the worst[DEVit 360] Opti-pessimism: Design for the best case, build for the worst
[DEVit 360] Opti-pessimism: Design for the best case, build for the worstCheryl Platz
 
5 Non-Obvious Trends For 2018 | Exclusive Book Preview
5 Non-Obvious Trends For 2018 | Exclusive Book Preview5 Non-Obvious Trends For 2018 | Exclusive Book Preview
5 Non-Obvious Trends For 2018 | Exclusive Book PreviewRohit Bhargava
 
HIERARCHY_Global shop recap 15
HIERARCHY_Global shop recap 15HIERARCHY_Global shop recap 15
HIERARCHY_Global shop recap 15Hierarchy, Inc.
 
Santahelena Truthtelling Hacktown 2019
Santahelena Truthtelling Hacktown 2019Santahelena Truthtelling Hacktown 2019
Santahelena Truthtelling Hacktown 2019Raul Santahelena
 
Data Design: Where Math and Art Collide
Data Design: Where Math and Art CollideData Design: Where Math and Art Collide
Data Design: Where Math and Art CollideTrina Chiasson
 
Data Driven Marketing
Data Driven MarketingData Driven Marketing
Data Driven MarketingDemandSphere
 
Halley Gray - Use Humor to Increase Your Conversion Rate by 28%
Halley Gray - Use Humor to Increase Your Conversion Rate by 28%Halley Gray - Use Humor to Increase Your Conversion Rate by 28%
Halley Gray - Use Humor to Increase Your Conversion Rate by 28%Julia Grosman
 
3.0 nobody knows... intro planning strategique
3.0 nobody knows... intro planning strategique3.0 nobody knows... intro planning strategique
3.0 nobody knows... intro planning strategiqueHelene Duvoux-Mauguet
 
Palestra sobre o livro TRUTHTELLING
Palestra sobre o livro TRUTHTELLINGPalestra sobre o livro TRUTHTELLING
Palestra sobre o livro TRUTHTELLINGRaul Santahelena
 
Cobalt LLP Social Media Presentation 2012
Cobalt LLP Social Media Presentation 2012Cobalt LLP Social Media Presentation 2012
Cobalt LLP Social Media Presentation 2012Tsan Abrahamson
 
Social Media Optimization for Business 2013
Social Media Optimization for Business 2013Social Media Optimization for Business 2013
Social Media Optimization for Business 2013Jay Feitlinger
 
TCS: Trend Based Marketing - Tomorrow's Campaigns Today
TCS: Trend Based Marketing - Tomorrow's Campaigns TodayTCS: Trend Based Marketing - Tomorrow's Campaigns Today
TCS: Trend Based Marketing - Tomorrow's Campaigns TodayRoland Frasier
 

Similar to #5 DataBeersBCN -"Dos and Don'ts of Data Viz" (20)

Вебинар «Интерактивная визуализация данных при помощи Infogram»
Вебинар «Интерактивная визуализация данных при помощи Infogram»Вебинар «Интерактивная визуализация данных при помощи Infogram»
Вебинар «Интерактивная визуализация данных при помощи Infogram»
 
Semiotic strategies: The things you are looking at have names
Semiotic strategies: The things you are looking at have namesSemiotic strategies: The things you are looking at have names
Semiotic strategies: The things you are looking at have names
 
TCS: Success Strategies Of The Fastest Growing Internet Retailers
TCS: Success Strategies Of The Fastest Growing Internet RetailersTCS: Success Strategies Of The Fastest Growing Internet Retailers
TCS: Success Strategies Of The Fastest Growing Internet Retailers
 
Santahelena Truthtelling - Por marcas mais humanas, autênticas e verdadeiras
Santahelena Truthtelling - Por marcas mais humanas, autênticas e verdadeirasSantahelena Truthtelling - Por marcas mais humanas, autênticas e verdadeiras
Santahelena Truthtelling - Por marcas mais humanas, autênticas e verdadeiras
 
Mind The Gap - ConnectNow
Mind The Gap - ConnectNowMind The Gap - ConnectNow
Mind The Gap - ConnectNow
 
How to Visualize Data Like a Pro
How to Visualize Data Like a ProHow to Visualize Data Like a Pro
How to Visualize Data Like a Pro
 
[DEVit 360] Opti-pessimism: Design for the best case, build for the worst
[DEVit 360] Opti-pessimism: Design for the best case, build for the worst[DEVit 360] Opti-pessimism: Design for the best case, build for the worst
[DEVit 360] Opti-pessimism: Design for the best case, build for the worst
 
Data storytelling
Data storytellingData storytelling
Data storytelling
 
5 Non-Obvious Trends For 2018 | Exclusive Book Preview
5 Non-Obvious Trends For 2018 | Exclusive Book Preview5 Non-Obvious Trends For 2018 | Exclusive Book Preview
5 Non-Obvious Trends For 2018 | Exclusive Book Preview
 
HIERARCHY_Global shop recap 15
HIERARCHY_Global shop recap 15HIERARCHY_Global shop recap 15
HIERARCHY_Global shop recap 15
 
Santahelena Truthtelling Hacktown 2019
Santahelena Truthtelling Hacktown 2019Santahelena Truthtelling Hacktown 2019
Santahelena Truthtelling Hacktown 2019
 
Data Design: Where Math and Art Collide
Data Design: Where Math and Art CollideData Design: Where Math and Art Collide
Data Design: Where Math and Art Collide
 
Data Driven Marketing
Data Driven MarketingData Driven Marketing
Data Driven Marketing
 
Halley Gray - Use Humor to Increase Your Conversion Rate by 28%
Halley Gray - Use Humor to Increase Your Conversion Rate by 28%Halley Gray - Use Humor to Increase Your Conversion Rate by 28%
Halley Gray - Use Humor to Increase Your Conversion Rate by 28%
 
3.0 nobody knows... intro planning strategique
3.0 nobody knows... intro planning strategique3.0 nobody knows... intro planning strategique
3.0 nobody knows... intro planning strategique
 
Palestra sobre o livro TRUTHTELLING
Palestra sobre o livro TRUTHTELLINGPalestra sobre o livro TRUTHTELLING
Palestra sobre o livro TRUTHTELLING
 
World Communications Forum Davos 2013
World Communications Forum Davos 2013World Communications Forum Davos 2013
World Communications Forum Davos 2013
 
Cobalt LLP Social Media Presentation 2012
Cobalt LLP Social Media Presentation 2012Cobalt LLP Social Media Presentation 2012
Cobalt LLP Social Media Presentation 2012
 
Social Media Optimization for Business 2013
Social Media Optimization for Business 2013Social Media Optimization for Business 2013
Social Media Optimization for Business 2013
 
TCS: Trend Based Marketing - Tomorrow's Campaigns Today
TCS: Trend Based Marketing - Tomorrow's Campaigns TodayTCS: Trend Based Marketing - Tomorrow's Campaigns Today
TCS: Trend Based Marketing - Tomorrow's Campaigns Today
 

More from DataBeersBCN

#6 DataBeersBCN -"Whales"
#6 DataBeersBCN -"Whales"#6 DataBeersBCN -"Whales"
#6 DataBeersBCN -"Whales"DataBeersBCN
 
#6 DataBeersBCN -"Data, Beer and Enterprise Architecture"
#6 DataBeersBCN -"Data, Beer and Enterprise Architecture"#6 DataBeersBCN -"Data, Beer and Enterprise Architecture"
#6 DataBeersBCN -"Data, Beer and Enterprise Architecture"DataBeersBCN
 
#6 DataBeersBCN -"GoodCityLife.org"
#6 DataBeersBCN -"GoodCityLife.org"#6 DataBeersBCN -"GoodCityLife.org"
#6 DataBeersBCN -"GoodCityLife.org"DataBeersBCN
 
#6 DataBeersBCN -"The (Big) Data behind the brain"
#6 DataBeersBCN -"The (Big) Data behind the brain"#6 DataBeersBCN -"The (Big) Data behind the brain"
#6 DataBeersBCN -"The (Big) Data behind the brain"DataBeersBCN
 
#5 DataBeersBCN -"How to do Data Journalism… and not die trying"
#5 DataBeersBCN -"How to do Data Journalism… and not die trying"#5 DataBeersBCN -"How to do Data Journalism… and not die trying"
#5 DataBeersBCN -"How to do Data Journalism… and not die trying"DataBeersBCN
 
#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"
#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"
#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"DataBeersBCN
 
#5 DataBeersBCN -"Location Based Business Oportunity Detector"
#5 DataBeersBCN -"Location Based Business Oportunity Detector"#5 DataBeersBCN -"Location Based Business Oportunity Detector"
#5 DataBeersBCN -"Location Based Business Oportunity Detector"DataBeersBCN
 
#4 DataBeersBCN - "Visualizing Geolocated Tweets" by Joana Simoes
#4 DataBeersBCN - "Visualizing Geolocated Tweets" by Joana Simoes#4 DataBeersBCN - "Visualizing Geolocated Tweets" by Joana Simoes
#4 DataBeersBCN - "Visualizing Geolocated Tweets" by Joana SimoesDataBeersBCN
 
#4 DataBeersBCN - "We know what you did last sonar" by Fernando Cucchietti
#4 DataBeersBCN - "We know what you did last sonar" by Fernando Cucchietti#4 DataBeersBCN - "We know what you did last sonar" by Fernando Cucchietti
#4 DataBeersBCN - "We know what you did last sonar" by Fernando CucchiettiDataBeersBCN
 
#3 DataBeersBCN - "The impact of data in reality" by Karina Gibert
#3 DataBeersBCN - "The impact of data in reality" by Karina Gibert#3 DataBeersBCN - "The impact of data in reality" by Karina Gibert
#3 DataBeersBCN - "The impact of data in reality" by Karina GibertDataBeersBCN
 
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...DataBeersBCN
 
#3 DataBeersBCN - "When we start caring about data" by Dani Pearson & Pau Gar...
#3 DataBeersBCN - "When we start caring about data" by Dani Pearson & Pau Gar...#3 DataBeersBCN - "When we start caring about data" by Dani Pearson & Pau Gar...
#3 DataBeersBCN - "When we start caring about data" by Dani Pearson & Pau Gar...DataBeersBCN
 
#3 DataBeersBCN - "Big Fun Data" by Xavier Guardiola
#3 DataBeersBCN - "Big Fun Data" by Xavier Guardiola#3 DataBeersBCN - "Big Fun Data" by Xavier Guardiola
#3 DataBeersBCN - "Big Fun Data" by Xavier GuardiolaDataBeersBCN
 
#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo Aragon
#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo Aragon#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo Aragon
#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo AragonDataBeersBCN
 
#2 DataBeersBCN - "Using data to make great and succesful mobile games" by J...
 #2 DataBeersBCN - "Using data to make great and succesful mobile games" by J... #2 DataBeersBCN - "Using data to make great and succesful mobile games" by J...
#2 DataBeersBCN - "Using data to make great and succesful mobile games" by J...DataBeersBCN
 
#2 DataBeersBCN - "Govern Obert - Opengov.cat" by Concha Catalan
#2 DataBeersBCN - "Govern Obert  - Opengov.cat" by Concha Catalan#2 DataBeersBCN - "Govern Obert  - Opengov.cat" by Concha Catalan
#2 DataBeersBCN - "Govern Obert - Opengov.cat" by Concha CatalanDataBeersBCN
 
#1 DataBeersBCN - Xavier
#1 DataBeersBCN - Xavier#1 DataBeersBCN - Xavier
#1 DataBeersBCN - XavierDataBeersBCN
 
#1 DataBeersBCN - David Solans
#1 DataBeersBCN - David Solans#1 DataBeersBCN - David Solans
#1 DataBeersBCN - David SolansDataBeersBCN
 
#1 DataBeersBCN - Dani Villatoro from BBVA DATA ANALYTICS
#1 DataBeersBCN - Dani Villatoro  from BBVA DATA ANALYTICS#1 DataBeersBCN - Dani Villatoro  from BBVA DATA ANALYTICS
#1 DataBeersBCN - Dani Villatoro from BBVA DATA ANALYTICSDataBeersBCN
 
#1 DataBeersBCN - Oscar Marin from Outliers.Collective
#1 DataBeersBCN - Oscar Marin from Outliers.Collective#1 DataBeersBCN - Oscar Marin from Outliers.Collective
#1 DataBeersBCN - Oscar Marin from Outliers.CollectiveDataBeersBCN
 

More from DataBeersBCN (20)

#6 DataBeersBCN -"Whales"
#6 DataBeersBCN -"Whales"#6 DataBeersBCN -"Whales"
#6 DataBeersBCN -"Whales"
 
#6 DataBeersBCN -"Data, Beer and Enterprise Architecture"
#6 DataBeersBCN -"Data, Beer and Enterprise Architecture"#6 DataBeersBCN -"Data, Beer and Enterprise Architecture"
#6 DataBeersBCN -"Data, Beer and Enterprise Architecture"
 
#6 DataBeersBCN -"GoodCityLife.org"
#6 DataBeersBCN -"GoodCityLife.org"#6 DataBeersBCN -"GoodCityLife.org"
#6 DataBeersBCN -"GoodCityLife.org"
 
#6 DataBeersBCN -"The (Big) Data behind the brain"
#6 DataBeersBCN -"The (Big) Data behind the brain"#6 DataBeersBCN -"The (Big) Data behind the brain"
#6 DataBeersBCN -"The (Big) Data behind the brain"
 
#5 DataBeersBCN -"How to do Data Journalism… and not die trying"
#5 DataBeersBCN -"How to do Data Journalism… and not die trying"#5 DataBeersBCN -"How to do Data Journalism… and not die trying"
#5 DataBeersBCN -"How to do Data Journalism… and not die trying"
 
#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"
#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"
#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"
 
#5 DataBeersBCN -"Location Based Business Oportunity Detector"
#5 DataBeersBCN -"Location Based Business Oportunity Detector"#5 DataBeersBCN -"Location Based Business Oportunity Detector"
#5 DataBeersBCN -"Location Based Business Oportunity Detector"
 
#4 DataBeersBCN - "Visualizing Geolocated Tweets" by Joana Simoes
#4 DataBeersBCN - "Visualizing Geolocated Tweets" by Joana Simoes#4 DataBeersBCN - "Visualizing Geolocated Tweets" by Joana Simoes
#4 DataBeersBCN - "Visualizing Geolocated Tweets" by Joana Simoes
 
#4 DataBeersBCN - "We know what you did last sonar" by Fernando Cucchietti
#4 DataBeersBCN - "We know what you did last sonar" by Fernando Cucchietti#4 DataBeersBCN - "We know what you did last sonar" by Fernando Cucchietti
#4 DataBeersBCN - "We know what you did last sonar" by Fernando Cucchietti
 
#3 DataBeersBCN - "The impact of data in reality" by Karina Gibert
#3 DataBeersBCN - "The impact of data in reality" by Karina Gibert#3 DataBeersBCN - "The impact of data in reality" by Karina Gibert
#3 DataBeersBCN - "The impact of data in reality" by Karina Gibert
 
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
 
#3 DataBeersBCN - "When we start caring about data" by Dani Pearson & Pau Gar...
#3 DataBeersBCN - "When we start caring about data" by Dani Pearson & Pau Gar...#3 DataBeersBCN - "When we start caring about data" by Dani Pearson & Pau Gar...
#3 DataBeersBCN - "When we start caring about data" by Dani Pearson & Pau Gar...
 
#3 DataBeersBCN - "Big Fun Data" by Xavier Guardiola
#3 DataBeersBCN - "Big Fun Data" by Xavier Guardiola#3 DataBeersBCN - "Big Fun Data" by Xavier Guardiola
#3 DataBeersBCN - "Big Fun Data" by Xavier Guardiola
 
#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo Aragon
#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo Aragon#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo Aragon
#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo Aragon
 
#2 DataBeersBCN - "Using data to make great and succesful mobile games" by J...
 #2 DataBeersBCN - "Using data to make great and succesful mobile games" by J... #2 DataBeersBCN - "Using data to make great and succesful mobile games" by J...
#2 DataBeersBCN - "Using data to make great and succesful mobile games" by J...
 
#2 DataBeersBCN - "Govern Obert - Opengov.cat" by Concha Catalan
#2 DataBeersBCN - "Govern Obert  - Opengov.cat" by Concha Catalan#2 DataBeersBCN - "Govern Obert  - Opengov.cat" by Concha Catalan
#2 DataBeersBCN - "Govern Obert - Opengov.cat" by Concha Catalan
 
#1 DataBeersBCN - Xavier
#1 DataBeersBCN - Xavier#1 DataBeersBCN - Xavier
#1 DataBeersBCN - Xavier
 
#1 DataBeersBCN - David Solans
#1 DataBeersBCN - David Solans#1 DataBeersBCN - David Solans
#1 DataBeersBCN - David Solans
 
#1 DataBeersBCN - Dani Villatoro from BBVA DATA ANALYTICS
#1 DataBeersBCN - Dani Villatoro  from BBVA DATA ANALYTICS#1 DataBeersBCN - Dani Villatoro  from BBVA DATA ANALYTICS
#1 DataBeersBCN - Dani Villatoro from BBVA DATA ANALYTICS
 
#1 DataBeersBCN - Oscar Marin from Outliers.Collective
#1 DataBeersBCN - Oscar Marin from Outliers.Collective#1 DataBeersBCN - Oscar Marin from Outliers.Collective
#1 DataBeersBCN - Oscar Marin from Outliers.Collective
 

Recently uploaded

20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computationsit20ad004
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/managementakshesh doshi
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 

Recently uploaded (20)

20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computation
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/management
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 

#5 DataBeersBCN -"Dos and Don'ts of Data Viz"

  • 1. DOS AND DON’TS OF DATAVIZ ATALEOFPIES,DECEPTIONANDMINDTRICKS IÑAKIPUIGDOLLERS SABIN Data Scientist
  • 2. DON’T RESCALE PROPORTIONS! x1.75 times bigger Source: http://cadenaser.com/ 15.1 + 70.7 + 15.2 = 101%
  • 3. DO KEEP THE PROPORTIONSAS THEYARE
  • 4. YET ANOTHER EXAMPLE … ? Source: Twitter, @ppmadrid
  • 5. NOW IN A PROPORTIONAL SCALE PSOE PARTIDO POPULAR NúmerodeParados
  • 6. DON’T OMIT THE ORIGIN OF THE Y-AXIS Where is the Axis?? 94 is not 0 Source: http://blog.rtve.es/ http://mediamatters.org/
  • 7. DO SHOW THE Y-AXIS FROM THE ORIGIN MillionDollars 50.66% 49.07%
  • 8. THIS ALSO HAPPENS IN SCIENTIFIC PAPERS This is a big difference, isn’t it? According to the paper, this should be 1.82 The value of Y (Rape Myth Acceptance) varies between 1 and 5 There are values placed in the wrong position Source: Fox, Jesse; Bailenson, Jeremy N.; Tricase, Liz (2013). "The embodiment of sexualized virtual selves: The Proteus effect and experiences of self-objectification via avatars". Computers in Human Behavior 29 (3): 930–938
  • 9. THE REALITY IS SOMETHING DIFFERENT Face It was not that different in the end… Remember: The value of Y (Rape Myth Acceptance) varies between 1 and 5
  • 10. DON’T USE INVENTED OR TAILOR-MADE SCALES How can this be a line? Source: http://mediamatters.org/
  • 11. DO PLOT DATAAS IT IS
  • 12. DON’T USE DIFFERENT SCALES FOR THE SAME AXIS Left Y-Axis (representing the non-smokers) starts at 2 Right Y-Axis (representing the smokers) starts at 3 Source: H. Wainer, Visual Revelations, Graphical Tales of Fate and Deceptions from Napoleon Bonaparte to Ross Perot Disclaimer! This Graph is from a tobacco company
  • 13. DO USE THE SAME SCALE TO MAKE DATA COMPARABLE
  • 14. DON’T SHOW MEANINGLESS NUMBERS DON’T USE PIE CHARTS 193% ??? That’s a big pie! Source: http://mediamatters.org/ DON’T USE 3D Perspective makes percentages look different Source: http://imgarcade.com/1/misleading-circle-graphs/
  • 15. SOME THINGS WE LEARNED AT SCHIBSTED ■Know your audience and adapt the visualization to them ■The title matters, it has to be attractive but not distracting ■Select the most suitable plot, there is no one-plot-fit-all ■Show only relevant information, crowded visualizations are misleading ■Sometimes you can break the rules… 
  • 16. DO CHOOSE A VISUALIZATION FITTING YOUR AUDIENCE Percentage of Sellers per segment Slack channels sharing users
  • 17. DON’T USE CROWDED PLOTS WITH MISLEADING INFORMATION ■Too many elements ■The colours are meaningless ■The axes are misleading (not showing the origin)
  • 18. DO SHOW ONLY WHAT IS IMPORTANT ■Axes starting at 0 ■Only the necessary elements GOAL Show the correlation of the data points
  • 19. … A DIFFERENTAPPROACH ■We don’t care about the value  it’s OK to break the axis rule!! ■The colours have a meaning GOAL Show the distribution and density of the data points
  • 20. WE ARE LOOKING FOR TALENT! inaki.puigdollers@schibsted.com Thanks, questions? Data Scientist – Schibsted Product & Technology

Editor's Notes

  1. -A picture tells a thousand words -Goal: share examples of visualizations showing distorted information and how can this be addressed
  2. -Common practise to fool people’s mind is rescaling porportions -Even though you show the numbers, if the plot is not proportional  contradictory information -A picture tells a thousand words
  3. -Here you see how different the plot looks when the proportions are as they should -However this particular example can be just an error, just not intentional. But what about this one?
  4. -Spatial perception is a very important component of image processing in human’s brain -This is why mass media abuses this kind of blatant distortions to communicate somehow biased message
  5. -Again, if we do the exercise of re-plotting the data in a fairer way we see that reality is something different to what they try to show -So the blue line is flatter than the one they presented originally, take your own conclusions…
  6. -Another technique to show distorted data is omitting the Y-axis. -Messes up with spatial perception again -Comparing is very difficult
  7. -But if we re-plot it truth comes to surface again… -And that incredibly huge difference betwwen both candidates is gone -And the federal wellfare received in US hasn’t grown as much neither...
  8. -No surprise media uses this -We all knew that TV and newspapers provided biased information -Is more strange is to see this in science
  9. -Some scientific studies use distortion techniques as well to “enhance” their message -But if we see how it really looks like this is what we have: the difference between conditions is not that big -Is it science a matter of believe in the end?
  10. -Another great example from Fox news: created a linear growth of the job loss by QUARTER out of the blue
  11. -This is how it really looks, not only the values are not linear, but the periods are not quarters but random months across 3 different years!
  12. -Another good deceiving technique is to use double axis in the same plot -It can be good: enhanced readability, but if the axis are not the same you can create effects like the one from this tobacco company showing that smoking is not affecting with death rate, only the age matters
  13. -However if we re-plot it correctly we see a complete different story -No Surprise it comes from a tobacco company, right?
  14. -And then we have the pie charts. -Should I use them? I ‘ll try to avoid them -If you insist -remember simple rule: pie charts show parts of a whole so make them sum up to 100% and no more - avoid perspective games
  15. -Things we learned at Schisted, I’m going to talk about a couple of them -One of the most delicate points: choosing which visualization to use -Know audience beforehand
  16. -Not everybody understands reality the same way, while a DS may feel comfortable with a network plot, BP tend to prefer bar plots or waterfall plots -In addition, There is no one-plot-fit-all solution
  17. -Once you have decided which way to go you have to be careful with the number elements you add to the plot. By elements I mean : colours, size of the points, width of the bars, regression lines,… amogn others -Crowded plots are, more often than not, misleading and distracting audience's attention from what is important.
  18. -My suggestion: do not add irrelevant elements, every single element you have in the plot has to be meaningful by itself. -Here, for instance we have a clear goal, so we sticked to it and showed only elements that helped us to explain that message
  19. -If your goal is different, so is your plot -All in all, I would say that the golden rule in data visualization is two folded to communicate a message (this is your goal) based on some observed data (which you have to respect)
  20. -If your goal is different, so is your plot -All in all, I would say that the golden rule in data visualization is two folded to communicate a message (this is your goal) based on some observed data (which you have to respect)