Stop Making Pie Charts!

•Download as ODP, PDF•

2 likes•832 views

Don’t let Excel’s default settings ruin your data analysis! Learn insights from research into visual perception and interpretation. Robin Gower will present some great ideas stolen from the likes of Edward Tufte, Leland Wilkinson, and Stephen Few. You don’t need to be a technical user to enjoy the talk but you should be prepared never to look at a pie chart quite the same way again! Robin is a freelance data engineer http://infonomics.ltd.uk/ and long-term mitherer at ODM

Data & Analytics

Stop Making Pie Charts!
An opinionated guide to the
craft of data
visualisation
Robin Gower
Open Data Manchester
30.06.15
infonomics.ltd.uk
@robsteranium

Motivation
Components
Perception
Guidance

CC BY 2.0 Flickr Procilas Moscas
Data – raw symbols

CC BY 2.0 Flickr Procilas Moscas
Information – meaning from context

Copyright | Denise Schmandt-Besserat (1977)
Visualisation – representation of the abstract

CC BY SA 4.0 Robin Gower 2015
Encoding – data → aesthetics

How similar are these sets?
Anscombe (1973) Graphs in Statistical Analysis

Transformations
Trafford MBC 2015 with Infonomics

CC BY-SA 2.0 Flickr Guian Bolisay
Scales – mapping to a common unit

Yahoo Finance via the Generalist
Scales – mapping to a common unit

Coordinates – mapping to the display
CC BY 2.0 Flickr Carsten Frenzl 2013

Coordinates – mapping to the display
DWP 2012 JSA Claimants in the North West,

Coordinates – mapping to the display
Distribution of Cultural Venues – Infonomics

Elements – aesthetic attributes
this page is intentionally left blank

Guides – to provide context
Out-of-Copyright Ordnance Survey 1887

Pre-attentive Processing
3.14159265358979
3238462643383279
5028841971693993
7510582097494459
2307816406286208
9986280348253421

Decoding accuracy
Cleveland, McGill (1986) An experiment in

Ranking of Perceptual Tasks
Mackinley 1986 Automating the Design of

Visualization: Using Computer Graphics to

Stephen Few http://www.perceptualedge.com

Aesthetics – Position
Copyright Christian Rudder, Dataclysm

Aesthetics – Position
Polish Central Examination Board Matura Test

Aesthetics – Colour – depends on context

Aesthetics – Colour – not the same to everyone

Aesthetics – Colour – limits to perception
XKCD Colour Survey 2010

Aesthetics – Colour – reach for a palette
Brewer Colour Palette

3D is bad (on 2D displays)
CC BY SA 4.0 Robin Gower 2014

http://blog.jgc.org/2009/08/please-dont-use-

Perspective Distortion
CC BY SA 4.0 WikiMedia Commons SharkD 2007

Chart Junk – 3d pies are a great way to deceive
2008 Macworld Expo via Engadget

Chart Junk – you can lie with line charts too
Florida Dept of Law Enforcement via Reuters

Chart Junk – improves memorability
Bateman et al (2010) Useful Junk? via

Data-Ink Ratio
Tufte (1983) The Visual Display of Quantitative
Data-ink
Ratio
Data-ink
Total ink used to print the graphic
1 – proportion of graphic that can be erased
=
=
proportion of a graphic’s ink devoted to the
non-redundant display of data-information
=

Data-Ink Ratio - Example
CC BY NC 2.0 Tim Bray

Data-Ink Ratio
http://darkhorseanalytics.com/blog/data-

Over-plotting
Open Government License DataGM 2013

Over-plotting - smaller
Open Government License DataGM 2013

Over-plotting - transparency
Open Government License DataGM 2013

Over-plotting – logarithmic scale
Open Government License DataGM 2013

Over-plotting – binning
Open Government License DataGM 2013

Sparklines
Robin Gower (2009) Infonomics AutoReporter

Small Multiples
IKEA discovered via Tufte twitter

Small Multiples
200 Calories – wisegeek.com

Viewers also liked

Furn vetrgater

GEOG8260.10, Eliminating chartjunkScott St. George

Avoiding Chartjunk CesToronto

Data Visualisation Design Workshop #UXbneCam Taylor

The seasonality of moisture-sensitive tree-ring recordsScott St. George

Why aren't Evaluators using Digital Media Analytics?CesToronto

Colman McMahon, DIT School of Computing: Getting Started with Data VisualisationDublinked .

Viewers also liked (7)

Furn vet

GEOG8260.10, Eliminating chartjunk

Avoiding Chartjunk

Data Visualisation Design Workshop #UXbne

The seasonality of moisture-sensitive tree-ring records

Why aren't Evaluators using Digital Media Analytics?

Colman McMahon, DIT School of Computing: Getting Started with Data Visualisation

Similar to Stop Making Pie Charts!

Stop Making Pie Charts (an opinionated guide to data visualisation)robingower

InfoVis&LargeScreensTiziana Catarci

Knowledge base for 3D rendering stylesSidonie Christophe

A Study on Data Visualization Techniques of Spatio Temporal DataIJMTST Journal

SLA Nov2009 Publicaspoerri

Vivarana literature surveyTharindu Ranasinghe

Visualization Techniques for Massive DatasetsMatthias Trapp

Dobson presentation nys geo summit for slidesharemdob

An Introduction to Video Principles-Part 1 Dr. Mohieddin Moradi

Freeman london's creative industries 2005: evidence of microspatial clusteringAlan Freeman

Data visualizationChristian Stade-Schuldt

Usability of spatio temporal uncertainty visualisation methodsHansi Senaratne

1) Definition of Data visualization-Representation and prese.docxcuddietheresa

Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014Jason Riedy

How distinct and aligned with UGC is European capitals’ DMO branding on Insta...MODUL Technology GmbH

Intro to Internet Mapping (epan 2011)WV Assocation of Geospatial Professionals

SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...BigData_Europe

BigDataEurope: Project Introduction @ Year #1 WorkshopsBigData_Europe

Ntegra 20180523 v10 copy.pptxISSIP

Carpita metulini 111220_dssr_bari_version2University of Salerno

Similar to Stop Making Pie Charts! (20)

Stop Making Pie Charts (an opinionated guide to data visualisation)

InfoVis&LargeScreens

Knowledge base for 3D rendering styles

A Study on Data Visualization Techniques of Spatio Temporal Data

SLA Nov2009 Public

Vivarana literature survey

Visualization Techniques for Massive Datasets

Dobson presentation nys geo summit for slideshare

An Introduction to Video Principles-Part 1

Freeman london's creative industries 2005: evidence of microspatial clustering

Data visualization

Usability of spatio temporal uncertainty visualisation methods

1) Definition of Data visualization-Representation and prese.docx

Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014

How distinct and aligned with UGC is European capitals’ DMO branding on Insta...

Intro to Internet Mapping (epan 2011)

SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...

BigDataEurope: Project Introduction @ Year #1 Workshops

Ntegra 20180523 v10 copy.pptx

Carpita metulini 111220_dssr_bari_version2

Recently uploaded

Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo

Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster

RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh

04242024_CCC TUG_Joins and Relationshipsccctableauusergroup

INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La

Call Girls in Saket 99530🔝 56974 Escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

How we prevented account sharing with MFAAndrei Kaleshka

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa

9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Industrialised data - the key to AI success.pdfLars Albertsson

Recently uploaded (20)

Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改

Call Girls In Dwarka 9654467111 Escorts Service

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024

RA-11058_IRR-COMPRESS Do 198 series of 1998

04242024_CCC TUG_Joins and Relationships

INTERNSHIP ON PURBASHA COMPOSITE TEX LTD

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一

Call Girls in Saket 99530🔝 56974 Escort Service

How we prevented account sharing with MFA

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf

9654467111 Call Girls In Munirka Hotel And Home Service

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...

Schema on read is obsolete. Welcome metaprogramming..pdf

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...

Industrialised data - the key to AI success.pdf

Stop Making Pie Charts!

1. Stop Making Pie Charts! An opinionated guide to the craft of data visualisation Robin Gower Open Data Manchester 30.06.15 infonomics.ltd.uk @robsteranium

3. Motivation Components Perception Guidance

4. Motivation

5. CC BY 2.0 Flickr Procilas Moscas Data – raw symbols

6. CC BY 2.0 Flickr Procilas Moscas Information – meaning from context

7. Copyright | Denise Schmandt-Besserat (1977) Visualisation – representation of the abstract

8. CC BY SA 4.0 Robin Gower 2015 Encoding – data → aesthetics

9. How similar are these sets? Anscombe (1973) Graphs in Statistical Analysis

10. How similar are these sets? Anscombe (1973) Graphs in Statistical Analysis

11. How similar are these sets? Anscombe (1973) Graphs in Statistical Analysis

12. Components

13.

14. Variables Trafford MBC 2015

15. Transformations Trafford MBC 2015 with Infonomics

16. CC BY-SA 2.0 Flickr Guian Bolisay Scales – mapping to a common unit

17. Yahoo Finance via the Generalist Scales – mapping to a common unit

18. Coordinates – mapping to the display CC BY 2.0 Flickr Carsten Frenzl 2013

19. Coordinates – mapping to the display DWP 2012 JSA Claimants in the North West,

20. Coordinates – mapping to the display Distribution of Cultural Venues – Infonomics

21. Elements – aesthetic attributes this page is intentionally left blank

22. Guides – to provide context Out-of-Copyright Ordnance Survey 1887

23. Perception

24. Pre-attentive Processing 3.14159265358979 3238462643383279 5028841971693993 7510582097494459 2307816406286208 9986280348253421

25. Pre-attentive Processing 3.14159265358979 3238462643383279 5028841971693993 7510582097494459 2307816406286208 9986280348253421

26. Decoding accuracy Cleveland, McGill (1986) An experiment in

27. Decoding accuracy Cleveland, McGill (1986) An experiment in

28. Ranking of Perceptual Tasks Mackinley 1986 Automating the Design of

29. CC BY 1.0 WikiMedia Commons Shutz 2007

30. CC BY 1.0 WikiMedia Commons Shutz 2007

31. Visualization: Using Computer Graphics to

32. Stephen Few http://www.perceptualedge.com

33. Aesthetics – Position Copyright Christian Rudder, Dataclysm

34. Aesthetics – Position Polish Central Examination Board Matura Test

35. Aesthetics – Colour – depends on context

36. Aesthetics – Colour – not the same to everyone

37. Aesthetics – Colour – limits to perception XKCD Colour Survey 2010

38. Aesthetics – Colour – reach for a palette Brewer Colour Palette

39. Gestalt laws of grouping – proximity

40. Gestalt laws of grouping – similarity

41. Gestalt laws of grouping – closure

42. Gestalt laws of grouping – continuation

43. 3D is bad (on 2D displays) CC BY SA 4.0 Robin Gower 2014

44. http://blog.jgc.org/2009/08/please-dont-use-

45. Perspective Distortion CC BY SA 4.0 WikiMedia Commons SharkD 2007

46. Perception vs Perspective

47. GuidanceGuidance

48. Chart Junk – 3d pies are a great way to deceive 2008 Macworld Expo via Engadget

49. Chart Junk – you can lie with line charts too Florida Dept of Law Enforcement via Reuters

50. Chart Junk – improves memorability Bateman et al (2010) Useful Junk? via

51. Data-Ink Ratio Tufte (1983) The Visual Display of Quantitative Data-ink Ratio Data-ink Total ink used to print the graphic 1 – proportion of graphic that can be erased = = proportion of a graphic’s ink devoted to the non-redundant display of data-information =

52. Data-Ink Ratio - Example CC BY NC 2.0 Tim Bray

53. Data-Ink Ratio http://darkhorseanalytics.com/blog/data-

54. Over-plotting Open Government License DataGM 2013

55. Over-plotting - smaller Open Government License DataGM 2013

56. Over-plotting - transparency Open Government License DataGM 2013

57. Over-plotting – logarithmic scale Open Government License DataGM 2013

58. Over-plotting – binning Open Government License DataGM 2013

59. Sparklines Robin Gower (2009) Infonomics AutoReporter

60. Oliver Byrne's Euclid 1847

61. Small Multiples IKEA discovered via Tufte twitter

62. Small Multiples 200 Calories – wisegeek.com

63. Jason Lockwood 2012 Perceptual Edge

64. Stop Making Pie Charts! An opinionated guide to the craft of data visualisation Robin Gower Open Data Manchester 30.06.15 infonomics.ltd.uk @robsteranium

Editor's Notes

Why do we visualise data?
Data are the raw symbols that allow us to store, transmit, and process outside of our brains.
Information is data that is given meaning through contextual relationships. Here the term from above is given meaning in the context of the other terms organised on this tablet.
Visualisation is the representation of abstract data encoded in visual (and interactive) form.
We encode information into a visualisation by setting aesthetic attributes according to the data. The viewer must study the visualisation to decode the information. We leverage the power of visual perception to assist us to interpret information.
Anscombes Quarter provides an excellent demonstration of the power of visualisation to aid interpretation. How similar are these 4 sets?
Statistical analysis finds them to be similar.
Visualisation shows the differences very clearly. Anscombe&apos;s quartet demonstrates both the effect of outliers on statistics and the importance of inspecting your data graphically as part of the analytical process.
“charts are usually instances of much more general objects… a pie is a divided bar with polar coordinates”
Variables are created from source datasets. Here we have library loans data opened as part of the Greater Manchester Data Synchronisation Project. Each column provides a variable. Here each row is a different area of Trafford.
The variables are manipulated in transformations. Here we add a total for all adult book loans, a ratio of fiction-to-non-fiction and a rank ordering. These are a critical part of the visualisation. A lot of design decisions depend upon the interaction of statistcal research as well as graphical analysis. For example, we could present a bivariate plot of fiction vs non-fiction or a uni-variate plot of the ratio.
Scales are used to map variables into a common measurement.
Logarithmic scales make it easier to compare values which either cover a large range, or cluster towards one end of the range. Under the linear scale, the larger absolute movements in the past 20 years dwarf previous changes. What&apos;s more important in stocks is percentage change. With a logarithmic scale, the same vertical change is equivalent to the same percentage change whatever the absolute level of the index. Now we can see the Great Depression and the Post-war Boom.
The coordinate system maps from the scale to the display.
The coordinate system maps from the scale to the display. This chart shows location quotients, the share relative to the average where &gt;100% is “more than their fair share”. One confounding problem with charting like this is that it encodes area (Cumbria is big)
Hexagonal binning is a great choice for map data as each bin has roughly similar radius and it tessellates. Density estimation takes this to the extreme building many overlapping bins and plotting the average.
Elements describe the marks and their aesthetic attributes. Points, lines, areas, angles, textures, shapes. There are lots of examples throughout this presentation so I&apos;ve not sought to display any particular ones here.
Guides provide context – e.g. legends/ axes. http://maps.nls.uk/os/6inch-england-and-wales/index.html
As we noted above, visualisations require that the viewer is able to decode the representation. It is important that we choose a representation that is easy to decode accurately, making best use of the brains abilities and avoiding optical illusions etc.
Pre-attentive processing allows us to recognise attributes without consciously focussed thought How many zeros are there?
Here the task is much easier because we&apos;ve used a colour-coding that may be processes pre-attentively. It is rapid, parallel and automatic but approximate. Attentive processing requires us to identify objects sequentially and hold them in memory. It is slower but more precise.
Position is the most accurate, length judgements are second, angle and slope judgements are third, and area judgements are last. Errors are smaller at the extremes. Error curve maxima are not clustered at 50%, rather higher and vary by type of judgement. No distinction between viewers according to training (professional vs college vs high-school).
Jock Mackinley has sought to extend this analysis to include non-quantitative perceptual tasks – ordinal ranking and nominal (categorical) comparisons. Based upon analyses of perceptual tasks but has not been validated empirically. Position is still the best performing encoding. Area is worse at ordinal coding as it&apos;s easy to confuse adjacent levels (critical to ordinal comparison but less important in quantitative comparison). It&apos;s ranked lower for nominal comparison as the view may perceive an ordinal ranking by size.
Can you spot the difference between these pies? Area is a poor choice for encoding quantitative data. Although pies can also be interpreted by the angles – they are not-aligned which makes it harder.
The corresponding bar charts show the differences immediately.
Just because you can do something, doesn&apos;t mean you should. If we&apos;re seeking to show trends over time, why not use a line chart?
The equivalent line chart is much easier to interpret. Note tables of data (beneath) makes use of position to distinguish variable levels nominally or ordinally
Grayscale is particularly difficult. A and B are the same colour although the checkerboard context tricks the eye into seeing them differently
5% of your audience will not be able to distinguish red and green
It&apos;s difficult to retain the meaning of more than 9 colours simultaneously (in short term memory) XKCD colour survey – 223k user sessions It&apos;s hard enough to perceive more than 5 levels Colours, therefore, aren&apos;t great for quantitative scales Muted colours are easier on the eye
Brewer colour palette Different colour schemes for different purposes – spectra, qualitative, diverging. Muted pastel tones avoid after-images caused by highly saturated colours. Useful for grouping and search.
Chart-junk are the extraneous elements that don&apos;t represent the numbers and are detrimental to our understanding of the data. The 3D distortion here is not only unnecessary, it actually makes it look like the iPhone has more market share than the “other” category.
The “Stand Your Ground Law” authorises people to defend themselves with lethal force. This chart switches the y-axis giving the impression that murders fell after the introduction of the law. The author claimed it was a personal preference meant to evoke images of dripping blood.
Nigel Holmes argues that data graphics must engage the readers interest. Bateman et al published a study which concludes that participants were better able to recall Holmes-style charts 1-3 weeks later Robert Kosara on eagereyes distinguishes 3 types of chart-junk: useful (infographics, annotations, explanatory text), harmless, and harmful
“A large share of ink on a graphic should present data-information, the ink changing as the data change. Data-ink is the non-erasable core of a graphic, the non-redundant ink arranged in response to variation in the numbers represented”
“Erase non-data ink, within reason” “Erase redundant data-ink, within reason”
The problem is that some non-data ink can help by providing context – e.g. graphs axis lines. Shouldn&apos;t forget the “within reason” part of Tufte&apos;s suggestions – even if he does.
Shrink the dots (but can&apos;t go far enough)
Transparency leverages overplotting – higher contrast means more points Still lose individual points and main bulk is concentrated in the corner
Logarithmic scales stretch the point cloud out but are harder to interpret. Note that the scales now start in different places.
Binning allows us to fully represent overplotted points and outliers.
Sparklines are small word-like charts Sacrifice context by dropping scales and axes but are thus small enough to fit into paragraphs of text. Useful for describing the shape of trends.
Delightful mix of images and text to visualise Euclids propositions of geometry
A group of similar charts using similar scales and axes to allow them to be compared.
Comparable – each is 200kcal. 200kcal doesn&apos;t need to mean anything – each dish give context to all of the others. Consistent plate size provides a scale with figure-ground effect: the more plate you can see, the higher the energy-per-volume.

Stop Making Pie Charts!

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (7)

Similar to Stop Making Pie Charts!

Similar to Stop Making Pie Charts! (20)

More from Open Data Manchester

More from Open Data Manchester (19)

Recently uploaded

Recently uploaded (20)

Stop Making Pie Charts!

Editor's Notes