SlideShare a Scribd company logo
1 of 31
Download to read offline
torturing  numbers  
a novice’s guide to descriptive dtatistics
1	
  
Bandhu	
  P.	
  Das	
  
"If you torture the data long
enough, it will confess"
@BPDas_	
   2	
  
– Ronald Harry Coase
why  do  we  torture  numbers?
@BPDas_	
   3	
  
q  Describe the story
q  Find trends in data
against variation
q  Determine if a sample
represents a population
q  Draw conclusions about the story
a tool called
‘descriptive statistics’
is used
@BPDas_	
   4	
  
describing  numbers
@BPDas_	
   5	
  
25 people were asked what an
average person pay in tax?
What do these numbers tell you?
£45,000	
   £3,700	
   £10,000	
   £2,000	
   £2,000	
  
£15,000	
   £3,000	
   £5,000	
   £3,700	
   £2,000	
  
£10,000	
   £2,000	
   £2,000	
   £3,700	
   £2,000	
  
£5,700	
   £2,000	
   £2,000	
   £3,700	
   £2,000	
  
£5,000	
   £2,000	
   £5,000	
   £2,000	
   £2,000	
  
describing  numbers
@BPDas_	
   6	
  
£2,000
Here is the same data ordered from greatest to
least and weighted to show how many times each
value occurs in the data set
•  Now what do the data tell
you?
•  What is the average income?
£45,000
£15,000
£10,000
£5,700
£5,000
£3,700
£3,000
£45,000
£15,000
£10,000
£5,700
£5,000
£3,700
£3,000
describing  numbers
@BPDas_	
   7	
  
BEWARE! The reported ‘average’ might
depend on what you are meant to see.
Which would you use?
MEAN (arithmetic average)
MEDIAN (midpoint in range)
MODE (most frequent)
So, to really understand the
data set you need more than
just the ‘average’
£2,000
spread  and  variability
@BPDas_	
   8	
  
You need to know the spread of the data
•  This histogram
shows the ages
of people that
use a smart
phone
•  Is it typical
for 90 year
olds to use a
smart phone?
spread  and  variability
@BPDas_	
   9	
  
When the mean and median are the same, you
have a special situation called a ‘normal’ curve
On this
symmetrical
curve, the
variability can
be described
using standard
deviations (SD)
spread  and  variability
@BPDas_	
   10	
  
SD is a way to determine how far a data
point is from the mean
You can now say
that 90 year
olds fall more
than 2 SD from
the mean, or
that they make
up less than
2.5% of the
data set
spread  and  variability
@BPDas_	
   11	
  
If we collapse the whole data set to one bar,
we can show the mean with some measure
of variability (std dev, std error, etc.)
Without some indication of variability, you
cannot effectively compare two data sets
spread  and  variability
@BPDas_	
   12	
  
Min Q1 Median Q3 Max
Perhaps the best way to describe any data set is
with five numbers: Minimum, Q1, Median, Q3,
Maximum. This helps when comparing data sets,
and when there are oddities called outliers.
25% 25% 25% 25%
*
“79.48% of all statistics are
made up on the spot.”
@BPDas_	
   13	
  
– John A. Paulos
a  sample  study
@BPDas_	
   14	
  
Researchers want to
know which of three
fertilisers produce the
highest wheat yield in
kg/plot
a  sample  study
@BPDas_	
   15	
  
They design a study with three treatments
and five replications for each treatment
3 Treatments (Fertilisers 1, 2 and 3)
5Replicates
a  sample  study
@BPDas_	
   16	
  
Could a nearby
forest or
river be a
confounding
variable?
Variables like soil type and other local
influences may have unexpected impacts…
a  sample  study
@BPDas_	
   17	
  
This is why a good study is
randomised, to defeat potentially
confounding variables
Does the sample
plot in our study
represent all the
wheat in all the
world?
P
O
P
U
L
A
T
I
O
N
SAMPLE
@BPDas_	
  
18	
  
uncertainty
@BPDas_	
   19	
  
With all the unknown variables, there will
always be a degree of uncertainty that our
sample represents the population
That’s why the more samples we have, the more
confident we are that our study represents the
population
confidence
@BPDas_	
   20	
  
•  Any confidence interval
could be used, but 95% is
often chosen
•  This means that 95% of
the time, you expect your
data represents reality
•  BEWARE reports with no
confidence interval
@BPDas_	
   21	
  
Fer$lizer	
  1	
  Fer$lizer	
  2	
  Fer$lizer	
  3	
  
64.8	
   56.5	
   65.8	
  
60.5	
   53.8	
   73.2	
  
63.4	
   59.4	
   59.5	
  
48.2	
   61.1	
   66.3	
  
55.5	
   58.8	
   70.2	
  
two  ways  to  present  data
Tables are the preferred way to show data,
but graphs paint a quick, easy and
seductive picture
drawing  conclusions
A presenter may want you to see a
relationship between two variables
Fertiliser 3 appears to increase the average yield
of wheat – but what kind of average is this? How big
was the sample? Where is the indication of
variability? Where is the confidence interval?
@BPDas_	
   22	
  
drawing  conclusions
A presenter may want you to see a
relationship between two variables
Fertiliser 3 appears to increase the average yield
of wheat – but what kind of average is this? How big
was the sample? Where is the indication of
variability? Where is the confidence interval?
@BPDas_	
  
23	
  
Bad stats and
presentation may
lead to bad
conclusions
2 SD
drawing  conclusions
@BPDas_	
   24	
  
Correlation does not imply causation
The more firemen fighting a fire, the
bigger the fire is observed to be.
Therefore more firemen cause an increase
in the size of a fire
Often, a presenter wants to lead you to
a conclusion. Newspapers, TV and
online articles should be scrutinised!
BEWARE:
“This is not a scientific poll…”
“These results may not be representative of
the population”
“…based on a list of those that responded”
“Data showed a trend but was not
statistically significant”
it’s  all  in  how  they  are  presented
@BPDas_	
   25	
  
it’s  all  in  how  they  are  presented
@BPDas_	
   26	
  
Pies are for eating
It’s very hard to see differences
BEWARE CHARTJUNK!
it’s  all  in  how  they  are  presented
@BPDas_	
  
27	
  
Amusing graphics are nothing but distractions
Again, it’s very hard to see differences
BEWARE CHARTJUNK!
it’s  all  in  how  they  are  presented
@BPDas_	
   28	
  
Here is the same population growth data
shown on two scales. Which would you use to
demonstrate rapid growth?
BEWARE tricky scales!
it’s  all  in  how  they  are  presented
@BPDas_	
   29	
  
BEWARE statements with no context.
Here’s a made-up example:
Did you know that even speaking to
someone that once smoked, DOUBLES
your chance of getting cancer?! ;)
Your odds go from
to
0.000000001:1
0.000000002:1
conclusion
@BPDas_	
   30	
  
Like any tool, stats can be misused
(intentionally or unintentionally)
Maintain a healthy skepticism and
question charts, tables and conclusions
where insufficient information is provided
references
@BPDas_	
   31	
  
-  The Cartoon Guide to Statistics (1993)
-  Larry Gonick and Woolcott Smith
-  How to Lie with Statistics (1954)
-  Darrel Huff

More Related Content

Similar to A Visual Guide for Describing Numbers

Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)jasondeveau
 
Statistical analysis
Statistical analysisStatistical analysis
Statistical analysishighlandn
 
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docxHomework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docxpooleavelina
 
Mat 255 chapter 3 notes
Mat 255 chapter 3 notesMat 255 chapter 3 notes
Mat 255 chapter 3 notesadrushle
 
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (02/12/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker  (02/12/2020)Reuters/Ipsos Core Political Survey: Presidential Approval Tracker  (02/12/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (02/12/2020)Ipsos Public Affairs
 
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/04/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/04/2020)Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/04/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/04/2020)Ipsos Public Affairs
 
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (02/26/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (02/26/2020)Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (02/26/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (02/26/2020)Ipsos Public Affairs
 
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/11/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/11/2020)Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/11/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/11/2020)Ipsos Public Affairs
 
Interpretation of Data and Statistical Fallacies
Interpretation of Data and Statistical FallaciesInterpretation of Data and Statistical Fallacies
Interpretation of Data and Statistical FallaciesRHIMRJ Journal
 
Basic statistics for pharmaceutical (Part 1)
Basic statistics for pharmaceutical (Part 1)Basic statistics for pharmaceutical (Part 1)
Basic statistics for pharmaceutical (Part 1)Syed Muhammad Danish
 
Reuters/Ipsos Core Political Survey: Congressional Approval Tracker (02/20/2020)
Reuters/Ipsos Core Political Survey: Congressional Approval Tracker (02/20/2020)Reuters/Ipsos Core Political Survey: Congressional Approval Tracker (02/20/2020)
Reuters/Ipsos Core Political Survey: Congressional Approval Tracker (02/20/2020)Ipsos Public Affairs
 
Fundamental of Biostatics DR.SOMANATH.ppt
Fundamental of Biostatics DR.SOMANATH.pptFundamental of Biostatics DR.SOMANATH.ppt
Fundamental of Biostatics DR.SOMANATH.pptDentalYoutube
 
The%20 Minimum%20 Daily%20 Adult%20 %20 Ca Cmg
The%20 Minimum%20 Daily%20 Adult%20 %20 Ca CmgThe%20 Minimum%20 Daily%20 Adult%20 %20 Ca Cmg
The%20 Minimum%20 Daily%20 Adult%20 %20 Ca Cmgdahirf
 
Storyfying your Data: How to go from Data to Insights to Stories
Storyfying your Data: How to go from Data to Insights to StoriesStoryfying your Data: How to go from Data to Insights to Stories
Storyfying your Data: How to go from Data to Insights to StoriesGramener
 
Statistical ProcessesCan descriptive statistical processes b.docx
Statistical ProcessesCan descriptive statistical processes b.docxStatistical ProcessesCan descriptive statistical processes b.docx
Statistical ProcessesCan descriptive statistical processes b.docxdarwinming1
 
3.2 measures of variation
3.2 measures of variation3.2 measures of variation
3.2 measures of variationleblance
 

Similar to A Visual Guide for Describing Numbers (20)

Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)
 
Statistical analysis
Statistical analysisStatistical analysis
Statistical analysis
 
Chapter 11
Chapter 11Chapter 11
Chapter 11
 
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docxHomework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
 
Mat 255 chapter 3 notes
Mat 255 chapter 3 notesMat 255 chapter 3 notes
Mat 255 chapter 3 notes
 
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (02/12/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker  (02/12/2020)Reuters/Ipsos Core Political Survey: Presidential Approval Tracker  (02/12/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (02/12/2020)
 
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/04/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/04/2020)Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/04/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/04/2020)
 
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (02/26/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (02/26/2020)Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (02/26/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (02/26/2020)
 
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/11/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/11/2020)Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/11/2020)
Reuters/Ipsos Core Political Survey: Presidential Approval Tracker (03/11/2020)
 
Interpretation of Data and Statistical Fallacies
Interpretation of Data and Statistical FallaciesInterpretation of Data and Statistical Fallacies
Interpretation of Data and Statistical Fallacies
 
Basic statistics for pharmaceutical (Part 1)
Basic statistics for pharmaceutical (Part 1)Basic statistics for pharmaceutical (Part 1)
Basic statistics for pharmaceutical (Part 1)
 
Statistics
StatisticsStatistics
Statistics
 
Reuters/Ipsos Core Political Survey: Congressional Approval Tracker (02/20/2020)
Reuters/Ipsos Core Political Survey: Congressional Approval Tracker (02/20/2020)Reuters/Ipsos Core Political Survey: Congressional Approval Tracker (02/20/2020)
Reuters/Ipsos Core Political Survey: Congressional Approval Tracker (02/20/2020)
 
Fundamental of Biostatics DR.SOMANATH.ppt
Fundamental of Biostatics DR.SOMANATH.pptFundamental of Biostatics DR.SOMANATH.ppt
Fundamental of Biostatics DR.SOMANATH.ppt
 
The%20 Minimum%20 Daily%20 Adult%20 %20 Ca Cmg
The%20 Minimum%20 Daily%20 Adult%20 %20 Ca CmgThe%20 Minimum%20 Daily%20 Adult%20 %20 Ca Cmg
The%20 Minimum%20 Daily%20 Adult%20 %20 Ca Cmg
 
Storyfying your Data: How to go from Data to Insights to Stories
Storyfying your Data: How to go from Data to Insights to StoriesStoryfying your Data: How to go from Data to Insights to Stories
Storyfying your Data: How to go from Data to Insights to Stories
 
Statistical ProcessesCan descriptive statistical processes b.docx
Statistical ProcessesCan descriptive statistical processes b.docxStatistical ProcessesCan descriptive statistical processes b.docx
Statistical ProcessesCan descriptive statistical processes b.docx
 
SPSS software application.pdf
SPSS software application.pdfSPSS software application.pdf
SPSS software application.pdf
 
Statistics for ess
Statistics for essStatistics for ess
Statistics for ess
 
3.2 measures of variation
3.2 measures of variation3.2 measures of variation
3.2 measures of variation
 

Recently uploaded

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 

Recently uploaded (20)

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 

A Visual Guide for Describing Numbers

  • 1. torturing  numbers   a novice’s guide to descriptive dtatistics 1   Bandhu  P.  Das  
  • 2. "If you torture the data long enough, it will confess" @BPDas_   2   – Ronald Harry Coase
  • 3. why  do  we  torture  numbers? @BPDas_   3   q  Describe the story q  Find trends in data against variation q  Determine if a sample represents a population q  Draw conclusions about the story
  • 4. a tool called ‘descriptive statistics’ is used @BPDas_   4  
  • 5. describing  numbers @BPDas_   5   25 people were asked what an average person pay in tax? What do these numbers tell you? £45,000   £3,700   £10,000   £2,000   £2,000   £15,000   £3,000   £5,000   £3,700   £2,000   £10,000   £2,000   £2,000   £3,700   £2,000   £5,700   £2,000   £2,000   £3,700   £2,000   £5,000   £2,000   £5,000   £2,000   £2,000  
  • 6. describing  numbers @BPDas_   6   £2,000 Here is the same data ordered from greatest to least and weighted to show how many times each value occurs in the data set •  Now what do the data tell you? •  What is the average income? £45,000 £15,000 £10,000 £5,700 £5,000 £3,700 £3,000
  • 7. £45,000 £15,000 £10,000 £5,700 £5,000 £3,700 £3,000 describing  numbers @BPDas_   7   BEWARE! The reported ‘average’ might depend on what you are meant to see. Which would you use? MEAN (arithmetic average) MEDIAN (midpoint in range) MODE (most frequent) So, to really understand the data set you need more than just the ‘average’ £2,000
  • 8. spread  and  variability @BPDas_   8   You need to know the spread of the data •  This histogram shows the ages of people that use a smart phone •  Is it typical for 90 year olds to use a smart phone?
  • 9. spread  and  variability @BPDas_   9   When the mean and median are the same, you have a special situation called a ‘normal’ curve On this symmetrical curve, the variability can be described using standard deviations (SD)
  • 10. spread  and  variability @BPDas_   10   SD is a way to determine how far a data point is from the mean You can now say that 90 year olds fall more than 2 SD from the mean, or that they make up less than 2.5% of the data set
  • 11. spread  and  variability @BPDas_   11   If we collapse the whole data set to one bar, we can show the mean with some measure of variability (std dev, std error, etc.) Without some indication of variability, you cannot effectively compare two data sets
  • 12. spread  and  variability @BPDas_   12   Min Q1 Median Q3 Max Perhaps the best way to describe any data set is with five numbers: Minimum, Q1, Median, Q3, Maximum. This helps when comparing data sets, and when there are oddities called outliers. 25% 25% 25% 25% *
  • 13. “79.48% of all statistics are made up on the spot.” @BPDas_   13   – John A. Paulos
  • 14. a  sample  study @BPDas_   14   Researchers want to know which of three fertilisers produce the highest wheat yield in kg/plot
  • 15. a  sample  study @BPDas_   15   They design a study with three treatments and five replications for each treatment 3 Treatments (Fertilisers 1, 2 and 3) 5Replicates
  • 16. a  sample  study @BPDas_   16   Could a nearby forest or river be a confounding variable? Variables like soil type and other local influences may have unexpected impacts…
  • 17. a  sample  study @BPDas_   17   This is why a good study is randomised, to defeat potentially confounding variables
  • 18. Does the sample plot in our study represent all the wheat in all the world? P O P U L A T I O N SAMPLE @BPDas_   18  
  • 19. uncertainty @BPDas_   19   With all the unknown variables, there will always be a degree of uncertainty that our sample represents the population That’s why the more samples we have, the more confident we are that our study represents the population
  • 20. confidence @BPDas_   20   •  Any confidence interval could be used, but 95% is often chosen •  This means that 95% of the time, you expect your data represents reality •  BEWARE reports with no confidence interval
  • 21. @BPDas_   21   Fer$lizer  1  Fer$lizer  2  Fer$lizer  3   64.8   56.5   65.8   60.5   53.8   73.2   63.4   59.4   59.5   48.2   61.1   66.3   55.5   58.8   70.2   two  ways  to  present  data Tables are the preferred way to show data, but graphs paint a quick, easy and seductive picture
  • 22. drawing  conclusions A presenter may want you to see a relationship between two variables Fertiliser 3 appears to increase the average yield of wheat – but what kind of average is this? How big was the sample? Where is the indication of variability? Where is the confidence interval? @BPDas_   22  
  • 23. drawing  conclusions A presenter may want you to see a relationship between two variables Fertiliser 3 appears to increase the average yield of wheat – but what kind of average is this? How big was the sample? Where is the indication of variability? Where is the confidence interval? @BPDas_   23   Bad stats and presentation may lead to bad conclusions 2 SD
  • 24. drawing  conclusions @BPDas_   24   Correlation does not imply causation The more firemen fighting a fire, the bigger the fire is observed to be. Therefore more firemen cause an increase in the size of a fire
  • 25. Often, a presenter wants to lead you to a conclusion. Newspapers, TV and online articles should be scrutinised! BEWARE: “This is not a scientific poll…” “These results may not be representative of the population” “…based on a list of those that responded” “Data showed a trend but was not statistically significant” it’s  all  in  how  they  are  presented @BPDas_   25  
  • 26. it’s  all  in  how  they  are  presented @BPDas_   26   Pies are for eating It’s very hard to see differences BEWARE CHARTJUNK!
  • 27. it’s  all  in  how  they  are  presented @BPDas_   27   Amusing graphics are nothing but distractions Again, it’s very hard to see differences BEWARE CHARTJUNK!
  • 28. it’s  all  in  how  they  are  presented @BPDas_   28   Here is the same population growth data shown on two scales. Which would you use to demonstrate rapid growth? BEWARE tricky scales!
  • 29. it’s  all  in  how  they  are  presented @BPDas_   29   BEWARE statements with no context. Here’s a made-up example: Did you know that even speaking to someone that once smoked, DOUBLES your chance of getting cancer?! ;) Your odds go from to 0.000000001:1 0.000000002:1
  • 30. conclusion @BPDas_   30   Like any tool, stats can be misused (intentionally or unintentionally) Maintain a healthy skepticism and question charts, tables and conclusions where insufficient information is provided
  • 31. references @BPDas_   31   -  The Cartoon Guide to Statistics (1993) -  Larry Gonick and Woolcott Smith -  How to Lie with Statistics (1954) -  Darrel Huff