SlideShare a Scribd company logo
1 of 83
Digging Deeper
Investigative Reporting (and data)
Jennifer LaFleur
Reveal | Center for Investigative Reporting
Investigative reporting is one of the most powerful
tools in journalism
What is it?
• Uncovering new information
• Uncovering corruption or injustice
• Holding institutions accountable
http://www.investigative
-journalism-africa.info/
It often means using data
It can make this
Into this
It gives us the power to check assumptions.
Source: NHTSA
complaint data
Findings:
“…unintended
acceleration has
been a problem
across the auto
industry.”
It let’s us combine information in powerful ways.
It gives us extremes.
Caution: This slide contains extreme nerdiness
Getting the records and data
If something is inspected
Licensed
Enforced or
Purchased
…There probably is a database
Tips for records
In rare joyous instances data is
readily available online for
download
Where’s the data?
Sometimes you have to scrape it.
Where’s the data?
More often you need to go to an
agency to get the data
This can be tricky if an agency
doesn’t want to release it. (Stay
tuned for more on that…)
Where’s the data?
Budgets
Parking tickets
Elevator inspections
Health/food inspections
Campus crime
Campus data
SOURCE: Local health department
inspection reports
FINDINGS: At 28% of the venues,
more than half of the concession
stands or restaurants had been cited
for at least one "critical" or "major"
health violation.
Request records and data early
Get out and talk to real people
Keep track of your work and stay
organized
Understand the process of what
you’re covering
Tips for digging
Students are getting sick from eating
in the student center cafeteria.
• Who inspects the cafeteria?
• Has it has problems in the past?
• When/what did the students eat?
Did any of them file complaints?
Understand the process
Request records and data early
Get out and talk to real people
Keep track of your work and stay
organized
Understand the process of what
you’re covering
Confirm and corroborate
Make it something worth reading,
listening to, watching
Tips for digging
Sometimes, there is no data.
But it’s okay because there are
techniques for sampling and building
a database.
ProPublica pulled a random
sample of 500 names from a
list of individuals who had
been granted or denied
pardons (around 2,000). We
created a database from
months or researching
individuals: their crime, age,
sentence…
We found that even after
controlling for other factors,
whites were more likely to get
a pardon.
Stories don’t end at the
records. We must find people
to tell the stories
Source: School district
credit card purchases
Findings: District card
holders made
questionable
purchases with their
cards.
Data isn’t always rows and
columns
Source: 311 calls for downed trees
Findings: After a tornado swept across New York City, 311
calls for downed trees helps trace its path
Bulletproof your data
Before ever reporting data or building an app
Do integrity checks to find the flaws
Add caveats where necessary
Do your own analysis rather than relying on an
agency’s analysis
External checks
Read the documentation. Understand the
contents of every field.
Know how many records you should
have.
Check counts and totals against reports.
Are all possibilities included?
Internal checks
Compare fields to check for red flags
• More teachers than students
• More money going to vendors
than to contractors
• What things just don’t make
sense
Integrity checks for every data set
Check for missing data or misplaced data
Integrity checks for every data set
Check for missing data, misplaced data or blank
fields
Check for duplicates
Integrity checks for every data set
Check for missing data, misplaced data or blank
fields
Check for duplicates
Check for outliers and extreme ups and downs
Truck accidents by year and agency
Beyond the basics
Keep a notes file/git
Don’t work off your original data/documents
Know the source
Check against summary reports
Beyond the basics
Keep a notes file
Don’t work off your original database
Know the source
Check against summary reports
Use the right tool
Beyond the basics
Check with experts
Know the standards
Find out what others have done
Gut check – does it just seem wrong?
Beyond the basics
Check with experts
Know the standards
Find out what others have done
Gut check
Go physically see a record or spot check against
documents
Checks when you’re matching data
A name is not enough. Lots of people have the same name
Get dates of birth and
other information to
make sure you have
the correct person.
Even people with seemingly unique names aren’t so unique
Be transparent: Bounce your findings off targets and
tell users/readers/viewers what you did.
Erroneous government databases can often be
a story themselves
123456789 compared to 123-456-789
Sometimes missing data is the story
Sometimes you might have to fight for
records
(We’ll talk about this tomorrow)
Resources
Investigative Reporters and Editors: www.ire.org
Student Press Law Center: www.splc.org
@j_la28
jlafleur@cironline.org
revealnews.org

More Related Content

Viewers also liked

Data journalism without data
Data journalism without dataData journalism without data
Data journalism without dataJennifer LaFleur
 
Mind the Gap NICAR14 (holes in data)
Mind the Gap NICAR14 (holes in data)Mind the Gap NICAR14 (holes in data)
Mind the Gap NICAR14 (holes in data)Jennifer LaFleur
 
Number Off
Number OffNumber Off
Number OffLouka5
 
Diagnosing dirty data_ire2013
Diagnosing dirty data_ire2013Diagnosing dirty data_ire2013
Diagnosing dirty data_ire2013Jennifer LaFleur
 
Data journalism at Techraking 6
Data journalism at Techraking 6Data journalism at Techraking 6
Data journalism at Techraking 6Jennifer LaFleur
 
VVOJ Intro to data journalism
VVOJ Intro to data journalismVVOJ Intro to data journalism
VVOJ Intro to data journalismJennifer LaFleur
 
Cat techie aka vaidehi sachin nbc newsmakers broadcasting real hidden story
Cat techie aka vaidehi sachin nbc newsmakers broadcasting real hidden storyCat techie aka vaidehi sachin nbc newsmakers broadcasting real hidden story
Cat techie aka vaidehi sachin nbc newsmakers broadcasting real hidden storyVAidehi Sachin
 
Crunching the numbers NR14
Crunching the numbers NR14Crunching the numbers NR14
Crunching the numbers NR14Jennifer LaFleur
 
The CASTLE Principles - mini description
The CASTLE Principles - mini descriptionThe CASTLE Principles - mini description
The CASTLE Principles - mini descriptionLance Secretan
 
The CASTLE Principles - Presentation
The CASTLE Principles - PresentationThe CASTLE Principles - Presentation
The CASTLE Principles - PresentationLance Secretan
 

Viewers also liked (15)

Data journalism without data
Data journalism without dataData journalism without data
Data journalism without data
 
Mind the Gap NICAR14 (holes in data)
Mind the Gap NICAR14 (holes in data)Mind the Gap NICAR14 (holes in data)
Mind the Gap NICAR14 (holes in data)
 
Getting it the rightest
Getting it the rightestGetting it the rightest
Getting it the rightest
 
Number Off
Number OffNumber Off
Number Off
 
Diagnosing dirty data_ire2013
Diagnosing dirty data_ire2013Diagnosing dirty data_ire2013
Diagnosing dirty data_ire2013
 
Data journalism at Techraking 6
Data journalism at Techraking 6Data journalism at Techraking 6
Data journalism at Techraking 6
 
VVOJ Intro to data journalism
VVOJ Intro to data journalismVVOJ Intro to data journalism
VVOJ Intro to data journalism
 
Cat techie aka vaidehi sachin nbc newsmakers broadcasting real hidden story
Cat techie aka vaidehi sachin nbc newsmakers broadcasting real hidden storyCat techie aka vaidehi sachin nbc newsmakers broadcasting real hidden story
Cat techie aka vaidehi sachin nbc newsmakers broadcasting real hidden story
 
Crunching the numbers NR14
Crunching the numbers NR14Crunching the numbers NR14
Crunching the numbers NR14
 
ACP Getting the Goods
ACP Getting the GoodsACP Getting the Goods
ACP Getting the Goods
 
The CASTLE Principles - mini description
The CASTLE Principles - mini descriptionThe CASTLE Principles - mini description
The CASTLE Principles - mini description
 
The CASTLE Principles - Presentation
The CASTLE Principles - PresentationThe CASTLE Principles - Presentation
The CASTLE Principles - Presentation
 
Transparency ire13
Transparency ire13Transparency ire13
Transparency ire13
 
Ona 2012
Ona 2012Ona 2012
Ona 2012
 
Cats stats
Cats statsCats stats
Cats stats
 

Similar to ACP Digging Deeper

Data-driven enterprise off your beat - Doug Caruso - Columbus, Ohio, NewsTrai...
Data-driven enterprise off your beat - Doug Caruso - Columbus, Ohio, NewsTrai...Data-driven enterprise off your beat - Doug Caruso - Columbus, Ohio, NewsTrai...
Data-driven enterprise off your beat - Doug Caruso - Columbus, Ohio, NewsTrai...News Leaders Association's NewsTrain
 
Data collection methods
Data collection methodsData collection methods
Data collection methodsSourabh Modgil
 
Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...
Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...
Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...News Leaders Association's NewsTrain
 
Developing a data mindset to improve stories every day - Aaron Mendelson - Fr...
Developing a data mindset to improve stories every day - Aaron Mendelson - Fr...Developing a data mindset to improve stories every day - Aaron Mendelson - Fr...
Developing a data mindset to improve stories every day - Aaron Mendelson - Fr...News Leaders Association's NewsTrain
 
Data, Data Wherever.pptx
Data, Data Wherever.pptxData, Data Wherever.pptx
Data, Data Wherever.pptxMenchBroqz1
 
Data driven enterprise off your beat - denver news train - april 11-12, 2019
Data driven enterprise off your beat - denver news train - april 11-12, 2019Data driven enterprise off your beat - denver news train - april 11-12, 2019
Data driven enterprise off your beat - denver news train - april 11-12, 2019News Leaders Association's NewsTrain
 
Data-driven enterprise off your beat - Sarah Cohen - Phoenix NewsTrain - Apri...
Data-driven enterprise off your beat - Sarah Cohen - Phoenix NewsTrain - Apri...Data-driven enterprise off your beat - Sarah Cohen - Phoenix NewsTrain - Apri...
Data-driven enterprise off your beat - Sarah Cohen - Phoenix NewsTrain - Apri...News Leaders Association's NewsTrain
 
Data-driven enterprise off your beat - Steve Doig - Seattle NewsTrain - 11.11.17
Data-driven enterprise off your beat - Steve Doig - Seattle NewsTrain - 11.11.17Data-driven enterprise off your beat - Steve Doig - Seattle NewsTrain - 11.11.17
Data-driven enterprise off your beat - Steve Doig - Seattle NewsTrain - 11.11.17News Leaders Association's NewsTrain
 
Questionnaires
QuestionnairesQuestionnaires
QuestionnairesBeth Lee
 
STUDENT REPLIESSTUDENT REPLY #1 Danielle BerlusThe evolution.docx
STUDENT REPLIESSTUDENT REPLY #1 Danielle BerlusThe evolution.docxSTUDENT REPLIESSTUDENT REPLY #1 Danielle BerlusThe evolution.docx
STUDENT REPLIESSTUDENT REPLY #1 Danielle BerlusThe evolution.docxlillie234567
 
Turning Data into Infographics: An Interactive Workshop for Problem Solvers
Turning Data into Infographics: An Interactive Workshop for Problem SolversTurning Data into Infographics: An Interactive Workshop for Problem Solvers
Turning Data into Infographics: An Interactive Workshop for Problem SolversUNCResearchHub
 
Trusting a Distributed Data Pipeline | Masters of Conversion
Trusting a Distributed Data Pipeline | Masters of ConversionTrusting a Distributed Data Pipeline | Masters of Conversion
Trusting a Distributed Data Pipeline | Masters of ConversionVWO
 
Data collection methods
Data collection methodsData collection methods
Data collection methodsashima_sodhi
 

Similar to ACP Digging Deeper (20)

Mendelson: Driving daily enterprise coverage
Mendelson: Driving daily enterprise coverageMendelson: Driving daily enterprise coverage
Mendelson: Driving daily enterprise coverage
 
Jerait PDF.pdf
Jerait PDF.pdfJerait PDF.pdf
Jerait PDF.pdf
 
Serial Killers Presentation1
Serial Killers Presentation1Serial Killers Presentation1
Serial Killers Presentation1
 
Umhoefer: Data-driven enterprise - handout
Umhoefer: Data-driven enterprise - handoutUmhoefer: Data-driven enterprise - handout
Umhoefer: Data-driven enterprise - handout
 
Data-driven enterprise off your beat - Doug Caruso - Columbus, Ohio, NewsTrai...
Data-driven enterprise off your beat - Doug Caruso - Columbus, Ohio, NewsTrai...Data-driven enterprise off your beat - Doug Caruso - Columbus, Ohio, NewsTrai...
Data-driven enterprise off your beat - Doug Caruso - Columbus, Ohio, NewsTrai...
 
DATA-COLLECTION.pptx
DATA-COLLECTION.pptxDATA-COLLECTION.pptx
DATA-COLLECTION.pptx
 
Data collection methods
Data collection methodsData collection methods
Data collection methods
 
Mendelson: Driving daily enterprise coverage
Mendelson: Driving daily enterprise coverageMendelson: Driving daily enterprise coverage
Mendelson: Driving daily enterprise coverage
 
AS Primary & Secondary Data
AS Primary & Secondary DataAS Primary & Secondary Data
AS Primary & Secondary Data
 
Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...
Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...
Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...
 
Developing a data mindset to improve stories every day - Aaron Mendelson - Fr...
Developing a data mindset to improve stories every day - Aaron Mendelson - Fr...Developing a data mindset to improve stories every day - Aaron Mendelson - Fr...
Developing a data mindset to improve stories every day - Aaron Mendelson - Fr...
 
Data, Data Wherever.pptx
Data, Data Wherever.pptxData, Data Wherever.pptx
Data, Data Wherever.pptx
 
Data driven enterprise off your beat - denver news train - april 11-12, 2019
Data driven enterprise off your beat - denver news train - april 11-12, 2019Data driven enterprise off your beat - denver news train - april 11-12, 2019
Data driven enterprise off your beat - denver news train - april 11-12, 2019
 
Data-driven enterprise off your beat - Sarah Cohen - Phoenix NewsTrain - Apri...
Data-driven enterprise off your beat - Sarah Cohen - Phoenix NewsTrain - Apri...Data-driven enterprise off your beat - Sarah Cohen - Phoenix NewsTrain - Apri...
Data-driven enterprise off your beat - Sarah Cohen - Phoenix NewsTrain - Apri...
 
Data-driven enterprise off your beat - Steve Doig - Seattle NewsTrain - 11.11.17
Data-driven enterprise off your beat - Steve Doig - Seattle NewsTrain - 11.11.17Data-driven enterprise off your beat - Steve Doig - Seattle NewsTrain - 11.11.17
Data-driven enterprise off your beat - Steve Doig - Seattle NewsTrain - 11.11.17
 
Questionnaires
QuestionnairesQuestionnaires
Questionnaires
 
STUDENT REPLIESSTUDENT REPLY #1 Danielle BerlusThe evolution.docx
STUDENT REPLIESSTUDENT REPLY #1 Danielle BerlusThe evolution.docxSTUDENT REPLIESSTUDENT REPLY #1 Danielle BerlusThe evolution.docx
STUDENT REPLIESSTUDENT REPLY #1 Danielle BerlusThe evolution.docx
 
Turning Data into Infographics: An Interactive Workshop for Problem Solvers
Turning Data into Infographics: An Interactive Workshop for Problem SolversTurning Data into Infographics: An Interactive Workshop for Problem Solvers
Turning Data into Infographics: An Interactive Workshop for Problem Solvers
 
Trusting a Distributed Data Pipeline | Masters of Conversion
Trusting a Distributed Data Pipeline | Masters of ConversionTrusting a Distributed Data Pipeline | Masters of Conversion
Trusting a Distributed Data Pipeline | Masters of Conversion
 
Data collection methods
Data collection methodsData collection methods
Data collection methods
 

ACP Digging Deeper