Workshop data
Database Journalism:Finding the smoking gunSteve DoigArizona State Universitysteve.doig@asu.edu
Why use “data”?
Precision journalism
What is “data”?
Information -- but not data Steve Doig is a 65-year-old professorwho teaches at Arizona State Universityin Phoenix.
Now it’s data!LastnameFirstname Age Title CityDoig Steve 65 Professor PhoenixJones Bob 45 Editor SydneySmith Tom 34 Report...
Getting data WOBBING.EU Wet Openbaarheid van Bestuur (OpenGovernment Act) ISTAT.IT Request from agencies Build your o...
History lesson
Recent “big data” stories Education cheating (USA Today) UK MPs’ spending (Guardian) Medicare fraud (CalWatch) EU fish...
More examples Budgets and taxes Crime patterns School test scores Auto accidents Demographic change Pet licenses Ai...
Data journalism tools Analysis: Spreadsheet (Excel) Database (Access) Mapping (ArcMap) SPSS, SAS Text editor Social...
Patterns and outliers
Data power tool: MS Excel
A blank spreadsheet
What Excel can do Import data from many formats Sort data by one or more variables Filter data to show only selected ro...
Importing data Common formats *.xls (or *.xlsx) Fixed-width text Delimited text (comma, tab, etc) *.dbf files (old dB...
Delimited text example
Sorting a table
Now it’s sorted
Filtering: Data…Filter…Autofilter
Pick a category…
…and see just that
Transforming data Math functions Add, subtract, multiply, divide Average, median, maximum, minimum Date/Time functions...
Function Wizard (ƒx)
Function Wizard (ƒx)
Summarizing data
Example data
Pivot table example Data: Region, province, population,murders, etc. Question: “How many murders occurredin each region?...
Building a pivot table
Pivot table
Sorted pivot table
Data journalism resources
DataJournalismHandbook.org
Publications
DOMANDE?
Precision Journalism by Steve Doig
Upcoming SlideShare
Loading in …5
×

Precision Journalism by Steve Doig

1,439 views

Published on

These slides are by Steve Doig, journalism professor at ASU's Cronkite School and Pulitzer-winning journalist. The slides are from Doig's workshop Excel for Journalists, part of the School of Data Journalism 2013 at the International Journalism Festival, Perugia.

Published in: Technology, News & Politics
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,439
On SlideShare
0
From Embeds
0
Number of Embeds
481
Actions
Shares
0
Downloads
31
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide
  • Evidence, not just anecdote
  • Tools of social science to find stories: Statistics Polling Experiment
  • Data=organized information, Information in table form Columns are the variables Name, date, time, address, age, etc. Rows are the records Persons, incidents, etc.
  • Cronkite in ‘ 52 Meyer in the ‘ 60s Barlett & Steele in the ‘ 60s “ precision journalism ” in 1972 Reporters discover Apple/PC in early ‘ 80s “ Color of Money ” in 1989 “ What Went Wrong ” in 1992 – smoking gun
  • Data journalism is all about finding patterns and spotting outliers Journalists like outliers
  • Good stories can be found in the patterns of data Human mind alone can ’t see the patterns in large sets of data Excel has tools to help us see the patterns in data in table form
  • Excel can handle large tables More than 16,000 columns More than 1 million rows
  • We often want to take a big collection of individual records and pile them into categories Counts, averages, totals, etc… Trick: Visualize the piece of paper that would give you the answer you seek Tool: Pivot tables
  • Where to learn more
  • Precision Journalism by Steve Doig

    1. 1. Workshop data
    2. 2. Database Journalism:Finding the smoking gunSteve DoigArizona State Universitysteve.doig@asu.edu
    3. 3. Why use “data”?
    4. 4. Precision journalism
    5. 5. What is “data”?
    6. 6. Information -- but not data Steve Doig is a 65-year-old professorwho teaches at Arizona State Universityin Phoenix.
    7. 7. Now it’s data!LastnameFirstname Age Title CityDoig Steve 65 Professor PhoenixJones Bob 45 Editor SydneySmith Tom 34 Reporter Melbourne
    8. 8. Getting data WOBBING.EU Wet Openbaarheid van Bestuur (OpenGovernment Act) ISTAT.IT Request from agencies Build your own?
    9. 9. History lesson
    10. 10. Recent “big data” stories Education cheating (USA Today) UK MPs’ spending (Guardian) Medicare fraud (CalWatch) EU fishing subsidies (ICIJ)
    11. 11. More examples Budgets and taxes Crime patterns School test scores Auto accidents Demographic change Pet licenses Air quality Sports statistics
    12. 12. Data journalism tools Analysis: Spreadsheet (Excel) Database (Access) Mapping (ArcMap) SPSS, SAS Text editor Social networkanalysis (NodeXL) Presentation: Google fusion tables Ruby, Django, perl,python, et al. Photoshop
    13. 13. Patterns and outliers
    14. 14. Data power tool: MS Excel
    15. 15. A blank spreadsheet
    16. 16. What Excel can do Import data from many formats Sort data by one or more variables Filter data to show only selected rows Transform data using functions andformulas Summarize data into categories
    17. 17. Importing data Common formats *.xls (or *.xlsx) Fixed-width text Delimited text (comma, tab, etc) *.dbf files (old dBase) HTML tables Data Import Wizard will help
    18. 18. Delimited text example
    19. 19. Sorting a table
    20. 20. Now it’s sorted
    21. 21. Filtering: Data…Filter…Autofilter
    22. 22. Pick a category…
    23. 23. …and see just that
    24. 24. Transforming data Math functions Add, subtract, multiply, divide Average, median, maximum, minimum Date/Time functions Day of week, days between Text functions Extract parts of text strings Search and replace text
    25. 25. Function Wizard (ƒx)
    26. 26. Function Wizard (ƒx)
    27. 27. Summarizing data
    28. 28. Example data
    29. 29. Pivot table example Data: Region, province, population,murders, etc. Question: “How many murders occurredin each region?” Visualize the piece of paper that wouldanswer the question
    30. 30. Building a pivot table
    31. 31. Pivot table
    32. 32. Sorted pivot table
    33. 33. Data journalism resources
    34. 34. DataJournalismHandbook.org
    35. 35. Publications
    36. 36. DOMANDE?

    ×