Monday, 5 March 2012
Watch: West Wing on mapswww.youtube.com/watch?v=n8zBC2dvERMMonday, 5 March 2012
Data 2:    clean, combine, communicate   Online Journalism   City University   Paul BradshawMonday, 5 March 2012
Themes   5 things you need to know about each   Data journalism in action   WalkthroughMonday, 5 March 2012
How clean is the data?   .Monday, 5 March 2012
Watch: The Wire: http://www.youtube.com/watch?v=xH_6_8NOfwIMonday, 5 March 2012
http://onlinejournalismblog.com/2010/10/12/statistical-analysis-as-journalism-benfords-law/Monday, 5 March 2012
“With the help of just Benford’s law and data sets to compare he’s able to demonstrate how the police are systematically h...
http://delicious.com/paulb/benfordslawMonday, 5 March 2012
5 things you need to know about    cleaning data   1. Data always needs cleaning up   2. Treat the ‘source’ like a source ...
Monday, 5 March 2012
“What the Independent have done is confuse the UK’s deficit with our debt [making] the debt problem look around eight time...
Monday, 5 March 2012
Question?  A town has two hospitals. Hospital A is  bigger than hospital B. One of them has  a birth rate of 60% boys. Whi...
Question?  The smaller hospital is more likely to  have a 60% birth rate - larger samples  are more stable.Monday, 5 March...
http://blog.ouseful.info/2011/10/31/power-tools-for-aspiring-data-journalists-r/   16Monday, 5 March 2012
What is the data worth?   Measurement doesnt answer anything if   theres only one variable   Statistical significance   Sa...
Getting data ready to answer    questions   Data > Text to columns or =SPLIT   Find & replace   =IF(condition, if met, if ...
Walkthrough: cleaning data in    Google Refine   Edit cells > common transforms   Edit cells > split multi-valued cells   ...
Communicating data stories   .Monday, 5 March 2012
Monday, 5 March 2012
5 things you need to know about    visualising data   1. Choose the chart for the purpose   2. For answers or for story?  ...
or http://chartchooser.juiceanalytics.com/Monday, 5 March 2012
http://junkcharts.typepad.com/junk_charts/trifecta-checkup/Monday, 5 March 2012
What is wrong with this picture?Monday, 5 March 2012
Monday, 5 March 2012
http://simplecomplexity.net/statistics-without-context/Monday, 5 March 2012
.Monday, 5 March 2012
Visualisation tools   ManyEyes, Tableau, Number Picture   Wordle, Tagxedo   BatchGeo, FusionTables   Gephi   Delicious.com...
Distribution: getting social Publish embed code & link to data Have or join a Flickr group for visualisations, comment on ...
Mashing data   .Monday, 5 March 2012
5 things you need to know about    mashing data   1. It is what a journalist does best   2. Look for a point of connection...
Monday, 5 March 2012
Mashup tools   Yahoo! Pipes, xFruits   OpenHeatMap   Mapalist, Maptube, FusionTables   Scraperwiki   Google RefineMonday, ...
Walkthrough: grabbing geo data    with Google Refine   Edit column > Add column by fetching   URLs   Use GREL (Google Refi...
Questions?  .Monday, 5 March 2012
Links   OnlineJournalismClasses.tumblr.com   Delicious.com/paulb/cityoj09   Delicious.com/paulb/datajournalism   Delicious...
Lab  Before the lab: play with these  techniques yourself, have problems,  find solutions, raise questions. Install  Googl...
Books   Kaiser Fung - Numbers Rule Your World   Ben Goldacre - Bad Science   Donna Wong - The WSJ Guide to   Information G...
Upcoming SlideShare
Loading in...5
×

Data Journalism 2: cleaning, combining, communicating

1,171

Published on

1 Comment
4 Likes
Statistics
Notes
No Downloads
Views
Total Views
1,171
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
30
Comments
1
Likes
4
Embeds 0
No embeds

No notes for slide

Data Journalism 2: cleaning, combining, communicating

  1. 1. Monday, 5 March 2012
  2. 2. Watch: West Wing on mapswww.youtube.com/watch?v=n8zBC2dvERMMonday, 5 March 2012
  3. 3. Data 2: clean, combine, communicate Online Journalism City University Paul BradshawMonday, 5 March 2012
  4. 4. Themes 5 things you need to know about each Data journalism in action WalkthroughMonday, 5 March 2012
  5. 5. How clean is the data? .Monday, 5 March 2012
  6. 6. Watch: The Wire: http://www.youtube.com/watch?v=xH_6_8NOfwIMonday, 5 March 2012
  7. 7. http://onlinejournalismblog.com/2010/10/12/statistical-analysis-as-journalism-benfords-law/Monday, 5 March 2012
  8. 8. “With the help of just Benford’s law and data sets to compare he’s able to demonstrate how the police are systematically hiding over a thousand murders a year in a single state, and that’s just in one small part of the article”Monday, 5 March 2012 - Pete Warden
  9. 9. http://delicious.com/paulb/benfordslawMonday, 5 March 2012
  10. 10. 5 things you need to know about cleaning data 1. Data always needs cleaning up 2. Treat the ‘source’ like a source 3. Use the right ‘average’ and percentage 4. Watch for changing context: inflation, boundaries, classification 5. Always work on copies of raw dataMonday, 5 March 2012
  11. 11. Monday, 5 March 2012
  12. 12. “What the Independent have done is confuse the UK’s deficit with our debt [making] the debt problem look around eight times worse than it is. And it used the whole of its front page to do so.” - James BallMonday, 5 March 2012
  13. 13. Monday, 5 March 2012
  14. 14. Question? A town has two hospitals. Hospital A is bigger than hospital B. One of them has a birth rate of 60% boys. Which one is it more likely to be?Monday, 5 March 2012
  15. 15. Question? The smaller hospital is more likely to have a 60% birth rate - larger samples are more stable.Monday, 5 March 2012
  16. 16. http://blog.ouseful.info/2011/10/31/power-tools-for-aspiring-data-journalists-r/ 16Monday, 5 March 2012
  17. 17. What is the data worth? Measurement doesnt answer anything if theres only one variable Statistical significance Sample size and selection Controls and the placebo effect Regression to the mean Read up.Monday, 5 March 2012
  18. 18. Getting data ready to answer questions Data > Text to columns or =SPLIT Find & replace =IF(condition, if met, if not) =TRIM, =CONCATENATE =RIGHT, =LEFT, =MID =REPLACE, =SUBSTITUTE =LENMonday, 5 March 2012
  19. 19. Walkthrough: cleaning data in Google Refine Edit cells > common transforms Edit cells > split multi-valued cells Facet > text facet Export...Monday, 5 March 2012
  20. 20. Communicating data stories .Monday, 5 March 2012
  21. 21. Monday, 5 March 2012
  22. 22. 5 things you need to know about visualising data 1. Choose the chart for the purpose 2. For answers or for story? 3. Good design is when there’s nothing more to take away 4. It should be self-contained & have refs 5. Be careful with scales and classesMonday, 5 March 2012
  23. 23. or http://chartchooser.juiceanalytics.com/Monday, 5 March 2012
  24. 24. http://junkcharts.typepad.com/junk_charts/trifecta-checkup/Monday, 5 March 2012
  25. 25. What is wrong with this picture?Monday, 5 March 2012
  26. 26. Monday, 5 March 2012
  27. 27. http://simplecomplexity.net/statistics-without-context/Monday, 5 March 2012
  28. 28. .Monday, 5 March 2012
  29. 29. Visualisation tools ManyEyes, Tableau, Number Picture Wordle, Tagxedo BatchGeo, FusionTables Gephi Delicious.com/paulb/vis+toolsMonday, 5 March 2012
  30. 30. Distribution: getting social Publish embed code & link to data Have or join a Flickr group for visualisations, comment on others Tumblr blog Digg, Reddit, Stumbleupon BuzzdataMonday, 5 March 2012
  31. 31. Mashing data .Monday, 5 March 2012
  32. 32. 5 things you need to know about mashing data 1. It is what a journalist does best 2. Look for a point of connection: place? Person? Company? Date? Code? 3. Mashups can be live, updated or static 4. What an API can do 5. What APIs there areMonday, 5 March 2012
  33. 33. Monday, 5 March 2012
  34. 34. Mashup tools Yahoo! Pipes, xFruits OpenHeatMap Mapalist, Maptube, FusionTables Scraperwiki Google RefineMonday, 5 March 2012
  35. 35. Walkthrough: grabbing geo data with Google Refine Edit column > Add column by fetching URLs Use GREL (Google Refine Expression Language) Search web for help & examplesMonday, 5 March 2012
  36. 36. Questions? .Monday, 5 March 2012
  37. 37. Links OnlineJournalismClasses.tumblr.com Delicious.com/paulb/cityoj09 Delicious.com/paulb/datajournalism Delicious.com/paulb/visualisation Delicious.com/paulb/statistics Delicious.com/paulb/mashupsMonday, 5 March 2012
  38. 38. Lab Before the lab: play with these techniques yourself, have problems, find solutions, raise questions. Install Google Refine and Tableau on your laptop to use. - Visualise, interrogate or mash dataMonday, 5 March 2012
  39. 39. Books Kaiser Fung - Numbers Rule Your World Ben Goldacre - Bad Science Donna Wong - The WSJ Guide to Information Graphics Brian Suda - A Practical Guide to Designing with DataMonday, 5 March 2012
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×