• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Data Journalism 2: cleaning, combining, communicating
 

Data Journalism 2: cleaning, combining, communicating

on

  • 1,241 views

 

Statistics

Views

Total Views
1,241
Views on SlideShare
1,241
Embed Views
0

Actions

Likes
4
Downloads
19
Comments
1

0 Embeds 0

No embeds

Accessibility

Upload Details

Uploaded via as Adobe PDF

Usage Rights

CC Attribution-NonCommercial LicenseCC Attribution-NonCommercial License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel

11 of 1 previous next

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Data Journalism 2: cleaning, combining, communicating Data Journalism 2: cleaning, combining, communicating Presentation Transcript

    • Monday, 5 March 2012
    • Watch: West Wing on mapswww.youtube.com/watch?v=n8zBC2dvERMMonday, 5 March 2012
    • Data 2: clean, combine, communicate Online Journalism City University Paul BradshawMonday, 5 March 2012
    • Themes 5 things you need to know about each Data journalism in action WalkthroughMonday, 5 March 2012
    • How clean is the data? .Monday, 5 March 2012
    • Watch: The Wire: http://www.youtube.com/watch?v=xH_6_8NOfwIMonday, 5 March 2012
    • http://onlinejournalismblog.com/2010/10/12/statistical-analysis-as-journalism-benfords-law/Monday, 5 March 2012
    • “With the help of just Benford’s law and data sets to compare he’s able to demonstrate how the police are systematically hiding over a thousand murders a year in a single state, and that’s just in one small part of the article”Monday, 5 March 2012 - Pete Warden
    • http://delicious.com/paulb/benfordslawMonday, 5 March 2012
    • 5 things you need to know about cleaning data 1. Data always needs cleaning up 2. Treat the ‘source’ like a source 3. Use the right ‘average’ and percentage 4. Watch for changing context: inflation, boundaries, classification 5. Always work on copies of raw dataMonday, 5 March 2012
    • Monday, 5 March 2012
    • “What the Independent have done is confuse the UK’s deficit with our debt [making] the debt problem look around eight times worse than it is. And it used the whole of its front page to do so.” - James BallMonday, 5 March 2012
    • Monday, 5 March 2012
    • Question? A town has two hospitals. Hospital A is bigger than hospital B. One of them has a birth rate of 60% boys. Which one is it more likely to be?Monday, 5 March 2012
    • Question? The smaller hospital is more likely to have a 60% birth rate - larger samples are more stable.Monday, 5 March 2012
    • http://blog.ouseful.info/2011/10/31/power-tools-for-aspiring-data-journalists-r/ 16Monday, 5 March 2012
    • What is the data worth? Measurement doesnt answer anything if theres only one variable Statistical significance Sample size and selection Controls and the placebo effect Regression to the mean Read up.Monday, 5 March 2012
    • Getting data ready to answer questions Data > Text to columns or =SPLIT Find & replace =IF(condition, if met, if not) =TRIM, =CONCATENATE =RIGHT, =LEFT, =MID =REPLACE, =SUBSTITUTE =LENMonday, 5 March 2012
    • Walkthrough: cleaning data in Google Refine Edit cells > common transforms Edit cells > split multi-valued cells Facet > text facet Export...Monday, 5 March 2012
    • Communicating data stories .Monday, 5 March 2012
    • Monday, 5 March 2012
    • 5 things you need to know about visualising data 1. Choose the chart for the purpose 2. For answers or for story? 3. Good design is when there’s nothing more to take away 4. It should be self-contained & have refs 5. Be careful with scales and classesMonday, 5 March 2012
    • or http://chartchooser.juiceanalytics.com/Monday, 5 March 2012
    • http://junkcharts.typepad.com/junk_charts/trifecta-checkup/Monday, 5 March 2012
    • What is wrong with this picture?Monday, 5 March 2012
    • Monday, 5 March 2012
    • http://simplecomplexity.net/statistics-without-context/Monday, 5 March 2012
    • .Monday, 5 March 2012
    • Visualisation tools ManyEyes, Tableau, Number Picture Wordle, Tagxedo BatchGeo, FusionTables Gephi Delicious.com/paulb/vis+toolsMonday, 5 March 2012
    • Distribution: getting social Publish embed code & link to data Have or join a Flickr group for visualisations, comment on others Tumblr blog Digg, Reddit, Stumbleupon BuzzdataMonday, 5 March 2012
    • Mashing data .Monday, 5 March 2012
    • 5 things you need to know about mashing data 1. It is what a journalist does best 2. Look for a point of connection: place? Person? Company? Date? Code? 3. Mashups can be live, updated or static 4. What an API can do 5. What APIs there areMonday, 5 March 2012
    • Monday, 5 March 2012
    • Mashup tools Yahoo! Pipes, xFruits OpenHeatMap Mapalist, Maptube, FusionTables Scraperwiki Google RefineMonday, 5 March 2012
    • Walkthrough: grabbing geo data with Google Refine Edit column > Add column by fetching URLs Use GREL (Google Refine Expression Language) Search web for help & examplesMonday, 5 March 2012
    • Questions? .Monday, 5 March 2012
    • Links OnlineJournalismClasses.tumblr.com Delicious.com/paulb/cityoj09 Delicious.com/paulb/datajournalism Delicious.com/paulb/visualisation Delicious.com/paulb/statistics Delicious.com/paulb/mashupsMonday, 5 March 2012
    • Lab Before the lab: play with these techniques yourself, have problems, find solutions, raise questions. Install Google Refine and Tableau on your laptop to use. - Visualise, interrogate or mash dataMonday, 5 March 2012
    • Books Kaiser Fung - Numbers Rule Your World Ben Goldacre - Bad Science Donna Wong - The WSJ Guide to Information Graphics Brian Suda - A Practical Guide to Designing with DataMonday, 5 March 2012