An introduction to Data Journalism

458 views

Published on

Presentation at School of Data training on May 14th for journalists at training with Open Data PH Taskforce in the Philippines.

0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
458
On SlideShare
0
From Embeds
0
Number of Embeds
4
Actions
Shares
0
Downloads
10
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • The Tragedy of the Commons – a phrase of the strange economist and ecologist Garrett Hardin – describes a situation where too many people in a village graze their cows on the common. Even as the overgrazed common becomes less productive, individual commoners still benefit from grazing one more cow there.
    Not many people today graze their cows on commons, but the recent history of North Sea cod stocks show that the tragedy of the commons is very much alive today.
  • Start by making an inventory of what data you hold; there are probably more types than you would guess at first. Here are some types and examples that a local authority might come up with.

    Administrative: spending, budgets, org chart, energy use, departments, contact details.
    Services: bin collection dates, school enrollment, fines, licences, housing lists.
    Demographic: households, tenures, age, occupation types.
    Democratic: election data, members, meeting dates, minutes, ward boundaries.
    Geospatial: public buildings, parks, cycle racks, bus stops.
    Environmental: air pollution, water quality, flood plain, contaminated land.
    Real time: transport, parking spaces, anonymised traffic data.
    Planning: applications, housing stock, land use, green belt, catastral data.
  • An introduction to Data Journalism

    1. 1. Data journalism - setting the stage Anders Pedersen @anpe @SchoolOfData
    2. 2. Open Knowledge Open Knowledge is a worldwide non-profit network of people passionate about openness, using advocacy, technology and training to unlock information and enable people to work with it to create and share knowledge.
    3. 3. Evidence is power School of Data works to empower civil society organizations, journalists and citizens with the skills they need to use data effectively – in their effort to create better societies.
    4. 4. Target audience We work mostly with change makers: NGOs and journalists. We empower them to use data effectively to advance their cause and mission through a combination of training and long terms support.
    5. 5. Why School of Data School of Data is a critical component of the open data ecosystem: ● provides tools and training to empower people to use open data for good - especially to people new to open data; ● supports outreach and engagement by creating a supportive community of learners and mentors - working with Open Knowledge Foundation Local Groups; ● creates opportunities for people and communities to use open data to make an impact; ● works both with governments to open up data and data users such as journalists and NGOs.
    6. 6. Slide name here ● Data expeditions - online and offline short gatherings where a group of people with different backgrounds tackle a data related problem ● Data clinics - hands on support working directly with people’s data ● Mentoring - local mentors working with local communities ● Online content - tutorial and walkthroughs ● Offline resources e.g. Data Journalism Handbook
    7. 7. Slide name here ● We work globally, with a focus on the following regions: Latin America, Sub Saharan Africa and Middle East, Europe ● School of Data is translated in Spanish and Portuguese ● Future: French, Greek and Italian ● Over 10 fellows working in countries like: Egypt, Lebanon, Uganda, Mexico, Costa Rica, Brazil, etc.
    8. 8. Data Journalism: Setting the stage
    9. 9. Where do gun owners live? Complex stories can now be told
    10. 10. Budget information that readers can understand But be aware of complexity!
    11. 11. How quickly will the ambulance arrive? Source: http://visualoop.com/media/2012/11/How-fast-is-LAFD- where-you-live-750x298.jpg Enables you to focus locally
    12. 12. And how about the fire truck? Fire fighter response times in London
    13. 13. Granularity is king Tip: the story is almost always buried in granular data Source: Mapumental
    14. 14. Granularity is king Who benefits from government subsidies?
    15. 15. Who are benefiting from government contracts? Source: http://usual-suppliers.pudo.org/
    16. 16. Data journalism is also text mining ● U.K. MP expenses – 700,000 documents in PDF- format ● Wikileaks Iraq war data – 391,832 structured records, each including a text descriptions ● Wikileaks diplomatic cables – 251,287 cables, each a few pages long ● NSA files leaked by Snowden – 50,000 to 200,000 according to the NSA A text document also contains data Source: Jonathan Stray, Overview project
    17. 17. Telling clear stories Where do companies live?
    18. 18. Company ownership networks
    19. 19. Where do people live? Source: Where nobody lives, http://mapsbynik.tumblr.com/post/82791188950/nobody-lives-here-the- nearly-5-million-census Demographics: Where nobody lives
    20. 20. Using statistics can help you find stories Stories in statistics: regression analysis and outliers → test fraud cases
    21. 21. Condition: Machine readable data Nothing beats a good CSV file
    22. 22. Good data is rarely available
    23. 23. How we often get important data Government official: “Please receive our annual audit reports in this stack of papers.” Hard copies = hard work!
    24. 24. Crowd cleaning of data When data is messy: Readers can assist extracting and cleaning data
    25. 25. Crowd cleaning of data Readers can annotate documents
    26. 26. Mapping people, power and money Source: “Who is in charge” created by CIVIO (Spain), http://quienmanda.es/ Mapping relationships
    27. 27. Who are friending who? What is in a picture? Matching faces to names Source: vg.no mapping the royal family network in Norway (left), Dirty Energy Money (right)
    28. 28. Connected China Source: “Who is in charge” created by CIVIO (Spain), http://quienmanda.es/ Data on relationships
    29. 29. Crowd collection of data Readers can assist collecting data
    30. 30. A clear bar chart is often all you need
    31. 31. Spending: make readers understand
    32. 32. Where to find the data?
    33. 33. The data journalism tool box ● Extraction and scraping ○ Tabula ○ Scraperwiki ○ Online OCR ● Data cleaning ○ Open Refine ○ Spreadsheets - yes, you cannot live without ● Visualisation ○ DataWrapper - http://datawrapper.de/ ○ D3.js - http://d3js.org/ The Data Journalism Handbook School of Data The tools you need
    34. 34. The data journalism tool box ● Extraction and scraping ○ Tabula ○ Scraperwiki ○ Online OCR ● Data cleaning ○ Open Refine ○ Spreadsheets - yes, you cannot live without ● Visualisation ○ DataWrapper - http://datawrapper.de/ ○ D3.js - http://d3js.org/ The Data Journalism Handbook School of Data The tools you need
    35. 35. Mailing lists
    36. 36. Thank you! Stay in touch: anders.pedersen@okfn.org | schoolofdata@okfn.org @anpe | @SchooOfData

    ×