Data Mining en Data Driven Story Telling Computer Assisted Research and Reporting Peter Verweij School of Journalism Utrec...
Wikileaks <ul><li>Begonnen als citizen reporters site </li></ul><ul><li>een klokkenluiders site </li></ul><ul><li>Onderzoe...
Nick Davies <ul><li>Copy/paste or new possibilities </li></ul><ul><li>Feiten, nauwkeurigheid en geloofwaardigheid: Nick Da...
Nick Davies 2 <ul><li>Wat is de conclusie? </li></ul><ul><li>Geen research of onderzoekjournalistiek is  nog mogelijk? </l...
Verkeersongelukken in Utrecht
Some classical US examples <ul><li>School bus and drunken drivers </li></ul><ul><ul><li>convictions drunken driven; driver...
What’s in a name? <ul><li>Phil Meyer(Precision Journalism): </li></ul><ul><li>Some practitioners of the &quot; new journal...
Some other examples <ul><li>NY Times:  Gap in life expectancy </li></ul><ul><li>USA Today:  delegate tracker </li></ul><ul...
World Food Prices <ul><li>Simple Story </li></ul><ul><ul><li>World Bank echoes food cost alarm </li></ul></ul><ul><li>Rese...
Examples
How to follow the story about food prices on the web? <ul><li>Find leading media: FT, BBC, Economist and subscribe to RSS ...
What has been changed in reporting? <ul><li>Internet: </li></ul><ul><ul><ul><li>More sources in number and in full text </...
What has been changed in reporting? (2) <ul><li>Tools for handling data from databases </li></ul><ul><ul><li>Spreadsheet; ...
New tools <ul><li>Google public data: directe analyse van databases </li></ul><ul><li>Google forms: online enquete maken <...
Maps masups <ul><li>Web 2.0 and mashups: merging data on the web </li></ul><ul><li>http://projects.latimes.com/homicide/ma...
 
 
Verkiezingen   1998/2003   Grootste partij per gemeente www.nederlandkiest.nl
Gemeente data <ul><li>Gemeente utrecht: </li></ul><ul><li>http://www.utrecht.nl/smartsite.dws?id=86964 </li></ul><ul><li>I...
 
What can we do with these tools? <ul><li>Calculations: averages </li></ul><ul><li>Graphs: bar, line, pie </li></ul><ul><li...
What is the objective? <ul><li>In journalism: </li></ul><ul><ul><ul><li>Graphs are analysis not illustrations </li></ul></...
Other techniques <ul><li>Social network analysis: From  IRE </li></ul><ul><ul><li>Terrorist Network Valdis Krebs published...
Overzicht uit NRC
Netwerken in journalistiek 2 Jury Lidmaatschap Literaire Prijzen
Twitter netwerken tussen politici en journalisten More
Other techniques 2 <ul><li>Collect your own data </li></ul><ul><ul><li>From secondary to primary data </li></ul></ul><ul><...
 
 
Upcoming SlideShare
Loading in...5
×

Computer assisted research and reporting

1,023

Published on

CARR

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
1,023
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
9
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • Spreadsheet over asiel aanvragen naar nationaliteit; gebaseerd op CBS; toont aan het gemak van spreadsheets voor het maken van berekeningen en grafieken
  • Computer assisted research and reporting

    1. 1. Data Mining en Data Driven Story Telling Computer Assisted Research and Reporting Peter Verweij School of Journalism Utrecht The Netherlands
    2. 2. Wikileaks <ul><li>Begonnen als citizen reporters site </li></ul><ul><li>een klokkenluiders site </li></ul><ul><li>Onderzoeksjournalisten zijn altijd afhankelijk van hulp van buitenaf </li></ul><ul><li>300.000 documenten uit de US diplomatieke post geven, bewerkt door journalisten, een beeld van de buitenlandse politiek </li></ul><ul><li>Julian Assange op de lijst van interpol </li></ul><ul><li>Nieuw is het aantal: door digitalisering. </li></ul><ul><li>Titel: datamining- datadriven story telling </li></ul>
    3. 3. Nick Davies <ul><li>Copy/paste or new possibilities </li></ul><ul><li>Feiten, nauwkeurigheid en geloofwaardigheid: Nick Davies Boek, interview, sites </li></ul><ul><li>http:// www.humedia.nl /profiles/ blogs / nick-davies-over </li></ul><ul><li>recensie http://extra.volkskrant.nl/select/boeken/artikel.php?id=843 </li></ul><ul><li>website boek http://www.flatearthnews.net/ </li></ul><ul><li>Nederlands onderzoek: http://www.volkskrant.nl/multimedia/article1135829.ece/Eenderde_nieuws_is_voorverpakt </li></ul>
    4. 4. Nick Davies 2 <ul><li>Wat is de conclusie? </li></ul><ul><li>Geen research of onderzoekjournalistiek is nog mogelijk? </li></ul><ul><li>Maar: nieuws is overal en de media focussen op eigen productie: bv nrcnext en parlementaire verslaggeving </li></ul><ul><li>Maar: digitalisering biedt meer mogelijkheden </li></ul><ul><li>Voorbeeld: twitternetwerken </li></ul>
    5. 5. Verkeersongelukken in Utrecht
    6. 6. Some classical US examples <ul><li>School bus and drunken drivers </li></ul><ul><ul><li>convictions drunken driven; driver licence number, school bus drivers </li></ul></ul><ul><li>Hurricane Andrew </li></ul><ul><ul><li>damage map related to wind strength; building construction fraude </li></ul></ul>
    7. 7. What’s in a name? <ul><li>Phil Meyer(Precision Journalism): </li></ul><ul><li>Some practitioners of the &quot; new journalism &quot; took to making up their facts in order to keep up with the deadline pressures. Others stopped short of making things up, but combined facts from different cases to write composite portrayals of reality that they passed off as real cases. Despite the problems, the new nonfiction remains an interesting effort at coping with information complexity and finding a way to communicate essential truth. It pushes journalism toward art . Its problem is that journalism requires discipline, and the discipline of art may not be the most appropriate kind. A better solution is to push journalism toward science, incorporating both the powerful data-gathering and analysis tools of science and its disciplined search for verifiable truth. </li></ul><ul><li>After the introduction of internet and spreadsheets: CARR: computer assisted research and reporting </li></ul><ul><li>Now because of analysis of databases: Data-mining </li></ul>
    8. 8. Some other examples <ul><li>NY Times: Gap in life expectancy </li></ul><ul><li>USA Today: delegate tracker </li></ul><ul><li>Volkskrant: topsalarissen </li></ul><ul><li>NRC: voedselprijzen </li></ul><ul><li>NRC: WOZ waarde </li></ul>
    9. 9. World Food Prices <ul><li>Simple Story </li></ul><ul><ul><li>World Bank echoes food cost alarm </li></ul></ul><ul><li>Research and background </li></ul><ul><ul><li>Food price crisis </li></ul></ul><ul><ul><li>Costs of food </li></ul></ul><ul><li>Continuum for reporting: </li></ul><ul><ul><li>Re-active reporting </li></ul></ul><ul><ul><li>Proactive reporting </li></ul></ul><ul><li>From one column press release story to a full investigative scoop </li></ul>
    10. 10. Examples
    11. 11. How to follow the story about food prices on the web? <ul><li>Find leading media: FT, BBC, Economist and subscribe to RSS feed </li></ul><ul><li>Search newspaper archives: lexis/nexis </li></ul><ul><li>Search the web with Google </li></ul><ul><ul><ul><li>Use more keywords; quotation marks </li></ul></ul></ul><ul><ul><ul><li>Look different source type: doc, xls, ppt </li></ul></ul></ul><ul><li>Use Google news and create RSS feeds </li></ul><ul><li>Find institutions and their databases or use Google public data </li></ul><ul><li>Bloggers: using technorati </li></ul><ul><li>Use Twitter search hashtags # related to food prices </li></ul>
    12. 12. What has been changed in reporting? <ul><li>Internet: </li></ul><ul><ul><ul><li>More sources in number and in full text </li></ul></ul></ul><ul><ul><ul><li>Geographical range wider </li></ul></ul></ul><ul><ul><ul><li>Direct access </li></ul></ul></ul><ul><ul><ul><li>Multi media: including audio/video/graphics </li></ul></ul></ul><ul><li>Google indexes about 10 bill pages but that is 20% of the information on the web </li></ul><ul><ul><ul><li>Databases: more data </li></ul></ul></ul><ul><ul><ul><li>How do you find databases? </li></ul></ul></ul><ul><ul><ul><ul><li>Institutional approach for searching </li></ul></ul></ul></ul><ul><ul><ul><ul><li>CBS, Worldbank, IMF FAO, UN, eurostat </li></ul></ul></ul></ul>
    13. 13. What has been changed in reporting? (2) <ul><li>Tools for handling data from databases </li></ul><ul><ul><li>Spreadsheet; excel </li></ul></ul><ul><ul><li>Database; access </li></ul></ul><ul><ul><li>GIS, geographic information systems; mapping; arcgis </li></ul></ul><ul><li>How do you store your data? </li></ul><ul><ul><li>Create your own database or spreadsheets to store your data </li></ul></ul><ul><ul><li>Asksam </li></ul></ul><ul><ul><li>Google notes </li></ul></ul>
    14. 14. New tools <ul><li>Google public data: directe analyse van databases </li></ul><ul><li>Google forms: online enquete maken </li></ul><ul><li>Google fusion tables: data aan kaarten koppelen </li></ul><ul><li>Google maps mashups: adding data to google maps </li></ul><ul><li>Links: memeburn en blog </li></ul><ul><li>Wordpress plugin voor poll </li></ul>
    15. 15. Maps masups <ul><li>Web 2.0 and mashups: merging data on the web </li></ul><ul><li>http://projects.latimes.com/homicide/map/ </li></ul><ul><ul><li>Using google API to create poi’s </li></ul></ul><ul><ul><ul><li>FCJ Utrecht </li></ul></ul></ul><ul><ul><ul><li>Maps in slideshow with audio </li></ul></ul></ul><ul><li>http://www.fao.org/hunger/en/ </li></ul><ul><li>http://www.gapminder.org/world/ </li></ul>
    16. 18. Verkiezingen 1998/2003 Grootste partij per gemeente www.nederlandkiest.nl
    17. 19. Gemeente data <ul><li>Gemeente utrecht: </li></ul><ul><li>http://www.utrecht.nl/smartsite.dws?id=86964 </li></ul><ul><li>Interactieve databank: </li></ul><ul><li>http://utrecht.buurtmonitor.nl/ </li></ul>
    18. 21. What can we do with these tools? <ul><li>Calculations: averages </li></ul><ul><li>Graphs: bar, line, pie </li></ul><ul><li>Maps: </li></ul><ul><li>Interactive graphs </li></ul><ul><li>UNDP data by gapminder </li></ul>
    19. 22. What is the objective? <ul><li>In journalism: </li></ul><ul><ul><ul><li>Graphs are analysis not illustrations </li></ul></ul></ul><ul><ul><ul><li>Cooperation between programmers, design and journalists </li></ul></ul></ul><ul><ul><ul><li>Aim is better journalism; better storytelling, informing public </li></ul></ul></ul><ul><li>What do you need? </li></ul><ul><ul><li>Knowledge about statistics </li></ul></ul><ul><ul><li>How to handle spreadsheets, graphs, maps </li></ul></ul>
    20. 23. Other techniques <ul><li>Social network analysis: From IRE </li></ul><ul><ul><li>Terrorist Network Valdis Krebs published &quot;Uncloaking Terrorist Networks,&quot; an analysis of the Sept. 11, 2001, terrorist network in the April 2002 issue of First Monday, a peer-reviewed Internet journal. This article explains how Krebs was able to construct a visual representation of the network as well as what this visualization can tell us about the network that was previously unknown. Other papers Krebs has authored, including information on InFlow software, can be found at the researcher's Web site: www.orgnet.com </li></ul></ul><ul><ul><li>527 Committee Donors In the 2004 presidential election &quot;huge donations of a handful of wealthy liberals named Linda Pritzker, Stephen L. Bing, Peter B. Lewis and George Soros could determine the outcome. Together, they have given more than $26 million to help finance the most extensive get-out-the vote operation in history, the goal of which is to make John F. Kerry president.&quot; These donations were made to 527 organizations. &quot;Named after a section of the tax code, the 527 groups are doing much of the advertising and field work traditionally left to party organizations.&quot; Included with this story is a diagram displaying contributions to Democratic 527s and a list of the biggest donors to these groups. </li></ul></ul><ul><ul><li>They Rule They Rule is a Web site that allows you to create maps of the interlocking directories of the top 100 companies in the United States in 2001. The data is static, so it is fast becoming out of date, as companies merge and disappear and directors shift boards. A new version of this site is being developed. </li></ul></ul>
    21. 24. Overzicht uit NRC
    22. 25. Netwerken in journalistiek 2 Jury Lidmaatschap Literaire Prijzen
    23. 26. Twitter netwerken tussen politici en journalisten More
    24. 27. Other techniques 2 <ul><li>Collect your own data </li></ul><ul><ul><li>From secondary to primary data </li></ul></ul><ul><li>Design your own survey and collect data online using </li></ul><ul><li>Or Content analysis: for example NRC; talkshow and partij </li></ul><ul><li>Google forms or wordpress plugin </li></ul><ul><li>surveymonkey </li></ul><ul><li>Analysis: </li></ul><ul><ul><li>Online </li></ul></ul><ul><ul><li>Importing in spreadsheet </li></ul></ul><ul><ul><li>Datamatrix using SPSS </li></ul></ul>
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×