Ona 2013 data journalism no excuses with la nacion data

5,282 views

Published on

Published in: News & Politics, Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
5,282
On SlideShare
0
From Embeds
0
Number of Embeds
1,673
Actions
Shares
0
Downloads
10
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Ona 2013 data journalism no excuses with la nacion data

  1. 1. Data Journalism, no excuses! Momi Peralta Ramos @momiperalta Florencia Coelho @fcoel LA NACION DATA www.lanacion.com.ar/data
  2. 2. About LA NACION • Based in Buenos Aires, Argentina • Sunday print circulation: + 360.000 • www.lanacion.com unique visitors/month: + 11MM • 9 magazine titles • Impremedia (90%): US hisp. leading publishing company
  3. 3. LA NACION Data It´s LA NACION´s initiative to develop data journalism and contribute to opening data in Argentina
  4. 4. Transparency? http://www.transparency.org/cpi2012/r esults Argentina: 102 score 35
  5. 5. Not law, but decree http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1857498 David Banisar, FOIA Expert. October 2012
  6. 6. Data.gov Portals? In 2012 – Buenos Aires City and Misiones Province opened data portals 2013 – National Data Portal!
  7. 7. A vision for several actions LA NACION decided to challenge status quo and started opening data and developing data Journalism @fcoel - @momiperalta
  8. 8. Why? - Data is a new raw material for journalism - Move public data closer to the people - Activate demand of public data - Discover new stories hidden in datasets - Allow citizen´s collaboration + innovate - It is the future of journalism! @fcoel - @momiperalta
  9. 9. HOW...is this possible?? • EXCUSES: – There is NO DATA or DATA is not credible – We are not the US or the UK in terms or transparency – We DON’T have programmers in our newsroom – We DON’T have skills in our newsroom to gather or analize datasets – Will all this effort make sense? Will someone use this data? – We don’t… we dont…
  10. 10. HOW...is this possible?? • EXCUSES: – There is NO DATA or DATA is not credible – We are not the US or the UK in terms or transparency – We DON’T have programmers in our newsroom – We DON’T have skills in our newsroom to gather or analize datasets – Will all this effort make sense? Will someone use this data? – We don’t… we dont…
  11. 11. HOW...is this possible?? • EXCUSES: – There is NO DATA or DATA is not credible – We are not the US or the UK in terms or transparency – We DON’T have programmers in our newsroom – We DON’T have skills in our newsroom to gather or analize datasets – Will all this effort make sense? Will someone use this data? KILLING THIS SCKEPTICISM – We don’t… we dont… ONE BY ONE
  12. 12. 1. LEARN HERE!! • Go to conferences or follow it online, and learn in the sessions!! Become a member. • ONA 2010 was our first inspiration into dataj, yes, a pre-conference workshop in ONA. • Learn free online in MOOCs , webinars, books
  13. 13. 2. EMBRACE HACKTIVISM Big community of developers and NGOs willing to help! Embrace, engage and promote hacktivism! @fcoel - @momiperalta
  14. 14. VIDEO FIRST STEPS http://www.youtube.com/watch?v=O1bC5bbie5g&feature=youtu.be
  15. 15. 3. START CREATING DATASETS, START SMALL …BE HUMBLE, BECOME A DATA BUILDER
  16. 16. 4. LEARN TO ASK FOR HELP – TEAMWORK!! • With a little help from my friends… DEVELOPER JOURNALIST The perfect team… @fcoel - @momiperalta IMAGE from Scraperwiki https://scraperwiki.com/
  17. 17. Tools & technology D3.js
  18. 18. Timeline.JS
  19. 19. HIV / AIDS 30 Anniversary VERTICAL Timeline Collaborative Tools: Vertical Timeline with Google Spreadsheets. Team: 1 member LN data + 1 member HIV/AIDS NGO and … @fcoel - @momiperalta http://www.lanacion.com.ar/1583473-a-treinta-anos-del-descubrimiento-del-vihsida
  20. 20. … International Collaboration!! WNYC Vertical Timeline’s Google Spreadsheet model online ready to copy & paste. “Absurdly illustrated” guide by @LisaWilliams @fcoel - @momiperalta
  21. 21. Tableau Public for data interactives (without developers)
  22. 22. PLAY WITH OTHER´s DATA! DATA: The Guardian Data store
  23. 23. DATAJ CASE INFLATION in ARGENTINA How to show (with data) what you can’t tell? @fcoel - @momiperalta
  24. 24. http://www.bloomberg.com/news/2013-02-01/argentina-becomes-first-nationcensured-by-imf-on-inflation-data.html
  25. 25. @fcoel - @momiperalta
  26. 26. ADELCO (Consumer Association) Monthly registration, since 2006, of the same 28 products, untill September 2012 2 Baskets of product prices index
  27. 27. ADELCO (Consumer Association) @fcoel - @momiperalta
  28. 28. http://data.lanacion.com.ar/dashboards/5068/inflacion-y-precios/
  29. 29. DATAJ CASE SUBSIDIES for the BUS TRANSPORTATION SERVICES @fcoel - @momiperalta
  30. 30. Transport Agency – Processing CCP Subsidy corresponding to March 2012 ccp_sistau_marzo12(6).xlsx CCP) (Subsidio BUT: 1.200 Rows (Companies) 21 Columns 1.600 PDF files (subsidies CASH y Gasoil for Buses and Trains) from 2003 to now (March 2013). Extra Challenge: After published, files are updated (up to 10 times)
  31. 31. Transport Secretary – Analysis Monthly consolidation for 3 subsidies for each company, and geographic zone Subsidies paid in 2010  USD 4.260.000 ….. per day - What companies received more subsidies? , Which one have the greatest rise?
  32. 32. @fcoel - @momiperalta
  33. 33. First data experiences from our newsroom http://www.youtube.com/watch?v=w7jcjH5gLSU
  34. 34. www.lanacion.com.ar/data @fcoel - @momiperalta
  35. 35. @fcoel - @momiperalta
  36. 36. data.lanacion.com.ar
  37. 37. DATAJ CASE CENSUS 2001 - 2010 @fcoel - @momiperalta
  38. 38. Census Argentina 2001-2010 Our 2013 Knight-Mozilla OpenNews Fellow in LA NACION, Manuel Aristaran, was in charge of this technological challenge together with our newsroom developer.
  39. 39. A census-thon (download & normalize census variables marathon) • 2 days with Knight ICFJ fellow Sandra Crucianelli @fcoel - @momiperalta
  40. 40. DATAJ CASE ARGENTINA SENATE EXPENSES 2004-2013 @fcoel - @momiperalta
  41. 41. SENATE EXPENSES 2004 – 2013 Google-GEN Data Journalism Award Winner 2013 Finalists: The Guardian, The Associated Press, BBC News, Center for Public Integrity, The Financial Times, Global News, La Nación de Costa Rica, The Los Angeles Times y Mother Jones
  42. 42. CASE: Senate Expenses 2004-2013
  43. 43. First Step More than 33.600 .pdf files
  44. 44. Second step
  45. 45. Third step: detect and extract entities, amounts, places
  46. 46. Fourth Step – parsing and analysis
  47. 47. Fifth Step – Front page story @fcoel - @momiperalta
  48. 48. & online story with interactive dataviz @fcoel - @momiperalta
  49. 49. Conclusions after analysis of overlapped date ranges of trips
  50. 50. Bonus track: Boudou’s luxury furniture with emergency fund @fcoel - @momiperalta
  51. 51. Senate Expenses – Team Video http://www.youtube.com/watch?v=qEZ2xMwPMWo&feature=youtu.be
  52. 52. La Plata City Major Floodings (April 2013) • Collaborative Tools: Google Spreadsheets Google Maps Google Fusion Tables • Other tools: Excel, Tableau Public.
  53. 53. Hypothesis: Gov was hiding real number of deaths to diminish impact of its own responsabilities • We got 150 copies of handwritten death certificates in La Plata for April (1st-15th). • We made a database model, typed each case details into a spreadsheet, then ordered, filtered, analysed…
  54. 54. • Visualizations for time & place helped us confirm that most deaths happened between April 2nd and 4th (or were directly related) and many were located over water streams running under the city and/or flooded blocks.
  55. 55. No Geocoding in GFT. La Plata: “Where the streets have no name”
  56. 56. … so, we turned to good old Google Maps
  57. 57. Then the developer, combined 2 JPGs we got from different sources, water streams (some running under the city) and flooded blocks
  58. 58. And we published this map, based on Google FTables + 2 combined overlayed JPGs.
  59. 59. Impact. Starting from 51 deaths … One day after publishing: A judge confirms 60 deaths due to major floodings 45 days after: 78 deaths officially confirmed
  60. 60. Team explains how collaboration worked http://youtu.be/a56fWexw8uo <- Meet the team. Journalist, dataminer, programmer, designer, data producer and me, multitasker. Goodie for later!
  61. 61. Elections CartoDB Map
  62. 62. Open Assets Declarations. News App in collaboration with 3 transparency NGOs, Collaborative Tools: Google Spreadsheets Trello Document Cloud Team: around 45 LN staff, NGOs staff + 30 volunteers
  63. 63. Before & After Public servant declaration of assets. In total we typed 15.000 rows x 28 cols
  64. 64. The Check-a-thons (checking marathons) @fcoel - @momiperalta
  65. 65. Collaborative Checking (in presence & remote)
  66. 66. STEP BY STEP MANUALLY BUILDING DATASET • • • • • Step 1: Data entry + 15.000 rows x 28 columns Step 2: Raw data checking (paper vs spreadsheet) Step 3: Normalizing Step 4: Front end field by field checking (news app vs paper) Step 5: Publish and…OPEN THE DATA!!
  67. 67. Open Assets Declarations NGOs testimony on collaboration project http://www.youtube.com/watch?v=OmsDdzTvp0E&feature=youtu.be
  68. 68. WE WORK WITH THE COMMUNITY OPEN, OPEN, LEARN, SHARE, SHARE… @fcoel - @momiperalta
  69. 69. Promote data opportunities, tools and events @fcoel - @momiperalta
  70. 70. http://www.datajournalismhandbook.com https://www.dropbox.com/s/5oseg79q96l0slq/ddj_spanish_doc book_fixed.pdf
  71. 71. DATAFEST Our EVENT for opening & mining public data @fcoel - @momiperalta
  72. 72. • Besides daily efforts, we opened 21 datasets and made “Dataset cheat sheets” to make them accessible and ready for analysis with data mining techniques or data visualization in our first DATAFEST in Argentina last November. • This DATAFEST was organized by LA NACION and UNIVERSIDAD AUSTRAL Masters Degree in Data Mining and Universidad Austral Communications Faculty.
  73. 73. DataSets inventory and their “cheat sheets”
  74. 74. Cheat Sheet : Subsidies for the Public Bus Transport System – Cleaned , normalized and open DATA
  75. 75. VIDEO http://youtu.be/ZelT_UrisQk
  76. 76. Thanks… @fcoel @momiperalta LA NACION DATA – @LNdata - Oct, 2013

×