Your SlideShare is downloading. ×
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
DataJournalism: How To get data and process them?
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

DataJournalism: How To get data and process them?

264

Published on

Workshop on datajournalism given at the DataDays organised by the Open Knowledge Foundation on the 17th of February 2014.

Workshop on datajournalism given at the DataDays organised by the Open Knowledge Foundation on the 17th of February 2014.

Published in: Technology
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
264
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
18
Comments
0
Likes
2
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Workshop on Data Journalism February 17, 2014 Ghent How to get the data and how to process them? Lorenzo Pellizzari 1
  • 2. About me … 2
  • 3. Get the data Receive it Advanced search techniques How to get the data? FOI laws Scrape it 3
  • 4. 1 Receive it Analyzing the War Logs (Associated Press) 4
  • 5. 2 Advanced search techniques: Google 79.300.000 results 5results 5
  • 6. 2 Advanced search techniques: SPARQL http://dbpedia.org/sparql 6
  • 7. 2 Advanced search techniques: SPARQL 7
  • 8. 2 Advanced search techniques: SPARQL http://latemar.science.unitn.it/spacetime/spacetime.html 8
  • 9. 3 Freedom of Information laws 9
  • 10. 3 Freedom of Information laws 10
  • 11. 4 Scrape your data “Web scraping (web harvesting or web data extraction) is a computer software technique of extracting information from websites.” (Wikipedia) http://www-news.iaea.org/ 11
  • 12. 4 Scrape your data 12
  • 13. 4 Scrape your data 13
  • 14. Process the data What Analytics, Data mining, Big Data software you used in the past 12 months for a real project (not just evaluation) [798 voters] http://www.kdnuggets.com/ 14
  • 15. The software for data analysis Share of R- or SAS-related posts to Stack Overflow by week. http://r4stats.com/articles/popularity/ 15
  • 16. The software for data analysis 16
  • 17. Example: ABC News Interactive map of gas wells and leases in Australia Scraping: Main data coming from gouvernemental websites FOI: Data on chemical releases Variety of reports: Data on salt and water http://datajournalismhandbook.org/ 17
  • 18. Example: ABC News • A web developer and designer • A lead journalist • A part time researcher with expertise in data extraction, excel spread sheets and data cleaning • A part time junior journalist • A consultant executive producer • A academic consultant with expertise in data mining, graphic visualization and advanced research skills • The services of a project manager and the administrative assistance of the ABC’s multi-platform unit • Importantly we also had a reference group of journalists and others whom we consulted on a needs basis http://datajournalismhandbook.org/ 18
  • 19. 19

×