Scraping the Olympics

  • 5,320 views
Uploaded on

Presentation for a workshop at the BBC Data Journalism Day, July 2012

Presentation for a workshop at the BBC Data Journalism Day, July 2012

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
No Downloads

Views

Total Views
5,320
On Slideshare
0
From Embeds
0
Number of Embeds
5

Actions

Shares
Downloads
11
Comments
1
Likes
2

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Scraping the OlympicsPaul Bradshaw, author: Scraping for Journalists * Leanpub.com/scrapingforjournalists
  • 2. ?Scraping basicsCombining dataFinding stories in data *
  • 3. *
  • 4. Function (Parameters) *
  • 5. Function (Parameters)=SUM(A2:A50)=AVERAGE(B2:B300)=COUNTIF(A10:A3000,”Smith”) *
  • 6. (“string”, index) *
  • 7. Tip: search fordocumentation *
  • 8. Tip: search for structure around data *
  • 9. *
  • 10. //div[starts-with(@class, ‘jobWrap’)]*
  • 11. *
  • 12. Combining data *
  • 13. ?Question:Which torchbearers arefrom Dorset? *
  • 14. *
  • 15. *
  • 16. *
  • 17. *
  • 18. *
  • 19. *
  • 20. *
  • 21. *
  • 22. ?Finding leads:Corporate torchbearers? *
  • 23. *
  • 24. *
  • 25. *
  • 26. *
  • 27. New entries - ordisappearing ones *
  • 28. *
  • 29. *
  • 30. *
  • 31. *
  • 32. Leanpub.com/scrapingforjournalists @paulbradshaw onlinejournalismblog.com helpmeinvestigate.com slideshare.net/onlinejournalist * linkedin.com/in/onlinejournalist