Your SlideShare is downloading. ×
Scraping the Olympics
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Introducing the official SlideShare app

Stunning, full-screen experience for iPhone and Android

Text the download link to your phone

Standard text messaging rates apply

Scraping the Olympics

5,913
views

Published on

Presentation for a workshop at the BBC Data Journalism Day, July 2012

Presentation for a workshop at the BBC Data Journalism Day, July 2012


1 Comment
2 Likes
Statistics
Notes
No Downloads
Views
Total Views
5,913
On Slideshare
0
From Embeds
0
Number of Embeds
6
Actions
Shares
0
Downloads
11
Comments
1
Likes
2
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Scraping the OlympicsPaul Bradshaw, author: Scraping for Journalists * Leanpub.com/scrapingforjournalists
  • 2. ?Scraping basicsCombining dataFinding stories in data *
  • 3. *
  • 4. Function (Parameters) *
  • 5. Function (Parameters)=SUM(A2:A50)=AVERAGE(B2:B300)=COUNTIF(A10:A3000,”Smith”) *
  • 6. (“string”, index) *
  • 7. Tip: search fordocumentation *
  • 8. Tip: search for structure around data *
  • 9. *
  • 10. //div[starts-with(@class, ‘jobWrap’)]*
  • 11. *
  • 12. Combining data *
  • 13. ?Question:Which torchbearers arefrom Dorset? *
  • 14. *
  • 15. *
  • 16. *
  • 17. *
  • 18. *
  • 19. *
  • 20. *
  • 21. *
  • 22. ?Finding leads:Corporate torchbearers? *
  • 23. *
  • 24. *
  • 25. *
  • 26. *
  • 27. New entries - ordisappearing ones *
  • 28. *
  • 29. *
  • 30. *
  • 31. *
  • 32. Leanpub.com/scrapingforjournalists @paulbradshaw onlinejournalismblog.com helpmeinvestigate.com slideshare.net/onlinejournalist * linkedin.com/in/onlinejournalist