Scraping the Olympics
Upcoming SlideShare
Loading in...5
×

Like this? Share it with your network

Share

Scraping the Olympics

  • 5,619 views
Uploaded on

Presentation for a workshop at the BBC Data Journalism Day, July 2012

Presentation for a workshop at the BBC Data Journalism Day, July 2012

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
No Downloads

Views

Total Views
5,619
On Slideshare
2,390
From Embeds
3,229
Number of Embeds
7

Actions

Shares
Downloads
11
Comments
1
Likes
2

Embeds 3,229

http://visualoop.com 3,068
http://scrapingforjournalists.posterous.com 94
http://www.bloglecom.com.br 62
http://translate.googleusercontent.com 2
http://posterous.com 1
http://www.linkedin.com 1
http://plus.url.google.com 1

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Scraping the OlympicsPaul Bradshaw, author: Scraping for Journalists * Leanpub.com/scrapingforjournalists
  • 2. ?Scraping basicsCombining dataFinding stories in data *
  • 3. *
  • 4. Function (Parameters) *
  • 5. Function (Parameters)=SUM(A2:A50)=AVERAGE(B2:B300)=COUNTIF(A10:A3000,”Smith”) *
  • 6. (“string”, index) *
  • 7. Tip: search fordocumentation *
  • 8. Tip: search for structure around data *
  • 9. *
  • 10. //div[starts-with(@class, ‘jobWrap’)]*
  • 11. *
  • 12. Combining data *
  • 13. ?Question:Which torchbearers arefrom Dorset? *
  • 14. *
  • 15. *
  • 16. *
  • 17. *
  • 18. *
  • 19. *
  • 20. *
  • 21. *
  • 22. ?Finding leads:Corporate torchbearers? *
  • 23. *
  • 24. *
  • 25. *
  • 26. *
  • 27. New entries - ordisappearing ones *
  • 28. *
  • 29. *
  • 30. *
  • 31. *
  • 32. Leanpub.com/scrapingforjournalists @paulbradshaw onlinejournalismblog.com helpmeinvestigate.com slideshare.net/onlinejournalist * linkedin.com/in/onlinejournalist