Your SlideShare is downloading. ×
0
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Scraping the Olympics
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Scraping the Olympics

7,063

Published on

Presentation for a workshop at the BBC Data Journalism Day, July 2012

Presentation for a workshop at the BBC Data Journalism Day, July 2012

1 Comment
2 Likes
Statistics
Notes
No Downloads
Views
Total Views
7,063
On Slideshare
0
From Embeds
0
Number of Embeds
10
Actions
Shares
0
Downloads
13
Comments
1
Likes
2
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Scraping the OlympicsPaul Bradshaw, author: Scraping for Journalists * Leanpub.com/scrapingforjournalists
  • 2. ?Scraping basicsCombining dataFinding stories in data *
  • 3. *
  • 4. Function (Parameters) *
  • 5. Function (Parameters)=SUM(A2:A50)=AVERAGE(B2:B300)=COUNTIF(A10:A3000,”Smith”) *
  • 6. (“string”, index) *
  • 7. Tip: search fordocumentation *
  • 8. Tip: search for structure around data *
  • 9. *
  • 10. //div[starts-with(@class, ‘jobWrap’)]*
  • 11. *
  • 12. Combining data *
  • 13. ?Question:Which torchbearers arefrom Dorset? *
  • 14. *
  • 15. *
  • 16. *
  • 17. *
  • 18. *
  • 19. *
  • 20. *
  • 21. *
  • 22. ?Finding leads:Corporate torchbearers? *
  • 23. *
  • 24. *
  • 25. *
  • 26. *
  • 27. New entries - ordisappearing ones *
  • 28. *
  • 29. *
  • 30. *
  • 31. *
  • 32. Leanpub.com/scrapingforjournalists @paulbradshaw onlinejournalismblog.com helpmeinvestigate.com slideshare.net/onlinejournalist * linkedin.com/in/onlinejournalist

×