Your SlideShare is downloading. ×
  • Like
Webscraping for jounalists
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Now you can save presentations on your phone or tablet

Available for both IPhone and Android

Text the download link to your phone

Standard text messaging rates apply

Webscraping for jounalists

  • 1,696 views
Published

From a presentation I have at the Canadian Association of Journalists on how journalists can learn to web scrape. Most of the presentation was real-time demos not included in this PPT deck.

From a presentation I have at the Canadian Association of Journalists on how journalists can learn to web scrape. Most of the presentation was real-time demos not included in this PPT deck.

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
1,696
On SlideShare
0
From Embeds
0
Number of Embeds
1

Actions

Shares
Downloads
9
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. “A little Wget magic”
    Webscraping for journalists
    CAJ May 13, 2011
  • 2. Webscraping
    Using software that simulates a web browser to download large quantities of information from a web site.
  • 3. Why webscrape?
    • Assemble your own copy of online data
    • 4. Save time pointing-and-clicking
  • Why webscrape?
    • Data publishers (governments) want you to access data on their terms
  • 5.
  • 6.
  • 7. Is it legal?
    Yes. But.
    Do it ethically.
    Watch for robots.txt
  • 8. Tools for scraping