Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Paul Bradshaw
Leanpub.com/scrapingforjournalists*
Scraping
in 60 mins
How do you scrape?
Aron Pilhofer, News Rewired
WYSIWYG tools (Import.io,
OutWit Hub)
Google Sheets =IMPORT
Scraperwiki, Morph.io
Scraping tools
OutWit Hub
Import.io
Import.io
*
Chrome extensions:
*
Edit column >
Add column by fetching URLs…
https://ifttt.com/channels
Call it what you want
Put it where you want
*
*
*
Function (Arguments)
(aka parameters)
*
Function (arguments)
=SUM(A2:A50)
=AVERAGE(B2:B300)
=COUNTIF(A10:A3000,”Smith”)
*
Function (parameters)
=SUM(range of cells to be
summed)
=AVERAGE(range of cells to be
averaged)
=COUNTIF(range of cells ...
*
(“string”, index)
*
Tip: search for
documentation
*
Variable
*
Variables
*
Jargon checklist:
Function
Arguments
Parameters
String
Index
Variable
Documentation
IMPORTXML
IMPORTDATA
IMPORTFEED
Paul Bradshaw
Leanpub.com/scrapingforjournalists*
Thank you.
Scraping in 60 minutes
Scraping in 60 minutes
Scraping in 60 minutes
Scraping in 60 minutes
Scraping in 60 minutes
Scraping in 60 minutes
Scraping in 60 minutes
Scraping in 60 minutes
Scraping in 60 minutes
Upcoming SlideShare
Loading in …5
×

Scraping in 60 minutes

594 views

Published on

Presentation at CIJ Summer School 2016

Published in: Education
  • Be the first to comment

  • Be the first to like this

Scraping in 60 minutes

  1. 1. Paul Bradshaw Leanpub.com/scrapingforjournalists* Scraping in 60 mins
  2. 2. How do you scrape? Aron Pilhofer, News Rewired
  3. 3. WYSIWYG tools (Import.io, OutWit Hub) Google Sheets =IMPORT Scraperwiki, Morph.io Scraping tools
  4. 4. OutWit Hub
  5. 5. Import.io
  6. 6. Import.io
  7. 7. * Chrome extensions:
  8. 8. * Edit column > Add column by fetching URLs…
  9. 9. https://ifttt.com/channels
  10. 10. Call it what you want Put it where you want
  11. 11. *
  12. 12. *
  13. 13. * Function (Arguments) (aka parameters)
  14. 14. * Function (arguments) =SUM(A2:A50) =AVERAGE(B2:B300) =COUNTIF(A10:A3000,”Smith”)
  15. 15. * Function (parameters) =SUM(range of cells to be summed) =AVERAGE(range of cells to be averaged) =COUNTIF(range of cells to be counted,what to count)
  16. 16. * (“string”, index)
  17. 17. * Tip: search for documentation
  18. 18. * Variable
  19. 19. * Variables
  20. 20. * Jargon checklist: Function Arguments Parameters String Index Variable Documentation
  21. 21. IMPORTXML IMPORTDATA IMPORTFEED
  22. 22. Paul Bradshaw Leanpub.com/scrapingforjournalists* Thank you.

×