Web Scraping and Data Extraction Service
Upcoming SlideShare
Loading in...5
×
 

Like this? Share it with your network

Share

Web Scraping and Data Extraction Service

on

  • 644 views

Learn more about Web Scraping and data extraction services. We have covered various points about scraping, extraction and converting un-structured data to structured format. For more info visit ...

Learn more about Web Scraping and data extraction services. We have covered various points about scraping, extraction and converting un-structured data to structured format. For more info visit http://promptcloud.com/

Statistics

Views

Total Views
644
Views on SlideShare
584
Embed Views
60

Actions

Likes
0
Downloads
6
Comments
0

3 Embeds 60

http://www.scoop.it 24
https://twitter.com 19
http://promptcloud.com 17

Accessibility

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Web Scraping and Data Extraction Service Presentation Transcript

  • 1. Image Credits: codeatomic
  • 2. What is Web Scraping • Web Scraping refers to an application that processes the HTML of a Web page to extract data for manipulation such as converting the Web page to another format (i.e. HTML to XML). • It is also known as Web Harvesting and Web Data Extraction
  • 3. Web Scraping Architecture Image Credits: dotnet4features
  • 4. • Web Scraping scripts and applications will simulate a person viewing a Web site with a browser. Using these scripts you can connect to a Web page and request a page, exactly as a browser would do. • The Web server will send back the page which you can then manipulate or extract specific information from.
  • 5. Converting Unstructured data to Structured data Image Credits: netscavator
  • 6. • Unstructured content is largely obtained after the scraping process. Structuring the data is the tedious process. But nowadays most of the tools easily does this functionality to segregate the data based on the fields. After the segregation the data is converted into either an API or any other format like • CSV • XML • XLS • JSON
  • 7. Web Indexing Image Credits: iloveldsclothing
  • 8. • Web scraping is closely related to web indexing, which indexes information on the web using a bot or web crawler and is a universal technique adopted by most search engines.
  • 9. Uses of Scraping Services Image Credits: agconexus
  • 10. Following are some of the uses of Scraping service: • Online price comparison • Contact scraping • weather data monitoring • Website change detection • To collect data's for research work • web mash up • web data integration • Scraping articles blog and content • Social media crawling • Crawling review data
  • 11. Outsourcing SLA for web crawl Image Credits: cpltechnology
  • 12. If you have a plan to outsource the web crawl or Scraping services, consider the following SLA's • Crawlability • Scalability • Data structure capabilities • Data accuracy • Data coverage • Availability • Adaptability • Maintainability
  • 13. For more information Visit http://blog.promptcloud.com/ Reach out to info@promptcloud.com
  • 14. Visit http://promptcloud.com/