Your SlideShare is downloading. ×
Piggy Bank
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Saving this for later?

Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime - even offline.

Text the download link to your phone

Standard text messaging rates apply

Piggy Bank

1,626
views

Published on

07-0113 - SWT Presentation 01

07-0113 - SWT Presentation 01

Published in: Technology, Education

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
1,626
On Slideshare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. PIGGY BANK Experience the Semantic Web Inside Your Web Browser Qurat ul Ain 07-0113
  • 2. Road Map
    • Vision behind Piggy Bank
    • Piggy Bank Introduction
    • Screen Scrapers
    • Piggy Bank Approach
    • Semantic Bank
    • User Experiences
    • Demo - VacancyGuide Screen Scraper
    • Implementation
    • Architecture
    • Limitations and Recommendations
    • Conclusion
    • Acknowledgments
    • References
    2/2/2010 Piggy Bank
  • 3. Vision behind Piggy Bank
    • Semantic Web envisions a Web wherein information is offered free of presentation, allowing effective exchange across web sites and across web pages.
    • But without substantial Semantic Web content, few tools will be written to consume it; without many such tools, there is little appeal to publish Semantic Web content.
    2/2/2010 Piggy Bank
  • 4. Piggy Bank
    • Mozilla Firefox extension.
    • Turns the browser into a mashup platform, by allowing to extract data from different web sites and mix them together.
    • Wherever Semantic Web content is not available, Piggy Bank can invoke screen scrapers to re-structure information within web pages into Semantic Web format.
    2/2/2010 Piggy Bank
  • 5. Screen Scrapers
    • When the web page does not include the “pure” information itself, Piggy Bank needs to invoke the software code to extract such information. For each web page a different piece of software code is required to perform this translation — that piece of code is called a “screen scraper”.
    2/2/2010 Piggy Bank
  • 6. Available Scrapers
    • FlickrPhoto Scraper
    • Generic Web Page Scraper
    • LinkedIn Scraper
    • Ontoworld Upcoming Events Scraper
    • Orkut Friends Scraper
    • Presidential Candidates 2008 Scraper
    • VacancyGuide Screen Scraper
    2/2/2010 Piggy Bank
  • 7. Approach
  • 8. Approach
    • Piggy Bank, a tool that lets Web users extract pure information items from within web pages and save them in Semantic Web format (RDF).
    • Piggy Bank then lets users make use of these items right inside the same web browser. These items, collected from different sites, can now be browsed, searched, sorted, and organized together, regardless of their origins and types.
    2/2/2010 Piggy Bank
  • 9. Semantic Bank
    • For further experience of Semantic Web.
    • A common repository of RDF to which a community of Piggy Bank users can contribute to share the information they have collected.
    2/2/2010 Piggy Bank
  • 10. User Experiences
  • 11. 2/2/2010 Collecting Information Piggy Bank
  • 12. Sharing Information 2/2/2010 Piggy Bank
  • 13. Installation of Scrapers
    • It supports easy and safe installation of scrapers through the use of RDF.
    • To install a scraper in Piggy Bank, the user only needs to save its metadata into his/her Piggy Bank and then “activates” it.
    2/2/2010 Piggy Bank
  • 14. Demo - VacancyGuide Screen Scraper
  • 15.
    • http://simile.mit.edu/piggy-bank/screencasts/apartments.swf
    2/2/2010 Piggy Bank
  • 16. Implementation
  • 17. Piggy Bank
    • Piggy Bank is implemented as an extension to the web browser rather than as a stand-alone application.
      • Rich and familiar user interface of the web browser.
      • Lets users experience the benefits of Semantic Web technologies without much cost.
      • It also lets users enjoy the RDF-enabled sites while still improving their experience on the scrapable sites.
    2/2/2010 Piggy Bank
  • 18.
    • They selected Java as the Piggy Bank’s core implementation language. It gave them benefit in using the Java-based RDF libraries as well as for many other functionalities like for parsing RSS feeds and etc.
    • They made the act of collecting information items as lightweight as possible, first they make use of a status-bar icon to indicate that a web page is scrapable, and second they support collecting through a single-click on that same icon.
    2/2/2010 Piggy Bank
  • 19.
    • Piggy Bank uses any combination of the following three methods for collection:
      • Links from the current web page to Web resources in other formats are retrieved and are parsed into RDF.
      • Available XSL transformations are applied on the current web page’s DOM (Document Object Model).
      • Available Javascript code is run on the current web page’s DOM, retrieving other web pages to process if necessary.
    • They made use of a servlet capable of generating DHTML-based user interface, so that users can browse the collected information as web pages.
    2/2/2010 Piggy Bank
  • 20. Architecture
  • 21. 2/2/2010 Piggy Bank
  • 22. Semantic Bank
    • Semantic Bank shares a very similar architecture to the Java part of Piggy Bank. They both make use of the same servlet to serve the faceted browsing user interface.
    • It gives each of its subscribed members a different profile for persisting data while it keeps another profile where “published” information from all members gets mixed together.
    • Semantic Bank listens to HTTP POSTs sent by a user’s Piggy Bank to upload his/her data. All of the uploaded data goes into that user’s profile on the Semantic Bank, and those items marked as public are copied to the common profile.
    2/2/2010 Piggy Bank
  • 23. Limitations and Recommendations
  • 24.
    • Needs more Scrapers and more intellectual and radiant ideas.
    • Needs to be adopted widely to make a real and significant change on the Web.
    2/2/2010 Piggy Bank
  • 25.
    • Copyright issues?
    • How will copyright laws adapt to the nature of the information being redistributed?
    • Will scraper writers be held responsible for illegal use of the information their scrapers produce on a massive scale?
    • Is Semantic Bank responsible for checking for copyright infringement of information published to it?
    2/2/2010 Piggy Bank
  • 26.
    • In order to remain in control of their information, one might expect publishers to publish Semantic Web information themselves so to eliminate the need for scraping their sites.
    • They might include copyright information into every item they publish and hold Piggy Bank and Semantic Bank responsible for keeping that information intact as the items are moved about.
    • So that this evolution of Semantic Web would not have to be stopped.
    2/2/2010 Piggy Bank
  • 27. Conclusion
  • 28.
    • In adopting Piggy Bank, users immediately gain flexibility in the ways they use existing Web information without ever leaving their familiar web browser.
    • Through the use of Piggy Bank, as they consume Web information, they automatically produce Semantic Web information.
    • Through Semantic Bank, the information they have collected, on publishing, merges together smoothly, giving rise to higher-ordered patterns and structures. This is how the Semantic Web might emerge from the Web.
    2/2/2010 Piggy Bank
  • 29. Scraping The Web
    • Empowering Web users, giving them control over the information that they encounter. Even in the cases where the web sites do not publish Semantic Web information directly, users can still extract the data using scrapers.
    • It has introduced a means for users to integrate information from multiple sources on the Web at their own choice.
    2/2/2010 Piggy Bank
  • 30. Information wants to be Free!
    • This system goes beyond just collecting Semantic Web information but also enables users to publish the collected information back out to the Web.
    2/2/2010 Piggy Bank
  • 31. Acknowledgments
  • 32. Simile Project
    • This work is conducted by the Simile Project, a collaborative effort between the MIT Libraries, the Haystack group at MIT CSAIL, and the World Wide Web Consortium.
    2/2/2010 Piggy Bank
  • 33. References
  • 34.
    • http://simile.mit.edu/
    • http://simile.mit.edu/wiki/Piggy_Bank
    • http://simile.mit.edu/papers/iswc05.pdf
    • http://simile.mit.edu/piggy-bank/screencasts/apartments.swf
    2/2/2010 Piggy Bank
  • 35. Thankyou Folks!