Moving From Noise to Signal Semantic Web
Agenda <ul><li>Introduction </li></ul><ul><li>Semantic Web </li></ul><ul><ul><li>What is Semantic Web? </li></ul></ul><ul>...
Introduction <ul><li>1.3+ billion people connected to the web </li></ul><ul><li>2006  </li></ul><ul><ul><li>161 EB of info...
Information Overload Is that really how the Web experience is supposed to feel?  <ul><li>Key Problem – how to share meanin...
Semantics? <ul><li>Related to Syntax </li></ul><ul><li>Syntax – How you say something (letters, punctuation, grammar) eg. ...
What’s the big deal? <ul><li>Internet </li></ul><ul><li>std way to communicate </li></ul><ul><li>Parrot – mimic w/o unders...
The Answer – Semantic Web <ul><li>Understand the meaning behind webpages </li></ul><ul><li>Web of Things vs Web of Documen...
The Possibilities
 
Why Semantic Web? <ul><li>Spend less time searching </li></ul><ul><li>Spend less time looking at things that do not matter...
Cartoon by  Geek and Poke
It’s all about the noise… Web 1.0:  Get  (hear & see) ‘Noise’ Web 2.0:  Make  Noise Web 3.0:  Filter  the Noise Web 4.0: G...
Semantifying the Web - Approaches <ul><li>Bottom Up </li></ul><ul><ul><li>Annotating information in web pages with machine...
Annotation Technologies  <ul><li>Trade-off between simplicity and completeness </li></ul><ul><li>RDF </li></ul><ul><ul><li...
Semantifying the Web - Approaches <ul><li>Top Down </li></ul><ul><ul><li>Focused on leveraging information in existing web...
Map+ add-on for Firefox vertical search engine Spock
More Annotations    Structured Web    More Precise Top-Down
<ul><li>Web 3.0  =  Semantic Web  =  Linked Data </li></ul><ul><li>Are They Equal?? </li></ul>
 
Structured Data <ul><ul><li>RDBMS </li></ul></ul><ul><ul><ul><li>Powerful and flexible </li></ul></ul></ul><ul><ul><ul><li...
Linked Data <ul><li>Medium for the semantic web  </li></ul><ul><li>It does not create smart data, only enables it </li></u...
Further Reading <ul><ul><li>RDF, OWL, Microformats, FOAF </li></ul></ul><ul><ul><li>Linked Data </li></ul></ul><ul><ul><li...
Upcoming SlideShare
Loading in...5
×

Semantic Web

973

Published on

Published in: Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
973
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Transcript of "Semantic Web"

  1. 1. Moving From Noise to Signal Semantic Web
  2. 2. Agenda <ul><li>Introduction </li></ul><ul><li>Semantic Web </li></ul><ul><ul><li>What is Semantic Web? </li></ul></ul><ul><ul><li>Why it matters? </li></ul></ul><ul><ul><li>How to Semantify the Web? </li></ul></ul><ul><li>Web 3.0 </li></ul><ul><li>Linked Data </li></ul>
  3. 3. Introduction <ul><li>1.3+ billion people connected to the web </li></ul><ul><li>2006 </li></ul><ul><ul><li>161 EB of information created/replicated (1 EB = 1 billion GB) </li></ul></ul><ul><ul><li>Technical information doubled every 2 years </li></ul></ul><ul><li>By 2010 </li></ul><ul><ul><li>six times to 988Eb (approx = 1 ZB) </li></ul></ul><ul><ul><li>Technical information will double every 72 hours </li></ul></ul><ul><li>Computers, mobile phones, intelligent devices </li></ul><ul><li>Internet is broken – not one web – unable to communicate </li></ul>
  4. 4. Information Overload Is that really how the Web experience is supposed to feel? <ul><li>Key Problem – how to share meaning? </li></ul><ul><li>Filtering, not aggregating. </li></ul><ul><li>Not more, just smarter. </li></ul>
  5. 5. Semantics? <ul><li>Related to Syntax </li></ul><ul><li>Syntax – How you say something (letters, punctuation, grammar) eg. HTML </li></ul><ul><li>Semantics – Meaning behind what you say </li></ul><ul><li>Example: </li></ul><ul><li>I Love Technology </li></ul><ul><li>I Technology </li></ul>
  6. 6. What’s the big deal? <ul><li>Internet </li></ul><ul><li>std way to communicate </li></ul><ul><li>Parrot – mimic w/o understanding </li></ul><ul><li>The Web </li></ul><ul><li>Store and retrieve docs on the internet </li></ul><ul><li>syntax to display the doc (HTML) </li></ul><ul><li>Search Engines </li></ul><ul><li>Find any website that we want </li></ul>Life is good!!! Can we make it any better?? How??
  7. 7. The Answer – Semantic Web <ul><li>Understand the meaning behind webpages </li></ul><ul><li>Web of Things vs Web of Documents </li></ul><ul><li>Things can be ANYTHING – people, places, pets, events, music, movies, organizations…. </li></ul><ul><li>Not only identify these things but also relationships (Human-like!!!) </li></ul><ul><li>Embed semantics in html docs – microformats, RDF </li></ul><ul><li>It’s not about the future…it’s about Today!!! </li></ul>
  8. 8. The Possibilities
  9. 10. Why Semantic Web? <ul><li>Spend less time searching </li></ul><ul><li>Spend less time looking at things that do not matter </li></ul><ul><li>Spend less time explaining what we want to computers </li></ul><ul><li>Bottomline – improve the online experience!!! </li></ul>
  10. 11. Cartoon by Geek and Poke
  11. 12. It’s all about the noise… Web 1.0: Get (hear & see) ‘Noise’ Web 2.0: Make Noise Web 3.0: Filter the Noise Web 4.0: Going deaf….or SmartNoise
  12. 13. Semantifying the Web - Approaches <ul><li>Bottom Up </li></ul><ul><ul><li>Annotating information in web pages with machine readable tags </li></ul></ul><ul><li>Technical Challenges </li></ul><ul><ul><li>Representational Complexity </li></ul></ul><ul><ul><li>How to create – manual/automatic? </li></ul></ul><ul><ul><li>How much can be transformed? </li></ul></ul><ul><ul><li>Standard Issue </li></ul></ul><ul><li>Business Challenges </li></ul><ul><ul><li>It’s primitive </li></ul></ul><ul><ul><li>Consumer Value? </li></ul></ul><ul><ul><li>How to market? </li></ul></ul><ul><li>Recent Wins: </li></ul><ul><ul><li>Yahoo search engine to support RDF, MF </li></ul></ul><ul><ul><li>Dapper – automated annotation tool </li></ul></ul>
  13. 14. Annotation Technologies <ul><li>Trade-off between simplicity and completeness </li></ul><ul><li>RDF </li></ul><ul><ul><li>Graph based – things, attributes, relationships </li></ul></ul><ul><ul><li>Precise but complex </li></ul></ul><ul><ul><li>Triple </li></ul></ul><ul><li>Microformats </li></ul><ul><ul><li>Uses specific CSS styles </li></ul></ul><ul><ul><li>Compact </li></ul></ul><ul><ul><li>Embedded in HTML </li></ul></ul><ul><ul><li>gaining popularity because of their simplicity </li></ul></ul><ul><ul><li>Popular microformats: </li></ul></ul><ul><ul><ul><li>hCard: describes personal and company contact information </li></ul></ul></ul><ul><ul><ul><li>hReview: adds meta information to review pages </li></ul></ul></ul><ul><ul><ul><li>hCalendar: used to describe events </li></ul></ul></ul><ul><ul><li>Limitations </li></ul></ul><ul><ul><ul><li>no way to described type hierarchies </li></ul></ul></ul><ul><ul><ul><li>somewhat cryptic, because the focus is to keep the annotations to a minimum </li></ul></ul></ul><ul><ul><li>Flickr, Eventful, and LinkedIn </li></ul></ul>
  14. 15. Semantifying the Web - Approaches <ul><li>Top Down </li></ul><ul><ul><li>Focused on leveraging information in existing web pages </li></ul></ul><ul><ul><li>As – is </li></ul></ul><ul><ul><li>NLP Tools (entity extraction) </li></ul></ul><ul><ul><ul><li>Calais & TextWise – APIs that recognize people, companies, places in docs </li></ul></ul></ul><ul><ul><ul><li>Vertical Search Engines – ZoomInfo, Spock & Retrevo </li></ul></ul></ul><ul><ul><ul><li>Dapper, BlueOrganizer, ClearForest – recognize objects in web pages & annotate them </li></ul></ul></ul><ul><ul><ul><li>Yahoo! Shortcuts, Snap, Smartlinks – recognize objects in text and links </li></ul></ul></ul><ul><li>Challenges </li></ul><ul><ul><li>Not 100% perfect, has ambiguities </li></ul></ul><ul><ul><li>May not scale well </li></ul></ul>
  15. 16. Map+ add-on for Firefox vertical search engine Spock
  16. 17. More Annotations  Structured Web  More Precise Top-Down
  17. 18. <ul><li>Web 3.0 = Semantic Web = Linked Data </li></ul><ul><li>Are They Equal?? </li></ul>
  18. 20. Structured Data <ul><ul><li>RDBMS </li></ul></ul><ul><ul><ul><li>Powerful and flexible </li></ul></ul></ul><ul><ul><ul><li>Pre-defined relationships and usage of data </li></ul></ul></ul><ul><ul><ul><li>Too constraining and too structured </li></ul></ul></ul><ul><ul><ul><ul><li>Schema changes are expensive </li></ul></ul></ul></ul><ul><ul><ul><ul><li>Virtually impossible to make different DBs speak </li></ul></ul></ul></ul><ul><li>Linked Data </li></ul><ul><ul><li>Establish linkages at the data level(RDF) </li></ul></ul><ul><ul><li>Bridges the gap between unstructured and structured data </li></ul></ul><ul><ul><li>Does not add any semantic meaning to the information </li></ul></ul>
  19. 21. Linked Data <ul><li>Medium for the semantic web </li></ul><ul><li>It does not create smart data, only enables it </li></ul><ul><li>Relies on clean, granular, structured data </li></ul><ul><li>Pre-Structured </li></ul><ul><li>Pre-Connected </li></ul>
  20. 22. Further Reading <ul><ul><li>RDF, OWL, Microformats, FOAF </li></ul></ul><ul><ul><li>Linked Data </li></ul></ul><ul><ul><li>Semantic APIs </li></ul></ul>

×