Semantic Web Applications Horatiu Mocian, 04.05.2011
Use of Semantic Web <ul><li>Semantic knowledge bases </li></ul><ul><ul><li>Freebase, DBpedia </li></ul></ul><ul><li>Semant...
Semantic knowledge bases <ul><li>Databases where information is stored in a semantic web format (RDF), and is organised in...
Semantic tagging tools <ul><li>Tools that automatically identify, disambiguate and mark (tag) named entities from a text <...
Web search engines <ul><li>Traditional search engines </li></ul><ul><ul><li>Google, Bing, Yahoo! </li></ul></ul><ul><li>St...
Semantic search engines <ul><li>2 distinctive features </li></ul><ul><ul><li>Analyze indexed documents semantically </li><...
Natural Language Processing <ul><li>Building block of the semantic web </li></ul><ul><li>Subfields </li></ul><ul><ul><li>M...
RDF – a reminder <ul><ul><li>Formats: XML, N-Triple, Turtle </li></ul></ul>Subject Predicate Value Object Triple
List of applications <ul><li>Freebase </li></ul><ul><li>Dbpedia </li></ul><ul><li>data.nytimes.com </li></ul><ul><li>OpenC...
Freebase <ul><li>www.freebase.com </li></ul><ul><li>One of the largest semantic knowledge bases (2 nd  ?) </li></ul><ul><l...
DBpedia <ul><li>http://dbpedia.org/About </li></ul><ul><li>Largest semantic knowledge base </li></ul><ul><li>“ A structure...
NY Times Linked Open Data <ul><li>http://data.nytimes.com </li></ul><ul><li>NY Times is one of the largest media companies...
OpenCalais <ul><li>http://viewer.opencalais.com </li></ul><ul><li>Best NLP processing tool (for business articles in Engli...
Zemanta <ul><li>http://www.zemanta.com/ </li></ul><ul><li>Finds entities in texts, and enriches them with links, images </...
TrueKnowledge <ul><li>www.trueknowledge.com </li></ul><ul><li>Pure semantic search (one of the fewest) </li></ul><ul><li>C...
Google Squared <ul><li>http://www.google.com/squared/ </li></ul><ul><li>Google's natural search engine </li></ul><ul><li>D...
Bing <ul><li>www.bing.com </li></ul><ul><li>Microsoft's search engine </li></ul><ul><li>Handles Yahoo! Searches </li></ul>...
RDF Microformats <ul><li>HTML tags that can be interpreted semantically </li></ul><ul><li>More lightweight than proper RDF...
Interested in more? <ul><li>TripIt (tripit.com) </li></ul><ul><li>Evri (evri.com) </li></ul><ul><li>Inform (inform.com) </...
Thank you Questions?
Upcoming SlideShare
Loading in...5
×

SemanticWebApp

799

Published on

Published in: Technology, Education
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
799
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
0
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Transcript of "SemanticWebApp"

  1. 1. Semantic Web Applications Horatiu Mocian, 04.05.2011
  2. 2. Use of Semantic Web <ul><li>Semantic knowledge bases </li></ul><ul><ul><li>Freebase, DBpedia </li></ul></ul><ul><li>Semantic tagging tools </li></ul><ul><ul><li>OpenCalais, Zemanta </li></ul></ul><ul><li>Semantic search </li></ul><ul><ul><li>TrueKnowledge, Google Squared </li></ul></ul><ul><li>Data integration </li></ul>
  3. 3. Semantic knowledge bases <ul><li>Databases where information is stored in a semantic web format (RDF), and is organised in ontologies </li></ul><ul><li>Most famous </li></ul><ul><ul><li>Freebase ( www.freebase.com ) </li></ul></ul><ul><ul><li>Dbpedia ( www.dbpedia.org ) </li></ul></ul>
  4. 4. Semantic tagging tools <ul><li>Tools that automatically identify, disambiguate and mark (tag) named entities from a text </li></ul><ul><li>Commercial: OpenCalais, Zemanta, ... </li></ul><ul><li>Academic: GATE, Stanford NER, ... </li></ul>
  5. 5. Web search engines <ul><li>Traditional search engines </li></ul><ul><ul><li>Google, Bing, Yahoo! </li></ul></ul><ul><li>Structured search engines </li></ul><ul><ul><li>WolframAlpha, TripAdvisor </li></ul></ul><ul><li>Semantic search engines </li></ul><ul><ul><li>TrueKnowledge, Twine (extinct), Powerset (extinct), Google Square </li></ul></ul>
  6. 6. Semantic search engines <ul><li>2 distinctive features </li></ul><ul><ul><li>Analyze indexed documents semantically </li></ul></ul><ul><ul><li>Analyze the query text semantically -> answer to natural text questions </li></ul></ul><ul><li>Not yet mainstream </li></ul><ul><ul><li>Powerset, Twine, were bought </li></ul></ul><ul><ul><li>Google, Bing use semantic features </li></ul></ul>
  7. 7. Natural Language Processing <ul><li>Building block of the semantic web </li></ul><ul><li>Subfields </li></ul><ul><ul><li>Machine Translation </li></ul></ul><ul><ul><li>Automatic Summarization </li></ul></ul><ul><ul><li>Named Entity Recognition </li></ul></ul><ul><ul><li>Speech Recognition </li></ul></ul><ul><ul><li>POS Tagging </li></ul></ul>
  8. 8. RDF – a reminder <ul><ul><li>Formats: XML, N-Triple, Turtle </li></ul></ul>Subject Predicate Value Object Triple
  9. 9. List of applications <ul><li>Freebase </li></ul><ul><li>Dbpedia </li></ul><ul><li>data.nytimes.com </li></ul><ul><li>OpenCalais, Zemanta </li></ul><ul><li>TrueKnowledge, Google Square </li></ul><ul><li>Bing </li></ul><ul><li>Microformats + Google, Bing </li></ul>
  10. 10. Freebase <ul><li>www.freebase.com </li></ul><ul><li>One of the largest semantic knowledge bases (2 nd ?) </li></ul><ul><li>Created by the community (large input of data from Wikipedia) </li></ul><ul><li>Over 22,000,000 topics (361M triples) </li></ul><ul><li>Its parent, Metaweb Technologies, acquired by Google for $700 million in 2010 </li></ul>
  11. 11. DBpedia <ul><li>http://dbpedia.org/About </li></ul><ul><li>Largest semantic knowledge base </li></ul><ul><li>“ A structured Wikipedia” </li></ul><ul><li>Open community project </li></ul><ul><li>3.5 million things (672M triples) </li></ul>
  12. 12. NY Times Linked Open Data <ul><li>http://data.nytimes.com </li></ul><ul><li>NY Times is one of the largest media companies, news organizations </li></ul><ul><li>Started publishing semantic data in 2009 </li></ul><ul><li>Currently, it has 10,000 topics </li></ul><ul><li>Another 30,000 topics will be published </li></ul><ul><li>A large effort by a large media company </li></ul>
  13. 13. OpenCalais <ul><li>http://viewer.opencalais.com </li></ul><ul><li>Best NLP processing tool (for business articles in English) </li></ul><ul><li>Developed by ClearForest, Acquired by Reuters (now Thomson Reuters) for $25 million in 2007 </li></ul><ul><li>Used internally to tag Thomson Reuters texts </li></ul>
  14. 14. Zemanta <ul><li>http://www.zemanta.com/ </li></ul><ul><li>Finds entities in texts, and enriches them with links, images </li></ul><ul><li>Useful for blogging </li></ul><ul><li>Example of bottom-up approach </li></ul><ul><li>API provides information in RDF format </li></ul>
  15. 15. TrueKnowledge <ul><li>www.trueknowledge.com </li></ul><ul><li>Pure semantic search (one of the fewest) </li></ul><ul><li>Company based in Cambridge, UK </li></ul><ul><li>19,000,000 things (531M facts) </li></ul>
  16. 16. Google Squared <ul><li>http://www.google.com/squared/ </li></ul><ul><li>Google's natural search engine </li></ul><ul><li>Developed by Google NY office </li></ul><ul><li>Launched in 2009 </li></ul>
  17. 17. Bing <ul><li>www.bing.com </li></ul><ul><li>Microsoft's search engine </li></ul><ul><li>Handles Yahoo! Searches </li></ul><ul><li>Steadily catching up to Google </li></ul><ul><li>Has semantic abilities after acquiring Powerset in 2008 </li></ul><ul><li>Examples: recipes, travel, events </li></ul>
  18. 18. RDF Microformats <ul><li>HTML tags that can be interpreted semantically </li></ul><ul><li>More lightweight than proper RDF, easier to integrate in web pages </li></ul><ul><li>Most common: hCard, hCalendar, hReview, hResume, hRecipe </li></ul><ul><li>Recognized by both Google, and Bing </li></ul><ul><li>Example: game reviews on IGN.com </li></ul>
  19. 19. Interested in more? <ul><li>TripIt (tripit.com) </li></ul><ul><li>Evri (evri.com) </li></ul><ul><li>Inform (inform.com) </li></ul><ul><li>Daylife (daylife.com) </li></ul><ul><li>Google “semantic web applications” </li></ul>
  20. 20. Thank you Questions?

×