EpiphanyAdaptable RDFa Generation Linking the Web of Documents to the Web of Data<br />Benjamin Adrian, JörnHeesIvan Herma...
 Outline<br />Web of Document vs. Web of Data<br />RDFa, the glue combining both worlds<br />Use Linked Data for RDFa gene...
 Web of Documents<br />Features<br />Distributed textual content<br />Addressed by URLs <br />Layout in HTML, CSS<br />Con...
 Web of Data<br />Features<br />Distributed data sets<br />Addressed by URIs<br />Format is RDF<br />Connected with RDF Li...
Plain HTML<br /><BODY><br /><H1>Epiphany</H1><UL> <LI>Ben</LI> <LI>Jörn</LI> <LI>Ivan</LI> <LI>Michael</LI> <LI>Andreas</L...
HTML + RDFa<br /><BODY  vocab  = “http://xmlns.com/foaf/0.1/”prefix = “dc=http://purl.org/dc/terms/”><H1 about = “#epiphan...
RDF<br />@prefix foaf: <http://xmlns.com/foaf/0.1/>@prefix dc: <http://purl.org/dc/terms/>#epiphany	dc:title	“Epiphany” .[...
 A bridge from document to data<br />8<br />RDFa<br />RDFa<br />WWW<br />
 A bridge from document to data<br />9<br />RDFa<br />Benefits<br />RDFa is easy to generate by CMS ( i.e., Drupal), and o...
 Epiphany<br />10<br />How to consumeLinked Data forRDFa generation<br />Generate RDFa-annotated versions of web pages on-...
 Epiphany - Example<br />11<br />How to consumeLinked Data forRDFa generation<br />Tim Burton is a movie maker.<br /><span...
 Epiphany<br />12<br />How to extractRDF from web pages<br />Ontology-based Information Extraction<br />RDFgraph<br />Prep...
 Ontology-based Information  Extraction<br />13<br />RDF-based Information Extraction<br />Ontology-based  Information Ext...
RDFa Generation<br />14<br />How to create semantic annotations<br />requestHTML<br />tidy toXHTML<br />list of RDF triple...
RDFa Generation<br />15<br />How to create semantic annotations<br />for each text node<br />around matches create html:SP...
RDFa Generation<br />16<br />How to create semantic annotations<br />for each text node<br />around matches create html:SP...
RDFa Visualization<br />17<br />Use RDF togenerateEpiphanies<br />Browser<br />Epiphany<br />Linked Data<br />AJAX call to...
 Evaluation<br />18<br />ComparedEpiphany and Open Calais<br />Goal: 	Epiphany is at least as good as Open Calais<br />But...
 Evaluation<br />19<br />ComparedEpiphany and Open Calais<br />Linked data model : 12,462 pages + RDF graphs by BBC Music ...
 Evaluation – BBC Corpus<br />20<br />ComparedEpiphany and Open Calais<br />For comparing results generated by Open Calais...
 Evaluation<br />21<br />Named Entities<br />How did Open Calais and Epiphany<br />retrieve  any instances with known foaf...
 Evaluation<br />22<br />Solo MusicArtists<br />mo:SoloMusicArtistwith known  foaf:name values?<br />e.g., [] foaf:name “B...
 Evaluation<br />23<br />MusicGroups<br />mo:MusicGroupwith known foaf:name values?<br />e.g., [] foaf:name “Queen”; <br /...
Upcoming SlideShare
Loading in …5
×

Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web of Data

2,575 views
2,476 views

Published on

This presentation is about Epiphany, a system that automatically generates RDFa annotated versions of web pages based on information from Linked Data models.

Published in: Technology, Design
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
2,575
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
32
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web of Data

  1. 1. EpiphanyAdaptable RDFa Generation Linking the Web of Documents to the Web of Data<br />Benjamin Adrian, JörnHeesIvan Herman, Michael Sintek,Andreas Dengel<br />
  2. 2. Outline<br />Web of Document vs. Web of Data<br />RDFa, the glue combining both worlds<br />Use Linked Data for RDFa generation<br />Extract RDF from web pages<br />Visualize RDFa via Linked Data<br />Evaluation and Comparison with Open Calais<br />2<br />
  3. 3. Web of Documents<br />Features<br />Distributed textual content<br />Addressed by URLs <br />Layout in HTML, CSS<br />Connected with hyperlinks<br />Access via HTTP<br />Made for human readers!<br />3<br />World WideWeb<br />WWW<br />
  4. 4. Web of Data<br />Features<br />Distributed data sets<br />Addressed by URIs<br />Format is RDF<br />Connected with RDF Links<br />Access via HTTP<br />Made for machine readers!<br />4<br />Linked OpenData<br />
  5. 5. Plain HTML<br /><BODY><br /><H1>Epiphany</H1><UL> <LI>Ben</LI> <LI>Jörn</LI> <LI>Ivan</LI> <LI>Michael</LI> <LI>Andreas</LI></UL><br /></BODY><br />5<br />RDFa – Adding machine-readable data to web documents<br />RDF in HTMLattributes<br />
  6. 6. HTML + RDFa<br /><BODY vocab = “http://xmlns.com/foaf/0.1/”prefix = “dc=http://purl.org/dc/terms/”><H1 about = “#epiphany”property = “dc:title”>Epiphany</H1><UL inv = “member”> <LI typeof = “Person” property = “name”>Ben</LI> <LI typeof = “Person”property = “name”>Jörn</LI> <LI typeof = “Person”property = “name”>Ivan</LI> <LI typeof = “Person”property = “name”>Michael</LI> <LI typeof = “Person”property = “name”>Andreas</LI></UL><br /></BODY><br />6<br />RDFa – Adding machine-readable data to web documents<br />RDF in HTMLattributes<br />
  7. 7. RDF<br />@prefix foaf: <http://xmlns.com/foaf/0.1/>@prefix dc: <http://purl.org/dc/terms/>#epiphany dc:title “Epiphany” .[] foaf:name “Ben” ;rdf:typefoaf:Person ;foaf:member #epiphany .[] foaf:name “Jörn” ;rdf:typefoaf:Person ;foaf:member #epiphany .[] foaf:name “Ivan” ;rdf:typefoaf:Person ;foaf:member #epiphany .[] foaf:name “Michael” ;rdf:typefoaf:Person ;foaf:member #epiphany .[] foaf:name “Andreas” ;rdf:typefoaf:Person ;foaf:member #epiphany .<br />7<br />RDFa – Adding machine-readable data to web documents<br />RDF in TURTLE notation<br />
  8. 8. A bridge from document to data<br />8<br />RDFa<br />RDFa<br />WWW<br />
  9. 9. A bridge from document to data<br />9<br />RDFa<br />Benefits<br />RDFa is easy to generate by CMS ( i.e., Drupal), and other dynamic content providers.<br />It is easy to annotate well-structured data with RDFa<br />RDFa<br />WWW<br />Open Problems<br />How to annotate unstructured plain text content?<br />How to annotate the same documentsdifferently based on different data bases?<br />
  10. 10. Epiphany<br />10<br />How to consumeLinked Data forRDFa generation<br />Generate RDFa-annotated versions of web pages on-the-fly.<br />For different Linked Data Models used generate different RDFa annotations.<br />Create interactive boxes filled with additional information about annotated resources.<br />Linked Data Model<br />EpiphanyRDFa Generation<br />Linked Data Model<br />HTML+RDFa<br />HTML<br />
  11. 11. Epiphany - Example<br />11<br />How to consumeLinked Data forRDFa generation<br />Tim Burton is a movie maker.<br /><span about=“http://dbpedia.org/resource/Tim_Burton”property=“foaf:name”>Tim Burton</span>is a movie maker.<br />
  12. 12. Epiphany<br />12<br />How to extractRDF from web pages<br />Ontology-based Information Extraction<br />RDFgraph<br />Preprocessing<br />ExtractionPipeline<br />RDFgraphstore<br />Cache<br />HTML<br />RDFaGenerator<br />Linked Data Model<br />HTML+RDFa<br />
  13. 13. Ontology-based Information Extraction<br />13<br />RDF-based Information Extraction<br />Ontology-based Information Extraction Pipeline<br />“Ben is member of RDFa WG.”<br />Text Normalization<br />“Ben”, “is”, “member”, “of”, “RDFa WG”, ”.”<br />Text Segmentation<br />[] foaf:name “Ben”.<br />[] foaf:name “RDFa WG” .<br />Symbol Recognition<br /><#me> foaf:name “Ben”. <br /><#RDFaWG> foaf:name ”RDFa WG.”<br />Instance Recognition<br /><#me> foaf:member of <#RDFaWG>.<br />Contextual Fact Recognition<br /><#me> foaf:name “Ben”. <br /><#RDFaWG> foaf:name ”RDFa WG.”<br /><#me> foaf:member <#RDFaWG>.<br />RDF Generation<br />
  14. 14. RDFa Generation<br />14<br />How to create semantic annotations<br />requestHTML<br />tidy toXHTML<br />list of RDF triples with literal object values<br />DOM node traversal<br />am:Burtonfoaf:name “Tim Burton”.<br />am:augustrdfs:label “August”.<br />am:autorfoaf:name “Autor”.<br />Am:filmfoaf:name “Film”.<br />
  15. 15. RDFa Generation<br />15<br />How to create semantic annotations<br />for each text node<br />around matches create html:SPANelement with RDFa attributes<br />requestHTML<br />tidy toXHTML<br />list of RDF triples with literal object values<br />DOM node traversal<br />am:Burtonfoaf:name “Tim Burton”.<br />am:augustrdfs:label “August”.<br />am:autorfoaf:name “Autor”.<br />Am:filmfoaf:name “Film”.<br />
  16. 16. RDFa Generation<br />16<br />How to create semantic annotations<br />for each text node<br />around matches create html:SPANelement with RDFa attributes<br />requestHTML<br />tidy toXHTML<br />add link to RDF graph to header<br />list of RDF triples with literal object values<br />DOM node traversal<br />add js:onclick listener to elements with RDFa:about attributes<br />am:Burtonfoaf:name “Tim Burton”.<br />am:augustrdfs:label “August”.<br />am:autorfoaf:name “Autor”.<br />Am:filmfoaf:name “Film”.<br />
  17. 17. RDFa Visualization<br />17<br />Use RDF togenerateEpiphanies<br />Browser<br />Epiphany<br />Linked Data<br />AJAX call toInformation Provider<br />onclick event on <SPAN/> elements<br />GET /resource/Tim_Burton<br />HTTP 1.1 HOST: dbpedia.org<br />ACCEPT: RDF<br />Render RDF data inHTML as lighting box<br />
  18. 18. Evaluation<br />18<br />ComparedEpiphany and Open Calais<br />Goal: Epiphany is at least as good as Open Calais<br />But:<br />
  19. 19. Evaluation<br />19<br />ComparedEpiphany and Open Calais<br />Linked data model : 12,462 pages + RDF graphs by BBC Music Artists<br /></music/artists/0383dadf-2a4e-4d10-a46a-e9e041da8eb3#artist> rdf:typemo:MusicGroup ;<br />rdf:typemo:MusicArtist ;<br />foaf:name „Queen“ .<br />…<br />http://www.bbc.co.uk/music/artists/0383dadf-2a4e-4d10-a46a-e9e041da8eb3<br />http://www.bbc.co.uk/music/artists/0383dadf-2a4e-4d10-a46a-e9e041da8eb3.rdf<br />
  20. 20. Evaluation – BBC Corpus<br />20<br />ComparedEpiphany and Open Calais<br />For comparing results generated by Open Calais and Epiphany,<br />we had to align Open Calais’ results to BBC’s vocabulary.<br />BBC / Epiphany:<br />mo:SoloMusicArtistmo:MusicGroup<br />foaf:name<br />Open Calais: <br />oc:Person<br />oc:MusicGroup<br />oc:match, oc:name<br />
  21. 21. Evaluation<br />21<br />Named Entities<br />How did Open Calais and Epiphany<br />retrieve any instances with known foaf:name values?<br />e.g., [] foaf:name “Queen”<br />
  22. 22. Evaluation<br />22<br />Solo MusicArtists<br />mo:SoloMusicArtistwith known foaf:name values?<br />e.g., [] foaf:name “Brian May”; <br /> a mo:SoloMusicArtist.<br />
  23. 23. Evaluation<br />23<br />MusicGroups<br />mo:MusicGroupwith known foaf:name values?<br />e.g., [] foaf:name “Queen”; <br /> a mo:MusicGroup .<br />
  24. 24. Discussion<br />24<br />DisambiguationProblems<br />
  25. 25. Summary<br />25<br />FinalOverview<br />Applied to several domains<br /><ul><li>DBpedia
  26. 26. SKOS-topic maps
  27. 27. Personal Information Models (PIMO)
  28. 28. BBC Music Artists (Music Ontology)
  29. 29. Amazon (Good Relations)</li></ul>At a glance<br />Epiphany is an RDFa Generator that enriches Web Pages with Information from Linked Data Models.<br />Demo Version<br />http://projects.dfki.uni-kl.de/epiphany/<br />
  30. 30. Future Work<br /><ul><li>Evaluate Epiphany in other domains
  31. 31. Increase Precision by adding context analyses
  32. 32. Deploy Epiphany for DBpedia content on a scalable server
  33. 33. Add provenance information
  34. 34. Integrate existing RDFa widgets and visualisations</li></ul>26<br />NextSteps<br />
  35. 35. Thank you for Listening!<br />Contributors<br />JörnHees<br />University of Kaiserslautern<br />Benjamin Adrian<br />DFKI<br />Andreas Dengel<br />DFKI<br />Michael Sintek<br />DFKI<br />Ivan Herman <br />W3C<br />27<br />

×