• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web of Data
 

Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web of Data

on

  • 2,480 views

This presentation is about Epiphany, a system that automatically generates RDFa annotated versions of web pages based on information from Linked Data models.

This presentation is about Epiphany, a system that automatically generates RDFa annotated versions of web pages based on information from Linked Data models.

Statistics

Views

Total Views
2,480
Views on SlideShare
2,480
Embed Views
0

Actions

Likes
1
Downloads
29
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

CC Attribution-NonCommercial-ShareAlike LicenseCC Attribution-NonCommercial-ShareAlike LicenseCC Attribution-NonCommercial-ShareAlike License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web of Data Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web of Data Presentation Transcript

    • EpiphanyAdaptable RDFa Generation Linking the Web of Documents to the Web of Data
      Benjamin Adrian, JörnHeesIvan Herman, Michael Sintek,Andreas Dengel
    • Outline
      Web of Document vs. Web of Data
      RDFa, the glue combining both worlds
      Use Linked Data for RDFa generation
      Extract RDF from web pages
      Visualize RDFa via Linked Data
      Evaluation and Comparison with Open Calais
      2
    • Web of Documents
      Features
      Distributed textual content
      Addressed by URLs
      Layout in HTML, CSS
      Connected with hyperlinks
      Access via HTTP
      Made for human readers!
      3
      World WideWeb
      WWW
    • Web of Data
      Features
      Distributed data sets
      Addressed by URIs
      Format is RDF
      Connected with RDF Links
      Access via HTTP
      Made for machine readers!
      4
      Linked OpenData
    • Plain HTML
      <BODY>
      <H1>Epiphany</H1><UL> <LI>Ben</LI> <LI>Jörn</LI> <LI>Ivan</LI> <LI>Michael</LI> <LI>Andreas</LI></UL>
      </BODY>
      5
      RDFa – Adding machine-readable data to web documents
      RDF in HTMLattributes
    • HTML + RDFa
      <BODY vocab = “http://xmlns.com/foaf/0.1/”prefix = “dc=http://purl.org/dc/terms/”><H1 about = “#epiphany”property = “dc:title”>Epiphany</H1><UL inv = “member”> <LI typeof = “Person” property = “name”>Ben</LI> <LI typeof = “Person”property = “name”>Jörn</LI> <LI typeof = “Person”property = “name”>Ivan</LI> <LI typeof = “Person”property = “name”>Michael</LI> <LI typeof = “Person”property = “name”>Andreas</LI></UL>
      </BODY>
      6
      RDFa – Adding machine-readable data to web documents
      RDF in HTMLattributes
    • RDF
      @prefix foaf: <http://xmlns.com/foaf/0.1/>@prefix dc: <http://purl.org/dc/terms/>#epiphany dc:title “Epiphany” .[] foaf:name “Ben” ;rdf:typefoaf:Person ;foaf:member #epiphany .[] foaf:name “Jörn” ;rdf:typefoaf:Person ;foaf:member #epiphany .[] foaf:name “Ivan” ;rdf:typefoaf:Person ;foaf:member #epiphany .[] foaf:name “Michael” ;rdf:typefoaf:Person ;foaf:member #epiphany .[] foaf:name “Andreas” ;rdf:typefoaf:Person ;foaf:member #epiphany .
      7
      RDFa – Adding machine-readable data to web documents
      RDF in TURTLE notation
    • A bridge from document to data
      8
      RDFa
      RDFa
      WWW
    • A bridge from document to data
      9
      RDFa
      Benefits
      RDFa is easy to generate by CMS ( i.e., Drupal), and other dynamic content providers.
      It is easy to annotate well-structured data with RDFa
      RDFa
      WWW
      Open Problems
      How to annotate unstructured plain text content?
      How to annotate the same documentsdifferently based on different data bases?
    • Epiphany
      10
      How to consumeLinked Data forRDFa generation
      Generate RDFa-annotated versions of web pages on-the-fly.
      For different Linked Data Models used generate different RDFa annotations.
      Create interactive boxes filled with additional information about annotated resources.
      Linked Data Model
      EpiphanyRDFa Generation
      Linked Data Model
      HTML+RDFa
      HTML
    • Epiphany - Example
      11
      How to consumeLinked Data forRDFa generation
      Tim Burton is a movie maker.
      <span about=“http://dbpedia.org/resource/Tim_Burton”property=“foaf:name”>Tim Burton</span>is a movie maker.
    • Epiphany
      12
      How to extractRDF from web pages
      Ontology-based Information Extraction
      RDFgraph
      Preprocessing
      ExtractionPipeline
      RDFgraphstore
      Cache
      HTML
      RDFaGenerator
      Linked Data Model
      HTML+RDFa
    • Ontology-based Information Extraction
      13
      RDF-based Information Extraction
      Ontology-based Information Extraction Pipeline
      “Ben is member of RDFa WG.”
      Text Normalization
      “Ben”, “is”, “member”, “of”, “RDFa WG”, ”.”
      Text Segmentation
      [] foaf:name “Ben”.
      [] foaf:name “RDFa WG” .
      Symbol Recognition
      <#me> foaf:name “Ben”.
      <#RDFaWG> foaf:name ”RDFa WG.”
      Instance Recognition
      <#me> foaf:member of <#RDFaWG>.
      Contextual Fact Recognition
      <#me> foaf:name “Ben”.
      <#RDFaWG> foaf:name ”RDFa WG.”
      <#me> foaf:member <#RDFaWG>.
      RDF Generation
    • RDFa Generation
      14
      How to create semantic annotations
      requestHTML
      tidy toXHTML
      list of RDF triples with literal object values
      DOM node traversal
      am:Burtonfoaf:name “Tim Burton”.
      am:augustrdfs:label “August”.
      am:autorfoaf:name “Autor”.
      Am:filmfoaf:name “Film”.
    • RDFa Generation
      15
      How to create semantic annotations
      for each text node
      around matches create html:SPANelement with RDFa attributes
      requestHTML
      tidy toXHTML
      list of RDF triples with literal object values
      DOM node traversal
      am:Burtonfoaf:name “Tim Burton”.
      am:augustrdfs:label “August”.
      am:autorfoaf:name “Autor”.
      Am:filmfoaf:name “Film”.
    • RDFa Generation
      16
      How to create semantic annotations
      for each text node
      around matches create html:SPANelement with RDFa attributes
      requestHTML
      tidy toXHTML
      add link to RDF graph to header
      list of RDF triples with literal object values
      DOM node traversal
      add js:onclick listener to elements with RDFa:about attributes
      am:Burtonfoaf:name “Tim Burton”.
      am:augustrdfs:label “August”.
      am:autorfoaf:name “Autor”.
      Am:filmfoaf:name “Film”.
    • RDFa Visualization
      17
      Use RDF togenerateEpiphanies
      Browser
      Epiphany
      Linked Data
      AJAX call toInformation Provider
      onclick event on <SPAN/> elements
      GET /resource/Tim_Burton
      HTTP 1.1 HOST: dbpedia.org
      ACCEPT: RDF
      Render RDF data inHTML as lighting box
    • Evaluation
      18
      ComparedEpiphany and Open Calais
      Goal: Epiphany is at least as good as Open Calais
      But:
    • Evaluation
      19
      ComparedEpiphany and Open Calais
      Linked data model : 12,462 pages + RDF graphs by BBC Music Artists
      </music/artists/0383dadf-2a4e-4d10-a46a-e9e041da8eb3#artist> rdf:typemo:MusicGroup ;
      rdf:typemo:MusicArtist ;
      foaf:name „Queen“ .

      http://www.bbc.co.uk/music/artists/0383dadf-2a4e-4d10-a46a-e9e041da8eb3
      http://www.bbc.co.uk/music/artists/0383dadf-2a4e-4d10-a46a-e9e041da8eb3.rdf
    • Evaluation – BBC Corpus
      20
      ComparedEpiphany and Open Calais
      For comparing results generated by Open Calais and Epiphany,
      we had to align Open Calais’ results to BBC’s vocabulary.
      BBC / Epiphany:
      mo:SoloMusicArtistmo:MusicGroup
      foaf:name
      Open Calais:
      oc:Person
      oc:MusicGroup
      oc:match, oc:name
    • Evaluation
      21
      Named Entities
      How did Open Calais and Epiphany
      retrieve any instances with known foaf:name values?
      e.g., [] foaf:name “Queen”
    • Evaluation
      22
      Solo MusicArtists
      mo:SoloMusicArtistwith known foaf:name values?
      e.g., [] foaf:name “Brian May”;
      a mo:SoloMusicArtist.
    • Evaluation
      23
      MusicGroups
      mo:MusicGroupwith known foaf:name values?
      e.g., [] foaf:name “Queen”;
      a mo:MusicGroup .
    • Discussion
      24
      DisambiguationProblems
    • Summary
      25
      FinalOverview
      Applied to several domains
      • DBpedia
      • SKOS-topic maps
      • Personal Information Models (PIMO)
      • BBC Music Artists (Music Ontology)
      • Amazon (Good Relations)
      At a glance
      Epiphany is an RDFa Generator that enriches Web Pages with Information from Linked Data Models.
      Demo Version
      http://projects.dfki.uni-kl.de/epiphany/
    • Future Work
      • Evaluate Epiphany in other domains
      • Increase Precision by adding context analyses
      • Deploy Epiphany for DBpedia content on a scalable server
      • Add provenance information
      • Integrate existing RDFa widgets and visualisations
      26
      NextSteps
    • Thank you for Listening!
      Contributors
      JörnHees
      University of Kaiserslautern
      Benjamin Adrian
      DFKI
      Andreas Dengel
      DFKI
      Michael Sintek
      DFKI
      Ivan Herman
      W3C
      27