SlideShare a Scribd company logo
1 of 25
Download to read offline
A Virtuous Cycle of
Semantic Enhancement with
    DBpedia Spotlight
    Pablo N. Mendes, Christian Bizer
      pablo.mendes@fu-berlin.de
       Web Based Systems Group
        Freie Universität Berlin


         SemTechBiz Berlin
         February 6th 2011
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                                                       Agenda

     • What do we mean by semantic enhancement?

     • How does DBpedia Spotlight work?

     • From Wikipedia to DBpedia Spotlight

     • From DBpedia Spotlight to Wikipedia

     • In your project
                                   Can you also enable a virtuous cycle?

                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                         Semantic Enhancement?
     • Generally:
            – Making something easier to understand
     • For humans:
            – Say what you mean (reduce ambiguity)
            – Make associations
            – Access to definitions, background
     • For machines:
            – the same, but in structured format
                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

     Semantic Enhancement (Example)
                                                                                                                 http://nyti.ms/qsYAyt

     News Annotation
           Links to “topics”
           Topic pages lead to related
            content

     Semantic Enhancement
           Links text to unique
            identifiers
           Adds background
            information
           Interconnects related
            content
                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de



      A Virtuous Cycle of Semantic Enhancement




                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                                     DBpedia Spotlight
     • DBpedia is a collection of entity descriptions
       extracted from Wikipedia & shared as linked data

     • DBpedia Spotlight uses data from DBpedia and
       text from associated Wikipedia pages

     • Learns how to recognize that a DBpedia resource
       was mentioned

     • Given plain text as input, generates annotated
       text

                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de



        DBpedia Spotlight: Text Annotation
     • From:
              (…) Upon their return, Lennon and McCartney
              went to New York to announce the formation of
              Apple Corps.


     • To:
              (…) Upon their return, Lennon and McCartney
              went to New York to announce the formation of
              Apple Corps.
                                       http://dbpedia.org/resource/New_York_City
               http://dbpedia.org/resource/Apple_Corps

                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                      Challenge: Term Ambiguity

     • ...this apple on the palm of my hand...
     • ...Apple tried to acquire Palm Inc....
     • ...eating an apple sitting next to a palm tree...

     • What do “apple” and “palm” mean in each case?

     • Our objective is to recognize entities/topics and
       disambiguate their meaning, generating DBpedia
       annotation in text.
                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                                       Stage 1: Spotting

  • Find substrings that seem worthy of annotation
      Input:
      (…) Upon their return, Lennon and McCartney went to New York
      to announce the formation of Apple Corps.
      Output:
      “Lennon”, “McCartney”, “New York”, “Apple Corps”


  • Simplest approach relies on a dictionary of known
    entity names.
         – Other: Named Entity Recognition, Keyphrase Extraction, ...
                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                   Stage 2: Candidate Mapping
    • Find possible meanings for each of the
      spotted substrings.
       Input (spotted names):
       “Lennon”, “McCartney”, “New York”, “Apple Corps”
       Output (candidate map):
       “Lennon”: { Lennon_(album),
                   Lennon,_Michigan, … }
       “McCartney”: { McCartney(surname),
                        Paul_McCartney, … }
       “New York”: { New_York_State,
                    New_York_City, … }
       “Apple Corps”: { Apple_Corps }
                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                          Stage 3: Disambiguation
     • Select the correct candidate DBpedia Resource
       for a given substring.
     • Decision is made based on the context(1) the
       substring was mentioned

        con·text (kntkst)n.
           1. the parts of a discourse that surround a word or
           passage and can throw light on its meaning

                                                    http://mw1.merriam-webster.com/dictionary/context

                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

       Learning the Context for a resource
              (…) Upon their return, Lennon and McCartney went to New
              York to announce the formation of Apple Corps.

   • Collect context for DBpedia Resources from all
     articles in Wikipedia
   e.g. Co-occurrence Statistics
   John_Lennon = {John:981, Beatles:320, McCartney:100, ...}

   • Types of context
          – Wikipedia Pages
          – Definitions from disambiguation pages
          – Paragraphs that link to resources

                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

     DBpedia Spotlight
                                                                       http://spotlight.dbpedia.org/demo




                                                                                  Freely available Web Service;
                                                                                    Open Source, Java/Scala
                                                                                       Apache V2 License.
                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                                        A Virtuous Cycle




                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                               The “Suggest” Button
                                                           User decides to add a link




                                                                                Suggest




                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                               The “Suggest” Button
                                               System suggest targets, user chooses




                                                                                Suggest




                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

     Sztakipedia
                                            http://pedia.sztaki.hu

     • Developed by Mihály Héder et al. at MTA SZTAKI
       (Hungarian Academy of Sciences)
     • Adds a toolbar to Wikipedia that can use DBpedia
       Spotlight to suggest links
            – Also suggests Categories, Infoboxes, Books



     • Helps editors to refine knowledge in Wikipedia
            – More interconnections, more entity types, more
              structured data!

                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                        Sztakipedia (screenshots)
                                                                                 http://pedia.sztaki.hu




                                                               Source: http://www.youtube.com/watch? v=8VW0TrvXpl4
                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

     Beyond Wikipedia? RDFaCE
                                                             http://rdface.aksw.org/


     • Developed by Ali Khalili at U. Leipzig (AKSW)
     • Helps users to add RDFa markup via a
       WYSIWYG interface
     • Can use DBpedia Spotlight, among other
       services to disambiguate entity names
     • Available as a Wordpress Plugin
            – Enables blogs as sources of context for
              disambiguation

                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                                                       RDFaCE
                                                                          http://rdface.aksw.org




                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

           A Virtuous Cycle in My Enterprise
  • Select a database of entity identifiers
  • Select textual sources that talk about those
    entities
  • Use semantic enhancement editors (with
    automatic suggestions) to annotate text
  • Use annotated text to re-train annotator

      DBpedia Spotlight is ready for MediaWiki XML and TSV.
           Other formats to come! Take a look at NIF
                                                                                                http://nlp2rdf.org/nif-1-0
                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

          Semantic Enhancement Marketplace
     • User-generated annotations are valuable
       crowdsourced knowledge
     • Can be used as currency:
            – “sweat for web service provision”
            – b2b partnerships


     • Example: RoboTagger.com
            – entity annotation service (in German)
            – entity types are not fixed (crowd-sourced)
            – users rewarded with more access to web service

                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                                                 Conclusion
     • Unstructured information (text) and
       structured information (e.g. RDF)
            – Mutually dependent and beneficial
     • DBpedia Spotlight sits on the border of two
       worlds:
            – From Wikipedia, an automatic annotator
            – From the auto-annotator, a more interconnected
              Wikipedia
     • Fosters a semantic enhancement ecosystem!
                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                                                 Thank you!
     On Twitter: @pablomendes
     E-mail: pablo.mendes@fu-berlin.de
     Web: http://pablomendes.com
     http://slideshare.net/pablomendes


     • Special thanks to Mihály Héder and Iavor Jelev
       for many fruitful discussions

     • DBpedia Spotlight is partially funded by LOD2.eu

                           http://spotlight.dbpedia.org
                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
FREIE UNIVERSITÄT BERLIN
SemTechBiz Berlin, February 2012
                                                                                                                                 http://wbsg.de

                                                           Links
     • Download
      – DBpedia: http://dbpedia.org/Downloads37/
      – DBpedia Spotlight:
        http://sourceforge.net/projects/dbp-spotlight/
      – RDFaCE
             • http://code.google.com/p/rdface/
             • Wordpress plugin:
               http://wordpress.org/extend/plugins/rdface/




                    Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight

More Related Content

Viewers also liked

Introduction to Probabilistic Latent Semantic Analysis
Introduction to Probabilistic Latent Semantic AnalysisIntroduction to Probabilistic Latent Semantic Analysis
Introduction to Probabilistic Latent Semantic AnalysisNYC Predictive Analytics
 
Syntactic Analysis
Syntactic AnalysisSyntactic Analysis
Syntactic AnalysisAleli Lac
 
Semantics: Seven types of meaning
Semantics: Seven types of meaningSemantics: Seven types of meaning
Semantics: Seven types of meaningMiftadia Laula
 
NERD: Evaluating Named Entity Recognition Tools in the Web of Data
NERD: Evaluating Named Entity Recognition Tools in the Web of DataNERD: Evaluating Named Entity Recognition Tools in the Web of Data
NERD: Evaluating Named Entity Recognition Tools in the Web of DataGiuseppe Rizzo
 
Collaborative Filtering at Spotify
Collaborative Filtering at SpotifyCollaborative Filtering at Spotify
Collaborative Filtering at SpotifyErik Bernhardsson
 

Viewers also liked (6)

Introduction to Probabilistic Latent Semantic Analysis
Introduction to Probabilistic Latent Semantic AnalysisIntroduction to Probabilistic Latent Semantic Analysis
Introduction to Probabilistic Latent Semantic Analysis
 
Syntactic Analysis
Syntactic AnalysisSyntactic Analysis
Syntactic Analysis
 
Semantics: Seven types of meaning
Semantics: Seven types of meaningSemantics: Seven types of meaning
Semantics: Seven types of meaning
 
NERD: Evaluating Named Entity Recognition Tools in the Web of Data
NERD: Evaluating Named Entity Recognition Tools in the Web of DataNERD: Evaluating Named Entity Recognition Tools in the Web of Data
NERD: Evaluating Named Entity Recognition Tools in the Web of Data
 
Semantics
SemanticsSemantics
Semantics
 
Collaborative Filtering at Spotify
Collaborative Filtering at SpotifyCollaborative Filtering at Spotify
Collaborative Filtering at Spotify
 

Similar to A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight - SemTech Berlin 2012

Identifying Topics in Social Media Posts
Identifying Topics in Social Media PostsIdentifying Topics in Social Media Posts
Identifying Topics in Social Media Postshavasmedialabs
 
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...MediaEval2012
 
Communicating Systems Biology - Why and How We Should Do Better in a Digital ...
Communicating Systems Biology - Why and How We Should Do Better in a Digital ...Communicating Systems Biology - Why and How We Should Do Better in a Digital ...
Communicating Systems Biology - Why and How We Should Do Better in a Digital ...Philip Bourne
 
DBpedia - An Interlinking Hub in the Web of Data
DBpedia - An Interlinking Hub in the Web of DataDBpedia - An Interlinking Hub in the Web of Data
DBpedia - An Interlinking Hub in the Web of DataChris Bizer
 
SemTechBiz 2012: Domeo: a web-based tool for semantic annotation of online do...
SemTechBiz 2012: Domeo: a web-based tool for semantic annotation of online do...SemTechBiz 2012: Domeo: a web-based tool for semantic annotation of online do...
SemTechBiz 2012: Domeo: a web-based tool for semantic annotation of online do...Paolo Ciccarese
 
Sw 3 bizer etal-d bpedia-crystallization-point-jws-preprint
Sw 3 bizer etal-d bpedia-crystallization-point-jws-preprintSw 3 bizer etal-d bpedia-crystallization-point-jws-preprint
Sw 3 bizer etal-d bpedia-crystallization-point-jws-preprintokeee
 
Nistep_seminar_introduction 20111208
Nistep_seminar_introduction 20111208Nistep_seminar_introduction 20111208
Nistep_seminar_introduction 20111208Kazuhiro Hayashi
 

Similar to A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight - SemTech Berlin 2012 (7)

Identifying Topics in Social Media Posts
Identifying Topics in Social Media PostsIdentifying Topics in Social Media Posts
Identifying Topics in Social Media Posts
 
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
 
Communicating Systems Biology - Why and How We Should Do Better in a Digital ...
Communicating Systems Biology - Why and How We Should Do Better in a Digital ...Communicating Systems Biology - Why and How We Should Do Better in a Digital ...
Communicating Systems Biology - Why and How We Should Do Better in a Digital ...
 
DBpedia - An Interlinking Hub in the Web of Data
DBpedia - An Interlinking Hub in the Web of DataDBpedia - An Interlinking Hub in the Web of Data
DBpedia - An Interlinking Hub in the Web of Data
 
SemTechBiz 2012: Domeo: a web-based tool for semantic annotation of online do...
SemTechBiz 2012: Domeo: a web-based tool for semantic annotation of online do...SemTechBiz 2012: Domeo: a web-based tool for semantic annotation of online do...
SemTechBiz 2012: Domeo: a web-based tool for semantic annotation of online do...
 
Sw 3 bizer etal-d bpedia-crystallization-point-jws-preprint
Sw 3 bizer etal-d bpedia-crystallization-point-jws-preprintSw 3 bizer etal-d bpedia-crystallization-point-jws-preprint
Sw 3 bizer etal-d bpedia-crystallization-point-jws-preprint
 
Nistep_seminar_introduction 20111208
Nistep_seminar_introduction 20111208Nistep_seminar_introduction 20111208
Nistep_seminar_introduction 20111208
 

More from Pablo Mendes

Entity Aware Click Graph
Entity Aware Click GraphEntity Aware Click Graph
Entity Aware Click GraphPablo Mendes
 
WWW2012 Tutorial Visualizing SPARQL Queries
WWW2012 Tutorial Visualizing SPARQL QueriesWWW2012 Tutorial Visualizing SPARQL Queries
WWW2012 Tutorial Visualizing SPARQL QueriesPablo Mendes
 
Sieve - Data Quality and Fusion - LWDM2012
Sieve - Data Quality and Fusion - LWDM2012Sieve - Data Quality and Fusion - LWDM2012
Sieve - Data Quality and Fusion - LWDM2012Pablo Mendes
 
Ligado nos Políticos at ESWC'2011 Workshop
Ligado nos Políticos at ESWC'2011 WorkshopLigado nos Políticos at ESWC'2011 Workshop
Ligado nos Políticos at ESWC'2011 WorkshopPablo Mendes
 
SMWCon Fall 2011 Lightning Talk
SMWCon Fall 2011 Lightning TalkSMWCon Fall 2011 Lightning Talk
SMWCon Fall 2011 Lightning TalkPablo Mendes
 
Dados Ligados (Linked Data) CONSEGI 2011
Dados Ligados (Linked Data) CONSEGI 2011Dados Ligados (Linked Data) CONSEGI 2011
Dados Ligados (Linked Data) CONSEGI 2011Pablo Mendes
 
Cuebee Architecture
Cuebee ArchitectureCuebee Architecture
Cuebee ArchitecturePablo Mendes
 
Twarql Architecture - Streaming Annotated Tweets
Twarql Architecture - Streaming Annotated TweetsTwarql Architecture - Streaming Annotated Tweets
Twarql Architecture - Streaming Annotated TweetsPablo Mendes
 
Dynamic Associative Relationships on the Linked Open Data Web
Dynamic Associative Relationships on the Linked Open Data WebDynamic Associative Relationships on the Linked Open Data Web
Dynamic Associative Relationships on the Linked Open Data WebPablo Mendes
 

More from Pablo Mendes (9)

Entity Aware Click Graph
Entity Aware Click GraphEntity Aware Click Graph
Entity Aware Click Graph
 
WWW2012 Tutorial Visualizing SPARQL Queries
WWW2012 Tutorial Visualizing SPARQL QueriesWWW2012 Tutorial Visualizing SPARQL Queries
WWW2012 Tutorial Visualizing SPARQL Queries
 
Sieve - Data Quality and Fusion - LWDM2012
Sieve - Data Quality and Fusion - LWDM2012Sieve - Data Quality and Fusion - LWDM2012
Sieve - Data Quality and Fusion - LWDM2012
 
Ligado nos Políticos at ESWC'2011 Workshop
Ligado nos Políticos at ESWC'2011 WorkshopLigado nos Políticos at ESWC'2011 Workshop
Ligado nos Políticos at ESWC'2011 Workshop
 
SMWCon Fall 2011 Lightning Talk
SMWCon Fall 2011 Lightning TalkSMWCon Fall 2011 Lightning Talk
SMWCon Fall 2011 Lightning Talk
 
Dados Ligados (Linked Data) CONSEGI 2011
Dados Ligados (Linked Data) CONSEGI 2011Dados Ligados (Linked Data) CONSEGI 2011
Dados Ligados (Linked Data) CONSEGI 2011
 
Cuebee Architecture
Cuebee ArchitectureCuebee Architecture
Cuebee Architecture
 
Twarql Architecture - Streaming Annotated Tweets
Twarql Architecture - Streaming Annotated TweetsTwarql Architecture - Streaming Annotated Tweets
Twarql Architecture - Streaming Annotated Tweets
 
Dynamic Associative Relationships on the Linked Open Data Web
Dynamic Associative Relationships on the Linked Open Data WebDynamic Associative Relationships on the Linked Open Data Web
Dynamic Associative Relationships on the Linked Open Data Web
 

Recently uploaded

sample sample sample sample sample sample
sample sample sample sample sample samplesample sample sample sample sample sample
sample sample sample sample sample sampleCasey Keith
 
Jalpaiguri Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Jalpaiguri Call Girls 🥰 8617370543 Service Offer VIP Hot ModelJalpaiguri Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Jalpaiguri Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Prayagraj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Prayagraj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelPrayagraj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Prayagraj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Sample sample sample sample sample sample
Sample sample sample sample sample sampleSample sample sample sample sample sample
Sample sample sample sample sample sampleCasey Keith
 
TOURISM ATTRACTION IN LESOTHO 2024.pptx.
TOURISM ATTRACTION IN LESOTHO 2024.pptx.TOURISM ATTRACTION IN LESOTHO 2024.pptx.
TOURISM ATTRACTION IN LESOTHO 2024.pptx.lihabaneo
 
abortion pills in Riyadh+966572737505 Cytotec Riyadh
abortion pills in  Riyadh+966572737505    Cytotec Riyadhabortion pills in  Riyadh+966572737505    Cytotec Riyadh
abortion pills in Riyadh+966572737505 Cytotec Riyadhsamsungultra782445
 
TOURISM ATTRACTION IN LESOTHO 2024.Pptx.
TOURISM ATTRACTION IN LESOTHO 2024.Pptx.TOURISM ATTRACTION IN LESOTHO 2024.Pptx.
TOURISM ATTRACTION IN LESOTHO 2024.Pptx.lihabaneo
 
Top places to visit, top tourist destinations
Top places to visit, top tourist destinationsTop places to visit, top tourist destinations
Top places to visit, top tourist destinationsswarajdm34
 
Rudrapur Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Rudrapur Call Girls 🥰 8617370543 Service Offer VIP Hot ModelRudrapur Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Rudrapur Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
sample sample sample sample sample sample
sample sample sample sample sample samplesample sample sample sample sample sample
sample sample sample sample sample sampleCasey Keith
 
Suri Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Suri Call Girls 🥰 8617370543 Service Offer VIP Hot ModelSuri Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Suri Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
bhachau Escort💋 Call Girl (Ramya) Service #bhachau Call Girl @Independent Girls
bhachau Escort💋 Call Girl (Ramya) Service #bhachau Call Girl @Independent Girlsbhachau Escort💋 Call Girl (Ramya) Service #bhachau Call Girl @Independent Girls
bhachau Escort💋 Call Girl (Ramya) Service #bhachau Call Girl @Independent Girlsmountabuangels4u
 
Ooty Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Ooty Call Girls 🥰 8617370543 Service Offer VIP Hot ModelOoty Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Ooty Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
ITALY - Visa Options for expats and digital nomads
ITALY - Visa Options for expats and digital nomadsITALY - Visa Options for expats and digital nomads
ITALY - Visa Options for expats and digital nomadsMarco Mazzeschi
 
Sun World Bana Hills, Vienam Part 2 (越南 巴拿山太陽世界 下集).ppsx
Sun World Bana Hills, Vienam Part 2  (越南 巴拿山太陽世界 下集).ppsxSun World Bana Hills, Vienam Part 2  (越南 巴拿山太陽世界 下集).ppsx
Sun World Bana Hills, Vienam Part 2 (越南 巴拿山太陽世界 下集).ppsxChung Yen Chang
 
Purba Bardhaman Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Purba Bardhaman Call Girls 🥰 8617370543 Service Offer VIP Hot ModelPurba Bardhaman Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Purba Bardhaman Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
sample sample sample sample sample sample
sample sample sample sample sample samplesample sample sample sample sample sample
sample sample sample sample sample sampleCasey Keith
 
Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...
Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...
Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...samsungultra782445
 
Morbi Escort💋 Call Girl (Komal) Service #Morbi Call Girl @Independent Girls
Morbi Escort💋 Call Girl (Komal) Service #Morbi Call Girl @Independent GirlsMorbi Escort💋 Call Girl (Komal) Service #Morbi Call Girl @Independent Girls
Morbi Escort💋 Call Girl (Komal) Service #Morbi Call Girl @Independent Girlsmountabuangels4u
 

Recently uploaded (20)

sample sample sample sample sample sample
sample sample sample sample sample samplesample sample sample sample sample sample
sample sample sample sample sample sample
 
Discover Mathura And Vrindavan A Spritual Journey.pdf
Discover Mathura And Vrindavan A Spritual Journey.pdfDiscover Mathura And Vrindavan A Spritual Journey.pdf
Discover Mathura And Vrindavan A Spritual Journey.pdf
 
Jalpaiguri Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Jalpaiguri Call Girls 🥰 8617370543 Service Offer VIP Hot ModelJalpaiguri Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Jalpaiguri Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Prayagraj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Prayagraj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelPrayagraj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Prayagraj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Sample sample sample sample sample sample
Sample sample sample sample sample sampleSample sample sample sample sample sample
Sample sample sample sample sample sample
 
TOURISM ATTRACTION IN LESOTHO 2024.pptx.
TOURISM ATTRACTION IN LESOTHO 2024.pptx.TOURISM ATTRACTION IN LESOTHO 2024.pptx.
TOURISM ATTRACTION IN LESOTHO 2024.pptx.
 
abortion pills in Riyadh+966572737505 Cytotec Riyadh
abortion pills in  Riyadh+966572737505    Cytotec Riyadhabortion pills in  Riyadh+966572737505    Cytotec Riyadh
abortion pills in Riyadh+966572737505 Cytotec Riyadh
 
TOURISM ATTRACTION IN LESOTHO 2024.Pptx.
TOURISM ATTRACTION IN LESOTHO 2024.Pptx.TOURISM ATTRACTION IN LESOTHO 2024.Pptx.
TOURISM ATTRACTION IN LESOTHO 2024.Pptx.
 
Top places to visit, top tourist destinations
Top places to visit, top tourist destinationsTop places to visit, top tourist destinations
Top places to visit, top tourist destinations
 
Rudrapur Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Rudrapur Call Girls 🥰 8617370543 Service Offer VIP Hot ModelRudrapur Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Rudrapur Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
sample sample sample sample sample sample
sample sample sample sample sample samplesample sample sample sample sample sample
sample sample sample sample sample sample
 
Suri Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Suri Call Girls 🥰 8617370543 Service Offer VIP Hot ModelSuri Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Suri Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
bhachau Escort💋 Call Girl (Ramya) Service #bhachau Call Girl @Independent Girls
bhachau Escort💋 Call Girl (Ramya) Service #bhachau Call Girl @Independent Girlsbhachau Escort💋 Call Girl (Ramya) Service #bhachau Call Girl @Independent Girls
bhachau Escort💋 Call Girl (Ramya) Service #bhachau Call Girl @Independent Girls
 
Ooty Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Ooty Call Girls 🥰 8617370543 Service Offer VIP Hot ModelOoty Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Ooty Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
ITALY - Visa Options for expats and digital nomads
ITALY - Visa Options for expats and digital nomadsITALY - Visa Options for expats and digital nomads
ITALY - Visa Options for expats and digital nomads
 
Sun World Bana Hills, Vienam Part 2 (越南 巴拿山太陽世界 下集).ppsx
Sun World Bana Hills, Vienam Part 2  (越南 巴拿山太陽世界 下集).ppsxSun World Bana Hills, Vienam Part 2  (越南 巴拿山太陽世界 下集).ppsx
Sun World Bana Hills, Vienam Part 2 (越南 巴拿山太陽世界 下集).ppsx
 
Purba Bardhaman Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Purba Bardhaman Call Girls 🥰 8617370543 Service Offer VIP Hot ModelPurba Bardhaman Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Purba Bardhaman Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
sample sample sample sample sample sample
sample sample sample sample sample samplesample sample sample sample sample sample
sample sample sample sample sample sample
 
Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...
Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...
Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...
 
Morbi Escort💋 Call Girl (Komal) Service #Morbi Call Girl @Independent Girls
Morbi Escort💋 Call Girl (Komal) Service #Morbi Call Girl @Independent GirlsMorbi Escort💋 Call Girl (Komal) Service #Morbi Call Girl @Independent Girls
Morbi Escort💋 Call Girl (Komal) Service #Morbi Call Girl @Independent Girls
 

A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight - SemTech Berlin 2012

  • 1. A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight Pablo N. Mendes, Christian Bizer pablo.mendes@fu-berlin.de Web Based Systems Group Freie Universität Berlin SemTechBiz Berlin February 6th 2011
  • 2. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Agenda • What do we mean by semantic enhancement? • How does DBpedia Spotlight work? • From Wikipedia to DBpedia Spotlight • From DBpedia Spotlight to Wikipedia • In your project Can you also enable a virtuous cycle? Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 3. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Semantic Enhancement? • Generally: – Making something easier to understand • For humans: – Say what you mean (reduce ambiguity) – Make associations – Access to definitions, background • For machines: – the same, but in structured format Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 4. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Semantic Enhancement (Example) http://nyti.ms/qsYAyt  News Annotation  Links to “topics”  Topic pages lead to related content  Semantic Enhancement  Links text to unique identifiers  Adds background information  Interconnects related content Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 5. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de A Virtuous Cycle of Semantic Enhancement Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 6. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de DBpedia Spotlight • DBpedia is a collection of entity descriptions extracted from Wikipedia & shared as linked data • DBpedia Spotlight uses data from DBpedia and text from associated Wikipedia pages • Learns how to recognize that a DBpedia resource was mentioned • Given plain text as input, generates annotated text Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 7. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de DBpedia Spotlight: Text Annotation • From: (…) Upon their return, Lennon and McCartney went to New York to announce the formation of Apple Corps. • To: (…) Upon their return, Lennon and McCartney went to New York to announce the formation of Apple Corps. http://dbpedia.org/resource/New_York_City http://dbpedia.org/resource/Apple_Corps Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 8. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Challenge: Term Ambiguity • ...this apple on the palm of my hand... • ...Apple tried to acquire Palm Inc.... • ...eating an apple sitting next to a palm tree... • What do “apple” and “palm” mean in each case? • Our objective is to recognize entities/topics and disambiguate their meaning, generating DBpedia annotation in text. Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 9. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Stage 1: Spotting • Find substrings that seem worthy of annotation Input: (…) Upon their return, Lennon and McCartney went to New York to announce the formation of Apple Corps. Output: “Lennon”, “McCartney”, “New York”, “Apple Corps” • Simplest approach relies on a dictionary of known entity names. – Other: Named Entity Recognition, Keyphrase Extraction, ... Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 10. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Stage 2: Candidate Mapping • Find possible meanings for each of the spotted substrings. Input (spotted names): “Lennon”, “McCartney”, “New York”, “Apple Corps” Output (candidate map): “Lennon”: { Lennon_(album), Lennon,_Michigan, … } “McCartney”: { McCartney(surname), Paul_McCartney, … } “New York”: { New_York_State, New_York_City, … } “Apple Corps”: { Apple_Corps } Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 11. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Stage 3: Disambiguation • Select the correct candidate DBpedia Resource for a given substring. • Decision is made based on the context(1) the substring was mentioned con·text (kntkst)n. 1. the parts of a discourse that surround a word or passage and can throw light on its meaning http://mw1.merriam-webster.com/dictionary/context Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 12. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Learning the Context for a resource (…) Upon their return, Lennon and McCartney went to New York to announce the formation of Apple Corps. • Collect context for DBpedia Resources from all articles in Wikipedia e.g. Co-occurrence Statistics John_Lennon = {John:981, Beatles:320, McCartney:100, ...} • Types of context – Wikipedia Pages – Definitions from disambiguation pages – Paragraphs that link to resources Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 13. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de DBpedia Spotlight http://spotlight.dbpedia.org/demo Freely available Web Service; Open Source, Java/Scala Apache V2 License. Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 14. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de A Virtuous Cycle Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 15. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de The “Suggest” Button User decides to add a link Suggest Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 16. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de The “Suggest” Button System suggest targets, user chooses Suggest Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 17. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Sztakipedia http://pedia.sztaki.hu • Developed by Mihály Héder et al. at MTA SZTAKI (Hungarian Academy of Sciences) • Adds a toolbar to Wikipedia that can use DBpedia Spotlight to suggest links – Also suggests Categories, Infoboxes, Books • Helps editors to refine knowledge in Wikipedia – More interconnections, more entity types, more structured data! Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 18. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Sztakipedia (screenshots) http://pedia.sztaki.hu Source: http://www.youtube.com/watch? v=8VW0TrvXpl4 Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 19. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Beyond Wikipedia? RDFaCE http://rdface.aksw.org/ • Developed by Ali Khalili at U. Leipzig (AKSW) • Helps users to add RDFa markup via a WYSIWYG interface • Can use DBpedia Spotlight, among other services to disambiguate entity names • Available as a Wordpress Plugin – Enables blogs as sources of context for disambiguation Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 20. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de RDFaCE http://rdface.aksw.org Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 21. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de A Virtuous Cycle in My Enterprise • Select a database of entity identifiers • Select textual sources that talk about those entities • Use semantic enhancement editors (with automatic suggestions) to annotate text • Use annotated text to re-train annotator DBpedia Spotlight is ready for MediaWiki XML and TSV. Other formats to come! Take a look at NIF http://nlp2rdf.org/nif-1-0 Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 22. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Semantic Enhancement Marketplace • User-generated annotations are valuable crowdsourced knowledge • Can be used as currency: – “sweat for web service provision” – b2b partnerships • Example: RoboTagger.com – entity annotation service (in German) – entity types are not fixed (crowd-sourced) – users rewarded with more access to web service Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 23. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Conclusion • Unstructured information (text) and structured information (e.g. RDF) – Mutually dependent and beneficial • DBpedia Spotlight sits on the border of two worlds: – From Wikipedia, an automatic annotator – From the auto-annotator, a more interconnected Wikipedia • Fosters a semantic enhancement ecosystem! Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 24. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Thank you! On Twitter: @pablomendes E-mail: pablo.mendes@fu-berlin.de Web: http://pablomendes.com http://slideshare.net/pablomendes • Special thanks to Mihály Héder and Iavor Jelev for many fruitful discussions • DBpedia Spotlight is partially funded by LOD2.eu http://spotlight.dbpedia.org Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight
  • 25. FREIE UNIVERSITÄT BERLIN SemTechBiz Berlin, February 2012 http://wbsg.de Links • Download – DBpedia: http://dbpedia.org/Downloads37/ – DBpedia Spotlight: http://sourceforge.net/projects/dbp-spotlight/ – RDFaCE • http://code.google.com/p/rdface/ • Wordpress plugin: http://wordpress.org/extend/plugins/rdface/ Pablo N. Mendes, Christian Bizer: A Virtuous Cycle of Semantic Enhancement with DBpedia Spotlight