NERD meets NIF:  Lifting NLP Extraction Results to the Linked Data Cloud
Upcoming SlideShare
Loading in...5
×
 

NERD meets NIF: Lifting NLP Extraction Results to the Linked Data Cloud

on

  • 1,304 views

Talk "NERD meets NIF: Lifting NLP Extraction Results to the Linked Data Cloud" event during LDOW'12 (WWW'12), Lyon, France

Talk "NERD meets NIF: Lifting NLP Extraction Results to the Linked Data Cloud" event during LDOW'12 (WWW'12), Lyon, France

Statistics

Views

Total Views
1,304
Views on SlideShare
1,301
Embed Views
3

Actions

Likes
3
Downloads
19
Comments
0

1 Embed 3

http://www.slashdocs.com 3

Accessibility

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

NERD meets NIF:  Lifting NLP Extraction Results to the Linked Data Cloud NERD meets NIF: Lifting NLP Extraction Results to the Linked Data Cloud Presentation Transcript

  • s NERD meets NIF:Lifting NLP Extraction Results to the Linked Data Cloud Giuseppe Rizzo and Raphaël Troncy EURECOM, France Sebastian Hellmann and Martin Bruemmer Universität Leipzig, Germany
  • What is a Named Entity recognition task?A task that aims to locate and classify the name of a person or anorganization, a location, a brand, a product, a numeric expressionincluding time, date, money and percent in a textual document 16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 2/15
  • NER tools Standalone software GATE Stanford CoreNLP Temis Web APIs16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 3/15 View slide
  • Factual comparison of 10 Web NER tools Alchemy DBpedia Evri Extractiv Lupedia Open Saplo Wikimeta Yahoo! Zemanta API Spotlight CalaisLanguage EN,FR, EN EN, EN EN,FR, EN,FR EN, EN,FR EN EN GR,IT, GR* IT IT SP SW SP PT,RU, PT* SP,SW SP*Granularity OEN OEN OED OEN OEN OEN OED OEN OEN OEDEntity N/A char N/A word range of char N/A POS range N/Aposition offset offset chars offset offset of charsClassification Alchemy DBpedia Evri DBpedia DBpedia Open N/A ESTER Yahoo FreeBaseschema FreeBase LinkedM Calais Scema.or DB gNumber of 324 320 5 34 319 95 5 7 13 81classesResponse JSON HTML HTM HTML HTML JSON JSON JSON JSON XMLFormat MicroF JSON L JSON JSON MicroF XML XML JSON XML RDF JSO RDF RDFa ormat RDF RDF XML N XML XML RDFQuota 30000 unl 3000 3000 unl 50000 1333 unl 5000 10000(calls/day) 16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 4/15 View slide
  • What is NERD? ontology1 REST API2 UI3 The NERD ontology has been integrated in the NIF project, a EU FP7 in the context of the LOD2: Creating Knowledge out of Interlinked Data1 http://nerd.eurecom.fr/ontology2 http://nerd.eurecom.fr/api/application.wadl3 http://nerd.eurecom.fr 16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 5/15
  • NERD Ontology Aligned the taxonomies used by the extractors16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 6/15
  • NERD type OccurrenceBuilding the NERD Ontology Person 10 Organization 10 Country 6 Company 6 Location 6 Continent 5 City 5 RadioStation 5 Album 5 Product 5 ... ...16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 7/15
  • Ontology alignment validation 5 TED talks 1000 NYT news articles 217WWW2011 abstracts 16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 8/15
  • Integration  Different outputs for the NLP tools (Standalone and Web APIs) OpenCalais DBpedia Spotlight "_type": "Organization", "@URI": "http://dbpedia.org/resource/DBpedia", “name": "North Atlantic Treaty Organization", "@types": "DBpedia:Software,DBpedia:Work” "organizationtype": "governmental civilian", "@surfaceForm": "dbpedia", "nationality": "N/A", "@offset": "0", "_typeReference": "@support": "11", http://s.opencalais.com/1/type/em/e/Organization", "@similarityScore": "0.2387271374464035", ... …  For integration or reuse manual effort is needed  time consuming  difficult to track definitions  NERD creates a sharable JSON/RDF annotation output16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 9/15
  • NERD REST API “entities” : [{ “entity”: “W3C” , “type”: “Organization” , “uri”: "http://dbpedia.org/page/W3C", JSON “nerdType”: "http://nerd.eurecom.fr/ontology#Organization", “startChar”: 30, “endChar”: 32,/document/{idDocument} “confidence”: 1,/user/{idUser} GET, “relevance”: 0.5 POST, }]/annotation/{extractor}/extraction/{idExtraction} PUT,/evaluation DELETE... RDF 16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 10/15
  • Textual annotation Lets consider the URI: http://www.w3.org/DesignIssues/LinkedData.html The Semantic Web isnt just about putting data on the web. It is about making links, so that a person or machine can explore the web of data. With linked data, when you have some of it, you can find other, related, data.…. All the above plus, Use open standards from W3C (RDF and SPARQL) to identify things, so that people can point at your stuff ... entities: { … [entity: W3C, startChar: 23107, endChar: 23110], … }16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 11/15
  • NERD meets NIF Model documents through a set of strings deferencable within the Web : offset_23107_ 23110 a str:String ; str:referenceContext :offset_0_26546 . Map string to entity : offset_23107_ 23110 sso:oen dbpedia:W3C . Classification dbpedia:W3C rdf:type nerd:Organization .16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 12/15
  • NERD User Interface16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 13/15
  • Conclusions and perspectivesNERD UI and REST API unified interface for extracting NEs from various type of textsNERD ontology common schema for entity classificationNERD & NIF lift the extraction annotation results to the LOD cloudSystematic comparison for the NE extraction andclassification tasks: ETAPE corpus CoNLL 2003 corpusCombining several extractions to improve the strengthsof a single tool16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 14/15
  • Thanks for your time and your attention http://nerd.eurecom.fr @giusepperizzo @rtroncy #nerd http://www.slideshare.net/giusepperizzo16/04/2012 5th Workshop on Linked Data on the Web (LDOW2012) 15/15