• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
NIF 2.0 draft for Pisa
 

NIF 2.0 draft for Pisa

on

  • 583 views

These slides accompagny a slidecast for the ISO Meeting for development of web service exchange protocol meeting in Pisa

These slides accompagny a slidecast for the ISO Meeting for development of web service exchange protocol meeting in Pisa

Statistics

Views

Total Views
583
Views on SlideShare
583
Embed Views
0

Actions

Likes
1
Downloads
9
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    NIF 2.0 draft for Pisa NIF 2.0 draft for Pisa Presentation Transcript

    • Creating Knowledge out of Interlinked Data Pisa – 2012/10/05 – Page 1 http://lod2.eu NIF 2.0 Draft http://slideshare.net/kurzum http://nlp2rdf.org http://lod2.eu Sebastian Hellmann AKSW, Universität LeipzigLOD2 Presentation . 02.09.2010 . Page http://lod2.eu
    • Pisa – 2012/10/05 – Page 2 http://lod2.eu IntroductionThe NLP Interchange Format (NIF) is an RDF/OWL-based formatthat aims to achieve interoperability between Natural LanguageProcessing (NLP) tools, language resources and annotations. • Version 1.0 published in November 2011 • Version 2.0 is scheduled for completion within 2012
    • Pisa – 2012/10/05 – Page 3 http://lod2.eu NIF Introduction
    • Pisa – 2012/10/05 – Page 4 http://lod2.eu Addressing primary data● file://path/on/my/local/drive/log.txt● http://www.w3.org/DesignIssues/LinkedData.html● “We the People of the United States, in Order to form a more perfectUnion, ...”NIF: use a document URI and add “#offset_717_729” to address asubstring of the text from index 717 to 729- file://path/on/my/local/drive/log.txt#offset_717_729- http://www.w3.org/DesignIssues/LinkedData.html#offset_717_729- urn:content-item-sha1-1bf4330e5a4a707f381513b2e7#offset_717_729
    • Pisa – 2012/10/05 – Page 5 http://lod2.eu Example
    • Pisa – 2012/10/05 – Page 6 http://lod2.eu Normalizing TextNIF provides URIs for Unicode Characters using UnicodeNormalization Form C counted in Code Units.For all NIF URIs, the universe of discourse will then be the wordsover the alphabet of Unicode characters (sometimes called Σ ∗ ).These URIs can become subjects in RDF triples:Structural interoperability:- based on RDF- defines how text is treated and counted- compatible with RFC 5147 “#char=717,729”
    • Pisa – 2012/10/05 – Page 7 http://lod2.eu As a Web Service
    • Pisa – 2012/10/05 – Page 8 http://lod2.eu NIF Combinatorhttp://nlp2rdf.lod2.eu/demo.php
    • Pisa – 2012/10/05 – Page 9 http://lod2.eu NIF Combinator
    • Pisa – 2012/10/05 – Page 10 http://lod2.eu Conceptual InteroperabilityNIF can be extended by Vocabulary ModulesOliAhttp://purl.org/olia
    • Pisa – 2012/10/05 – Page 11 http://lod2.eu Conceptual InteroperabilityNIF can be extended by Vocabulary ModulesApache Stanbolhttp://stanbol.apache.org/
    • Pisa – 2012/10/05 – Page 12 http://lod2.eu Scalabilityhttps://bitbucket.org/srfgkmt/stanbol-nlp
    • Pisa – 2012/10/05 – Page 13 http://lod2.eu Scalability- Less problematic, if only used as exchange format- RDF is flexible and good for data integration, not fast- NIF is very compact (1-3 triples per annotation)- Inference possible, but optional- Other formats add overhead as well (e.g. SOAP-XML)- NIF Web services are RESTful- JSON-LD might be the best option for serialization
    • Pisa – 2012/10/05 – Page 14 http://lod2.eu Scalability
    • Pisa – 2012/10/05 – Page 15 http://lod2.eu NIF 2.0 - plans• NIF 2.0 tries to be compatible to (Vocabulary Module): • FISE used in Apache Stanbol (IKS-EU Project) • LAF/GrAF XML – ISO standard, recently published • Fragment Identifiers by IETF and W3C • Lemon ontology from Monnet EU Project • NERD ontology from EURECOM and LinkedTV EU Project • Xpointer/XPath URI scheme
    • Pisa – 2012/10/05 – Page 16 http://lod2.eu Impact NIFImpact: • Around 600 feedback items or events (email requests, presentation Q&A, personal questions, 70 people on the mailing list) • Five known 3rd party implementations (one for GATE JAPE) • Over 1 million requests per month on the demo web services • Projects that have announced interest / are working on a NIF wrapper: LODifier, Apache Stanbol, LAPPS (NSF project), Tipalo/Fred, DKPro (UIMA instantiation), ITS 2.0 test suites
    • Pisa – 2012/10/05 – Page 17 http://lod2.eu Impact NIFNIF will likely be the recommended RDF conversion of theInternationalisation Tagset 2.0 W3C standard (ITS 2.0) -http://www.w3.org/TR/its20/
    • Pisa – 2012/10/05 – Page 18 http://lod2.eu Thanks for your attentionOpen Community – All feedback is welcome!http://slideshare.net/kurzumDirect email:http://bis.informatik.uni-leipzig.de/SebastianHellmannPublic Mailing List:http://lists.informatik.uni-leipzig.de/mailman/listinfo/nlp2rdfWiki (collection of use cases and issues):http://wiki.nlp2rdf.org/wiki/Use_cases_and_requirements#Use_caseshttp://wiki.nlp2rdf.org/wiki/IssuesWebsite:http://nlp2rdf.org