SlideShare a Scribd company logo
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 1
Co-funded by the Horizon 2020
Framework Programme of the European Union
Grant Agreement Number 644771
FEISGILTT 2015 |BERLIN, 3 JUNE 2015
Felix Sasaki, DFKI / W3C Fellow
On behalf of the FREME Consortium
FREME TO MAKE LINKED DATA
AVAILABLE TO LOCALIZERS
www.freme-project.eu
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 2
REMINDER: WHAT IS FREME?
• More info: see presentation from yesterday
http://slideshare.net/atcfsenzoku/freme-at-feisgiltt-2015-freme-use-cases
• Design of FREME takes up work from other projects
1. LIDER http://lider-project.eu/
◦ In FREME, we deploy best Practices on how to work with linguistic linked data (LLD)
◦ LLD: Linked data used to represent lexica, corpora, language processing workflows etc.
2. FALCON http://falcon-project.eu/
◦ In FREME, we benefit from experience on working with linked data in localisation scenarios
◦ One lesson learned: hide linked data in the right way from (localisation) developers
◦ No need to process linked data always in the native form, see Babelfy http://babelfy.org/
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 3
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 4
FREME E-SERVICES – BIRDS EYE VIEW
• e-Entity
◦ Automatic annotation of named entities
• e-Terminology
◦ Annotation of terms and linkage to term databases
• e-Link
◦ Enrichment with information from (linked) (open) data sources*
• e-Translation
◦ Cloud based machine translation
• e-Internationalisation
◦ ITS 2.0 metadata to govern the multilingual & semantic content workflow
• e-Publishing
◦ Publish enriched content in ePub format
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 5
EXAMPLE: E-SERVICE DESIGN
• RESTFul API
• Example
http://api.freme-project.eu/0.1/e-entity/dbpedia-spolight
• Under each service endpoint: tool specific versions
• Parameters for e-Entity
◦ Confidence threshold
◦ Informat. Currently text or NIF (explanation see next slides)
◦ Outformat.
• Output: NIF in various serializations
◦ Currently text/turtle or application/json+ld
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 6
WHAT IS NIF?
• Natural Language Processing Interchange Format
• “The XLIFF of natural language processing workflows” (Phil Ritchie, FEISGILTT 2015)
• NIF: Linked data based representation of digital content and NLP related annotations
• Anchoring in source format possible -> basis for roundtripping
• More info: see http://site.nlp2rdf.org/
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 7
NIF EXAMPLE: DESCRIBING DOCUMENTS
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
@prefix nif: <http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core#> .
<http://example.org/document/1#char=0,11>
a nif:String , nif:Context , nif:RFC5147String ;
nif:isString "the content"^^xsd:string;
nif:beginIndex "0"^^xsd:nonNegativeInteger;
nif:endIndex "11"^^xsd:nonNegativeInteger;
nif:sourceUrl <http://differentday.blogspot.com/2007_01_01_archive.html> .
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 8
DESCRIBING STRINGS
<http://example.org/document/1#char=0,21>
a nif:String , nif:Context , nif:RFC5147String ;
nif:isString "We talk about Xiamen."^^xsd:string;
nif:beginIndex "0"
nif:endIndex "21"
nif:sourceUrl <http://differentday.blogspot.com/2007_01_01_archive.html> .
<http://example.org/document/1#char=14,20> a nif:String , nif:RFC5147String ,
nif:Word, nif:Phrase;
nif:referenceContext <http://example.org/document/1#char=0,21> ;
nif:anchorOf "Xiamen" ;
nif:beginIndex "14" ;
nif:endIndex "20";
nif:wasConvertedFrom
<http://example.org?t=url&f=html&i=http://somewebpage.com#char=0,2820> ;
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 9
STORING E-ENTITY ENRICHMENT
<http://example.org/document/1#char=14,20> a nif:String , nif:RFC5147String ,
nif:Word, nif:Phrase;
itsrdf:taIdentRef <http://dbpedia.org/resource/Xiamen> ;
itsrdf:taClassRef <http://dbpedia.org/ontology/City> ;
itsrdf:taClassRef <http://dbpedia.org/ontology/Settlement> ;
itsrdf:taClassRef <http://dbpedia.org/ontology/PopulatedPlace> ;
itsrdf:taClassRef <http://dbpedia.org/ontology/Place> .
• NIF allows to add multiple annotations to content
• No constraints on the structure of annotations
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 10
KEEPING PROVENANCE
<http://example.org/document/1#char=0,21> …
nif:wasConvertedFrom
<http://example.com/?informat=html&intype=url&
input=
http://differentday.blogspot.com/2007_01_01_archive.html/
&xpath=/html/body[1]/h2[1]/span[1]/text()[1]>.
• XPath only an example
• nif:wasConvertedFrom can hold source format specific information
• Can the the basis for round tripping
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 11
BENEFIT AND DRAWBACKS OF NIF
Benefits
• NIF can store all information of enrichment services
◦ e-Entity, e-Link, e-Terminology, e-Translation
• Via NIF we can chain services easily
◦ No constraints on structures: NIF format constitutes general annotation structure
Drawbacks
• No tool support of heterogeneous input formats in current tooling
◦ Working on that  -> integration of Okapi and NIF tooling
• Size of NIF annotations may be an issue
◦ State: currently gathering implementation experience
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 12
DEMO: COMBINING E-SERVICES VIA NIF
• Try things yourself at http://api.freme-project.eu/doc/0.1/
Demo workflow:
1. Input: text
2. Processing via e-Entity
3. Output: NIF, input to step 4
4. Processing via e-Link
5. Output: NIF
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 13
HIDING COMPLEXITY (1/2): NIF AND E-SERVICE USER
FREME version 0.1: service endpoints understand text only or NIF content
http://api.freme-project.eu/0.1/e-entity/dbpedia-spolight
• Future version: support additional formats via integrating Okapi into NIF
◦ Informat: HTML, XML, Word, PDF; …
◦ Outformat: NIF, in some cases (HTML, XML, …) roundtripping
• API user sets input and output e.g. via Accept header
• NIF is processed internally, “hidden from the user”
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 14
HIDING COMPLEXITY (2/2): THE CASE OF E-LINK
• Many users don’t know linked data sources:
◦ What type of data is available?
◦ What linked data vocabularies are used: NIF, LEMON, …
◦ What queries do I need to get information of type X
• FREME e-Link allows them to query linked data without looking at it
◦ Input: content plus a query template : “Find my all events close to a given entity” , “Find me
all museums close to a given entity”, …
◦ Output: content enriched with information relevant to the query, also as JSON-LD
• Concept of query templates: similar to “Schematron for information architects”
approach, cf. George Bina at XML Prague 2015
http://archive.xmlprague.cz/2015/files/xmlprague-2015-proceedings.pdf#page=199
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 15
E-LINK EXAMPLE: TEMPLATE “PROVIDE GEO-INFORMATION FOR A
GIVEN ENTITY”
<http://example.org/document/1#char=0,6> ...
nif:anchorOf "Berlin"@en;
nif:beginIndex "0“;
nif:endIndex "6“; itsrdf:taIdentRef
<http://dbpedia.org/resource/Berlin> .
...
<http://dbpedia.org/resource/Berlin>;
itsrdf:taIdentRef <http://dbpedia.org/resource/Berlin>;
geo:lat "52.516666";
geo:long "13.383333" .
http://api.freme-project.eu/0.1/e-link/?outformat=turtle&templateid=1
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 16
LINKED DATA AND LOCALISATION: LESSONS LEARNED
• Integration of linked data and tooling: loose coupling wins
◦ Localisation tools talking to linked data enabled web services
• Hide complexity in the right manner
◦ Cf. e-Link template approach
• Give people “their output format” – probably json
◦ json-ld to the rescue
• Linked data world can benefit from localisation tooling
◦ Cf. work on OKAPI – NIF integration
FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 17
CONTACTS
FELIX SASAKI
Senior Researcher DFKI / W3C Fellow
On behalf of the FREME consortium
E-mail: felix.sasaki@dfki.de
CONSORTIUM

More Related Content

What's hot

Freme general-overview-version-june-2015
Freme general-overview-version-june-2015Freme general-overview-version-june-2015
Freme general-overview-version-june-2015FREMEProjectH2020
 
Linked Open Data Cloud
Linked Open Data CloudLinked Open Data Cloud
Linked Open Data CloudPretaLLOD
 
Corpus Annotation with Linked Open Data
Corpus Annotation with Linked Open DataCorpus Annotation with Linked Open Data
Corpus Annotation with Linked Open DataPretaLLOD
 
Oc wg-nif-20130711
Oc wg-nif-20130711Oc wg-nif-20130711
Oc wg-nif-20130711STIinnsbruck
 
Digital Publishing and the Open Web Platform
Digital Publishing and the Open Web PlatformDigital Publishing and the Open Web Platform
Digital Publishing and the Open Web PlatformIvan Herman
 
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web  NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web Sebastian Hellmann
 
Publishing open data and services for the Flemish Research Information Space
Publishing open data and services for the Flemish Research Information SpacePublishing open data and services for the Flemish Research Information Space
Publishing open data and services for the Flemish Research Information SpaceChristophe Debruyne
 
IIIF Introduction given in South Africa - 2019
IIIF Introduction given in South Africa - 2019IIIF Introduction given in South Africa - 2019
IIIF Introduction given in South Africa - 2019Glen Robson
 
TCS eScience Portal - Confusa
TCS eScience Portal - ConfusaTCS eScience Portal - Confusa
TCS eScience Portal - Confusahenrikau
 
:me owl:sameAs flickr:33669349@N00 .
:me owl:sameAs flickr:33669349@N00 .:me owl:sameAs flickr:33669349@N00 .
:me owl:sameAs flickr:33669349@N00 .Alexandre Passant
 
The META-NET Strategic Research Agenda and Linked Open Data
The META-NET Strategic Research Agenda and Linked Open DataThe META-NET Strategic Research Agenda and Linked Open Data
The META-NET Strategic Research Agenda and Linked Open DataGeorg Rehm
 
OntoWiki Application Framework & Erfurt API
OntoWiki Application Framework & Erfurt APIOntoWiki Application Framework & Erfurt API
OntoWiki Application Framework & Erfurt APIPhilipp Frischmuth
 

What's hot (18)

LOD2 Webinar: SIREn
LOD2 Webinar: SIREnLOD2 Webinar: SIREn
LOD2 Webinar: SIREn
 
LOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and SparqlifyLOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and Sparqlify
 
Lod2 review meeting
Lod2 review meetingLod2 review meeting
Lod2 review meeting
 
Freme general-overview-version-june-2015
Freme general-overview-version-june-2015Freme general-overview-version-june-2015
Freme general-overview-version-june-2015
 
NIF - NLP Interchange Format
NIF - NLP Interchange FormatNIF - NLP Interchange Format
NIF - NLP Interchange Format
 
Linked Open Data Cloud
Linked Open Data CloudLinked Open Data Cloud
Linked Open Data Cloud
 
LOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIF
LOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIFLOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIF
LOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIF
 
Corpus Annotation with Linked Open Data
Corpus Annotation with Linked Open DataCorpus Annotation with Linked Open Data
Corpus Annotation with Linked Open Data
 
Oc wg-nif-20130711
Oc wg-nif-20130711Oc wg-nif-20130711
Oc wg-nif-20130711
 
Digital Publishing and the Open Web Platform
Digital Publishing and the Open Web PlatformDigital Publishing and the Open Web Platform
Digital Publishing and the Open Web Platform
 
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and AuthoringLOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
 
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web  NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
 
Publishing open data and services for the Flemish Research Information Space
Publishing open data and services for the Flemish Research Information SpacePublishing open data and services for the Flemish Research Information Space
Publishing open data and services for the Flemish Research Information Space
 
IIIF Introduction given in South Africa - 2019
IIIF Introduction given in South Africa - 2019IIIF Introduction given in South Africa - 2019
IIIF Introduction given in South Africa - 2019
 
TCS eScience Portal - Confusa
TCS eScience Portal - ConfusaTCS eScience Portal - Confusa
TCS eScience Portal - Confusa
 
:me owl:sameAs flickr:33669349@N00 .
:me owl:sameAs flickr:33669349@N00 .:me owl:sameAs flickr:33669349@N00 .
:me owl:sameAs flickr:33669349@N00 .
 
The META-NET Strategic Research Agenda and Linked Open Data
The META-NET Strategic Research Agenda and Linked Open DataThe META-NET Strategic Research Agenda and Linked Open Data
The META-NET Strategic Research Agenda and Linked Open Data
 
OntoWiki Application Framework & Erfurt API
OntoWiki Application Framework & Erfurt APIOntoWiki Application Framework & Erfurt API
OntoWiki Application Framework & Erfurt API
 

Viewers also liked

Part 3-functions1-120315220356-phpapp01
Part 3-functions1-120315220356-phpapp01Part 3-functions1-120315220356-phpapp01
Part 3-functions1-120315220356-phpapp01Abdul Samee
 
13 recursion-120712074623-phpapp02
13 recursion-120712074623-phpapp0213 recursion-120712074623-phpapp02
13 recursion-120712074623-phpapp02Abdul Samee
 
Input outputdisplaydevices-140819061228-phpapp02
Input outputdisplaydevices-140819061228-phpapp02Input outputdisplaydevices-140819061228-phpapp02
Input outputdisplaydevices-140819061228-phpapp02Abdul Samee
 
3 operators-expressions-and-statements-120712073351-phpapp01
3 operators-expressions-and-statements-120712073351-phpapp013 operators-expressions-and-statements-120712073351-phpapp01
3 operators-expressions-and-statements-120712073351-phpapp01Abdul Samee
 
Freme at feisgiltt 2015 freme use cases
Freme at feisgiltt 2015   freme use casesFreme at feisgiltt 2015   freme use cases
Freme at feisgiltt 2015 freme use casesFREMEProjectH2020
 
3 operators-expressions-and-statements-120712073351-phpapp01
3 operators-expressions-and-statements-120712073351-phpapp013 operators-expressions-and-statements-120712073351-phpapp01
3 operators-expressions-and-statements-120712073351-phpapp01Abdul Samee
 
arrays-120712074248-phpapp01
arrays-120712074248-phpapp01arrays-120712074248-phpapp01
arrays-120712074248-phpapp01Abdul Samee
 
16 strings-and-text-processing-120712074956-phpapp02
16 strings-and-text-processing-120712074956-phpapp0216 strings-and-text-processing-120712074956-phpapp02
16 strings-and-text-processing-120712074956-phpapp02Abdul Samee
 
Garima Sareen Nagpal - CV
Garima Sareen Nagpal - CVGarima Sareen Nagpal - CV
Garima Sareen Nagpal - CVGarima Sareen
 

Viewers also liked (11)

Sasaki mlkrep-20150710
Sasaki mlkrep-20150710Sasaki mlkrep-20150710
Sasaki mlkrep-20150710
 
Part 3-functions1-120315220356-phpapp01
Part 3-functions1-120315220356-phpapp01Part 3-functions1-120315220356-phpapp01
Part 3-functions1-120315220356-phpapp01
 
13 recursion-120712074623-phpapp02
13 recursion-120712074623-phpapp0213 recursion-120712074623-phpapp02
13 recursion-120712074623-phpapp02
 
Input outputdisplaydevices-140819061228-phpapp02
Input outputdisplaydevices-140819061228-phpapp02Input outputdisplaydevices-140819061228-phpapp02
Input outputdisplaydevices-140819061228-phpapp02
 
3 operators-expressions-and-statements-120712073351-phpapp01
3 operators-expressions-and-statements-120712073351-phpapp013 operators-expressions-and-statements-120712073351-phpapp01
3 operators-expressions-and-statements-120712073351-phpapp01
 
Freme at feisgiltt 2015 freme use cases
Freme at feisgiltt 2015   freme use casesFreme at feisgiltt 2015   freme use cases
Freme at feisgiltt 2015 freme use cases
 
3 operators-expressions-and-statements-120712073351-phpapp01
3 operators-expressions-and-statements-120712073351-phpapp013 operators-expressions-and-statements-120712073351-phpapp01
3 operators-expressions-and-statements-120712073351-phpapp01
 
Linked data tooling XML
Linked data tooling XMLLinked data tooling XML
Linked data tooling XML
 
arrays-120712074248-phpapp01
arrays-120712074248-phpapp01arrays-120712074248-phpapp01
arrays-120712074248-phpapp01
 
16 strings-and-text-processing-120712074956-phpapp02
16 strings-and-text-processing-120712074956-phpapp0216 strings-and-text-processing-120712074956-phpapp02
16 strings-and-text-processing-120712074956-phpapp02
 
Garima Sareen Nagpal - CV
Garima Sareen Nagpal - CVGarima Sareen Nagpal - CV
Garima Sareen Nagpal - CV
 

Similar to Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891

Freme at feisgiltt 2015 freme use cases
Freme at feisgiltt 2015   freme use casesFreme at feisgiltt 2015   freme use cases
Freme at feisgiltt 2015 freme use casesFelix Sasaki
 
Semantic web-and-public-data - en
Semantic web-and-public-data - enSemantic web-and-public-data - en
Semantic web-and-public-data - enTenforce
 
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)Sergio Fernández
 
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...semanticsconference
 
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataOpen City Foundation
 
Datalift lod2-paris-24032011
Datalift lod2-paris-24032011Datalift lod2-paris-24032011
Datalift lod2-paris-24032011Datalift
 
SCAPE Webinar: Tools for uncovering preservation risks in large repositories
SCAPE Webinar: Tools for uncovering preservation risks in large repositoriesSCAPE Webinar: Tools for uncovering preservation risks in large repositories
SCAPE Webinar: Tools for uncovering preservation risks in large repositoriesSCAPE Project
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
 
Sasaki practical-linked-data
Sasaki practical-linked-dataSasaki practical-linked-data
Sasaki practical-linked-dataFelix Sasaki
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015Sebastian Hellmann
 
S. Bartoli & F. Pompermaier – A Semantic Big Data Companion
S. Bartoli & F. Pompermaier – A Semantic Big Data CompanionS. Bartoli & F. Pompermaier – A Semantic Big Data Companion
S. Bartoli & F. Pompermaier – A Semantic Big Data CompanionFlink Forward
 
Linked data-tooling-xml
Linked data-tooling-xmlLinked data-tooling-xml
Linked data-tooling-xmlFelix Sasaki
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikisSören Auer
 
Intro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensIntro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensStoitsis Giannis
 
Briefing on OASIS XLIFF OMOS TC 20160121
Briefing on OASIS XLIFF OMOS TC 20160121Briefing on OASIS XLIFF OMOS TC 20160121
Briefing on OASIS XLIFF OMOS TC 20160121Jamie Clark
 

Similar to Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891 (20)

Freme at feisgiltt 2015 freme use cases
Freme at feisgiltt 2015   freme use casesFreme at feisgiltt 2015   freme use cases
Freme at feisgiltt 2015 freme use cases
 
Semantic web-and-public-data - en
Semantic web-and-public-data - enSemantic web-and-public-data - en
Semantic web-and-public-data - en
 
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)
 
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...
 
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
 
NIF 2.0 draft for Pisa
NIF 2.0 draft for PisaNIF 2.0 draft for Pisa
NIF 2.0 draft for Pisa
 
LOD2: Guest presentation: French datalift project
LOD2: Guest presentation: French datalift projectLOD2: Guest presentation: French datalift project
LOD2: Guest presentation: French datalift project
 
Datalift lod2-paris-24032011
Datalift lod2-paris-24032011Datalift lod2-paris-24032011
Datalift lod2-paris-24032011
 
SCAPE Webinar: Tools for uncovering preservation risks in large repositories
SCAPE Webinar: Tools for uncovering preservation risks in large repositoriesSCAPE Webinar: Tools for uncovering preservation risks in large repositories
SCAPE Webinar: Tools for uncovering preservation risks in large repositories
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
Sasaki practical-linked-data
Sasaki practical-linked-dataSasaki practical-linked-data
Sasaki practical-linked-data
 
Towards a Linked Data Publishing Methodology
Towards a Linked Data Publishing MethodologyTowards a Linked Data Publishing Methodology
Towards a Linked Data Publishing Methodology
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015
 
S. Bartoli & F. Pompermaier – A Semantic Big Data Companion
S. Bartoli & F. Pompermaier – A Semantic Big Data CompanionS. Bartoli & F. Pompermaier – A Semantic Big Data Companion
S. Bartoli & F. Pompermaier – A Semantic Big Data Companion
 
Nifi workshop
Nifi workshopNifi workshop
Nifi workshop
 
Linked data-tooling-xml
Linked data-tooling-xmlLinked data-tooling-xml
Linked data-tooling-xml
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikis
 
Intro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensIntro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-Athens
 
LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine
 
Briefing on OASIS XLIFF OMOS TC 20160121
Briefing on OASIS XLIFF OMOS TC 20160121Briefing on OASIS XLIFF OMOS TC 20160121
Briefing on OASIS XLIFF OMOS TC 20160121
 

Recently uploaded

Getting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control TowerGetting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control TowerVladimir Samoylov
 
Oracle Database Administration I (1Z0-082) Exam Dumps 2024.pdf
Oracle Database Administration I (1Z0-082) Exam Dumps 2024.pdfOracle Database Administration I (1Z0-082) Exam Dumps 2024.pdf
Oracle Database Administration I (1Z0-082) Exam Dumps 2024.pdfSkillCertProExams
 
Acorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutesAcorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutesIP ServerOne
 
Introduction of Biology in living organisms
Introduction of Biology in living organismsIntroduction of Biology in living organisms
Introduction of Biology in living organismssoumyapottola
 
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...Orkestra
 
123445566544333222333444dxcvbcvcvharsh.pptx
123445566544333222333444dxcvbcvcvharsh.pptx123445566544333222333444dxcvbcvcvharsh.pptx
123445566544333222333444dxcvbcvcvharsh.pptxgargh1099
 
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
0x01 - Newton's Third Law:  Static vs. Dynamic Abusers0x01 - Newton's Third Law:  Static vs. Dynamic Abusers
0x01 - Newton's Third Law: Static vs. Dynamic AbusersOWASP Beja
 
The Canoga Gardens Development Project. PDF
The Canoga Gardens Development Project. PDFThe Canoga Gardens Development Project. PDF
The Canoga Gardens Development Project. PDFRahsaan L. Browne
 
Writing Sample 2 -Bridging the Divide: Enhancing Public Engagement in Urban D...
Writing Sample 2 -Bridging the Divide: Enhancing Public Engagement in Urban D...Writing Sample 2 -Bridging the Divide: Enhancing Public Engagement in Urban D...
Writing Sample 2 -Bridging the Divide: Enhancing Public Engagement in Urban D...Rahsaan L. Browne
 
Eureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationEureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationAccess Innovations, Inc.
 
05232024 Joint Meeting - Community Networking
05232024 Joint Meeting - Community Networking05232024 Joint Meeting - Community Networking
05232024 Joint Meeting - Community NetworkingMichael Orias
 
Pollinator Ambassador Earth Steward Day Presentation 2024-05-22
Pollinator Ambassador Earth Steward Day Presentation 2024-05-22Pollinator Ambassador Earth Steward Day Presentation 2024-05-22
Pollinator Ambassador Earth Steward Day Presentation 2024-05-22LHelferty
 
527598851-ppc-due-to-various-govt-policies.pdf
527598851-ppc-due-to-various-govt-policies.pdf527598851-ppc-due-to-various-govt-policies.pdf
527598851-ppc-due-to-various-govt-policies.pdfrajpreetkaur75080
 
Hi-Tech Industry 2024-25 Prospective.pptx
Hi-Tech Industry 2024-25 Prospective.pptxHi-Tech Industry 2024-25 Prospective.pptx
Hi-Tech Industry 2024-25 Prospective.pptxShivamM16
 

Recently uploaded (14)

Getting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control TowerGetting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control Tower
 
Oracle Database Administration I (1Z0-082) Exam Dumps 2024.pdf
Oracle Database Administration I (1Z0-082) Exam Dumps 2024.pdfOracle Database Administration I (1Z0-082) Exam Dumps 2024.pdf
Oracle Database Administration I (1Z0-082) Exam Dumps 2024.pdf
 
Acorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutesAcorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutes
 
Introduction of Biology in living organisms
Introduction of Biology in living organismsIntroduction of Biology in living organisms
Introduction of Biology in living organisms
 
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
 
123445566544333222333444dxcvbcvcvharsh.pptx
123445566544333222333444dxcvbcvcvharsh.pptx123445566544333222333444dxcvbcvcvharsh.pptx
123445566544333222333444dxcvbcvcvharsh.pptx
 
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
0x01 - Newton's Third Law:  Static vs. Dynamic Abusers0x01 - Newton's Third Law:  Static vs. Dynamic Abusers
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
 
The Canoga Gardens Development Project. PDF
The Canoga Gardens Development Project. PDFThe Canoga Gardens Development Project. PDF
The Canoga Gardens Development Project. PDF
 
Writing Sample 2 -Bridging the Divide: Enhancing Public Engagement in Urban D...
Writing Sample 2 -Bridging the Divide: Enhancing Public Engagement in Urban D...Writing Sample 2 -Bridging the Divide: Enhancing Public Engagement in Urban D...
Writing Sample 2 -Bridging the Divide: Enhancing Public Engagement in Urban D...
 
Eureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationEureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 Presentation
 
05232024 Joint Meeting - Community Networking
05232024 Joint Meeting - Community Networking05232024 Joint Meeting - Community Networking
05232024 Joint Meeting - Community Networking
 
Pollinator Ambassador Earth Steward Day Presentation 2024-05-22
Pollinator Ambassador Earth Steward Day Presentation 2024-05-22Pollinator Ambassador Earth Steward Day Presentation 2024-05-22
Pollinator Ambassador Earth Steward Day Presentation 2024-05-22
 
527598851-ppc-due-to-various-govt-policies.pdf
527598851-ppc-due-to-various-govt-policies.pdf527598851-ppc-due-to-various-govt-policies.pdf
527598851-ppc-due-to-various-govt-policies.pdf
 
Hi-Tech Industry 2024-25 Prospective.pptx
Hi-Tech Industry 2024-25 Prospective.pptxHi-Tech Industry 2024-25 Prospective.pptx
Hi-Tech Industry 2024-25 Prospective.pptx
 

Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891

  • 1. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 1 Co-funded by the Horizon 2020 Framework Programme of the European Union Grant Agreement Number 644771 FEISGILTT 2015 |BERLIN, 3 JUNE 2015 Felix Sasaki, DFKI / W3C Fellow On behalf of the FREME Consortium FREME TO MAKE LINKED DATA AVAILABLE TO LOCALIZERS www.freme-project.eu
  • 2. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 2 REMINDER: WHAT IS FREME? • More info: see presentation from yesterday http://slideshare.net/atcfsenzoku/freme-at-feisgiltt-2015-freme-use-cases • Design of FREME takes up work from other projects 1. LIDER http://lider-project.eu/ ◦ In FREME, we deploy best Practices on how to work with linguistic linked data (LLD) ◦ LLD: Linked data used to represent lexica, corpora, language processing workflows etc. 2. FALCON http://falcon-project.eu/ ◦ In FREME, we benefit from experience on working with linked data in localisation scenarios ◦ One lesson learned: hide linked data in the right way from (localisation) developers ◦ No need to process linked data always in the native form, see Babelfy http://babelfy.org/
  • 3. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 3
  • 4. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 4 FREME E-SERVICES – BIRDS EYE VIEW • e-Entity ◦ Automatic annotation of named entities • e-Terminology ◦ Annotation of terms and linkage to term databases • e-Link ◦ Enrichment with information from (linked) (open) data sources* • e-Translation ◦ Cloud based machine translation • e-Internationalisation ◦ ITS 2.0 metadata to govern the multilingual & semantic content workflow • e-Publishing ◦ Publish enriched content in ePub format
  • 5. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 5 EXAMPLE: E-SERVICE DESIGN • RESTFul API • Example http://api.freme-project.eu/0.1/e-entity/dbpedia-spolight • Under each service endpoint: tool specific versions • Parameters for e-Entity ◦ Confidence threshold ◦ Informat. Currently text or NIF (explanation see next slides) ◦ Outformat. • Output: NIF in various serializations ◦ Currently text/turtle or application/json+ld
  • 6. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 6 WHAT IS NIF? • Natural Language Processing Interchange Format • “The XLIFF of natural language processing workflows” (Phil Ritchie, FEISGILTT 2015) • NIF: Linked data based representation of digital content and NLP related annotations • Anchoring in source format possible -> basis for roundtripping • More info: see http://site.nlp2rdf.org/
  • 7. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 7 NIF EXAMPLE: DESCRIBING DOCUMENTS @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix xsd: <http://www.w3.org/2001/XMLSchema#> . @prefix nif: <http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core#> . <http://example.org/document/1#char=0,11> a nif:String , nif:Context , nif:RFC5147String ; nif:isString "the content"^^xsd:string; nif:beginIndex "0"^^xsd:nonNegativeInteger; nif:endIndex "11"^^xsd:nonNegativeInteger; nif:sourceUrl <http://differentday.blogspot.com/2007_01_01_archive.html> .
  • 8. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 8 DESCRIBING STRINGS <http://example.org/document/1#char=0,21> a nif:String , nif:Context , nif:RFC5147String ; nif:isString "We talk about Xiamen."^^xsd:string; nif:beginIndex "0" nif:endIndex "21" nif:sourceUrl <http://differentday.blogspot.com/2007_01_01_archive.html> . <http://example.org/document/1#char=14,20> a nif:String , nif:RFC5147String , nif:Word, nif:Phrase; nif:referenceContext <http://example.org/document/1#char=0,21> ; nif:anchorOf "Xiamen" ; nif:beginIndex "14" ; nif:endIndex "20"; nif:wasConvertedFrom <http://example.org?t=url&f=html&i=http://somewebpage.com#char=0,2820> ;
  • 9. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 9 STORING E-ENTITY ENRICHMENT <http://example.org/document/1#char=14,20> a nif:String , nif:RFC5147String , nif:Word, nif:Phrase; itsrdf:taIdentRef <http://dbpedia.org/resource/Xiamen> ; itsrdf:taClassRef <http://dbpedia.org/ontology/City> ; itsrdf:taClassRef <http://dbpedia.org/ontology/Settlement> ; itsrdf:taClassRef <http://dbpedia.org/ontology/PopulatedPlace> ; itsrdf:taClassRef <http://dbpedia.org/ontology/Place> . • NIF allows to add multiple annotations to content • No constraints on the structure of annotations
  • 10. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 10 KEEPING PROVENANCE <http://example.org/document/1#char=0,21> … nif:wasConvertedFrom <http://example.com/?informat=html&intype=url& input= http://differentday.blogspot.com/2007_01_01_archive.html/ &xpath=/html/body[1]/h2[1]/span[1]/text()[1]>. • XPath only an example • nif:wasConvertedFrom can hold source format specific information • Can the the basis for round tripping
  • 11. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 11 BENEFIT AND DRAWBACKS OF NIF Benefits • NIF can store all information of enrichment services ◦ e-Entity, e-Link, e-Terminology, e-Translation • Via NIF we can chain services easily ◦ No constraints on structures: NIF format constitutes general annotation structure Drawbacks • No tool support of heterogeneous input formats in current tooling ◦ Working on that  -> integration of Okapi and NIF tooling • Size of NIF annotations may be an issue ◦ State: currently gathering implementation experience
  • 12. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 12 DEMO: COMBINING E-SERVICES VIA NIF • Try things yourself at http://api.freme-project.eu/doc/0.1/ Demo workflow: 1. Input: text 2. Processing via e-Entity 3. Output: NIF, input to step 4 4. Processing via e-Link 5. Output: NIF
  • 13. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 13 HIDING COMPLEXITY (1/2): NIF AND E-SERVICE USER FREME version 0.1: service endpoints understand text only or NIF content http://api.freme-project.eu/0.1/e-entity/dbpedia-spolight • Future version: support additional formats via integrating Okapi into NIF ◦ Informat: HTML, XML, Word, PDF; … ◦ Outformat: NIF, in some cases (HTML, XML, …) roundtripping • API user sets input and output e.g. via Accept header • NIF is processed internally, “hidden from the user”
  • 14. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 14 HIDING COMPLEXITY (2/2): THE CASE OF E-LINK • Many users don’t know linked data sources: ◦ What type of data is available? ◦ What linked data vocabularies are used: NIF, LEMON, … ◦ What queries do I need to get information of type X • FREME e-Link allows them to query linked data without looking at it ◦ Input: content plus a query template : “Find my all events close to a given entity” , “Find me all museums close to a given entity”, … ◦ Output: content enriched with information relevant to the query, also as JSON-LD • Concept of query templates: similar to “Schematron for information architects” approach, cf. George Bina at XML Prague 2015 http://archive.xmlprague.cz/2015/files/xmlprague-2015-proceedings.pdf#page=199
  • 15. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 15 E-LINK EXAMPLE: TEMPLATE “PROVIDE GEO-INFORMATION FOR A GIVEN ENTITY” <http://example.org/document/1#char=0,6> ... nif:anchorOf "Berlin"@en; nif:beginIndex "0“; nif:endIndex "6“; itsrdf:taIdentRef <http://dbpedia.org/resource/Berlin> . ... <http://dbpedia.org/resource/Berlin>; itsrdf:taIdentRef <http://dbpedia.org/resource/Berlin>; geo:lat "52.516666"; geo:long "13.383333" . http://api.freme-project.eu/0.1/e-link/?outformat=turtle&templateid=1
  • 16. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 16 LINKED DATA AND LOCALISATION: LESSONS LEARNED • Integration of linked data and tooling: loose coupling wins ◦ Localisation tools talking to linked data enabled web services • Hide complexity in the right manner ◦ Cf. e-Link template approach • Give people “their output format” – probably json ◦ json-ld to the rescue • Linked data world can benefit from localisation tooling ◦ Cf. work on OKAPI – NIF integration
  • 17. FREME To Make Linked Data Available to Localizers – FREME at FEISGILTT 2015 WWW.FREME-PROJECT.EU 17 CONTACTS FELIX SASAKI Senior Researcher DFKI / W3C Fellow On behalf of the FREME consortium E-mail: felix.sasaki@dfki.de CONSORTIUM

Editor's Notes

  1. e-Translation: “Translate from Dutch to English” e-Terminology: “Add terminology annotations” e-Entity: “Identify unique entities” e-Link: “Add information from (linked open) data sources” e-Publishing: “Publish as digital book content” e-Internationalisation: “Use standardised metadata for multilingual content production” A KEY ASPECT FREME: FREME will allow to combine data and language technologies via adequate software interfaces (APIs) and graphical user interfaces (GUIs)
  2. Back Page #1 Social network icons refer to speaker (he/she has to link his/her accounts)