SlideShare a Scribd company logo
1 of 12
www.cngl.ie
Linked Data Reuse in the Language Services Industry
David Lewis, Leroy Finn, Dominic Jones
www.cngl.ie
Language & Localisation: Meta-Data and Interoperability
Multilingual
Web
Language
Resource
Community
Language
Technology
Community
Localisation
Industry
Translations
+ Term
€
www.cngl.ie
Content Management-Localisation
Workflow & Interoperability
• Interoperability
roadblocks in high
variety:
– Content
Management
Systems
– Content formats
• Increasingly
dynamic
• Add language
resource curation
– Data driven MT &
text analytics
Create
Content
Publish
Content
Client
Language
Service
Provider
Extract &
Segment
translators
QA
Translate
Content
CMS
CMS/
CVS
editors
Trans+Term
Reuse
CAT
QA tools
Web
CMS
design
Trans/term
Mgmt
MT
www.cngl.ie
Internationalisation Tag Set (ITS)
• Allows Localisation tools to be instructed to treat specific
text in specific ways
• Annotates text in XML and HTML5 documents
• RDF Mapping – NLP Interchange Format
• Principles:
– Minimise disturbance of original content
– Don’t reinvent wheel
– Link to existing meta-data before adding new
• Defined distinct, independent Data Categories
• ITS2.0 is “Dublin Core” of the Multilingual Web
www.cngl.ie
C. Lieske, F. Sasaki 2010
• Here Content is the Data
• So we must annotate textual content
• RDFa and microdata better for
annotating elements
www.cngl.ie
Process Monitoring in RDF
 Better up-front and real-time business decision support:
– Use historic and aggregated metadata for inference
– Content carries its own “audit trail”
 Metadata can be transformed to linked semantic data allowing:
– Inferential queries across data siloes
– New forms of data visualization
– Extensible data relationships
www.cngl.ie
Provenance Data: RDF-based logging
• W3C Provenance WG
• http://www.w3.org/2011/prov/
From: http://www.w3.org/TR/prov-primer/
ITS related entity subclass:
document, segment,
analysed-text, term,
translation, translation-revision
subproperty:
wasTranslatedFrom
www.cngl.ie
CMS to Localisation Workflow Roundtrip
Content
Management Localisation Preparation Translation Management
Source
CMS
Target
CMS
RDF
provenance
store
Named
Entity
Recogniser
–Enrycher
Web-
based
Postediter
MT -
Matrex
CAT
XLIFF
store
Parse, filter,
segment
ITS
+XLIFF
XLIFF/
PROV-O
Workflow
Management
QA
viewer
MT -
Bing
MT –
M4LOC
ITS
+HTML5
+CMIS
ITS
+XLIFF
ITS
+SPARQL
LKR
www.cngl.ie
Post
Edit
Quality
CheckText
Analysis
Terminology
Terminology
ITS
XLIFF
HTML5
CMIS
RDF: PROV-O + NIF
HTML5
CMIS
www.cngl.ie
Active Curation
• Localisation Industry:
– €33bn global industry 12% growth
– Established commercial models for translation and term reuse
– High interoperability costs
– Long tail industry – 99% SMEs
– SMEs struggle to benefit from leveraging reuse
– Challenged to adapt machine translation and text analytics
technology
• Active Curation: On-demand, quality-driven assembly of
training corpora
– Collect human linguistic judgements from across the workflow
– Professional translator/reviewer, crowds, end user
– Already shown 25% MT improvement via in-job retraining
• Linked Language and Localisation Data
– Extending existing tools to generate process provenance data
– Pooling/Bokering of data across value chains
www.cngl.ie
Where Next?
• Help EC and other public bodies to bootstrap data
– per year EC produces 11 million pages in 22 language =>
14 billion triples
• Attribution/license annotation and access control –
Controlled Linked Data
• Increase industrial engagement
– Media localisation – audio, video, images
– Non-localisation MT and TA applications
– Multilingual content personalisation
– YOUR USE CASE HERE
• Join new W3C Community Group:
Multilingual Linked Open Data
http://www.w3.org/community/bpmlod/
www.cngl.ie
Thank You

More Related Content

Viewers also liked

EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...
EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...
EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...European Data Forum
 
EDF2012 Nigel Shadbolt - Transparency and Open Data
EDF2012   Nigel Shadbolt - Transparency and Open DataEDF2012   Nigel Shadbolt - Transparency and Open Data
EDF2012 Nigel Shadbolt - Transparency and Open DataEuropean Data Forum
 
EDF2012 Mariana Damova - Factforge
EDF2012   Mariana Damova - FactforgeEDF2012   Mariana Damova - Factforge
EDF2012 Mariana Damova - FactforgeEuropean Data Forum
 
EDF2012 Márta Nagy-Rothengass - Leveraging the data potential in Europe
EDF2012   Márta Nagy-Rothengass - Leveraging the data potential in EuropeEDF2012   Márta Nagy-Rothengass - Leveraging the data potential in Europe
EDF2012 Márta Nagy-Rothengass - Leveraging the data potential in EuropeEuropean Data Forum
 
EDF2013: Selected Talk: František Nonnemann: Re-use of PSI and Personal Data...
 EDF2013: Selected Talk: František Nonnemann: Re-use of PSI and Personal Data... EDF2013: Selected Talk: František Nonnemann: Re-use of PSI and Personal Data...
EDF2013: Selected Talk: František Nonnemann: Re-use of PSI and Personal Data...European Data Forum
 
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...European Data Forum
 
EDF2013: Keynote Stefan Decker: Big Data In Ireland - Linked Data and beyond
EDF2013: Keynote Stefan Decker: Big Data In Ireland - Linked Data and beyondEDF2013: Keynote Stefan Decker: Big Data In Ireland - Linked Data and beyond
EDF2013: Keynote Stefan Decker: Big Data In Ireland - Linked Data and beyondEuropean Data Forum
 
EDF2013: Selected Talk Josep-L. Larriba-Pey: The Linked Data Benchmark Counci...
EDF2013: Selected Talk Josep-L. Larriba-Pey: The Linked Data Benchmark Counci...EDF2013: Selected Talk Josep-L. Larriba-Pey: The Linked Data Benchmark Counci...
EDF2013: Selected Talk Josep-L. Larriba-Pey: The Linked Data Benchmark Counci...European Data Forum
 

Viewers also liked (8)

EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...
EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...
EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...
 
EDF2012 Nigel Shadbolt - Transparency and Open Data
EDF2012   Nigel Shadbolt - Transparency and Open DataEDF2012   Nigel Shadbolt - Transparency and Open Data
EDF2012 Nigel Shadbolt - Transparency and Open Data
 
EDF2012 Mariana Damova - Factforge
EDF2012   Mariana Damova - FactforgeEDF2012   Mariana Damova - Factforge
EDF2012 Mariana Damova - Factforge
 
EDF2012 Márta Nagy-Rothengass - Leveraging the data potential in Europe
EDF2012   Márta Nagy-Rothengass - Leveraging the data potential in EuropeEDF2012   Márta Nagy-Rothengass - Leveraging the data potential in Europe
EDF2012 Márta Nagy-Rothengass - Leveraging the data potential in Europe
 
EDF2013: Selected Talk: František Nonnemann: Re-use of PSI and Personal Data...
 EDF2013: Selected Talk: František Nonnemann: Re-use of PSI and Personal Data... EDF2013: Selected Talk: František Nonnemann: Re-use of PSI and Personal Data...
EDF2013: Selected Talk: František Nonnemann: Re-use of PSI and Personal Data...
 
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
 
EDF2013: Keynote Stefan Decker: Big Data In Ireland - Linked Data and beyond
EDF2013: Keynote Stefan Decker: Big Data In Ireland - Linked Data and beyondEDF2013: Keynote Stefan Decker: Big Data In Ireland - Linked Data and beyond
EDF2013: Keynote Stefan Decker: Big Data In Ireland - Linked Data and beyond
 
EDF2013: Selected Talk Josep-L. Larriba-Pey: The Linked Data Benchmark Counci...
EDF2013: Selected Talk Josep-L. Larriba-Pey: The Linked Data Benchmark Counci...EDF2013: Selected Talk Josep-L. Larriba-Pey: The Linked Data Benchmark Counci...
EDF2013: Selected Talk Josep-L. Larriba-Pey: The Linked Data Benchmark Counci...
 

Similar to EDF2013: Selected talk by David Lewis: Linked Data Reuse in the Language Services Industry

Active Curation of Bi-Text Resources in Commercial Localization Workflows
Active Curation of Bi-Text Resources in Commercial Localization WorkflowsActive Curation of Bi-Text Resources in Commercial Localization Workflows
Active Curation of Bi-Text Resources in Commercial Localization Workflows Dave Lewis
 
The Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesThe Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesDave Lewis
 
Broadcast Digital Media Technology Trends
Broadcast Digital Media Technology TrendsBroadcast Digital Media Technology Trends
Broadcast Digital Media Technology TrendsKuppa Srinivas
 
Data Treatment MongoDB
Data Treatment MongoDBData Treatment MongoDB
Data Treatment MongoDBNorberto Leite
 
Tackle your Documentation Challenges with the IXIASOFT DITA CMS
Tackle your Documentation Challenges with the IXIASOFT DITA CMSTackle your Documentation Challenges with the IXIASOFT DITA CMS
Tackle your Documentation Challenges with the IXIASOFT DITA CMSIXIASOFT
 
FISL: Content Management Primer
FISL: Content Management PrimerFISL: Content Management Primer
FISL: Content Management PrimerRichard Esplin
 
Extend IBM Enterprise Content Management Solutions with Content Navigator
Extend IBM Enterprise Content Management Solutions with Content NavigatorExtend IBM Enterprise Content Management Solutions with Content Navigator
Extend IBM Enterprise Content Management Solutions with Content NavigatorPerficient, Inc.
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015Sebastian Hellmann
 
A social computing strategy for an enterprise content ecosystem
A social computing strategy for an enterprise content ecosystemA social computing strategy for an enterprise content ecosystem
A social computing strategy for an enterprise content ecosystemGokul Alex
 
Changing how we think about content
Changing how we think about contentChanging how we think about content
Changing how we think about contentRob Hanna, ECMs
 
Building an effective sharepoint team
Building an effective sharepoint teamBuilding an effective sharepoint team
Building an effective sharepoint teamBaris Bruce Tuncertan
 
Jeroen Kleinhoven (Treparel), Turn Big Content into Business Insights - Data ...
Jeroen Kleinhoven (Treparel), Turn Big Content into Business Insights - Data ...Jeroen Kleinhoven (Treparel), Turn Big Content into Business Insights - Data ...
Jeroen Kleinhoven (Treparel), Turn Big Content into Business Insights - Data ...Cre-Aid
 
Making Web Content Agile
Making Web Content AgileMaking Web Content Agile
Making Web Content AgileScott Abel
 
Gilbane 2009 -- How Can Content Management Software Keep Pace?
Gilbane 2009 -- How Can Content Management Software Keep Pace?Gilbane 2009 -- How Can Content Management Software Keep Pace?
Gilbane 2009 -- How Can Content Management Software Keep Pace?weisinger
 
Jonathan Ralton - Trusting Your KM & ECM Strategy To SharePoint
Jonathan Ralton - Trusting Your KM & ECM Strategy To SharePointJonathan Ralton - Trusting Your KM & ECM Strategy To SharePoint
Jonathan Ralton - Trusting Your KM & ECM Strategy To SharePointARMA International
 
Engineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platformsEngineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platformsHisham Arafat
 
LoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloudLoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloudEuropeana
 
Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...
Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...
Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...Zia Consulting
 

Similar to EDF2013: Selected talk by David Lewis: Linked Data Reuse in the Language Services Industry (20)

Active Curation of Bi-Text Resources in Commercial Localization Workflows
Active Curation of Bi-Text Resources in Commercial Localization WorkflowsActive Curation of Bi-Text Resources in Commercial Localization Workflows
Active Curation of Bi-Text Resources in Commercial Localization Workflows
 
The Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesThe Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New Technologies
 
Broadcast Digital Media Technology Trends
Broadcast Digital Media Technology TrendsBroadcast Digital Media Technology Trends
Broadcast Digital Media Technology Trends
 
Data Treatment MongoDB
Data Treatment MongoDBData Treatment MongoDB
Data Treatment MongoDB
 
Tackle your Documentation Challenges with the IXIASOFT DITA CMS
Tackle your Documentation Challenges with the IXIASOFT DITA CMSTackle your Documentation Challenges with the IXIASOFT DITA CMS
Tackle your Documentation Challenges with the IXIASOFT DITA CMS
 
FISL: Content Management Primer
FISL: Content Management PrimerFISL: Content Management Primer
FISL: Content Management Primer
 
Extend IBM Enterprise Content Management Solutions with Content Navigator
Extend IBM Enterprise Content Management Solutions with Content NavigatorExtend IBM Enterprise Content Management Solutions with Content Navigator
Extend IBM Enterprise Content Management Solutions with Content Navigator
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015
 
A social computing strategy for an enterprise content ecosystem
A social computing strategy for an enterprise content ecosystemA social computing strategy for an enterprise content ecosystem
A social computing strategy for an enterprise content ecosystem
 
Changing how we think about content
Changing how we think about contentChanging how we think about content
Changing how we think about content
 
Building an effective sharepoint team
Building an effective sharepoint teamBuilding an effective sharepoint team
Building an effective sharepoint team
 
Jeroen Kleinhoven (Treparel), Turn Big Content into Business Insights - Data ...
Jeroen Kleinhoven (Treparel), Turn Big Content into Business Insights - Data ...Jeroen Kleinhoven (Treparel), Turn Big Content into Business Insights - Data ...
Jeroen Kleinhoven (Treparel), Turn Big Content into Business Insights - Data ...
 
Making Web Content Agile
Making Web Content AgileMaking Web Content Agile
Making Web Content Agile
 
Gilbane 2009 -- How Can Content Management Software Keep Pace?
Gilbane 2009 -- How Can Content Management Software Keep Pace?Gilbane 2009 -- How Can Content Management Software Keep Pace?
Gilbane 2009 -- How Can Content Management Software Keep Pace?
 
NISO/DCMI Webinar: Metadata for Public Sector Administration
NISO/DCMI Webinar: Metadata for Public Sector AdministrationNISO/DCMI Webinar: Metadata for Public Sector Administration
NISO/DCMI Webinar: Metadata for Public Sector Administration
 
Jonathan Ralton - Trusting Your KM & ECM Strategy To SharePoint
Jonathan Ralton - Trusting Your KM & ECM Strategy To SharePointJonathan Ralton - Trusting Your KM & ECM Strategy To SharePoint
Jonathan Ralton - Trusting Your KM & ECM Strategy To SharePoint
 
Engineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platformsEngineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platforms
 
LoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloudLoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloud
 
SharePoint for Records Management
SharePoint for Records ManagementSharePoint for Records Management
SharePoint for Records Management
 
Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...
Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...
Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...
 

More from European Data Forum

EDF2014: Ralf-Peter Schaefer, Head of Traffic Product Unit, TomTom, Germany: ...
EDF2014: Ralf-Peter Schaefer, Head of Traffic Product Unit, TomTom, Germany: ...EDF2014: Ralf-Peter Schaefer, Head of Traffic Product Unit, TomTom, Germany: ...
EDF2014: Ralf-Peter Schaefer, Head of Traffic Product Unit, TomTom, Germany: ...European Data Forum
 
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...European Data Forum
 
EDF2014: BIG - NESSI Networking Session: Nuria de Lama, Representative to the...
EDF2014: BIG - NESSI Networking Session: Nuria de Lama, Representative to the...EDF2014: BIG - NESSI Networking Session: Nuria de Lama, Representative to the...
EDF2014: BIG - NESSI Networking Session: Nuria de Lama, Representative to the...European Data Forum
 
EDF2014: BIG - NESSI Networking Session: Intro Presentation
EDF2014: BIG - NESSI Networking Session: Intro PresentationEDF2014: BIG - NESSI Networking Session: Intro Presentation
EDF2014: BIG - NESSI Networking Session: Intro PresentationEuropean Data Forum
 
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...European Data Forum
 
EDF2014: Adrian Cristal, Barcelona Supercomputing Center, RETHINK big Project...
EDF2014: Adrian Cristal, Barcelona Supercomputing Center, RETHINK big Project...EDF2014: Adrian Cristal, Barcelona Supercomputing Center, RETHINK big Project...
EDF2014: Adrian Cristal, Barcelona Supercomputing Center, RETHINK big Project...European Data Forum
 
EDF2014: Dimitris Vassiliadis, Head of Unit, EXUS Innovation Attractor: From ...
EDF2014: Dimitris Vassiliadis, Head of Unit, EXUS Innovation Attractor: From ...EDF2014: Dimitris Vassiliadis, Head of Unit, EXUS Innovation Attractor: From ...
EDF2014: Dimitris Vassiliadis, Head of Unit, EXUS Innovation Attractor: From ...European Data Forum
 
EDF2014: Rüdiger Eichin, Research Manager at SAP AG, Germany: Deriving Value ...
EDF2014: Rüdiger Eichin, Research Manager at SAP AG, Germany: Deriving Value ...EDF2014: Rüdiger Eichin, Research Manager at SAP AG, Germany: Deriving Value ...
EDF2014: Rüdiger Eichin, Research Manager at SAP AG, Germany: Deriving Value ...European Data Forum
 
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...European Data Forum
 
EDF2014: Christian Lindemann, Wolters Kluwer Germany & Christian Dirschl, Wol...
EDF2014: Christian Lindemann, Wolters Kluwer Germany & Christian Dirschl, Wol...EDF2014: Christian Lindemann, Wolters Kluwer Germany & Christian Dirschl, Wol...
EDF2014: Christian Lindemann, Wolters Kluwer Germany & Christian Dirschl, Wol...European Data Forum
 
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...European Data Forum
 
EDF2014: Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the b...
EDF2014: Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the b...EDF2014: Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the b...
EDF2014: Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the b...European Data Forum
 
EDF2014: Michele Vescovi, Researcher, Semantic & Knowledge Innovation Lab, It...
EDF2014: Michele Vescovi, Researcher, Semantic & Knowledge Innovation Lab, It...EDF2014: Michele Vescovi, Researcher, Semantic & Knowledge Innovation Lab, It...
EDF2014: Michele Vescovi, Researcher, Semantic & Knowledge Innovation Lab, It...European Data Forum
 
EDF2014: Allan Hanbury, Senior Researcher, Vienna University of Technology, A...
EDF2014: Allan Hanbury, Senior Researcher, Vienna University of Technology, A...EDF2014: Allan Hanbury, Senior Researcher, Vienna University of Technology, A...
EDF2014: Allan Hanbury, Senior Researcher, Vienna University of Technology, A...European Data Forum
 
EDF2014: Nikolaos Loutas, Manager at PwC Belgium, Business Models for Linked ...
EDF2014: Nikolaos Loutas, Manager at PwC Belgium, Business Models for Linked ...EDF2014: Nikolaos Loutas, Manager at PwC Belgium, Business Models for Linked ...
EDF2014: Nikolaos Loutas, Manager at PwC Belgium, Business Models for Linked ...European Data Forum
 
EDF2014: Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center,...
EDF2014: Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center,...EDF2014: Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center,...
EDF2014: Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center,...European Data Forum
 
EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universid...
EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universid...EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universid...
EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universid...European Data Forum
 
EDF2014: Piek Vossen, Professor Computational Lexicology, VU University Amste...
EDF2014: Piek Vossen, Professor Computational Lexicology, VU University Amste...EDF2014: Piek Vossen, Professor Computational Lexicology, VU University Amste...
EDF2014: Piek Vossen, Professor Computational Lexicology, VU University Amste...European Data Forum
 
EDF2014: Taru Rastas, Senior Advisor, Ministry of Communications of Finland: ...
EDF2014: Taru Rastas, Senior Advisor, Ministry of Communications of Finland: ...EDF2014: Taru Rastas, Senior Advisor, Ministry of Communications of Finland: ...
EDF2014: Taru Rastas, Senior Advisor, Ministry of Communications of Finland: ...European Data Forum
 

More from European Data Forum (20)

EDF2014: Ralf-Peter Schaefer, Head of Traffic Product Unit, TomTom, Germany: ...
EDF2014: Ralf-Peter Schaefer, Head of Traffic Product Unit, TomTom, Germany: ...EDF2014: Ralf-Peter Schaefer, Head of Traffic Product Unit, TomTom, Germany: ...
EDF2014: Ralf-Peter Schaefer, Head of Traffic Product Unit, TomTom, Germany: ...
 
Barbato leit ict 15-16-17
Barbato leit ict 15-16-17Barbato leit ict 15-16-17
Barbato leit ict 15-16-17
 
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
 
EDF2014: BIG - NESSI Networking Session: Nuria de Lama, Representative to the...
EDF2014: BIG - NESSI Networking Session: Nuria de Lama, Representative to the...EDF2014: BIG - NESSI Networking Session: Nuria de Lama, Representative to the...
EDF2014: BIG - NESSI Networking Session: Nuria de Lama, Representative to the...
 
EDF2014: BIG - NESSI Networking Session: Intro Presentation
EDF2014: BIG - NESSI Networking Session: Intro PresentationEDF2014: BIG - NESSI Networking Session: Intro Presentation
EDF2014: BIG - NESSI Networking Session: Intro Presentation
 
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...
 
EDF2014: Adrian Cristal, Barcelona Supercomputing Center, RETHINK big Project...
EDF2014: Adrian Cristal, Barcelona Supercomputing Center, RETHINK big Project...EDF2014: Adrian Cristal, Barcelona Supercomputing Center, RETHINK big Project...
EDF2014: Adrian Cristal, Barcelona Supercomputing Center, RETHINK big Project...
 
EDF2014: Dimitris Vassiliadis, Head of Unit, EXUS Innovation Attractor: From ...
EDF2014: Dimitris Vassiliadis, Head of Unit, EXUS Innovation Attractor: From ...EDF2014: Dimitris Vassiliadis, Head of Unit, EXUS Innovation Attractor: From ...
EDF2014: Dimitris Vassiliadis, Head of Unit, EXUS Innovation Attractor: From ...
 
EDF2014: Rüdiger Eichin, Research Manager at SAP AG, Germany: Deriving Value ...
EDF2014: Rüdiger Eichin, Research Manager at SAP AG, Germany: Deriving Value ...EDF2014: Rüdiger Eichin, Research Manager at SAP AG, Germany: Deriving Value ...
EDF2014: Rüdiger Eichin, Research Manager at SAP AG, Germany: Deriving Value ...
 
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
 
EDF2014: Christian Lindemann, Wolters Kluwer Germany & Christian Dirschl, Wol...
EDF2014: Christian Lindemann, Wolters Kluwer Germany & Christian Dirschl, Wol...EDF2014: Christian Lindemann, Wolters Kluwer Germany & Christian Dirschl, Wol...
EDF2014: Christian Lindemann, Wolters Kluwer Germany & Christian Dirschl, Wol...
 
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
 
EDF2014: Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the b...
EDF2014: Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the b...EDF2014: Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the b...
EDF2014: Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the b...
 
EDF2014: Michele Vescovi, Researcher, Semantic & Knowledge Innovation Lab, It...
EDF2014: Michele Vescovi, Researcher, Semantic & Knowledge Innovation Lab, It...EDF2014: Michele Vescovi, Researcher, Semantic & Knowledge Innovation Lab, It...
EDF2014: Michele Vescovi, Researcher, Semantic & Knowledge Innovation Lab, It...
 
EDF2014: Allan Hanbury, Senior Researcher, Vienna University of Technology, A...
EDF2014: Allan Hanbury, Senior Researcher, Vienna University of Technology, A...EDF2014: Allan Hanbury, Senior Researcher, Vienna University of Technology, A...
EDF2014: Allan Hanbury, Senior Researcher, Vienna University of Technology, A...
 
EDF2014: Nikolaos Loutas, Manager at PwC Belgium, Business Models for Linked ...
EDF2014: Nikolaos Loutas, Manager at PwC Belgium, Business Models for Linked ...EDF2014: Nikolaos Loutas, Manager at PwC Belgium, Business Models for Linked ...
EDF2014: Nikolaos Loutas, Manager at PwC Belgium, Business Models for Linked ...
 
EDF2014: Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center,...
EDF2014: Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center,...EDF2014: Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center,...
EDF2014: Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center,...
 
EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universid...
EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universid...EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universid...
EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universid...
 
EDF2014: Piek Vossen, Professor Computational Lexicology, VU University Amste...
EDF2014: Piek Vossen, Professor Computational Lexicology, VU University Amste...EDF2014: Piek Vossen, Professor Computational Lexicology, VU University Amste...
EDF2014: Piek Vossen, Professor Computational Lexicology, VU University Amste...
 
EDF2014: Taru Rastas, Senior Advisor, Ministry of Communications of Finland: ...
EDF2014: Taru Rastas, Senior Advisor, Ministry of Communications of Finland: ...EDF2014: Taru Rastas, Senior Advisor, Ministry of Communications of Finland: ...
EDF2014: Taru Rastas, Senior Advisor, Ministry of Communications of Finland: ...
 

Recently uploaded

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 

Recently uploaded (20)

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 

EDF2013: Selected talk by David Lewis: Linked Data Reuse in the Language Services Industry

  • 1. www.cngl.ie Linked Data Reuse in the Language Services Industry David Lewis, Leroy Finn, Dominic Jones
  • 2. www.cngl.ie Language & Localisation: Meta-Data and Interoperability Multilingual Web Language Resource Community Language Technology Community Localisation Industry Translations + Term €
  • 3. www.cngl.ie Content Management-Localisation Workflow & Interoperability • Interoperability roadblocks in high variety: – Content Management Systems – Content formats • Increasingly dynamic • Add language resource curation – Data driven MT & text analytics Create Content Publish Content Client Language Service Provider Extract & Segment translators QA Translate Content CMS CMS/ CVS editors Trans+Term Reuse CAT QA tools Web CMS design Trans/term Mgmt MT
  • 4. www.cngl.ie Internationalisation Tag Set (ITS) • Allows Localisation tools to be instructed to treat specific text in specific ways • Annotates text in XML and HTML5 documents • RDF Mapping – NLP Interchange Format • Principles: – Minimise disturbance of original content – Don’t reinvent wheel – Link to existing meta-data before adding new • Defined distinct, independent Data Categories • ITS2.0 is “Dublin Core” of the Multilingual Web
  • 5. www.cngl.ie C. Lieske, F. Sasaki 2010 • Here Content is the Data • So we must annotate textual content • RDFa and microdata better for annotating elements
  • 6. www.cngl.ie Process Monitoring in RDF  Better up-front and real-time business decision support: – Use historic and aggregated metadata for inference – Content carries its own “audit trail”  Metadata can be transformed to linked semantic data allowing: – Inferential queries across data siloes – New forms of data visualization – Extensible data relationships
  • 7. www.cngl.ie Provenance Data: RDF-based logging • W3C Provenance WG • http://www.w3.org/2011/prov/ From: http://www.w3.org/TR/prov-primer/ ITS related entity subclass: document, segment, analysed-text, term, translation, translation-revision subproperty: wasTranslatedFrom
  • 8. www.cngl.ie CMS to Localisation Workflow Roundtrip Content Management Localisation Preparation Translation Management Source CMS Target CMS RDF provenance store Named Entity Recogniser –Enrycher Web- based Postediter MT - Matrex CAT XLIFF store Parse, filter, segment ITS +XLIFF XLIFF/ PROV-O Workflow Management QA viewer MT - Bing MT – M4LOC ITS +HTML5 +CMIS ITS +XLIFF ITS +SPARQL LKR
  • 10. www.cngl.ie Active Curation • Localisation Industry: – €33bn global industry 12% growth – Established commercial models for translation and term reuse – High interoperability costs – Long tail industry – 99% SMEs – SMEs struggle to benefit from leveraging reuse – Challenged to adapt machine translation and text analytics technology • Active Curation: On-demand, quality-driven assembly of training corpora – Collect human linguistic judgements from across the workflow – Professional translator/reviewer, crowds, end user – Already shown 25% MT improvement via in-job retraining • Linked Language and Localisation Data – Extending existing tools to generate process provenance data – Pooling/Bokering of data across value chains
  • 11. www.cngl.ie Where Next? • Help EC and other public bodies to bootstrap data – per year EC produces 11 million pages in 22 language => 14 billion triples • Attribution/license annotation and access control – Controlled Linked Data • Increase industrial engagement – Media localisation – audio, video, images – Non-localisation MT and TA applications – Multilingual content personalisation – YOUR USE CASE HERE • Join new W3C Community Group: Multilingual Linked Open Data http://www.w3.org/community/bpmlod/

Editor's Notes

  1. Language Resource CurationRelies on specific funded actions Episodic funding, sustainability and coverage concernsLOD promises:Improved sharing and annotation of existing resourcesSome opportunistic curation, e.g. mining DbpediaCan there be better synergy with commercial language services?Use case: Localisation workflows