SlideShare a Scribd company logo
1 of 10
Download to read offline
Open Data Management for
Public Automated Translation
Dave Lewis
CNGL at Trinity College Dublin
•  Open Data on the Web: W3C Semantic Web
standards allow data to be published on Web
–  Fine-grained URI-based inter-linking
–  Extensible meta-data
–  Standard Query APIs
•  Enables a Localization Web
–  Terms and translations become linkable resources
–  Meta-data from L10n workflows adds value
–  Leverage in training Machine Translation and Text
Analytics
The Localization Web
The Localization Web = Decentralised Annotated
Global Translation Memory and Term Base
Linguistic Linked Data
Red
Phonetic form
Form
number
singular
[RED]
Form
plural
[REDES]
Phonetic form
number
Red
Sense
written form
“red”
Sense
written form
“malla”
equivalent
Red
image
Red
Sense Sense
translation
es - en
written form
“red” “network”
written form
Red
written form
Form
gender
femenine
“red”
Use Case: BabelNet.org
Words as Resources on the Web
Barak Obama if the
44th president of the
United State of
America. He was
first elected in 2009.
Barak Obama si el 44 º
presidente de los
Estados Unidos de
América. Ha fue electo
primera vez en 2009.
http:// www.ex.org/obama_en.html
http:// www.ex.org/obama_es.html
The Web of Content The Localization Web
http://data.ex.org/String_0001
http:// data.ex.org/String_0002
Derived
From
Derived
From
Text: “Barak Obama
if the 44th president
of the United State of
America.”
Lang:en
Text:“Barak Obama si
el 44 º presidente de
los Estados Unidos de
América.”
Lang:es
TranslatedBy:Google
Translate
Translated
From
Translation Data
Term: “United State
of America.”
Lang:en
Term:“Estados
Unidos de América.”
Lang:es
Translation
Of
http:// babelnet.org/345621
http:// babelnet.org/57835
Terminology Data
Topic: Barak Obama
Lang: en
BirthDate: 1961-08-04
Spouse: Michelle Obama
Residence: White House
http:// Dbpedia.org/Page/
Barak_Obama
Encyclopaedic Data
Data Management Lifecycles
Publish
Correct
& refine
Lex-
concept
lifecycleCorrect
& refine
Discover
& use
Discover &
use
Correct
& refine
Bitext
lifecycle
Discover
data
(Re)train
-MT
Revise and
annotate
Publish
Content
lifecycle
Publish
I18n &
source QA
Trans
QA
Post-
edit
Automated
translation
Consume Create
•  Assert ownership and attribution,
licensing, access control?
•  Persistent URLs?
•  Open royalty free standards?
•  Indexing is key, federated vs aggregated
data?
•  Third party submission of errors, QA,
corrections? Publish actions taken on
submissions?
Public Automated Translation:
Data Management Needs?
•  Query bitext on:
–  languages, terms, MT engine, MT confidence, QA,
translator qualifications, postedit characteristics?
•  Query lexical-conceptual resources on:
–  language, domains, context, semantic relations,
provenance of lexical/conceptual data?
•  Localisation Web API:
–  HTTP content negotiation (Unicode extensions for
translation?),
–  Resource Format: RDF, TMX, TBX, RDF, JSON-LD,
–  SPARQL end points?
Public Automated Translation:
Data Access Needs?
Liaisons and Consensus Building
ITS2.0	
  
XLIFF	
  
Content	
  Processing	
  	
  (DOM)	
   Linked	
  Data	
  Processing	
  (RDF)	
  
MQM	
  
ITS2.0	
  
Ontology	
  
PROV-­‐O/Global	
  
Intelligent	
  
Content	
  
LinguisHc	
  Linked	
  Data	
  
OCELOT	
  
Use	
  Cases:	
  
Content	
  AnalyHcs;	
  
Corpora	
  curaHon;	
  	
  
Content	
  
enrichment;	
  
Human-­‐MT	
  quality;	
  
Your	
  use	
  case	
  
	
  
Linked	
  Data	
  for	
  
Language	
  Technology	
  
More Information
•  Contact: dave.lewis@cs.tcd.ie
•  https://www.w3.org/International/its/wiki/
Open_Data_Management_for_Public_Automated_Translation_Services
•  http://www.falcon-project.eu
•  See also:
–  Linked Data for Language Technology (LD4LT) W3C
Community Group
–  http://www.w3.org/community/ld4lt/

More Related Content

What's hot

ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...eswcsummerschool
 
Sparql a simple knowledge query
Sparql  a simple knowledge querySparql  a simple knowledge query
Sparql a simple knowledge queryStanley Wang
 
A review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebA review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebSimon Price
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of librariesRegan Harper
 
Linked data MLA 2015
Linked data MLA 2015Linked data MLA 2015
Linked data MLA 2015Cason Snow
 
Linked Data MLA 2015
Linked Data MLA 2015Linked Data MLA 2015
Linked Data MLA 2015Cason Snow
 
Consuming Linked Data by Humans - WWW2010
Consuming Linked Data by Humans - WWW2010Consuming Linked Data by Humans - WWW2010
Consuming Linked Data by Humans - WWW2010Juan Sequeda
 
Metadata Training for Staff and Librarians for the New Data Environment
Metadata Training for Staff and Librarians for the New Data EnvironmentMetadata Training for Staff and Librarians for the New Data Environment
Metadata Training for Staff and Librarians for the New Data EnvironmentDiane Hillmann
 
Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)Hong (Jenny) Jing
 
Linked Data for Czech Legislation
Linked Data for Czech LegislationLinked Data for Czech Legislation
Linked Data for Czech LegislationMartin Necasky
 
Islandora and Linked Open Data
Islandora and Linked Open Data Islandora and Linked Open Data
Islandora and Linked Open Data eohallor
 

What's hot (20)

ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
 
Sparql a simple knowledge query
Sparql  a simple knowledge querySparql  a simple knowledge query
Sparql a simple knowledge query
 
McDanold-1-jun15
McDanold-1-jun15McDanold-1-jun15
McDanold-1-jun15
 
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
 
A review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebA review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic Web
 
Semantic Web in Action
Semantic Web in ActionSemantic Web in Action
Semantic Web in Action
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of libraries
 
Assessing the performance of RDF Engines: Discussing RDF Benchmarks
Assessing the performance of RDF Engines: Discussing RDF Benchmarks Assessing the performance of RDF Engines: Discussing RDF Benchmarks
Assessing the performance of RDF Engines: Discussing RDF Benchmarks
 
Linked data MLA 2015
Linked data MLA 2015Linked data MLA 2015
Linked data MLA 2015
 
Linked Data MLA 2015
Linked Data MLA 2015Linked Data MLA 2015
Linked Data MLA 2015
 
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early AdoptersApril 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
 
April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...
April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...
April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...
 
Consuming Linked Data by Humans - WWW2010
Consuming Linked Data by Humans - WWW2010Consuming Linked Data by Humans - WWW2010
Consuming Linked Data by Humans - WWW2010
 
Metadata Training for Staff and Librarians for the New Data Environment
Metadata Training for Staff and Librarians for the New Data EnvironmentMetadata Training for Staff and Librarians for the New Data Environment
Metadata Training for Staff and Librarians for the New Data Environment
 
Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)
 
Snac webinar v3
Snac webinar v3Snac webinar v3
Snac webinar v3
 
Linked Data for Czech Legislation
Linked Data for Czech LegislationLinked Data for Czech Legislation
Linked Data for Czech Legislation
 
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
 NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti... NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
 
Islandora and Linked Open Data
Islandora and Linked Open Data Islandora and Linked Open Data
Islandora and Linked Open Data
 
Linked library data
Linked library dataLinked library data
Linked library data
 

Viewers also liked

Cara mengutip (TPKI)
Cara mengutip (TPKI)Cara mengutip (TPKI)
Cara mengutip (TPKI)Unnes
 
Simon cronshaw, remix europeana for creative industries talk
Simon cronshaw, remix   europeana for creative industries talkSimon cronshaw, remix   europeana for creative industries talk
Simon cronshaw, remix europeana for creative industries talkEuropeana
 
Land Deals in India
Land Deals in IndiaLand Deals in India
Land Deals in IndiaHitesh Dave
 
Copy of using cc images
Copy of using cc imagesCopy of using cc images
Copy of using cc imagesRachel Printy
 
Measuring and Reporting the Impact of Learning and Development
Measuring and Reporting the Impact of Learning and DevelopmentMeasuring and Reporting the Impact of Learning and Development
Measuring and Reporting the Impact of Learning and DevelopmentCon Sotidis
 
MARILYNS RESUME (3) (1) (2)
MARILYNS RESUME (3) (1) (2)MARILYNS RESUME (3) (1) (2)
MARILYNS RESUME (3) (1) (2)marilyn rivera
 
Top 10 slide tips
Top 10 slide tipsTop 10 slide tips
Top 10 slide tipsmsaldana19
 
Electrical electronic aptitude test 2
Electrical electronic aptitude  test 2 Electrical electronic aptitude  test 2
Electrical electronic aptitude test 2 Kj_ss
 
Achievement test
Achievement testAchievement test
Achievement testmaneeshakm
 

Viewers also liked (14)

BHUDEV SHARMA
BHUDEV SHARMABHUDEV SHARMA
BHUDEV SHARMA
 
Cara mengutip (TPKI)
Cara mengutip (TPKI)Cara mengutip (TPKI)
Cara mengutip (TPKI)
 
2010 01-16-테라스 사이트 구축 제안서-버전01
2010 01-16-테라스 사이트 구축 제안서-버전012010 01-16-테라스 사이트 구축 제안서-버전01
2010 01-16-테라스 사이트 구축 제안서-버전01
 
Simon cronshaw, remix europeana for creative industries talk
Simon cronshaw, remix   europeana for creative industries talkSimon cronshaw, remix   europeana for creative industries talk
Simon cronshaw, remix europeana for creative industries talk
 
AffleNews_Final(8sep)
AffleNews_Final(8sep)AffleNews_Final(8sep)
AffleNews_Final(8sep)
 
Land Deals in India
Land Deals in IndiaLand Deals in India
Land Deals in India
 
Copy of using cc images
Copy of using cc imagesCopy of using cc images
Copy of using cc images
 
Measuring and Reporting the Impact of Learning and Development
Measuring and Reporting the Impact of Learning and DevelopmentMeasuring and Reporting the Impact of Learning and Development
Measuring and Reporting the Impact of Learning and Development
 
MARILYNS RESUME (3) (1) (2)
MARILYNS RESUME (3) (1) (2)MARILYNS RESUME (3) (1) (2)
MARILYNS RESUME (3) (1) (2)
 
Seo tutorial
Seo tutorialSeo tutorial
Seo tutorial
 
Resume
ResumeResume
Resume
 
Top 10 slide tips
Top 10 slide tipsTop 10 slide tips
Top 10 slide tips
 
Electrical electronic aptitude test 2
Electrical electronic aptitude  test 2 Electrical electronic aptitude  test 2
Electrical electronic aptitude test 2
 
Achievement test
Achievement testAchievement test
Achievement test
 

Similar to Open Data Management for Public Automated Translation

Publishing data on the Semantic Web
Publishing data on the Semantic WebPublishing data on the Semantic Web
Publishing data on the Semantic WebPeter Mika
 
Linked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationLinked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationSören Auer
 
Paul houle resume
Paul houle resumePaul houle resume
Paul houle resumePaul Houle
 
OpenLink Virtuoso - Management & Decision Makers Overview
OpenLink Virtuoso - Management & Decision Makers OverviewOpenLink Virtuoso - Management & Decision Makers Overview
OpenLink Virtuoso - Management & Decision Makers OverviewKingsley Uyi Idehen
 
PoolParty SKOS and Linked Data
PoolParty SKOS and Linked DataPoolParty SKOS and Linked Data
PoolParty SKOS and Linked DataAndreas Blumauer
 
Active Curation of Bi-Text Resources in Commercial Localization Workflows
Active Curation of Bi-Text Resources in Commercial Localization WorkflowsActive Curation of Bi-Text Resources in Commercial Localization Workflows
Active Curation of Bi-Text Resources in Commercial Localization Workflows Dave Lewis
 
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...cmitch41
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data TutorialSören Auer
 
Schema.org: Where did that come from!
Schema.org: Where did that come from!Schema.org: Where did that come from!
Schema.org: Where did that come from!Richard Wallis
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
 
Contextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationContextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationRichard Wallis
 
What happened to the Semantic Web?
What happened to the Semantic Web?What happened to the Semantic Web?
What happened to the Semantic Web?Peter Mika
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark GreavesMediabistro
 
Repository for data crawled from multiple social networks
Repository for data crawled from multiple social networksRepository for data crawled from multiple social networks
Repository for data crawled from multiple social networksConstantinos Christofilos
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesRichard Wallis
 
Semantic web technology
Semantic web technologySemantic web technology
Semantic web technologyStanley Wang
 

Similar to Open Data Management for Public Automated Translation (20)

Publishing data on the Semantic Web
Publishing data on the Semantic WebPublishing data on the Semantic Web
Publishing data on the Semantic Web
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
Linked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationLinked data for Enterprise Data Integration
Linked data for Enterprise Data Integration
 
Paul houle resume
Paul houle resumePaul houle resume
Paul houle resume
 
OpenLink Virtuoso - Management & Decision Makers Overview
OpenLink Virtuoso - Management & Decision Makers OverviewOpenLink Virtuoso - Management & Decision Makers Overview
OpenLink Virtuoso - Management & Decision Makers Overview
 
PoolParty SKOS and Linked Data
PoolParty SKOS and Linked DataPoolParty SKOS and Linked Data
PoolParty SKOS and Linked Data
 
NISO Webinar: Back From the Endangered List: Using Authority Data to Enhance ...
NISO Webinar: Back From the Endangered List: Using Authority Data to Enhance ...NISO Webinar: Back From the Endangered List: Using Authority Data to Enhance ...
NISO Webinar: Back From the Endangered List: Using Authority Data to Enhance ...
 
Active Curation of Bi-Text Resources in Commercial Localization Workflows
Active Curation of Bi-Text Resources in Commercial Localization WorkflowsActive Curation of Bi-Text Resources in Commercial Localization Workflows
Active Curation of Bi-Text Resources in Commercial Localization Workflows
 
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 
Quick Introduction to the Semantic Web, RDFa & Microformats
Quick Introduction to the Semantic Web, RDFa & MicroformatsQuick Introduction to the Semantic Web, RDFa & Microformats
Quick Introduction to the Semantic Web, RDFa & Microformats
 
Schema.org: Where did that come from!
Schema.org: Where did that come from!Schema.org: Where did that come from!
Schema.org: Where did that come from!
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
Contextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationContextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data Foundation
 
What happened to the Semantic Web?
What happened to the Semantic Web?What happened to the Semantic Web?
What happened to the Semantic Web?
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark Greaves
 
Repository for data crawled from multiple social networks
Repository for data crawled from multiple social networksRepository for data crawled from multiple social networks
Repository for data crawled from multiple social networks
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of Entities
 
Extending Schema.org
Extending Schema.orgExtending Schema.org
Extending Schema.org
 
Semantic web technology
Semantic web technologySemantic web technology
Semantic web technology
 

Recently uploaded

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 

Recently uploaded (20)

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 

Open Data Management for Public Automated Translation

  • 1. Open Data Management for Public Automated Translation Dave Lewis CNGL at Trinity College Dublin
  • 2. •  Open Data on the Web: W3C Semantic Web standards allow data to be published on Web –  Fine-grained URI-based inter-linking –  Extensible meta-data –  Standard Query APIs •  Enables a Localization Web –  Terms and translations become linkable resources –  Meta-data from L10n workflows adds value –  Leverage in training Machine Translation and Text Analytics The Localization Web The Localization Web = Decentralised Annotated Global Translation Memory and Term Base
  • 3. Linguistic Linked Data Red Phonetic form Form number singular [RED] Form plural [REDES] Phonetic form number Red Sense written form “red” Sense written form “malla” equivalent Red image Red Sense Sense translation es - en written form “red” “network” written form Red written form Form gender femenine “red”
  • 5. Words as Resources on the Web Barak Obama if the 44th president of the United State of America. He was first elected in 2009. Barak Obama si el 44 º presidente de los Estados Unidos de América. Ha fue electo primera vez en 2009. http:// www.ex.org/obama_en.html http:// www.ex.org/obama_es.html The Web of Content The Localization Web http://data.ex.org/String_0001 http:// data.ex.org/String_0002 Derived From Derived From Text: “Barak Obama if the 44th president of the United State of America.” Lang:en Text:“Barak Obama si el 44 º presidente de los Estados Unidos de América.” Lang:es TranslatedBy:Google Translate Translated From Translation Data Term: “United State of America.” Lang:en Term:“Estados Unidos de América.” Lang:es Translation Of http:// babelnet.org/345621 http:// babelnet.org/57835 Terminology Data Topic: Barak Obama Lang: en BirthDate: 1961-08-04 Spouse: Michelle Obama Residence: White House http:// Dbpedia.org/Page/ Barak_Obama Encyclopaedic Data
  • 6. Data Management Lifecycles Publish Correct & refine Lex- concept lifecycleCorrect & refine Discover & use Discover & use Correct & refine Bitext lifecycle Discover data (Re)train -MT Revise and annotate Publish Content lifecycle Publish I18n & source QA Trans QA Post- edit Automated translation Consume Create
  • 7. •  Assert ownership and attribution, licensing, access control? •  Persistent URLs? •  Open royalty free standards? •  Indexing is key, federated vs aggregated data? •  Third party submission of errors, QA, corrections? Publish actions taken on submissions? Public Automated Translation: Data Management Needs?
  • 8. •  Query bitext on: –  languages, terms, MT engine, MT confidence, QA, translator qualifications, postedit characteristics? •  Query lexical-conceptual resources on: –  language, domains, context, semantic relations, provenance of lexical/conceptual data? •  Localisation Web API: –  HTTP content negotiation (Unicode extensions for translation?), –  Resource Format: RDF, TMX, TBX, RDF, JSON-LD, –  SPARQL end points? Public Automated Translation: Data Access Needs?
  • 9. Liaisons and Consensus Building ITS2.0   XLIFF   Content  Processing    (DOM)   Linked  Data  Processing  (RDF)   MQM   ITS2.0   Ontology   PROV-­‐O/Global   Intelligent   Content   LinguisHc  Linked  Data   OCELOT   Use  Cases:   Content  AnalyHcs;   Corpora  curaHon;     Content   enrichment;   Human-­‐MT  quality;   Your  use  case     Linked  Data  for   Language  Technology  
  • 10. More Information •  Contact: dave.lewis@cs.tcd.ie •  https://www.w3.org/International/its/wiki/ Open_Data_Management_for_Public_Automated_Translation_Services •  http://www.falcon-project.eu •  See also: –  Linked Data for Language Technology (LD4LT) W3C Community Group –  http://www.w3.org/community/ld4lt/