TaaS Workshop 2014, Active Terminology Prompting for SEO and Website Translation, Ioannis Iakovidis, Interverbum Technology

TAUS - The Language Data Network
TAUS - The Language Data NetworkTAUS - The Language Data Network
Wednesday,	
  4	
  June	
  /09:40	
  –	
  10:10	
  
	
  
Ac5ve	
  Terminology	
  Promp5ng	
  for	
  SEO	
  and	
  Website	
  
Transla5on	
  
Ioannis	
  Iakovidis,	
  Interverbum	
  Technology	
  
TaaS	
  Workshop	
  2014	
  
4	
  June,	
  Dublin	
  (Ireland)	
  
The	
  research	
  within	
  the	
  project	
  TaaS	
  leading	
  to	
  these	
  results	
  has	
  received	
  funding	
  from	
  the	
  European	
  Union	
  Seventh	
  Framework	
  Programme	
  (FP7/2007-­‐2013),	
  grant	
  agreement	
  no	
  296312	
  
Federated Active Linguistic data CuratiON	
  
Ioannis Iakovidis, MD, Interverbum Technology
FALCON
In brief…
§  Interverbum Technology est. 1999
§  Today more than 80,000 licensed TermWeb
users
§  Some clients:
Bauer, Euroscript, Arancho, FujiXerox,
Medtronic, Novell, SAS Institute, Scania, SDI
Media, SIS (Swedish Standards Institute),
Xerox, VMware, Salesforce.com, Agilent, The
World Bank
§  Clients served from five offices in Europe, North
America and Asia
Terminology Management for Global Marketing
Web-based Open
architecture
Easy
administration
Visuals and
sounds
TermWeb 3.11
The most powerful
terminology management system
anywhere
COLLABORATE
quick and easy grid-view editing
AUTOMATE
powerful customizable workflows
INTEGRATE
cross-system communication by TermWebIntegrator
EMBELLISH
full media support
EVOLVE
platform-independent, cloud-based, uniquely versatile	
  
TermWeb 3.11
Brings easier collaboration
to the cloud
FALCON in a
nutshell
§  L10n is a Big Data Industry
§  Large-scale, Monetised Reuse of
Translation Memories and Term Bases
§  SMT and Text Analytics now also leverage
these resources
§  Large clients and LSPs curate and add
value to such resources as assets
§  FALCON aims to extend these benefits to
SMEs
Leverage the power of Linked Data for the Long Tail of the Localisation Industry
Multilingual
Web
Language
Resource
Community
Language
Technology
Community
Localisation
Industry
XLIFF1.2
ITS 2.0
XLIFF 2.0
MLW-LOD
NIF-RDF
FALCON
§  Trinity College Dublin (IE)
LOD Mapping and Link Quality
Federated Access Control
L10n Interoperability
§  XTM International (UK)
CAT/L10n management vendor and interoperability
§  Dublin City University (IE)
SMT and text analytics
§  SKAWA Innovation (HU)
Web site translation (EasyLing), crowdsourcing
§  Interverbum Technology (SE)
Terminology Management
FALCON
Consortium
§  Provide an Open Schema and SaaS platform for
exposing language resources as linked data
§  Enable controlled, decentralised sharing of
resources and stand-off value-add annotation
Term or named entity annotation
Translation process provenance and QA
§  Active Curation of resources and value add
§  End-to-end process management
§  On-demand assembly of domain specific LT
training corpora
FALCON
Approach
From: http://www.w3.org/TR/prov-primer/
ITS related entity subclass:
document, segment,
analysed-text, term,
translation, translation-revision
subproperty:
wasTranslatedFrom
W3C Provenance WG
http://www.w3.org/2011/prov/
Provenance-Oriented
Approach
TaaS Workshop 2014, Active Terminology Prompting for SEO and Website Translation, Ioannis Iakovidis, Interverbum Technology
Language
Resource
LOD Store
Public
Language LOD
Resource
Curator
Multilingual Web
Management
(EasyLing)
Translation
Management
(XTM Cloud)
Machine
Translation
(Moses – DCU)
Terminology
Management
(TermWeb)
Client CMS
Text Analytics
(NER – DCU)
L3Data API L3Data API L3Data API L3Data API
UsersSystemsLinkedData
Localisation
Client
Project
Manager
Translators/
Posteditors
Translation
Reviewers
Terminologist
Source
doc
Target
Source
seg
Source
seg
Source
seg
Target
seg
Target
seg
Target
seg
bi-textbi-text
bi-text
bi-textbi-textML
terms
Project
TM
Project
term base
QA meta-
data
QA meta-
dataQA meta-
TB
TB
TB
TB
TB
CMS/SEO	
  CMS/SEO	
  TermWeb	
  
Local termbases with
SEO-scores are linked
via FALCON and
translated in TermWeb
TB
FALCON
TB
TB
TB
TB
TB
TB
Local termbases with
SEO-scores are linked
via FALCON and
translated in TermWeb
The translated term
with the best SEO-
score will be returned to
the local termbase
FALCON CMS/SEO	
  CMS/SEO	
  TermWeb	
  
§  Language Resource Publishers can audit
links used in building other resources,
track ROI
§  Tool Vendors and Integrators expand
markets with more open asset
management offerings
§  SME LSPs gain resource sharing and
pooling opportunities that avoid lock-in
§  LSPs and clients can use Active Curation
to quickly train domain specific SMT and
text analytics components
Value Network of Benefits
Welcome to TermWeb
§  info@termweb.com
§  Interverbumtech.com
§  +46 13 32 98 40 (or any other office)
STOCKHOLM LINKÖPING CHICAGO BOISE SINGAPORE
1 of 16

More Related Content

Viewers also liked(9)

TAUS Post-Editing Productivity GuidelinesTAUS Post-Editing Productivity Guidelines
TAUS Post-Editing Productivity Guidelines
TAUS - The Language Data Network1.2K views
Honyaku Center - TAUS Tokyo Forum 2015Honyaku Center - TAUS Tokyo Forum 2015
Honyaku Center - TAUS Tokyo Forum 2015
TAUS - The Language Data Network728 views
Human Science - TAUS Tokyo Forum 2015Human Science - TAUS Tokyo Forum 2015
Human Science - TAUS Tokyo Forum 2015
TAUS - The Language Data Network542 views
Quality Management in Localization CertificationQuality Management in Localization Certification
Quality Management in Localization Certification
TAUS - The Language Data Network2.5K views

Similar to TaaS Workshop 2014, Active Terminology Prompting for SEO and Website Translation, Ioannis Iakovidis, Interverbum Technology(20)

2004 01 10 Chef Sa V012004 01 10 Chef Sa V01
2004 01 10 Chef Sa V01
jiali zhang1K views
dorsdl2006-arrowdorsdl2006-arrow
dorsdl2006-arrow
guestfbf1e1896 views
Resume_AR Resume_AR
Resume_AR
Ashiqur Rahman158 views
profecia links company profileprofecia links company profile
profecia links company profile
Pratik Batra8 views
ahmed.hassanein_resume_1_11_2016ahmed.hassanein_resume_1_11_2016
ahmed.hassanein_resume_1_11_2016
ahmed hassanein221 views
Reusable Learning Objects: Designing and ArchivingReusable Learning Objects: Designing and Archiving
Reusable Learning Objects: Designing and Archiving
Ishan Abeywardena, Ph.D.2K views
Appnovation Intro DeckAppnovation Intro Deck
Appnovation Intro Deck
Appnovation Technologies895 views

More from TAUS - The Language Data Network(20)

Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
TAUS - The Language Data Network182 views
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
TAUS - The Language Data Network216 views
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
TAUS - The Language Data Network649 views
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
TAUS - The Language Data Network181 views
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
TAUS - The Language Data Network649 views

Recently uploaded(20)

TaaS Workshop 2014, Active Terminology Prompting for SEO and Website Translation, Ioannis Iakovidis, Interverbum Technology

  • 1. Wednesday,  4  June  /09:40  –  10:10     Ac5ve  Terminology  Promp5ng  for  SEO  and  Website   Transla5on   Ioannis  Iakovidis,  Interverbum  Technology   TaaS  Workshop  2014   4  June,  Dublin  (Ireland)   The  research  within  the  project  TaaS  leading  to  these  results  has  received  funding  from  the  European  Union  Seventh  Framework  Programme  (FP7/2007-­‐2013),  grant  agreement  no  296312  
  • 2. Federated Active Linguistic data CuratiON   Ioannis Iakovidis, MD, Interverbum Technology FALCON
  • 3. In brief… §  Interverbum Technology est. 1999 §  Today more than 80,000 licensed TermWeb users §  Some clients: Bauer, Euroscript, Arancho, FujiXerox, Medtronic, Novell, SAS Institute, Scania, SDI Media, SIS (Swedish Standards Institute), Xerox, VMware, Salesforce.com, Agilent, The World Bank §  Clients served from five offices in Europe, North America and Asia Terminology Management for Global Marketing
  • 4. Web-based Open architecture Easy administration Visuals and sounds TermWeb 3.11 The most powerful terminology management system anywhere
  • 5. COLLABORATE quick and easy grid-view editing AUTOMATE powerful customizable workflows INTEGRATE cross-system communication by TermWebIntegrator EMBELLISH full media support EVOLVE platform-independent, cloud-based, uniquely versatile   TermWeb 3.11 Brings easier collaboration to the cloud
  • 6. FALCON in a nutshell §  L10n is a Big Data Industry §  Large-scale, Monetised Reuse of Translation Memories and Term Bases §  SMT and Text Analytics now also leverage these resources §  Large clients and LSPs curate and add value to such resources as assets §  FALCON aims to extend these benefits to SMEs Leverage the power of Linked Data for the Long Tail of the Localisation Industry
  • 8. §  Trinity College Dublin (IE) LOD Mapping and Link Quality Federated Access Control L10n Interoperability §  XTM International (UK) CAT/L10n management vendor and interoperability §  Dublin City University (IE) SMT and text analytics §  SKAWA Innovation (HU) Web site translation (EasyLing), crowdsourcing §  Interverbum Technology (SE) Terminology Management FALCON Consortium
  • 9. §  Provide an Open Schema and SaaS platform for exposing language resources as linked data §  Enable controlled, decentralised sharing of resources and stand-off value-add annotation Term or named entity annotation Translation process provenance and QA §  Active Curation of resources and value add §  End-to-end process management §  On-demand assembly of domain specific LT training corpora FALCON Approach
  • 10. From: http://www.w3.org/TR/prov-primer/ ITS related entity subclass: document, segment, analysed-text, term, translation, translation-revision subproperty: wasTranslatedFrom W3C Provenance WG http://www.w3.org/2011/prov/ Provenance-Oriented Approach
  • 12. Language Resource LOD Store Public Language LOD Resource Curator Multilingual Web Management (EasyLing) Translation Management (XTM Cloud) Machine Translation (Moses – DCU) Terminology Management (TermWeb) Client CMS Text Analytics (NER – DCU) L3Data API L3Data API L3Data API L3Data API UsersSystemsLinkedData Localisation Client Project Manager Translators/ Posteditors Translation Reviewers Terminologist Source doc Target Source seg Source seg Source seg Target seg Target seg Target seg bi-textbi-text bi-text bi-textbi-textML terms Project TM Project term base QA meta- data QA meta- dataQA meta-
  • 13. TB TB TB TB TB CMS/SEO  CMS/SEO  TermWeb   Local termbases with SEO-scores are linked via FALCON and translated in TermWeb TB FALCON
  • 14. TB TB TB TB TB TB Local termbases with SEO-scores are linked via FALCON and translated in TermWeb The translated term with the best SEO- score will be returned to the local termbase FALCON CMS/SEO  CMS/SEO  TermWeb  
  • 15. §  Language Resource Publishers can audit links used in building other resources, track ROI §  Tool Vendors and Integrators expand markets with more open asset management offerings §  SME LSPs gain resource sharing and pooling opportunities that avoid lock-in §  LSPs and clients can use Active Curation to quickly train domain specific SMT and text analytics components Value Network of Benefits
  • 16. Welcome to TermWeb §  info@termweb.com §  Interverbumtech.com §  +46 13 32 98 40 (or any other office) STOCKHOLM LINKÖPING CHICAGO BOISE SINGAPORE