SlideShare a Scribd company logo
How to Localize XML Documents: A Workshop on the Do's and
Don'ts of XML Localization
November 2014
http:///
XML dominant!
• HTML/XHTML
• Web Services
• Adobe FrameMaker
• Microsoft Office
• Open Office
• ASP
• XAML
• Java Properties
• DITA
• Standards: TMX, XLIFF, SRX, GMX, TBX, xml:tm
• OAXAL Open Architecture for XML Authoring and Localization
Benefits of XML for L10N
• Separation of form and content
• Should make documents easier to translate
• There are some critical design decisions
• Mistakes can hinder translatability
• XML can bootstrap its own localization
The real significance of XML
• XML is not just another electronic format
• XML is an eXtensible syntax
• XML is a formal IT grammar
• XML is programmable
• XML is can bootrstrap its own localization
Benefits of XML for L10N
Why use XML for Localization
• One input format
• Elegant
• Uses the latest IT technology
• Separation of source and content
• One single data bus
• Open Standards based
• You can use XML assist its own localization
• One extraction + TM + SMT engine
Benefits of XML
Any electronic format not in XML can be converted to XML
• Frame Maker
• RTF
• Microsoft Office pre 2007
• Quark Express
• Windows resource files
• Java resources
• PO/POT
• YAML
• Etc.
And then back into the original format
Do: Use Standard XML Libraries
DO!
• Xerces/Xalan
• MSXML
DO NOT!
• Write your own XML parser
• Write you own serializer
Do: Use XLIFF
XML Localization Interchange File Format
OASIS Standard:
https://www.oasis-open.org/committees/xliff
1. Extract text for translation
2. Use Standard library parser!!!!!
• Java/C++ - Apache Xerces
• .Net – MSXML
3. XLIFF:doc – simplified XLIFF
• XLIFF 1.2 subset
http://code.google.com/p/interoperability-now/
4. Use skeleton file
• Use markers for inline elements
• Use only <g> and <x> inline elements
• Use <mrk> for terminology
XML Document Design Issues
Important points to take into account
Word/Phrase substitution Issues
What works in English/Chinese will not work in most other languages:
English/Chinese languages have an impoverished morphology:
• Please undo the bolt using a spanner.
• Proszę odkręcić śrubę kluczem.
Nominative form: klucz
The real killers: gender and case
• New Ford Model: New Fiesta, New Mondeo, New Focus
• Nowa Ford Fiesta
• Nowe Ford Mondeo
• Nowy Ford Focus
Avoid translatable Entity References
<para>Use a &tool; to release the catch.</para>
Problems:
• Grammatical difficulties
• Parsing difficulties
• Translation memory problems
Solution:
<para>
Use a <tool id="a1098">claw hammer</tool>
to release the CPU retention catch.
</para>
Avoid Word Substitution Mechanisms
<p>
Using a
<keyword conref=”tools.dita#tools/ClawHammer”/>,
remove the CPU from its mount.
</p>
Problems:
• Grammatical difficulties
• Parsing difficulties
• Translation memory problems
Solution:
<para>
Use a <tool id="a1098">claw hammer</tool>
to release the CPU from its mount.
</para>
Incorrect use of Translatable Attributes
<para>
Use a <tool id="a1098" name="claw hammer">
to release the CPU retention catch.
</para>
Problems:
• Grammatical difficulties
• Text flow problems
Solution:
<para>
Use a <tool id="a1098">claw hammer</tool>
to release the CPU retention catch.
</para>
CDATA Sections
<TEMPLATE><![CDATA[<p>Please refer to the
<em>index page
</em> page for further information</p> ]]>
</TEMPLATE>
Problems:
• Grammatical difficulties
• Text flow problems
Solution:
<TEMPLATE>
<dx:p>Please refer to the <dx:em>index page
</dx:em> page for further information</dx:p>
</TEMPLATE>
Infinite Naming Schemes
<?xml version="1.0" ?>
<resources xml:lang="en">
<err001>Cannot open file $1.</err001>
<hint001>Hint: does file $1 exist.</hint001>
<err002>Incorrect value.</err002>
<hint002>Hint: value must be between $1 and $2.</hint002>
<err003>Connection timeout.</err999>
.
.
</resources>
Problems:
• Poor XML practice
• Problems for extraction programs
Solution:
Infinite Naming Schemes contd.
<?xml version="1.0" ?>
<resources xml:lang="en">
<error id="001">
<caption>Cannot open file $1.</caption>
<hint>Does file $1 exist.</hint>
</error>
<error id="002">
<caption>Incorrect value.</caption>
<hint>Value must be between $1 and $2.</hint>
</error>
.
.
</resources>
Solution:
Avoid Processing Instructions
<para>
Use a <?tool name="claw hammer"?> to release
the CPU retention catch.
</para>
Problems:
• Grammatical difficulties
• Pis not guaranteed to survive transformations
• Text flow problems
Solution:
<para>
Use a <tool id="a1098">claw hammer</tool>
to release the CPU retention catch.
</para>
Avoid text in bitmap graphics
Text expansion
Never make assumptions about text length in design
Do: Always use UTF-8 or UTF-16 Encoding
• Avoid code conversion issues
• Always use Unicode encoding
• Not just CJK issues
• Also wingbats and special characters such
as:
• M-dash
• Ndash
• Non break spaces
• Etc.
Do not break text over non-inline elements
<para>
<line>This text should not be</line>
<line>broken this way – the translated
text may well be in a different order.</line>
</para>
Problems:
• Grammatical difficulties
• Against the principles of XML
• Text flow problems
Avoid the use of typographical elements
<para><b>Do not use</b> <br/>
’br’ type elements.
</para>
Problems:
• Grammatical difficulties
• Against the principles of XML
• Text flow problems
Solution:
<para>
<emph>Do not use</emph> 'br' type elements.
</para>
Do not mix xlatable and non-xlatable
<data-items>
<data id="class">
com.xmlintl.data.dataDefDefinition
</data>
<data id="text">
Replace generic data
definitions with specific instances.
</data>
</data-items>
Problems:
• Poor XML practice
• Problems for extraction programs
Solution:
Do not mix xlatable and non-xlatable contd.
<data-items>
<class id="com.xmlintl.data.dataDefinition">
<text>
Replace generic data
definitions with specific instances.
</text>
</class>
</data-items>
Avoid mixed language documents
<para>
<text xml:lang="en">
My hovercraft is full of eels.
</text>
<text xml:lang="fr">
Mon aéroglisseur est plein d'anguilles.
</text>
<text xml:lang="hu">
Légpárnás hajóm tele van angolnákkal.
</text>
<text xml:lang="ja">
私のホバークラフトは鰻で一杯です。
</text>
<text xml:lang="pl">
Mój poduszkowiec jest pełen węgorzy.
</text>
<text xml:lang="es">
Mi aerodeslizador está lleno de anguilas.
</text>
<text xml:lang="zh-CH">
我隻氣墊船裝滿晒鱔.
</text>
<text xml:lang="zh-TW">
我的氣墊船充滿了鱔魚 [我的气垫船充满了鳝鱼]
</text>
</para>
Do: clearly mark non-translatable text
<para>
The following part of this sentence
should
<its:its translate=‘no’>not be
translated</its:its>
at all.
</para>
Core L10 Interoperability Standards
• W3C ITS Document Rules
• ETSI LIS SRX
• ETSI LIS xml:tm
• ETSI LIS TMX
• ETSI LIS TBX
• ETSI LIS GMX
• OASIS XLIFF
• W3C/OASIS DITA (XHTML, DocBook, or any XML Vocabulary)
Putting It All Together
• Open Architecture for XML Authoring and Localization (OAXAL)
– http://wiki.oasis-open.org/oaxal/FrontPage
OAXAL
OAXAL
Localization without Standards
Customer
source
text
source text extract
extracted
text tm process
prepared
text
translatetranslated
text
target texttarget text
merge
target text
QA
True Cost of Translation
OAXAL in Action
Translating English Soccer
Articles into Arabic 24x7
Translating English Soccer
Articles into Arabic 24x7
Flagship website
Flagship website
Browser-Based Workbench
OAXAL In Action
Your opinion is important to us! Please tell us what you thought of the
lecture. We look forward to your feedback via smartphone or tablet under
http://LOC23.honestly.de
or scan the QR code
The feedback tool will be available even after the conference!
• Contact details:
• Andrzej Zydroń
• azydron@xtm-intl.com
• http://www.xtm-intl.com

More Related Content

What's hot

SMWCon Fall 2015 FForms
SMWCon Fall 2015 FFormsSMWCon Fall 2015 FForms
SMWCon Fall 2015 FForms
Open University in the Netherlands
 
Css
CssCss
Getting Started with the Alma API
Getting Started with the Alma APIGetting Started with the Alma API
Getting Started with the Alma API
Kyle Banerjee
 
Introduction to XML
Introduction to XMLIntroduction to XML
Introduction to XML
BG Java EE Course
 
Chapter4
Chapter4Chapter4
Chapter4
Fahad Sheref
 
Decoding and developing the online finding aid
Decoding and developing the online finding aidDecoding and developing the online finding aid
Decoding and developing the online finding aid
kgerber
 
Introduce to XML
Introduce to XMLIntroduce to XML
Introduce to XML
videde_group
 
Fergus Fahey - DRI/ARA(I) Training: Introduction to EAD - Introduction to XML
Fergus Fahey - DRI/ARA(I) Training: Introduction to EAD - Introduction to XMLFergus Fahey - DRI/ARA(I) Training: Introduction to EAD - Introduction to XML
Fergus Fahey - DRI/ARA(I) Training: Introduction to EAD - Introduction to XML
dri_ireland
 
Xml
XmlXml
Xml
Anas Sa
 
XML and XSLT
XML and XSLTXML and XSLT
XML and XSLT
Andrew Savory
 
23xml
23xml23xml
23xml
Adil Jafri
 
Painless OO XML with XML::Pastor
Painless OO XML with XML::PastorPainless OO XML with XML::Pastor
Painless OO XML with XML::Pastor
joelbernstein
 
Xml
XmlXml
CrashCourse: XML technologies
CrashCourse: XML technologiesCrashCourse: XML technologies
CrashCourse: XML technologies
ESRI Bulgaria
 
Introduction to XML and Databases
Introduction to XML and DatabasesIntroduction to XML and Databases
Introduction to XML and Databases
torp42
 
Xml dom
Xml domXml dom
Xml dom
sana mateen
 
XML XSLT
XML XSLTXML XSLT
fundamentals of XML
fundamentals of XMLfundamentals of XML
fundamentals of XML
hamsa nandhini
 
Xml 215-presentation
Xml 215-presentationXml 215-presentation
Xml 215-presentation
Manish Chaurasia
 

What's hot (19)

SMWCon Fall 2015 FForms
SMWCon Fall 2015 FFormsSMWCon Fall 2015 FForms
SMWCon Fall 2015 FForms
 
Css
CssCss
Css
 
Getting Started with the Alma API
Getting Started with the Alma APIGetting Started with the Alma API
Getting Started with the Alma API
 
Introduction to XML
Introduction to XMLIntroduction to XML
Introduction to XML
 
Chapter4
Chapter4Chapter4
Chapter4
 
Decoding and developing the online finding aid
Decoding and developing the online finding aidDecoding and developing the online finding aid
Decoding and developing the online finding aid
 
Introduce to XML
Introduce to XMLIntroduce to XML
Introduce to XML
 
Fergus Fahey - DRI/ARA(I) Training: Introduction to EAD - Introduction to XML
Fergus Fahey - DRI/ARA(I) Training: Introduction to EAD - Introduction to XMLFergus Fahey - DRI/ARA(I) Training: Introduction to EAD - Introduction to XML
Fergus Fahey - DRI/ARA(I) Training: Introduction to EAD - Introduction to XML
 
Xml
XmlXml
Xml
 
XML and XSLT
XML and XSLTXML and XSLT
XML and XSLT
 
23xml
23xml23xml
23xml
 
Painless OO XML with XML::Pastor
Painless OO XML with XML::PastorPainless OO XML with XML::Pastor
Painless OO XML with XML::Pastor
 
Xml
XmlXml
Xml
 
CrashCourse: XML technologies
CrashCourse: XML technologiesCrashCourse: XML technologies
CrashCourse: XML technologies
 
Introduction to XML and Databases
Introduction to XML and DatabasesIntroduction to XML and Databases
Introduction to XML and Databases
 
Xml dom
Xml domXml dom
Xml dom
 
XML XSLT
XML XSLTXML XSLT
XML XSLT
 
fundamentals of XML
fundamentals of XMLfundamentals of XML
fundamentals of XML
 
Xml 215-presentation
Xml 215-presentationXml 215-presentation
Xml 215-presentation
 

Viewers also liked

Xtm webinar presentation xtm system overview
Xtm webinar presentation   xtm system overviewXtm webinar presentation   xtm system overview
Xtm webinar presentation xtm system overview
Andrzej Zydroń MBCS
 
Open Standards
Open StandardsOpen Standards
Open Standards
Andrzej Zydroń MBCS
 
OAXAL
OAXALOAXAL
Falcon
FalconFalcon
The Tipping Point
The Tipping PointThe Tipping Point
The Tipping Point
Andrzej Zydroń MBCS
 
The tipping point
The tipping pointThe tipping point
The tipping point
Andrzej Zydroń MBCS
 
Interverbum falcon-10oct14-az
Interverbum falcon-10oct14-azInterverbum falcon-10oct14-az
Interverbum falcon-10oct14-az
Andrzej Zydroń MBCS
 
DITA and Translation Best Praticices
DITA and Translation Best PraticicesDITA and Translation Best Praticices
DITA and Translation Best Praticices
Andrzej Zydroń MBCS
 
OAXAL
OAXALOAXAL
DITA for Localization
DITA for LocalizationDITA for Localization
DITA for Localization
Andrzej Zydroń MBCS
 
Understanding linport
Understanding linportUnderstanding linport
Understanding linport
Andrzej Zydroń MBCS
 

Viewers also liked (11)

Xtm webinar presentation xtm system overview
Xtm webinar presentation   xtm system overviewXtm webinar presentation   xtm system overview
Xtm webinar presentation xtm system overview
 
Open Standards
Open StandardsOpen Standards
Open Standards
 
OAXAL
OAXALOAXAL
OAXAL
 
Falcon
FalconFalcon
Falcon
 
The Tipping Point
The Tipping PointThe Tipping Point
The Tipping Point
 
The tipping point
The tipping pointThe tipping point
The tipping point
 
Interverbum falcon-10oct14-az
Interverbum falcon-10oct14-azInterverbum falcon-10oct14-az
Interverbum falcon-10oct14-az
 
DITA and Translation Best Praticices
DITA and Translation Best PraticicesDITA and Translation Best Praticices
DITA and Translation Best Praticices
 
OAXAL
OAXALOAXAL
OAXAL
 
DITA for Localization
DITA for LocalizationDITA for Localization
DITA for Localization
 
Understanding linport
Understanding linportUnderstanding linport
Understanding linport
 

Similar to Dos and donts

BITM3730Week5.pptx
BITM3730Week5.pptxBITM3730Week5.pptx
BITM3730Week5.pptx
MattMarino13
 
BITM3730 10-31.pptx
BITM3730 10-31.pptxBITM3730 10-31.pptx
BITM3730 10-31.pptx
MattMarino13
 
BITM3730 10-18.pptx
BITM3730 10-18.pptxBITM3730 10-18.pptx
BITM3730 10-18.pptx
MattMarino13
 
Xml passing in java
Xml passing in javaXml passing in java
1 xml fundamentals
1 xml fundamentals1 xml fundamentals
1 xml fundamentals
Dr.Saranya K.G
 
Java Web Services
Java Web ServicesJava Web Services
Java Web Services
Jussi Pohjolainen
 
Unit iv xml dom
Unit iv xml domUnit iv xml dom
Unit iv xml dom
smitha273566
 
Introduction to xml
Introduction to xmlIntroduction to xml
Introduction to xml
Gtu Booker
 
Unit3wt
Unit3wtUnit3wt
Unit3wt
vamsi krishna
 
Unit3wt
Unit3wtUnit3wt
Unit3wt
vamsitricks
 
Data interchange integration, HTML XML Biological XML DTD
Data interchange integration, HTML XML Biological XML DTDData interchange integration, HTML XML Biological XML DTD
Data interchange integration, HTML XML Biological XML DTD
AnushaMahmood
 
CTDA Workshop on XML and MODS
CTDA Workshop on XML and MODSCTDA Workshop on XML and MODS
CTDA Workshop on XML and MODS
University of Connecticut Libraries
 
XML
XMLXML
Xml basics
Xml basicsXml basics
Xml basics
Kumar
 
Xml
XmlXml
00 introduction
00 introduction00 introduction
00 introduction
Baskarkncet
 
WEB TECHNOLOGIES XML
WEB TECHNOLOGIES XMLWEB TECHNOLOGIES XML
Web Service Workshop - 3 days
Web Service Workshop - 3 daysWeb Service Workshop - 3 days
Web Service Workshop - 3 days
David Ionut
 
Xml presentation
Xml presentationXml presentation
Xml presentation
Miguel Angel Teheran Garcia
 
Xmll
XmllXmll

Similar to Dos and donts (20)

BITM3730Week5.pptx
BITM3730Week5.pptxBITM3730Week5.pptx
BITM3730Week5.pptx
 
BITM3730 10-31.pptx
BITM3730 10-31.pptxBITM3730 10-31.pptx
BITM3730 10-31.pptx
 
BITM3730 10-18.pptx
BITM3730 10-18.pptxBITM3730 10-18.pptx
BITM3730 10-18.pptx
 
Xml passing in java
Xml passing in javaXml passing in java
Xml passing in java
 
1 xml fundamentals
1 xml fundamentals1 xml fundamentals
1 xml fundamentals
 
Java Web Services
Java Web ServicesJava Web Services
Java Web Services
 
Unit iv xml dom
Unit iv xml domUnit iv xml dom
Unit iv xml dom
 
Introduction to xml
Introduction to xmlIntroduction to xml
Introduction to xml
 
Unit3wt
Unit3wtUnit3wt
Unit3wt
 
Unit3wt
Unit3wtUnit3wt
Unit3wt
 
Data interchange integration, HTML XML Biological XML DTD
Data interchange integration, HTML XML Biological XML DTDData interchange integration, HTML XML Biological XML DTD
Data interchange integration, HTML XML Biological XML DTD
 
CTDA Workshop on XML and MODS
CTDA Workshop on XML and MODSCTDA Workshop on XML and MODS
CTDA Workshop on XML and MODS
 
XML
XMLXML
XML
 
Xml basics
Xml basicsXml basics
Xml basics
 
Xml
XmlXml
Xml
 
00 introduction
00 introduction00 introduction
00 introduction
 
WEB TECHNOLOGIES XML
WEB TECHNOLOGIES XMLWEB TECHNOLOGIES XML
WEB TECHNOLOGIES XML
 
Web Service Workshop - 3 days
Web Service Workshop - 3 daysWeb Service Workshop - 3 days
Web Service Workshop - 3 days
 
Xml presentation
Xml presentationXml presentation
Xml presentation
 
Xmll
XmllXmll
Xmll
 

Recently uploaded

7 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 20247 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 2024
Danica Gill
 
Explore-Insanony: Watch Instagram Stories Secretly
Explore-Insanony: Watch Instagram Stories SecretlyExplore-Insanony: Watch Instagram Stories Secretly
Explore-Insanony: Watch Instagram Stories Secretly
Trending Blogers
 
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
bseovas
 
Design Thinking NETFLIX using all techniques.pptx
Design Thinking NETFLIX using all techniques.pptxDesign Thinking NETFLIX using all techniques.pptx
Design Thinking NETFLIX using all techniques.pptx
saathvikreddy2003
 
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
bseovas
 
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaalmanuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
wolfsoftcompanyco
 
Gen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needsGen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needs
Laura Szabó
 
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
vmemo1
 
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
cuobya
 
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
xjq03c34
 
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
uehowe
 
Discover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to IndiaDiscover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to India
davidjhones387
 
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
fovkoyb
 
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
ukwwuq
 
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
uehowe
 
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
ysasp1
 
国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理
国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理
国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理
zoowe
 
Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!
Toptal Tech
 
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
cuobya
 
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdfMeet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Florence Consulting
 

Recently uploaded (20)

7 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 20247 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 2024
 
Explore-Insanony: Watch Instagram Stories Secretly
Explore-Insanony: Watch Instagram Stories SecretlyExplore-Insanony: Watch Instagram Stories Secretly
Explore-Insanony: Watch Instagram Stories Secretly
 
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
 
Design Thinking NETFLIX using all techniques.pptx
Design Thinking NETFLIX using all techniques.pptxDesign Thinking NETFLIX using all techniques.pptx
Design Thinking NETFLIX using all techniques.pptx
 
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
 
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaalmanuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
 
Gen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needsGen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needs
 
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
 
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
 
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
 
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
 
Discover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to IndiaDiscover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to India
 
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
 
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
 
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
 
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
 
国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理
国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理
国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理
 
Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!
 
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
 
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdfMeet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
 

Dos and donts

  • 1. How to Localize XML Documents: A Workshop on the Do's and Don'ts of XML Localization November 2014 http:///
  • 2. XML dominant! • HTML/XHTML • Web Services • Adobe FrameMaker • Microsoft Office • Open Office • ASP • XAML • Java Properties • DITA • Standards: TMX, XLIFF, SRX, GMX, TBX, xml:tm • OAXAL Open Architecture for XML Authoring and Localization
  • 3. Benefits of XML for L10N • Separation of form and content • Should make documents easier to translate • There are some critical design decisions • Mistakes can hinder translatability • XML can bootstrap its own localization
  • 4. The real significance of XML • XML is not just another electronic format • XML is an eXtensible syntax • XML is a formal IT grammar • XML is programmable • XML is can bootrstrap its own localization
  • 5. Benefits of XML for L10N Why use XML for Localization • One input format • Elegant • Uses the latest IT technology • Separation of source and content • One single data bus • Open Standards based • You can use XML assist its own localization • One extraction + TM + SMT engine
  • 6. Benefits of XML Any electronic format not in XML can be converted to XML • Frame Maker • RTF • Microsoft Office pre 2007 • Quark Express • Windows resource files • Java resources • PO/POT • YAML • Etc. And then back into the original format
  • 7. Do: Use Standard XML Libraries DO! • Xerces/Xalan • MSXML DO NOT! • Write your own XML parser • Write you own serializer
  • 8. Do: Use XLIFF XML Localization Interchange File Format OASIS Standard: https://www.oasis-open.org/committees/xliff 1. Extract text for translation 2. Use Standard library parser!!!!! • Java/C++ - Apache Xerces • .Net – MSXML 3. XLIFF:doc – simplified XLIFF • XLIFF 1.2 subset http://code.google.com/p/interoperability-now/ 4. Use skeleton file • Use markers for inline elements • Use only <g> and <x> inline elements • Use <mrk> for terminology
  • 9. XML Document Design Issues Important points to take into account
  • 10. Word/Phrase substitution Issues What works in English/Chinese will not work in most other languages: English/Chinese languages have an impoverished morphology: • Please undo the bolt using a spanner. • Proszę odkręcić śrubę kluczem. Nominative form: klucz The real killers: gender and case • New Ford Model: New Fiesta, New Mondeo, New Focus • Nowa Ford Fiesta • Nowe Ford Mondeo • Nowy Ford Focus
  • 11. Avoid translatable Entity References <para>Use a &tool; to release the catch.</para> Problems: • Grammatical difficulties • Parsing difficulties • Translation memory problems Solution: <para> Use a <tool id="a1098">claw hammer</tool> to release the CPU retention catch. </para>
  • 12. Avoid Word Substitution Mechanisms <p> Using a <keyword conref=”tools.dita#tools/ClawHammer”/>, remove the CPU from its mount. </p> Problems: • Grammatical difficulties • Parsing difficulties • Translation memory problems Solution: <para> Use a <tool id="a1098">claw hammer</tool> to release the CPU from its mount. </para>
  • 13. Incorrect use of Translatable Attributes <para> Use a <tool id="a1098" name="claw hammer"> to release the CPU retention catch. </para> Problems: • Grammatical difficulties • Text flow problems Solution: <para> Use a <tool id="a1098">claw hammer</tool> to release the CPU retention catch. </para>
  • 14. CDATA Sections <TEMPLATE><![CDATA[<p>Please refer to the <em>index page </em> page for further information</p> ]]> </TEMPLATE> Problems: • Grammatical difficulties • Text flow problems Solution: <TEMPLATE> <dx:p>Please refer to the <dx:em>index page </dx:em> page for further information</dx:p> </TEMPLATE>
  • 15. Infinite Naming Schemes <?xml version="1.0" ?> <resources xml:lang="en"> <err001>Cannot open file $1.</err001> <hint001>Hint: does file $1 exist.</hint001> <err002>Incorrect value.</err002> <hint002>Hint: value must be between $1 and $2.</hint002> <err003>Connection timeout.</err999> . . </resources> Problems: • Poor XML practice • Problems for extraction programs Solution:
  • 16. Infinite Naming Schemes contd. <?xml version="1.0" ?> <resources xml:lang="en"> <error id="001"> <caption>Cannot open file $1.</caption> <hint>Does file $1 exist.</hint> </error> <error id="002"> <caption>Incorrect value.</caption> <hint>Value must be between $1 and $2.</hint> </error> . . </resources> Solution:
  • 17. Avoid Processing Instructions <para> Use a <?tool name="claw hammer"?> to release the CPU retention catch. </para> Problems: • Grammatical difficulties • Pis not guaranteed to survive transformations • Text flow problems Solution: <para> Use a <tool id="a1098">claw hammer</tool> to release the CPU retention catch. </para>
  • 18. Avoid text in bitmap graphics
  • 19. Text expansion Never make assumptions about text length in design
  • 20. Do: Always use UTF-8 or UTF-16 Encoding • Avoid code conversion issues • Always use Unicode encoding • Not just CJK issues • Also wingbats and special characters such as: • M-dash • Ndash • Non break spaces • Etc.
  • 21. Do not break text over non-inline elements <para> <line>This text should not be</line> <line>broken this way – the translated text may well be in a different order.</line> </para> Problems: • Grammatical difficulties • Against the principles of XML • Text flow problems
  • 22. Avoid the use of typographical elements <para><b>Do not use</b> <br/> ’br’ type elements. </para> Problems: • Grammatical difficulties • Against the principles of XML • Text flow problems Solution: <para> <emph>Do not use</emph> 'br' type elements. </para>
  • 23. Do not mix xlatable and non-xlatable <data-items> <data id="class"> com.xmlintl.data.dataDefDefinition </data> <data id="text"> Replace generic data definitions with specific instances. </data> </data-items> Problems: • Poor XML practice • Problems for extraction programs Solution:
  • 24. Do not mix xlatable and non-xlatable contd. <data-items> <class id="com.xmlintl.data.dataDefinition"> <text> Replace generic data definitions with specific instances. </text> </class> </data-items>
  • 25. Avoid mixed language documents <para> <text xml:lang="en"> My hovercraft is full of eels. </text> <text xml:lang="fr"> Mon aéroglisseur est plein d'anguilles. </text> <text xml:lang="hu"> Légpárnás hajóm tele van angolnákkal. </text> <text xml:lang="ja"> 私のホバークラフトは鰻で一杯です。 </text> <text xml:lang="pl"> Mój poduszkowiec jest pełen węgorzy. </text> <text xml:lang="es"> Mi aerodeslizador está lleno de anguilas. </text> <text xml:lang="zh-CH"> 我隻氣墊船裝滿晒鱔. </text> <text xml:lang="zh-TW"> 我的氣墊船充滿了鱔魚 [我的气垫船充满了鳝鱼] </text> </para>
  • 26. Do: clearly mark non-translatable text <para> The following part of this sentence should <its:its translate=‘no’>not be translated</its:its> at all. </para>
  • 27. Core L10 Interoperability Standards • W3C ITS Document Rules • ETSI LIS SRX • ETSI LIS xml:tm • ETSI LIS TMX • ETSI LIS TBX • ETSI LIS GMX • OASIS XLIFF • W3C/OASIS DITA (XHTML, DocBook, or any XML Vocabulary)
  • 28. Putting It All Together
  • 29. • Open Architecture for XML Authoring and Localization (OAXAL) – http://wiki.oasis-open.org/oaxal/FrontPage
  • 30. OAXAL
  • 31. OAXAL
  • 32. Localization without Standards Customer source text source text extract extracted text tm process prepared text translatetranslated text target texttarget text merge target text QA
  • 33. True Cost of Translation
  • 41. Your opinion is important to us! Please tell us what you thought of the lecture. We look forward to your feedback via smartphone or tablet under http://LOC23.honestly.de or scan the QR code The feedback tool will be available even after the conference!
  • 42. • Contact details: • Andrzej Zydroń • azydron@xtm-intl.com • http://www.xtm-intl.com