Making meaning in a
multilingual world
Tristan Roddis, Cogapp
MCN Minneapolis, November 2015
@tristan_roddis tristanr@cogapp.com
Why bother?
Worldwide USA
Lots more information in 2013 census data!
• Unicode!
OK, I’m convinced.
How?
'THUMBS UP SIGN' (U+1F44D)
'OK HAND SIGN' (U+1F44C)
Backend systems
• TMS now fully multilingual
• Multi Mimsy XG is too
Make sure you
understand it
• e.g. The Absolute Minimum Every Software
Developer Absolutely, Positively Must Know
About Unicode and Character Sets (No
Excuses!) by Joel Spolsky
Make sure processing
systems handle it too
• e.g. Python 2 versus Python 3
• Create a Unicode “sandwich”
What about a CMS?
• Drupal
• Excellent multilingual support
• Editing interface also translated
• Even better in Drupal 8
Things to consider
• Fallback language
• Translation of categories and taxonomies
• URL translation
Translation workflow
• Drupal: Translation Management Tool
• Third-party services, e.g. Lingotek
• Transiflex - free for Open Source
Right-to-left text
• <html dir=“rtl”>
• &rlm;
Mixed directions
CSS
• Use a pre-processor!
• e.g. SASS, LESS
CSS
Arabic numbers
• Western Arabic versus
Eastern Arabic numerals
• N.b. read left-to-right!
Images
• Avoid text in images
• Reverse image ordering
Exception: useful to
control layout
Beware web fonts
• Unsupported characters, especially for
mixed language
Beware web fonts
• Even transliteration can cause problems
• E.g. →
• See this article on diacritics
Beware fonts
Beware fonts
Beware fonts
So what do you get?
Questions?
• tristanr@cogapp.com
• @tristan_roddis

Making Meaning in a Multilingual World

Editor's Notes

  • #3 Worldwide: L1 speakers: 340 million L2 speakers: 510 million Total: 850 million Total population: 7.3bn =&amp;gt; 12% total &amp;lt; 5% first language USA: English: 80% Spanish: 12.4% and non-English speakers growing: 1 in 11 in 1980, 1 in 5 in 2013 Increase reach, nationally and internationally
  • #5 TMS went Unicode in TMS2012
  • #6 Article from 2003 buts still v.relevant
  • #17 Also: Hijri Camara
  • #20 Can composite images on kiosks to save installing fonts Also: Welsh singing aliens for the BBC
  • #22 Mixed language in title Also n.b. reversed arrows due to reading direction Viewed in Chrome with Gotham Rounded web font
  • #23 Same page, viewed in IE9
  • #24 Spoiler: composed characters safer than decomposed
  • #25 Brand guidelines specify a very stylized font
  • #26 Test with native speakers: large blocks of text hard to read
  • #27 So switch to standard Arial for web use