This document discusses Myanmar Unicode and its implementation on operating systems and devices. It provides an overview of Unicode, how Myanmar characters are encoded in Unicode, and the benefits of Unicode for supporting multiple languages and platforms. It also addresses challenges in supporting Myanmar scripts, such as issues with existing fonts like Zawgyi and Ayar, and the need for rendering engines, input methods, and localization to properly display Myanmar on devices.
Myanmar Unicode
MyanmarCharacter set in
the Unicode Block.
Not only Character points but
also how to implement in
storage.
It covers all ethnics
languages in Myanmar.
What is Unicode?
Computers deal with numbers
Characters are assigned numbers
Before Unicode, these numbers were
assigned in many different ways;
Encoding, Code-Pages
Code-Point overloading
Data Corruption/No Interchangeable
11.
What is Unicode?
Unique number for every character
no matter what the platform
no matter what the program
no matter what the language
Easy to determine what the character is and
which language it is from
12.
What is Unicode?
Universal character set
All of the major scripts
Simple and Consistent Manner
Alphabetic, Syllabic & Ideographic
Version 6.1
1,114,112 code points
109,384 characters
fully conformant with the International Standard ISO/IEC
10646:2011
Benefit of Unicode
Can be incorporated into
Desktop Apps
Client-Server Apps
Multi-tiered/WebApps
Significant cost savings
Single implementation targeted across:
Multiple Platforms
Multiple Languages, countries
No re-engineering
Data to be transported through many different systems
No data corruption!
Zawgyi is notUnicode
Non-standard. We can’t use Ethnic languages with Zawgyi.
Not compatible with Burmese standard fonts from Microsoft, Apple and
Linux community.
Zawgyi is a writing system/font for the Burmese script. It is based on
Unicode technology and there is overlap for many of the characters. The
consequence is that a true Unicode compliant font will see some of the
Zawgyi characters but will fail to see others.
Ref; http://ultimategerardm.blogspot.com/2010/10/what-is-in-name-when-it-cant-
be-read.html
17.
Ayar is notUnicode
Unicode is a standard that defines the characters of a script and their
behaviour. As it is a standard, implementing a font for a script and calling it
Unicode means that the specifications have to be followed exactly.
It abuses Unicode by not having the characters in the designated places.
This time it is the Ayar Unicode Group that is the focus of attention. The
Ayar system gets the characters in the right place but there are questions
about the placement of characters that form the syllables. With 12% of
them in the wrong order, this affects around 70% of the words in Myanmar.
Ref; http://ultimategerardm.blogspot.com/2011/03/drama-that-is-unicode-support-
for.html
18.
Localization (L10N)
Processof adapting a product for
a particular Locale : [local custom].
Usually comes after Internationalization : [Process of
creating (or re-engineering)
a system to support multiple locales with a single set of
source code] in the shape of a package of services.
Inside a MyanmarUnicode Font
Opentype accept glyphs in TrueType or
Type1.
Maps character code to glyph. For
Myanmar scripts nos. of glyph requires
are more than nos. of characters
defined.
GSUB Table provide substitution
information
e.g. မှာမှျှ မှွှဖမှြှဖမှြွှ
GPOS Table provide positioning
information
e.g. မှ ို့မှ ုို့ငမှ ြို့
Why Square andDotted Circle
Rendering/Shaping Engine not being supported
Burmese in Display
Not configure well according to System
Font do not have set of complex text rendering rules.
Finally, check your bias enforcement in current
system setting
25.
Input Methods
OperatingSystem will support storage oriented
method. ဖျ တျျာျ်=တဖ ာ်
Keymagic is Free/Open Source Input Method
Software and being supported any keyboard layout
in Windows/Linux/Mac. Even in iPad.
http://keymagic.net
Standards Layout is MyanSan which is available in
Keyman software by Myanmar NLP Lab.
Unicode is not
technologies
font
bad guys
It is practice. productive practice.
Without Unicode, It is dark and uncomputable
such as searching, matching, sorting and naming.
30.
Devices can’t displayBurmese
Vendor do not support Burmese.
Technologies are not ready
Configuration might be wrong.
But Unicode Standards is ready for any devices.
letting you know every devices can’t display
burmese well because shaping will not have
burmese support.