© Zühlke 2014
Tobias Kniep
Effective
Internationalization for
Embedded Systems
26. February 2014
SafetyUsability Acceptance
Coded Character Set
“A mapping from a set of
abstract characters to a set
of integers.”
Ken Lunde,
CJKV Information Proces...
ASCII
128
JIS X 0213:2004
4354
Unicode BMP
65536
Unicode
1114112
Number of characters
ASCII
H e a l t h
0x48 0x65 0x61 0x6C 0x74 0x68
[0x48,0x65,0x61,0x6C,0x74,0x68]
Encoding Form
„A character encoding form plus
byte serialization “ Glossary of the
Unicode Consortium
„Mapping from a char...
Unicode
健 康
0x5065 0x5EB7
UTF-16LE:[0x65,0x50,0xB7,0x5E]?
UTF-8:[0xE5,0x81,0xA5,0xE5,0xBA,0xB7]
UTF-32BE:[0x00,0x00,0x50,0...
Endianess
Little Endian
Big Endian
[0x65,0x50,0xB7,0x5E]
[0x50,0x65,0x5E,0xB7]
健康(0x6550,0xB75E)
敐띞
UTF-8
H 0x48
¥ 0xC2,0xA5
健 0xE5,0x81,0xA5
𠀷 0xF0,0xA0,0x80,0xB7
0x48 01001000
0xC2 11000010
0xE5 11100101
0xF0 11110000
UTF-16
H 0x0048
¥ 0x00A5
健 0x5065
𠀷 0xD840,0xDC37
UTF-32
H 0x00000048
¥ 0x000000A5
健 0x00005065
𠀷 0x00020037
Font
“A collection of glyphs used
for the visual depiction
of character data.”
Glossary of the
Unicode Consortium
Pixel-Matrix
Here:
3 bits/pixel
No scaling
Easy to
display
Predictable
output
Bitmap Font
Large
Outline Font
Lines, Arcs,
CurvesNeeds
Renderer
Scaling
Output hard
to predict
Anti-aliasing
Character
Set Encoding Font
Ken Lunde, CJKV Information Processing
O’Reilly 2009
Contact me
Tobias Kniep
Effective Internationalization for Embedded Systems (Embedded World 2014)
Effective Internationalization for Embedded Systems (Embedded World 2014)
Effective Internationalization for Embedded Systems (Embedded World 2014)
Effective Internationalization for Embedded Systems (Embedded World 2014)
Effective Internationalization for Embedded Systems (Embedded World 2014)
Effective Internationalization for Embedded Systems (Embedded World 2014)
Effective Internationalization for Embedded Systems (Embedded World 2014)
Effective Internationalization for Embedded Systems (Embedded World 2014)
Effective Internationalization for Embedded Systems (Embedded World 2014)
Effective Internationalization for Embedded Systems (Embedded World 2014)
Upcoming SlideShare
Loading in …5
×

Effective Internationalization for Embedded Systems (Embedded World 2014)

1,838 views
1,785 views

Published on

Blogpost "Internationalization for embedded systems" http://blog.zuehlke.com/i18n/

Displaying text in the user’s language is taken for granted in the PC world. However, it is also becoming more common for embedded systems. Even small devices, which may have just had a segment display a couple of years ago, are now often capable of displaying high-quality characters on pixel displays.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,838
On SlideShare
0
From Embeds
0
Number of Embeds
592
Actions
Shares
0
Downloads
21
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Effective Internationalization for Embedded Systems (Embedded World 2014)

  1. 1. © Zühlke 2014 Tobias Kniep Effective Internationalization for Embedded Systems 26. February 2014
  2. 2. SafetyUsability Acceptance
  3. 3. Coded Character Set “A mapping from a set of abstract characters to a set of integers.” Ken Lunde, CJKV Information Processing
  4. 4. ASCII 128 JIS X 0213:2004 4354 Unicode BMP 65536 Unicode 1114112 Number of characters
  5. 5. ASCII H e a l t h 0x48 0x65 0x61 0x6C 0x74 0x68 [0x48,0x65,0x61,0x6C,0x74,0x68]
  6. 6. Encoding Form „A character encoding form plus byte serialization “ Glossary of the Unicode Consortium „Mapping from a character set definition to the actual code units used to represent the data.” Encoding Scheme
  7. 7. Unicode 健 康 0x5065 0x5EB7 UTF-16LE:[0x65,0x50,0xB7,0x5E]? UTF-8:[0xE5,0x81,0xA5,0xE5,0xBA,0xB7] UTF-32BE:[0x00,0x00,0x50,0x65,0x00,0x00,0x5E,0xB7] å?¥åº·
  8. 8. Endianess
  9. 9. Little Endian Big Endian [0x65,0x50,0xB7,0x5E] [0x50,0x65,0x5E,0xB7] 健康(0x6550,0xB75E) 敐띞
  10. 10. UTF-8 H 0x48 ¥ 0xC2,0xA5 健 0xE5,0x81,0xA5 𠀷 0xF0,0xA0,0x80,0xB7 0x48 01001000 0xC2 11000010 0xE5 11100101 0xF0 11110000
  11. 11. UTF-16 H 0x0048 ¥ 0x00A5 健 0x5065 𠀷 0xD840,0xDC37
  12. 12. UTF-32 H 0x00000048 ¥ 0x000000A5 健 0x00005065 𠀷 0x00020037
  13. 13. Font “A collection of glyphs used for the visual depiction of character data.” Glossary of the Unicode Consortium
  14. 14. Pixel-Matrix Here: 3 bits/pixel No scaling Easy to display Predictable output Bitmap Font Large
  15. 15. Outline Font Lines, Arcs, CurvesNeeds Renderer Scaling Output hard to predict
  16. 16. Anti-aliasing
  17. 17. Character Set Encoding Font
  18. 18. Ken Lunde, CJKV Information Processing O’Reilly 2009
  19. 19. Contact me Tobias Kniep

×