Unicodev 1.0๋ฐ•์ผhttp://AnDStudy.comhttp://parkpd.egloos.com
๋ฌธ์ž? ๋ฌธ์ž์…‹? ์ธ์ฝ”๋”ฉ? ํฐํŠธ?๋ฌธ์ž๋Š” ๋Œ€์†Œ๋ฌธ์ž ๊ตฌ๋ณ„์„ ํ•œ๋‹ค. ์˜์–ด ๋ฌธ์ž๋Š” 52 ๊ฐœCharacter Code : ๋ฌธ์ž๋ฅผ ํ‘œํ˜„ํ•˜๋Š” ๋ฐ์ดํ„ฐ๊ฐ’A : 65, B : 66 in ASCII๋ฌธ์ž์…‹(Character Set) : ํ•˜๋‚˜์˜ ์–ธ์–ด๊ถŒ์—์„œ ์‚ฌ์šฉํ•˜๋Š” ์–ธ์–ด๋ฅผ ํ‘œํ˜„ํ•˜๊ธฐ ์œ„ํ•œ ๋ฌธ์ž๋“ค์˜ ์ง‘ํ•ฉ์ธ์ฝ”๋”ฉ: ๋ฌธ์ž์…‹๊ณผCharacter Code ์™€์˜ mappingASCII ๋„ ์ธ์ฝ”๋”ฉ ๋ฐฉ๋ฒ•์˜ ํ•˜๋‚˜ํฐํŠธ : glyphs ์ง‘ํ•ฉ์ผ๋ณธ์–ด : MS_Gothic, MS_Minch์ค‘๊ตญ์–ด : SimSun, PSimsunํฐํŠธglyphs(๊ธ€๋ฆฌํ”„) : ๋ฌธ์ž ํ‘œํ˜„๊ทธ๋ฆผ[๋„์•ˆ] ํ‘œ์ง€, [๊ฑด์ถ•]์žฅ์‹์šฉ ์„ธ๋กœํ™ˆ, [๊ณ ๊ณ ํ•™] ๊ทธ๋ฆผ ๋ฌธ์ž, ์ƒํ˜• ๋ฌธ์žTimes New Roman Bold A : AArial Bold A : A
ASCII26x2(์•ŒํŒŒ๋ฒณ ๋Œ€์†Œ๋ฌธ์ž) + 10(์ˆซ์ž) + ํŠน์ˆ˜๋ฌธ์ž + ํ†ต์ œ๋ฌธ์ž ->128๊ฐœ ์ดํ•˜(2^7)์˜›๋‚  ์›Œ๋“œ์Šคํƒ€์—์„œ๋Š” ๋‚˜๋จธ์ง€ 1 bit ๋ฅผ ์ œ์–ด์šฉ์œผ๋กœ ์‚ฌ์šฉ
์„œ์œ ๋Ÿฝ์œผ๋กœ ๊ฐ„ ASCII์›€๋ผ์šฐํŠธ ๋“ฑ์„ ํ‘œํ˜„ํ•˜๊ธฐ ์œ„ํ•ด 7bit ์— 1bit ์ถ”๊ฐ€ (2^8)ASCII ํ™•์žฅ ๋ฌธ์ž์…‹์„ISO ๊ฐ€ ๊ด€๋ฆฌํ•˜๊ฒŒ ๋จISO/IEC 8859-1 	๋ผํ‹ด-1 ์„œ์œ ๋ŸฝISO/IEC 8859-2 	๋ผํ‹ด-2 ์ค‘์•™์œ ๋Ÿฝ ๋ถ€ํ„ฐ...ISO/IEC 8859-16 ๋ผํ‹ด-10 ๋‚จ๋™์œ ๋Ÿฝ ๊นŒ์ง€
์ผ๋ณธ์œผ๋กœ ๊ฐ„ ๋ฌธ์ž์…‹1๋ฐ”์ดํŠธ๋กœ ์ผ๋ณธ์–ด๋ฅผ ํ‘œํ˜„ํ•˜๊ธฐ๊ธ€์ž๊ฐ€ ๋‘ฅ๊ธ€์–ด ๊ทธ๋ฆฌ๊ธฐ ์–ด๋ ค์šด ํžˆ๋ผ๊ฐ€๋‚˜(ใ‚ใ„ใ†ใˆใŠ) ๋Œ€์‹  ์นดํƒ€๊ฐ€๋‚˜(ใ‚ขใ‚คใ‚ฆใ‚จใ‚ช) ๋ฅผ ๋‚˜๋จธ์ง€ 128 ๋น„ํŠธ ๊ณต๊ฐ„์— ๋„ฃ์ž์˜์–ด์™€ ํฌ๊ธฐ๋ฅผ ๊ฐ™๊ฒŒ ํ•˜๊ธฐ ์œ„ํ•ด "๋ฐ˜๊ฐ(ๅŠ่ง’)๋ฌธ์ž, Half-Width Katakanaโ€œ ์‚ฌ์šฉMBCS - Multi Byte Character Set ๋“ฑ์žฅ์ตœ์ƒ์œ„ ๋น„ํŠธ๊ฐ€ 0 ์ด๋ฉด ASCII Code ๋กœ ํ•ด์„1 ์ด๋ฉด 2 ๋ฐ”์ดํŠธ๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ์ผ๋ณธ์–ด ๋ฌธ์ž์…‹์„ ์ฐพ๋Š”๋‹ค์˜ˆ : 0xA1 0x72 0xA3 0x70 0x52 0xA2 0xA31 ๋ฐ”์ดํŠธ๊ฐ€ 0x00 ~ 0x7F (0~127)๊นŒ์ง€์˜ ๊ฐ’์ด๋ผ๋ฉด ASCII ๋ฌธ์ž์ด๋‹ค.์„œ์œ ๋Ÿฝ์–ด๋Š”0x80 ~ 0xA0 (์˜ˆ์•ฝ๋ฒ”์œ„)๊นŒ์ง€ (128 ~ 160) ๊ณต๊ฐ„์„ ๋™์•„์‹œ์•„ MBCS ๋ฅผ ์œ„ํ•ด์„œ ๋น„์›Œ๋†“์•˜๋‹ค.
ํ•œ๊ตญ ๋ฌธ์ž์…‹- ์™„์„ฑํ˜•๊ณผ ์กฐํ•ฉํ˜•์™„์„ฑํ˜• : ์™„์„ฑํ˜•ํ•œ๊ธ€ 2350์ž, ํ•œ์ž(4884๊ฐœ), ์ˆซ์ž,โ€ฆโ€œ๊ฐ•โ€œ : 0xB0C1 (0xB000 + 0xC0 + 0x1)์กฐํ•ฉํ˜• : ์ดˆ์„ฑ"ใ„ฑ"๊ณผ ์ค‘์„ฑ"ใ…"๋ฅผ ์กฐ๋ฆฝํ•œ โ€œ๊ฐ€โ€ ๋Š” 0x1100,0x1161 ๋กœ ๋‚˜ํƒ€๋‚ผ ์ˆ˜๋„ ์žˆ๋‹ค.์ดˆ์„ฑ โ€˜ใ„ฑโ€™: 0x1100 HANGUL CHOSEONG KIYEOK์ค‘์„ฑ โ€˜ใ…โ€™:0x1161 HANGUL JUNGSEON Aํ™•์žฅ 1bit, ์ดˆ์„ฑ5bit, ์ค‘์„ฑ 5bit, ์ข…์„ฑ 5bit
EUCExtened Unix Code(ํ™•์žฅ ์œ ๋‹‰์Šค ์ฝ”๋“œ)8๋น„ํŠธ ๋ฌธ์ž ์ธ์ฝ”๋”ฉ ๋ฐฉ์‹ISO 2022 ํ‘œ์ค€ ๊ธฐ๋ฐ˜EUC-KR ์€ KS X 1001, KS X 1003 ์‚ฌ์šฉํ•œ๊ธ€ ์™„์„ฑํ˜• ์ธ์ฝ”๋”ฉKS X 1003 ๋Š” ์—ญ์Šฌ๋ž˜์‰ฌ ๋Œ€์‹  \ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ๋งŒ ์ œ์™ธํ•˜๋ฉด ASCII ์ฝ”๋“œ์™€ ๋™์ผKS X 1001 ์€ ํ•œ๊ธ€, ๊ทธ๋ฆผ ๋ฌธ์ž, ํ•œ์ž ๋“ฑ์„ ํฌํ•จ128๋ณด๋‹ค ์ž‘์€ ๋ฐ”์ดํŠธ์— KS X 1003 ๋ฐฐ๋‹น128๋ณด๋‹ค ํฌ๊ฑฐ๋‚˜ ๊ฐ™์€ ๋ฐ”์ดํŠธ์— KS X 1001 ๋ฐฐ๋‹น์‹ค์ œ ์‚ฌ์šฉ๊ณต๊ฐ„์ด ์ƒ์œ„๋ฐ”์ดํŠธ 161-254, ํ•˜์œ„๋ฐ”์ดํŠธ 161-254 ๋ฟ์ด์—ˆ๊ธฐ ๋•Œ๋ฌธ์— โ€˜๋˜ โ€™์ด๋‚˜ โ€˜๋ทโ€™ ๊ฐ™์€ ํ•œ๊ธ€์ด ๋น ์ง.
CP949MS ๊ฐ€ KS X 1001 ์— ์—†๋Š” ํ•œ๊ธ€ 8822 ์ž๋ฅผ ์ถ”๊ฐ€ํ•ด EUCKR ๋ฅผ ํ™•์žฅํ•œ ์™„์„ฑํ˜• ์ธ์ฝ”๋”ฉks_c_5601-1987์›๋ž˜๋Š” CodePage๋ฒˆํ˜ธ์˜€์œผ๋‚˜ ์ง€๊ธˆ์€ EUCKR ์˜ ํ™•์žฅํ˜•์ธ ํ•œ๊ธ€ ์ธ์ฝ”๋”ฉ ๋ฐฉ์‹์„ ์ง€์นญํ•˜๋Š” ์ด๋ฆ„์ด ๋˜์—ˆ๋‹คks_c_5601-92 ๋„ ์žˆ๋Š” ๋“ฏ
iso2022-kr ๊ณผ KPS-9566iso2022-krEucKR์„ 7bit ๋งŒ ์‚ฌ์šฉํ•˜๋ฉฐ ํ‘œํ˜„ํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ RFC1557 ์— ์ •์˜KPS-9566 : ๋ถํ•œ ์œ ์ผ์˜ ๊ณ ์œ  ๋ฌธ์ž์…‹ํ•œ๊ธ€ ๋ชจ์–‘์€ ์šฐ๋ฆฌ๋ณด๋‹ค 300๊ธ€์ž ์ •๋„ ๋งŽ๊ณ ํ•œ์ž๋Š” 200๊ธ€์ž ์ •๋„ ์ ๋‹คํ•œ๊ธ€ ์‹œ์ž‘์ด โ€˜๊ฐ€โ€™ ๊ฐ€ ์•„๋‹Œ โ€˜๊น€์ผ์„ฑ๊น€์ •์ผโ€™ 6 ๊ธ€์ž๊ฐ€ ๋จผ์ € ๋ฐฐ์น˜๋˜์–ด ์žˆ๋‹ค๊ณ ...์ž์Œ ์ •๋ ฌ ์ˆœ์„œใ„ฑใ„ดใ„ทใ„นใ…ใ…‚ใ……ใ…ˆใ…Šใ…‹ใ…Œใ…ใ…Žใ„ฒใ„ธใ…ƒใ…†ใ…‰ใ…‡
Code Page์ •์˜ : OS ์—์„œ ์„ ํƒํ•œ character code ๋“ค์„ ํŠน์ •ํ•œ ์ˆœ์„œ๋กœ ์ •๋ฆฌํ•ด ๋†“์€ ๋ชฉ๋ก(IBM, MS)another name for character encoding(from wikipedia)ํ™œ์„ฑ ์ฝ”๋“œ ํŽ˜์ด์ง€ : 949 (์™„์„ฑํ˜• ํ™•์žฅ)ํ•œ๊ธ€ ์กฐํ•ฉํ˜• : Code Page 1361์˜์–ด : ANSI-437์ด์Šค๋ผ์—˜ : ANSI-862๋กœ์ผ€์ผ utf-8 : 65001์ธ์ฝ”๋”ฉ๋œ ๋ฐ์ดํ„ฐ๋ฅผ ์–ด๋–ป๊ฒŒ ํ•ด์„ํ•  ๊ฒƒ์ธ๊ฐ€ CHCP (change code page)Code Page Identifiershttp://msdn.microsoft.com/en-us/library/dd317756
๋ฌธ์ œ์ ๋‹ค๋ฅธ CodePage์—์„œ ํŒŒ์ผ์„ ์—ด๋ฉด ๊ธ€์ž๊ฐ€ ๊นจ์ ธ ๋ณด์ž„์—ฌ๋Ÿฌ ๋‚˜๋ผ์˜ ๋ฌธ์ž์…‹์„ ๊ฐ™์ด ๋ณด์—ฌ์ค„ ์ˆ˜ ์—†์Œ์†Œํ”„ํŠธ์›จ์–ด๋ฅผ ๋ฐ”์ด๋„ˆ๋ฆฌ ํ•˜๋‚˜๋กœ ์—ฌ๋Ÿฌ ๋‚˜๋ผ์— ํŒ๋งคํ•  ์ˆ˜ ์—†์ŒDOS ์‹œ์ ˆ ์ผ๋ณธ ๊ฒŒ์ž„ ๋•Œ๋ฌธ์— ์ธ์ฝ”๋”ฉ ๋ฐ”๊ฟจ๋‹ค๋ฉด ๋‚˜์ค‘์— ์ธ์ฝ”๋”ฉ์„ ๋Œ๋ ค๋†”์•ผ ํ–ˆ๋‹ค๋ชจ๋“  ๋ฌธ์ž๋ณ„๋กœ ์œ ์ผํ•œ ๊ฐ’์„ ํ• ๋‹นํ•˜๊ณ  ์‹ถ๋‹ค
Unicode ์‹œ์ž‘๋ชจ๋“  ๋ฌธ์ž๋ณ„๋กœ ์œ ์ผํ•œ Character Code ๋ฅผ ์ง€์ •ํ•˜์ž1984๋…„ ISO(๊ตญ์ œํ‘œ์ค€๊ธฐ๊ตฌ)๋Š” ISO 10646 ๊ตญ์ œ ํ‘œ์ค€ ์ฒด๊ฒฐ -> ๋ชจ๋“  ๋ฌธ์ž๋ฅผ 4 ๋ฐ”์ดํŠธ๋กœ1993๋…„ 5์›”๊ทธ๋ฆฌ์Šค ์•„ํ…Œ๋„ค ํšŒ์˜ : ์ตœ์ข… ํ™•์ •Unicode Working Group(1989๋…„)Apple, Xerox, Sun, Microsoft, NeXT : 2 ๋ฐ”์ดํŠธUnicode ์ปจ์†Œ์‹œ์—„์˜ ์ œ์•ˆ ์ผ๋ถ€๋ฅผ ISO ์—์„œ ์ˆ˜์šฉISO 10646-1Universal(Multiple-Octet Coded) Character Set: UCS๋•๋ถ„์— Unicode ๊ฐ€ UCS ์˜ ์„œ๋ธŒ์…‹์ด ๋˜์—ˆ์Œ๊ฐ€์žฅ ์ตœ์‹  ๋ฒ„์ „ ํ‘œ์ค€Unicode 5.2ISO/IEC 10646:2003 plus Amendments 1,2,3,4,5,6
Unicode ๊ตฌ์กฐ๋ฌธ์ž๋ณ„๋กœ ๋ฒˆํ˜ธ(์ฝ”๋“œ ํฌ์ธํŠธ Code Point) ์ง€์ •U+0041U+ ๋Š” Unicode0041 : ์ฝ”๋“œ ํฌ์ธํŠธ ๊ฐ’์œผ๋กœ 16 ์ง„์ˆ˜๋กœ ํ‘œ๊ธฐU+0041 ๋Š” ์˜์–ด ์•ŒํŒŒ๋ฒณ 'Aโ€™U+AC00 : ํ•œ๊ธ€ '๊ฐ€โ€˜U+0000~U+00FF ์˜์—ญ์€ ISO 8859-1 ๋ฌธ์ž์…‹๊ณผ ๋™์ผํ•œ๊ธ€์€ U+AC00 ~ U+D7AF ์˜์—ญ์— ์ •์˜0x10FFFF^2 : 100๋งŒ๊ฐœ ๊ธ€์ž(10๋งŒ๊ฐœ ์‚ฌ์šฉ)
Unicode ์ฒด๊ณ„BMP (Basic multilingual Plane. ๊ธฐ๋ณธ์–ธ์–ดํŒ)์ตœ์ดˆ 65536(2^16) ๊ฐœ์˜ ๋ฌธ์ž ํ• ๋‹น๋˜๋Š” ์˜์—ญ.Unicode 3.0 : 49,194 ๋ฌธ์ž ์ •์˜UCS-2 ๊ณผ ๋™์ผํŠนํžˆ ํ•œ๋ฌธ์—์„œ ํ•„์š”๋ฌธ์ž๊ฐ€ ๋Š˜์–ด๋‚˜๋ฉด์„œ ๋ณด์ถฉ์–ธ์–ดํŒ(Supplementary Plaines)์„ ์ •์˜Unicode 3.1 ์—์„œ๋Š” BMP ์— 2๊ฐœ ๋ฌธ์ž ์ถ”๊ฐ€, ๋ณด์ถฉ์–ธ์–ดํŒ์—44,944 ๊ฐœ ๋ฌธ์ž ์ถ”๊ฐ€์Œํ‘œ,๊ณ ๋Œ€๋ฌธ์ž,ํ•œ์ž(CJK Ideographic Extension B)	CJK : ํ•œ๊ตญ, ์ค‘๊ตญ, ์ผ๋ณธUnicode 3.1: 49,194 + 44,944 = 94,140
UCS ์ฒด๊ณ„Cell : ํ•œ ๊ฐœ์˜ ๋ฌธ์ž๊ฐ€ ํ• ๋‹น๋˜๋Š” ๊ณต๊ฐ„Plane : 256 * 256๊ฐœ์˜ cell ๋ฌถ์Œ  65536(0xFFFF) ๊ฐœ -> UCS-2BMP : Plain 00Group : 256 ๊ฐœ์˜ Plane ๋ฌถ์Œ(7F ๊ฐœ)
Unicode ํ‘œํ˜„'Aโ€™ : U+0041Group 00, Plane 00, Cell 41'๊ฐ€โ€™ : U+AC00Group 00, Plane 00, Cell 41โ™ช : U+1D160Group 00, Plane 01, Cell D160์ฆ‰, Plain ๋ฒˆํ˜ธ 5๋น„ํŠธ, Cell ๋ฒˆํ˜ธ 16๋น„ํŠธ21๋น„ํŠธ ๊ณต๊ฐ„ ์‚ฌ์šฉ
Unicode ์ธ์ฝ”๋”ฉUTF-32UTF-16UTF-8UTF-7email ์šฉUCS-2UCS-4๋ชจ๋“  Unicode ํ‘œํ˜„ํ•  ์ˆ˜ ์žˆ์œผ๋ฏ€๋กœ ์„œ๋กœ ๋ฌด์†์‹ค ๋ณ€ํ™˜ ๊ฐ€๋Šฅ
UTF-32๋ชจ๋“  ๋ฌธ์ž๋ฅผ ์ฝ”๋“œ ํฌ์ธํŠธ ๊ฐ’ ์œ ์ง€ํ•˜๋ฉด์„œ 32 ๋น„ํŠธ๋กœ ๋งŒ๋“ ๋‹ค. (๊ณ ์ •๊ธธ์ด)linux์˜ ๊ฒฝ์šฐ wchar_t์˜ ํฌ๊ธฐ๊ฐ€ 32bit ๋ผ์„œ mbstowcs()๋ฅผ ์ด์šฉํ•ด์„œ ๋ณ€ํ™˜ ํ›„ ๊ณ ์ •๊ธธ์ด ์ธ์ฝ”๋”ฉ์ฒ˜๋Ÿผwcsํ•จ์ˆ˜๋ฅผ ์‚ฌ์šฉํ•˜๋ฉด ๋œ๋‹ค.UCS-4 ์˜ ๋ถ€๋ถ„์ง‘ํ•ฉ(17 ๊ฐœ์˜ ์–ธ์–ดํŒ๋งŒ ์ •์˜)
UTF-16BMP ์˜์—ญ ์•ˆ(U+0000-U+FFFF)์˜ ๋ฌธ์ž๋Š” ๊ทธ๋Œ€๋กœ ํ‘œํ˜„, ๋ฐ–์˜ ๋ฌธ์ž๋Š” ๋ณ€ํ™˜ ํ•„์š” (๊ฐ€๋ณ€๊ธธ์ด)Windows2000 ๊ณผ ์ดํ›„ ๋ฒ„์ „์€ UTF-16 ๊ธฐ๋ฐ˜. ์ด์ „ NT ์ปค๋„์€UCS-2 ๊ธฐ๋ฐ˜Java 2/Java 5๋Š” UCS2/UTF-16์— ์˜์กดUCS-2 ๋ณด๋‹ค ํ™•์žฅ๋œ ๊ฐœ๋…
UTF-16 ๋ณ€ํ™˜ ๊ทœ์น™Surrogate Pair (U+D800~U+DFFF) ์—๋Š” ๋ฌธ์ž ํ• ๋‹น๋˜์–ด ์žˆ์ง€ ์•Š์Œ
UTF-8'Aโ€™ : U+0041๊ฐ™์€ UTF-16 ๋ฅผ char ๋กœ ์ฝ์œผ๋ฉด 00 (null) ๋ฌธ์ž์—ด ๋•Œ๋ฌธ์— ๊ธฐ์กด ํ•จ์ˆ˜๊ฐ€ ์˜ค์ž‘๋™<html><head> <meta http-equiv=โ€œContent-Typeโ€ content=โ€œtext/html;charset=utf-8โ€>Charset๊นŒ์ง€๋Š” ascii๋กœ ์ฝ๊ณ  charset์ฝ์€ ํ›„์— ์ธ์ฝ”๋”ฉ์— ๋งž์ถฐ์„œ ํŒŒ์‹ฑ ์‹œ์ž‘. ๊ทธ๋Ÿฌ๋‹ˆ charset์ด์ „์— unicode์ธ์ฝ”๋”ฉ ๊ธ€์ž๊ฐ€ ๋“ค์–ด๊ฐ€๋ฉด ์•ˆ ๋จ์›น์˜ ์‹ค์งˆ์  ํ‘œ์ค€, ๋งŽ์€ *nix ์‹œ์Šคํ…œ, xml, python ์€ UTF-8 ์„ ๊ฐ€์žฅ ๊ธฐ์ดˆ์ ์ธ ์ธ์ฝ”๋”ฉ์œผ๋กœ ์‚ฌ์šฉ๊ธ€์ž ๊ธธ์ด๋ฅผ ์•Œ๋ ค๋ฉด ์ „์ฒด ๊ธ€์„ ํŒŒ์‹ฑํ•ด์•ผ ํ•จ
Unicode ํ•œ๊ธ€์—์„œ ๋ฐ›์นจ ์•Œ๊ธฐ์œ ๋‹ˆ์ฝ”๋“œ 2.0 : ํ•œ๊ธ€์€ ์ดˆ์„ฑ 19๊ฐœ, ์ค‘์„ฑ 21๊ฐœ, ์ข…์„ฑ 28๊ฐœ(์—†์Œ๋„ ํฌํ•จ)๊ฐ€ ์žˆ๋‹ค. ์ดˆ์„ฑ 19๊ฐœ๋ฅผ 0...18๊นŒ์ง€ ๋ฒˆํ˜ธ๋ฅผ ๋ถ™์ด๊ณ  ์ค‘์„ฑ๋„ 0...20, ์ข…์„ฑ๋„ ์—ญ์‹œ 0...27๊นŒ์ง€ ๋ฒˆํ˜ธ๋ฅผ ๋ถ™์ธ๋‹ค๋ฉด, ์›ํ•˜๋Š” ์ฝ”๋“œ๋Š” 0xAC00 + x*21*28 + y*28 + z (x=์ดˆ์„ฑ๋ฒˆํ˜ธ, y=์ค‘์„ฑ๋ฒˆํ˜ธ, z=์ข…์„ฑ๋ฒˆํ˜ธ)๋กœ ๋งŒ๋“ค ์ˆ˜ ์žˆ๋‹ค. ์ข…์„ฑ์—์„œ 0 ๋ฒˆ์งธ์— ํ•ด๋‹นํ•˜๋Š” ๊ฒƒ์€ '์—†์Œ'์ด๋ฏ€๋กœ ์œ ๋‹ˆ์ฝ”๋“œ๊ฐ’์—์„œ 0xAC00์„ ๋บ€ ํ›„์— 28๋กœ ๋‚˜๋ˆ„์–ด ๋–จ์–ด์ง€๋Š”์ง€ ํ™•์ธํ•˜๋ฉด ๋ฉ๋‹ˆ๋‹ค.http://jof4002.net/Unicodewchar_t* pString = L"๊ฐ€๊ฐ๋‚˜๋“ฏ";cout << (pString[0] - 0xAC00) % 28 << endl;  // 0cout << (pString[1] - 0xAC00) % 28 << endl;  // 1cout << (pString[2] - 0xAC00) % 28 << endl;  // 0cout << (pString[3] - 0xAC00) % 28 << endl;  // 19
Unicode ๋ณ€ํ™˜USES_CONVERSION;pI->SomeFunctionNeedsUnicode(T2OLE(lpszA));๋งคํฌ๋กœ ์ธ์ž๊ฒฐ๊ณผA2CW 	(LPCSTR) 		(LPCWSTR)A2W 		(LPCSTR) 		(LPWSTR)W2CA 	(LPCWSTR) 	(LPCSTR)W2A 		(LPCWSTR) 	(LPSTR)T2COLE 	(LPCTSTR) 		(LPCOLESTR)T2OLE 	(LPCTSTR) 		(LPOLESTR)OLE2CT 	(LPCOLESTR) 	(LPCTSTR)OLE2T 	(LPCOLESTR) 	(LPCSTR)
Unicode in VC++std::locale::global(std::locale("" ));wcin.imbue(locale("korean")); ์™€ wcout.imbue(locale("korean"));wcout.fail() ๋กœํ™•์ธํ•˜๊ณ , wcout.clear();_setmode(_fileno(stdout), _O_U16TEXT);
UTF-16 ๋ฌธ์ž ๊ฐœ์ˆ˜ ๊ตฌํ•˜๊ธฐcode snippet http://dodoubt.tistory.com/40์ฐธ๊ณ 
BOM(Byte Order Mark)ํŒŒ์ผ์ด ์–ด๋–ค ์‹์œผ๋กœ ์ธ์ฝ”๋”ฉ๋˜์–ด ์žˆ๋Š”์ง€ ์•Œ๋ ค์ฃผ๋Š” ํ—ค๋” ์—ญํ• UTF-32, big-endian : 00 00 FE FFUTF-32, little-endian : FF FE 00 00UTF-16, big-endian : FE FFUTF-16, little-endian : FF FEUTF-8 : EF BB BFUTF-8 ์—์„œ๋Š” BOM ์‚ฌ์šฉ์„ ๋ณ„๋กœ ๊ถŒ์žฅํ•˜์ง€ ์•Š์Œ. UTF-8 ์ด ๊ธฐ๋ณธ ์–ธ์–ด๋Š” ASCII ์™€ ํ˜ธํ™˜๋œ๋‹ค๋Š” ์žฅ์ ์ด ์žˆ๋Š”๋ฐ, BOM ์ฒ˜๋ฆฌ๋ฅผ ํ•˜์ง€ ์•Š๋Š” editor ๋‚˜ ์›นํŽ˜์ด์ง€์—์„œ๋Š” BOM ์„ iโ‰ซยฟ ๋กœ ์ถœ๋ ฅํ•  ์ˆ˜ ์žˆ๋‹ค.
Font๋ฌธ์ž -> ์œ ๋‹ˆ์ฝ”๋“œ -> ์œ ๋‹ˆ์ฝ”๋“œ ์ธ์ฝ”๋”ฉ-> ํ™”๋ฉด ๊ทธ๋ฆฌ๊ธฐ์œ ๋‹ˆ์ฝ”๋“œ ํฐํŠธArial Unicode MS(ARIALUNI.TTF, 22,730KB)ํ•จ์ดˆ๋กฑ์ฒด, ํ•œ์ปด๋ฐ”ํƒ• : http://maplestory.pe.kr/1785๊ณ ์ •๊ธธ์ด ํฐํŠธ(Monospace Font)๊ตด๋ฆผ์ฒด, ๋ฐ”ํƒ•์ฒด, ๋‹์›€์ฒด๊ฐ€๋ณ€๊ธธ์ด ํฐํŠธ๊ตด๋ฆผ, ๋ฐ”ํƒ•, ๋‹์›€์ƒ๊ด€์—†์ง€๋งŒ ๋‚˜๋ˆ”๊ณ ๋”• ์ฝ”๋”ฉ๊ธ€๊ผดhttp://dev.naver.com/projects/nanumfont/downloadBitstream Vera Sans Mono + ๋ง‘์€๊ณ ๋”•http://ggotbo.egloos.com/2334938
Console ์—์„œ์˜ ํฐํŠธ[HKEY_CURRENT_USER\Console\%SystemRoot%_system32_cmd.exe]"CodePage"=dword:000001b5"FontSize"=dword:000c0000"FontFamily"=dword:00000036"FontWeight"=dword:00000190"FaceName"=" ๊ตด๋ฆผ์ฒดโ€œ๋ช…๋ น ์ฐฝ์—์„œ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ๊ธ€๊ผด์— ๋Œ€ํ•ด ํ•„์š”ํ•œ ์กฐ๊ฑด	fixed-pitch font, not italic font,no negative A or C space	if (TrueType) FF_MODERN else OEM_CHARSEThttp://support.microsoft.com/kb/Q247815๋ช…๋ น ํ”„๋กฌํ”„ํŠธ ๋””ํดํŠธ ํฐํŠธ ๋ฐ”๊พธ๋Š” ๋ฒ•http://pcwinvista.tistory.com/340http://dodoubt.tistory.com/34
charmap
์ด๋ฐ์•„๋ฌธ์ž-> ์œ ๋‹ˆ์ฝ”๋“œ-> ์œ ๋‹ˆ์ฝ”๋“œ ์ธ์ฝ”๋”ฉ-> ํฐํŠธ
์ธ์ฝ”๋”ฉSBCS(Single Byte Character Set)ASCIIMBCS(Multi Byte Character Set)UTF-16, UTF-8๋ฌธ์ž์—ด ๊ธธ์ด๋ฅผ ๋ฐ”๋กœ ์•Œ ์ˆ˜ ์—†๋‹ค.WBCS(Wide Byte Character Set)UTF-32, UCS-2, UCS-4๋ฌธ์ž์…‹๊ณผ์ธ์ฝ”๋”ฉ์ด ๋™์ผSBCD, MBCS, WBCS ๋Š” ์ธ์ฝ”๋”ฉ ๋ฐฉ๋ฒ•์ด์ง€ ์ธ์ฝ”๋”ฉ์ด ์•„๋‹˜
ReferenceUnicode ์˜ ์ดํ•ด โ€“ novo networkshttp://www.novonetworks.com/jamestic/Unicode_1.0.pdf์ง„์ˆ™์˜ ์œ ๋‹ˆ์ฝ”๋“œ ์ž…๋ฌธ์„œhttp://www.kristalinfo.com/K-Lab/unicode/Unicode_intro-kr.htmlMBCS ์™€ ์œ ๋‹ˆ์ฝ”๋“œ	http://www.animalpicturesarchive.com/jinsuk-kim/diary/read.php?2006/0203์œ„ํ‚ค๋ฐฑ๊ณผโ€“ ์œ ๋‹ˆ์ฝ”๋“œ, ์œ ๋‹ˆ์ฝ”๋“œ ๋ฒ”์œ„ ๋ชฉ๋กUnicode 5.2 Character Code Chartshttp://www.unicode.org/charts/์กฐ์—˜ ์˜จ ์†Œํ”„ํŠธ์›จ์–ด : ์œ ๋‹ˆ์ฝ”๋“œ์™€ ๋ฌธ์ž์ง‘ํ•ฉ์— ๋Œ€ํ•œ ๊ณ ์ฐฐhttp://www.joelonsoftware.com/articles/Unicode.htmlCharacter sets and codepageshttp://www.microsoft.com/typography/unicode/cscp.htmhttp://www.microsoft.com/typography/unicode/1250.gifํ•œ๊ธ€ ์ฝ”๋“œํŽ˜์ด์ง€ http://www.unicode.org/charts/PDF/UAC00.pdfKS C 5601 ์™„์„ฑํ˜• ์ฝ”๋“œhttp://zbxe.bluegate.kr/42http://whatisthat.co.kr/6
Referencehttp://jof4002.net/UnicodeVC++ : ์œ ๋‹ˆ์ฝ”๋“œ๋ฅผ ํ‘œ์ค€ ์ถœ๋ ฅ์— ๋‚ด๋ณด๋‚ด๊ธฐhttp://kaistizen.net/EE/index.php/weblog/comments/unicode_hangul_to_stdout/IdeAthinKING - C fileโ€™s orientationhttp://ideathinking.com/blog/?p=108http://ideathinking.com/blog/?p=109rein : ์ธ์ฝ”๋”ฉ๊ณผ ๋ฌธ์ž์ง‘ํ•ฉ: Unicodehttp://rein.kr/blog/archives/280rein : Windows Character Encoding: UCS2? UTF-16?http://rein.kr/blog/archives/585STL string ์‚ฌ์šฉ์‹œ wstring์ผ๋•Œ, ์ถœ๋ ฅ์ด ๋˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค. http://kldp.org/node/93573http://en.wikipedia.org/wiki/Code_pagehttp://gpgstudy.com/gpgiki/์œˆ๋„์šฐ ๋‹ค๊ตญ์–ด ํ”„๋กœ๊ทธ๋ž˜๋ฐMBCS ์™€ UNICODE FAQ ์ •๋ฆฌhttp://mynotepad.tistory.com/67
ReferenceUnicode - (1) ๊ฐœ๋…http://dodoubt.tistory.com/29Standard output์œผ๋กœ unicode๋ฌธ์ž๋ฅผ ์ถœ๋ ฅํ•˜๊ธฐ (Win32 console application)http://dodoubt.tistory.com/35Unicode - (2) UTF-16(wide character) in Windowshttp://dodoubt.tistory.com/36Unicode - (3) UTF-8 in Windowshttp://dodoubt.tistory.com/38Unicode - (4) ๋ฌธ์ž ๊ฐœ์ˆ˜ ๊ตฌํ•˜๊ธฐ, ๋ณ€ํ™˜(convert) code snippethttp://dodoubt.tistory.com/40window command prompt(cmd.exe)์—์„œ ์‚ฌ์šฉํ•˜๋Š” font ์ถ”๊ฐ€ ๋ฐ ๋ณ€๊ฒฝํ•˜๊ธฐhttp://dodoubt.tistory.com/34ASCII and Unicode quotation marks by Markus Kuhn http://www.cl.cam.ac.uk/~mgk25/ucs/quotes.html์œ ๋‹ˆ ์ฝ”๋“œ (๊ตฌ์›์˜ ์—ฌ์‹ ์˜ ๋“ฑ์žฅ?) - ๋ฐ•์šฐ์˜http://web.edunet4u.net/~han0416/%ED%95%98%EB%93%9C%EC%9B%A8%EC%96%B4%20%EA%B0%95%EC%A2%8C/chapter2/uni_code.htmCode2001, a Plane 1 Unicode-based Fonthttp://www.code2000.net/code2001.htmwprintf/wcout and unicode characters in VS2005http://blog.kalmbachnet.de/?postid=98
ReferenceDavid Myriad Rosenbaum's Font Sanctuary (Ugaritic Font)http://davidmyriad.tripod.com/myriads.font.page.htmlhttp://www.alanwood.net/unicode/fonts-middle-eastern.html#ugaritic์™ธ๊ตญ์–ด ์ง€์›์„ ์œ„ํ•œ Unicode ํ™œ์šฉ ๋ฐฉ๋ฒ•http://www.ibm.com/developerworks/kr/library/l-linuni.htmlASCII Tablehttp://www.asciitable.com/์‹ฌ์‹ฌํ• ๋•Œ ์ฝ์–ด๋ณด๋Š” ๋ฌธ์ž์…‹, ์ธ์ฝ”๋”ฉ ์ด์•ผ๊ธฐhttp://blog.daum.net/_blog/tagArticleList.do?blogid=0Idq4&tagName=%EB%AC%B8%EC%9E%90%EC%85%8B#ajax_history_homeํ•œ๊ธ€ ์ธ์ฝ”๋”ฉ ์ด์•ผ๊ธฐ - (1) ASCII, ์™„์„ฑํ˜•, ์กฐํ•ฉํ˜•, EUCKR, CP949http://heyjimin.tistory.com/14ํ•œ๊ธ€ ์ธ์ฝ”๋”ฉ ์ด์•ผ๊ธฐ - (2) ์œ ๋‹ˆ์ฝ”๋“œ, UCS-2, UTF-8, UTF-16http://heyjimin.tistory.com/15http://namoda.springnote.com/pages/2017552์œ ๋‹ˆ์ฝ”๋“œ ๋ณผ ์ˆ˜ ์žˆ๋Š” ์—๋””ํ„ฐ? KORAIShttp://korais.sourceforge.net/screenshots.html

Unicode

  • 1.
  • 3.
    ๋ฌธ์ž? ๋ฌธ์ž์…‹? ์ธ์ฝ”๋”ฉ?ํฐํŠธ?๋ฌธ์ž๋Š” ๋Œ€์†Œ๋ฌธ์ž ๊ตฌ๋ณ„์„ ํ•œ๋‹ค. ์˜์–ด ๋ฌธ์ž๋Š” 52 ๊ฐœCharacter Code : ๋ฌธ์ž๋ฅผ ํ‘œํ˜„ํ•˜๋Š” ๋ฐ์ดํ„ฐ๊ฐ’A : 65, B : 66 in ASCII๋ฌธ์ž์…‹(Character Set) : ํ•˜๋‚˜์˜ ์–ธ์–ด๊ถŒ์—์„œ ์‚ฌ์šฉํ•˜๋Š” ์–ธ์–ด๋ฅผ ํ‘œํ˜„ํ•˜๊ธฐ ์œ„ํ•œ ๋ฌธ์ž๋“ค์˜ ์ง‘ํ•ฉ์ธ์ฝ”๋”ฉ: ๋ฌธ์ž์…‹๊ณผCharacter Code ์™€์˜ mappingASCII ๋„ ์ธ์ฝ”๋”ฉ ๋ฐฉ๋ฒ•์˜ ํ•˜๋‚˜ํฐํŠธ : glyphs ์ง‘ํ•ฉ์ผ๋ณธ์–ด : MS_Gothic, MS_Minch์ค‘๊ตญ์–ด : SimSun, PSimsunํฐํŠธglyphs(๊ธ€๋ฆฌํ”„) : ๋ฌธ์ž ํ‘œํ˜„๊ทธ๋ฆผ[๋„์•ˆ] ํ‘œ์ง€, [๊ฑด์ถ•]์žฅ์‹์šฉ ์„ธ๋กœํ™ˆ, [๊ณ ๊ณ ํ•™] ๊ทธ๋ฆผ ๋ฌธ์ž, ์ƒํ˜• ๋ฌธ์žTimes New Roman Bold A : AArial Bold A : A
  • 4.
    ASCII26x2(์•ŒํŒŒ๋ฒณ ๋Œ€์†Œ๋ฌธ์ž) +10(์ˆซ์ž) + ํŠน์ˆ˜๋ฌธ์ž + ํ†ต์ œ๋ฌธ์ž ->128๊ฐœ ์ดํ•˜(2^7)์˜›๋‚  ์›Œ๋“œ์Šคํƒ€์—์„œ๋Š” ๋‚˜๋จธ์ง€ 1 bit ๋ฅผ ์ œ์–ด์šฉ์œผ๋กœ ์‚ฌ์šฉ
  • 5.
    ์„œ์œ ๋Ÿฝ์œผ๋กœ ๊ฐ„ ASCII์›€๋ผ์šฐํŠธ๋“ฑ์„ ํ‘œํ˜„ํ•˜๊ธฐ ์œ„ํ•ด 7bit ์— 1bit ์ถ”๊ฐ€ (2^8)ASCII ํ™•์žฅ ๋ฌธ์ž์…‹์„ISO ๊ฐ€ ๊ด€๋ฆฌํ•˜๊ฒŒ ๋จISO/IEC 8859-1 ๋ผํ‹ด-1 ์„œ์œ ๋ŸฝISO/IEC 8859-2 ๋ผํ‹ด-2 ์ค‘์•™์œ ๋Ÿฝ ๋ถ€ํ„ฐ...ISO/IEC 8859-16 ๋ผํ‹ด-10 ๋‚จ๋™์œ ๋Ÿฝ ๊นŒ์ง€
  • 6.
    ์ผ๋ณธ์œผ๋กœ ๊ฐ„ ๋ฌธ์ž์…‹1๋ฐ”์ดํŠธ๋กœ์ผ๋ณธ์–ด๋ฅผ ํ‘œํ˜„ํ•˜๊ธฐ๊ธ€์ž๊ฐ€ ๋‘ฅ๊ธ€์–ด ๊ทธ๋ฆฌ๊ธฐ ์–ด๋ ค์šด ํžˆ๋ผ๊ฐ€๋‚˜(ใ‚ใ„ใ†ใˆใŠ) ๋Œ€์‹  ์นดํƒ€๊ฐ€๋‚˜(ใ‚ขใ‚คใ‚ฆใ‚จใ‚ช) ๋ฅผ ๋‚˜๋จธ์ง€ 128 ๋น„ํŠธ ๊ณต๊ฐ„์— ๋„ฃ์ž์˜์–ด์™€ ํฌ๊ธฐ๋ฅผ ๊ฐ™๊ฒŒ ํ•˜๊ธฐ ์œ„ํ•ด "๋ฐ˜๊ฐ(ๅŠ่ง’)๋ฌธ์ž, Half-Width Katakanaโ€œ ์‚ฌ์šฉMBCS - Multi Byte Character Set ๋“ฑ์žฅ์ตœ์ƒ์œ„ ๋น„ํŠธ๊ฐ€ 0 ์ด๋ฉด ASCII Code ๋กœ ํ•ด์„1 ์ด๋ฉด 2 ๋ฐ”์ดํŠธ๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ์ผ๋ณธ์–ด ๋ฌธ์ž์…‹์„ ์ฐพ๋Š”๋‹ค์˜ˆ : 0xA1 0x72 0xA3 0x70 0x52 0xA2 0xA31 ๋ฐ”์ดํŠธ๊ฐ€ 0x00 ~ 0x7F (0~127)๊นŒ์ง€์˜ ๊ฐ’์ด๋ผ๋ฉด ASCII ๋ฌธ์ž์ด๋‹ค.์„œ์œ ๋Ÿฝ์–ด๋Š”0x80 ~ 0xA0 (์˜ˆ์•ฝ๋ฒ”์œ„)๊นŒ์ง€ (128 ~ 160) ๊ณต๊ฐ„์„ ๋™์•„์‹œ์•„ MBCS ๋ฅผ ์œ„ํ•ด์„œ ๋น„์›Œ๋†“์•˜๋‹ค.
  • 7.
    ํ•œ๊ตญ ๋ฌธ์ž์…‹- ์™„์„ฑํ˜•๊ณผ์กฐํ•ฉํ˜•์™„์„ฑํ˜• : ์™„์„ฑํ˜•ํ•œ๊ธ€ 2350์ž, ํ•œ์ž(4884๊ฐœ), ์ˆซ์ž,โ€ฆโ€œ๊ฐ•โ€œ : 0xB0C1 (0xB000 + 0xC0 + 0x1)์กฐํ•ฉํ˜• : ์ดˆ์„ฑ"ใ„ฑ"๊ณผ ์ค‘์„ฑ"ใ…"๋ฅผ ์กฐ๋ฆฝํ•œ โ€œ๊ฐ€โ€ ๋Š” 0x1100,0x1161 ๋กœ ๋‚˜ํƒ€๋‚ผ ์ˆ˜๋„ ์žˆ๋‹ค.์ดˆ์„ฑ โ€˜ใ„ฑโ€™: 0x1100 HANGUL CHOSEONG KIYEOK์ค‘์„ฑ โ€˜ใ…โ€™:0x1161 HANGUL JUNGSEON Aํ™•์žฅ 1bit, ์ดˆ์„ฑ5bit, ์ค‘์„ฑ 5bit, ์ข…์„ฑ 5bit
  • 8.
    EUCExtened Unix Code(ํ™•์žฅ์œ ๋‹‰์Šค ์ฝ”๋“œ)8๋น„ํŠธ ๋ฌธ์ž ์ธ์ฝ”๋”ฉ ๋ฐฉ์‹ISO 2022 ํ‘œ์ค€ ๊ธฐ๋ฐ˜EUC-KR ์€ KS X 1001, KS X 1003 ์‚ฌ์šฉํ•œ๊ธ€ ์™„์„ฑํ˜• ์ธ์ฝ”๋”ฉKS X 1003 ๋Š” ์—ญ์Šฌ๋ž˜์‰ฌ ๋Œ€์‹  \ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ๋งŒ ์ œ์™ธํ•˜๋ฉด ASCII ์ฝ”๋“œ์™€ ๋™์ผKS X 1001 ์€ ํ•œ๊ธ€, ๊ทธ๋ฆผ ๋ฌธ์ž, ํ•œ์ž ๋“ฑ์„ ํฌํ•จ128๋ณด๋‹ค ์ž‘์€ ๋ฐ”์ดํŠธ์— KS X 1003 ๋ฐฐ๋‹น128๋ณด๋‹ค ํฌ๊ฑฐ๋‚˜ ๊ฐ™์€ ๋ฐ”์ดํŠธ์— KS X 1001 ๋ฐฐ๋‹น์‹ค์ œ ์‚ฌ์šฉ๊ณต๊ฐ„์ด ์ƒ์œ„๋ฐ”์ดํŠธ 161-254, ํ•˜์œ„๋ฐ”์ดํŠธ 161-254 ๋ฟ์ด์—ˆ๊ธฐ ๋•Œ๋ฌธ์— โ€˜๋˜ โ€™์ด๋‚˜ โ€˜๋ทโ€™ ๊ฐ™์€ ํ•œ๊ธ€์ด ๋น ์ง.
  • 9.
    CP949MS ๊ฐ€ KSX 1001 ์— ์—†๋Š” ํ•œ๊ธ€ 8822 ์ž๋ฅผ ์ถ”๊ฐ€ํ•ด EUCKR ๋ฅผ ํ™•์žฅํ•œ ์™„์„ฑํ˜• ์ธ์ฝ”๋”ฉks_c_5601-1987์›๋ž˜๋Š” CodePage๋ฒˆํ˜ธ์˜€์œผ๋‚˜ ์ง€๊ธˆ์€ EUCKR ์˜ ํ™•์žฅํ˜•์ธ ํ•œ๊ธ€ ์ธ์ฝ”๋”ฉ ๋ฐฉ์‹์„ ์ง€์นญํ•˜๋Š” ์ด๋ฆ„์ด ๋˜์—ˆ๋‹คks_c_5601-92 ๋„ ์žˆ๋Š” ๋“ฏ
  • 10.
    iso2022-kr ๊ณผ KPS-9566iso2022-krEucKR์„7bit ๋งŒ ์‚ฌ์šฉํ•˜๋ฉฐ ํ‘œํ˜„ํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ RFC1557 ์— ์ •์˜KPS-9566 : ๋ถํ•œ ์œ ์ผ์˜ ๊ณ ์œ  ๋ฌธ์ž์…‹ํ•œ๊ธ€ ๋ชจ์–‘์€ ์šฐ๋ฆฌ๋ณด๋‹ค 300๊ธ€์ž ์ •๋„ ๋งŽ๊ณ ํ•œ์ž๋Š” 200๊ธ€์ž ์ •๋„ ์ ๋‹คํ•œ๊ธ€ ์‹œ์ž‘์ด โ€˜๊ฐ€โ€™ ๊ฐ€ ์•„๋‹Œ โ€˜๊น€์ผ์„ฑ๊น€์ •์ผโ€™ 6 ๊ธ€์ž๊ฐ€ ๋จผ์ € ๋ฐฐ์น˜๋˜์–ด ์žˆ๋‹ค๊ณ ...์ž์Œ ์ •๋ ฌ ์ˆœ์„œใ„ฑใ„ดใ„ทใ„นใ…ใ…‚ใ……ใ…ˆใ…Šใ…‹ใ…Œใ…ใ…Žใ„ฒใ„ธใ…ƒใ…†ใ…‰ใ…‡
  • 11.
    Code Page์ •์˜ :OS ์—์„œ ์„ ํƒํ•œ character code ๋“ค์„ ํŠน์ •ํ•œ ์ˆœ์„œ๋กœ ์ •๋ฆฌํ•ด ๋†“์€ ๋ชฉ๋ก(IBM, MS)another name for character encoding(from wikipedia)ํ™œ์„ฑ ์ฝ”๋“œ ํŽ˜์ด์ง€ : 949 (์™„์„ฑํ˜• ํ™•์žฅ)ํ•œ๊ธ€ ์กฐํ•ฉํ˜• : Code Page 1361์˜์–ด : ANSI-437์ด์Šค๋ผ์—˜ : ANSI-862๋กœ์ผ€์ผ utf-8 : 65001์ธ์ฝ”๋”ฉ๋œ ๋ฐ์ดํ„ฐ๋ฅผ ์–ด๋–ป๊ฒŒ ํ•ด์„ํ•  ๊ฒƒ์ธ๊ฐ€ CHCP (change code page)Code Page Identifiershttp://msdn.microsoft.com/en-us/library/dd317756
  • 12.
    ๋ฌธ์ œ์ ๋‹ค๋ฅธ CodePage์—์„œ ํŒŒ์ผ์„์—ด๋ฉด ๊ธ€์ž๊ฐ€ ๊นจ์ ธ ๋ณด์ž„์—ฌ๋Ÿฌ ๋‚˜๋ผ์˜ ๋ฌธ์ž์…‹์„ ๊ฐ™์ด ๋ณด์—ฌ์ค„ ์ˆ˜ ์—†์Œ์†Œํ”„ํŠธ์›จ์–ด๋ฅผ ๋ฐ”์ด๋„ˆ๋ฆฌ ํ•˜๋‚˜๋กœ ์—ฌ๋Ÿฌ ๋‚˜๋ผ์— ํŒ๋งคํ•  ์ˆ˜ ์—†์ŒDOS ์‹œ์ ˆ ์ผ๋ณธ ๊ฒŒ์ž„ ๋•Œ๋ฌธ์— ์ธ์ฝ”๋”ฉ ๋ฐ”๊ฟจ๋‹ค๋ฉด ๋‚˜์ค‘์— ์ธ์ฝ”๋”ฉ์„ ๋Œ๋ ค๋†”์•ผ ํ–ˆ๋‹ค๋ชจ๋“  ๋ฌธ์ž๋ณ„๋กœ ์œ ์ผํ•œ ๊ฐ’์„ ํ• ๋‹นํ•˜๊ณ  ์‹ถ๋‹ค
  • 13.
    Unicode ์‹œ์ž‘๋ชจ๋“  ๋ฌธ์ž๋ณ„๋กœ์œ ์ผํ•œ Character Code ๋ฅผ ์ง€์ •ํ•˜์ž1984๋…„ ISO(๊ตญ์ œํ‘œ์ค€๊ธฐ๊ตฌ)๋Š” ISO 10646 ๊ตญ์ œ ํ‘œ์ค€ ์ฒด๊ฒฐ -> ๋ชจ๋“  ๋ฌธ์ž๋ฅผ 4 ๋ฐ”์ดํŠธ๋กœ1993๋…„ 5์›”๊ทธ๋ฆฌ์Šค ์•„ํ…Œ๋„ค ํšŒ์˜ : ์ตœ์ข… ํ™•์ •Unicode Working Group(1989๋…„)Apple, Xerox, Sun, Microsoft, NeXT : 2 ๋ฐ”์ดํŠธUnicode ์ปจ์†Œ์‹œ์—„์˜ ์ œ์•ˆ ์ผ๋ถ€๋ฅผ ISO ์—์„œ ์ˆ˜์šฉISO 10646-1Universal(Multiple-Octet Coded) Character Set: UCS๋•๋ถ„์— Unicode ๊ฐ€ UCS ์˜ ์„œ๋ธŒ์…‹์ด ๋˜์—ˆ์Œ๊ฐ€์žฅ ์ตœ์‹  ๋ฒ„์ „ ํ‘œ์ค€Unicode 5.2ISO/IEC 10646:2003 plus Amendments 1,2,3,4,5,6
  • 14.
    Unicode ๊ตฌ์กฐ๋ฌธ์ž๋ณ„๋กœ ๋ฒˆํ˜ธ(์ฝ”๋“œํฌ์ธํŠธ Code Point) ์ง€์ •U+0041U+ ๋Š” Unicode0041 : ์ฝ”๋“œ ํฌ์ธํŠธ ๊ฐ’์œผ๋กœ 16 ์ง„์ˆ˜๋กœ ํ‘œ๊ธฐU+0041 ๋Š” ์˜์–ด ์•ŒํŒŒ๋ฒณ 'Aโ€™U+AC00 : ํ•œ๊ธ€ '๊ฐ€โ€˜U+0000~U+00FF ์˜์—ญ์€ ISO 8859-1 ๋ฌธ์ž์…‹๊ณผ ๋™์ผํ•œ๊ธ€์€ U+AC00 ~ U+D7AF ์˜์—ญ์— ์ •์˜0x10FFFF^2 : 100๋งŒ๊ฐœ ๊ธ€์ž(10๋งŒ๊ฐœ ์‚ฌ์šฉ)
  • 16.
    Unicode ์ฒด๊ณ„BMP (Basicmultilingual Plane. ๊ธฐ๋ณธ์–ธ์–ดํŒ)์ตœ์ดˆ 65536(2^16) ๊ฐœ์˜ ๋ฌธ์ž ํ• ๋‹น๋˜๋Š” ์˜์—ญ.Unicode 3.0 : 49,194 ๋ฌธ์ž ์ •์˜UCS-2 ๊ณผ ๋™์ผํŠนํžˆ ํ•œ๋ฌธ์—์„œ ํ•„์š”๋ฌธ์ž๊ฐ€ ๋Š˜์–ด๋‚˜๋ฉด์„œ ๋ณด์ถฉ์–ธ์–ดํŒ(Supplementary Plaines)์„ ์ •์˜Unicode 3.1 ์—์„œ๋Š” BMP ์— 2๊ฐœ ๋ฌธ์ž ์ถ”๊ฐ€, ๋ณด์ถฉ์–ธ์–ดํŒ์—44,944 ๊ฐœ ๋ฌธ์ž ์ถ”๊ฐ€์Œํ‘œ,๊ณ ๋Œ€๋ฌธ์ž,ํ•œ์ž(CJK Ideographic Extension B) CJK : ํ•œ๊ตญ, ์ค‘๊ตญ, ์ผ๋ณธUnicode 3.1: 49,194 + 44,944 = 94,140
  • 17.
    UCS ์ฒด๊ณ„Cell :ํ•œ ๊ฐœ์˜ ๋ฌธ์ž๊ฐ€ ํ• ๋‹น๋˜๋Š” ๊ณต๊ฐ„Plane : 256 * 256๊ฐœ์˜ cell ๋ฌถ์Œ 65536(0xFFFF) ๊ฐœ -> UCS-2BMP : Plain 00Group : 256 ๊ฐœ์˜ Plane ๋ฌถ์Œ(7F ๊ฐœ)
  • 18.
    Unicode ํ‘œํ˜„'Aโ€™ :U+0041Group 00, Plane 00, Cell 41'๊ฐ€โ€™ : U+AC00Group 00, Plane 00, Cell 41โ™ช : U+1D160Group 00, Plane 01, Cell D160์ฆ‰, Plain ๋ฒˆํ˜ธ 5๋น„ํŠธ, Cell ๋ฒˆํ˜ธ 16๋น„ํŠธ21๋น„ํŠธ ๊ณต๊ฐ„ ์‚ฌ์šฉ
  • 19.
    Unicode ์ธ์ฝ”๋”ฉUTF-32UTF-16UTF-8UTF-7email ์šฉUCS-2UCS-4๋ชจ๋“ Unicode ํ‘œํ˜„ํ•  ์ˆ˜ ์žˆ์œผ๋ฏ€๋กœ ์„œ๋กœ ๋ฌด์†์‹ค ๋ณ€ํ™˜ ๊ฐ€๋Šฅ
  • 20.
    UTF-32๋ชจ๋“  ๋ฌธ์ž๋ฅผ ์ฝ”๋“œํฌ์ธํŠธ ๊ฐ’ ์œ ์ง€ํ•˜๋ฉด์„œ 32 ๋น„ํŠธ๋กœ ๋งŒ๋“ ๋‹ค. (๊ณ ์ •๊ธธ์ด)linux์˜ ๊ฒฝ์šฐ wchar_t์˜ ํฌ๊ธฐ๊ฐ€ 32bit ๋ผ์„œ mbstowcs()๋ฅผ ์ด์šฉํ•ด์„œ ๋ณ€ํ™˜ ํ›„ ๊ณ ์ •๊ธธ์ด ์ธ์ฝ”๋”ฉ์ฒ˜๋Ÿผwcsํ•จ์ˆ˜๋ฅผ ์‚ฌ์šฉํ•˜๋ฉด ๋œ๋‹ค.UCS-4 ์˜ ๋ถ€๋ถ„์ง‘ํ•ฉ(17 ๊ฐœ์˜ ์–ธ์–ดํŒ๋งŒ ์ •์˜)
  • 21.
    UTF-16BMP ์˜์—ญ ์•ˆ(U+0000-U+FFFF)์˜๋ฌธ์ž๋Š” ๊ทธ๋Œ€๋กœ ํ‘œํ˜„, ๋ฐ–์˜ ๋ฌธ์ž๋Š” ๋ณ€ํ™˜ ํ•„์š” (๊ฐ€๋ณ€๊ธธ์ด)Windows2000 ๊ณผ ์ดํ›„ ๋ฒ„์ „์€ UTF-16 ๊ธฐ๋ฐ˜. ์ด์ „ NT ์ปค๋„์€UCS-2 ๊ธฐ๋ฐ˜Java 2/Java 5๋Š” UCS2/UTF-16์— ์˜์กดUCS-2 ๋ณด๋‹ค ํ™•์žฅ๋œ ๊ฐœ๋…
  • 22.
    UTF-16 ๋ณ€ํ™˜ ๊ทœ์น™SurrogatePair (U+D800~U+DFFF) ์—๋Š” ๋ฌธ์ž ํ• ๋‹น๋˜์–ด ์žˆ์ง€ ์•Š์Œ
  • 23.
    UTF-8'Aโ€™ : U+0041๊ฐ™์€UTF-16 ๋ฅผ char ๋กœ ์ฝ์œผ๋ฉด 00 (null) ๋ฌธ์ž์—ด ๋•Œ๋ฌธ์— ๊ธฐ์กด ํ•จ์ˆ˜๊ฐ€ ์˜ค์ž‘๋™<html><head> <meta http-equiv=โ€œContent-Typeโ€ content=โ€œtext/html;charset=utf-8โ€>Charset๊นŒ์ง€๋Š” ascii๋กœ ์ฝ๊ณ  charset์ฝ์€ ํ›„์— ์ธ์ฝ”๋”ฉ์— ๋งž์ถฐ์„œ ํŒŒ์‹ฑ ์‹œ์ž‘. ๊ทธ๋Ÿฌ๋‹ˆ charset์ด์ „์— unicode์ธ์ฝ”๋”ฉ ๊ธ€์ž๊ฐ€ ๋“ค์–ด๊ฐ€๋ฉด ์•ˆ ๋จ์›น์˜ ์‹ค์งˆ์  ํ‘œ์ค€, ๋งŽ์€ *nix ์‹œ์Šคํ…œ, xml, python ์€ UTF-8 ์„ ๊ฐ€์žฅ ๊ธฐ์ดˆ์ ์ธ ์ธ์ฝ”๋”ฉ์œผ๋กœ ์‚ฌ์šฉ๊ธ€์ž ๊ธธ์ด๋ฅผ ์•Œ๋ ค๋ฉด ์ „์ฒด ๊ธ€์„ ํŒŒ์‹ฑํ•ด์•ผ ํ•จ
  • 25.
    Unicode ํ•œ๊ธ€์—์„œ ๋ฐ›์นจ์•Œ๊ธฐ์œ ๋‹ˆ์ฝ”๋“œ 2.0 : ํ•œ๊ธ€์€ ์ดˆ์„ฑ 19๊ฐœ, ์ค‘์„ฑ 21๊ฐœ, ์ข…์„ฑ 28๊ฐœ(์—†์Œ๋„ ํฌํ•จ)๊ฐ€ ์žˆ๋‹ค. ์ดˆ์„ฑ 19๊ฐœ๋ฅผ 0...18๊นŒ์ง€ ๋ฒˆํ˜ธ๋ฅผ ๋ถ™์ด๊ณ  ์ค‘์„ฑ๋„ 0...20, ์ข…์„ฑ๋„ ์—ญ์‹œ 0...27๊นŒ์ง€ ๋ฒˆํ˜ธ๋ฅผ ๋ถ™์ธ๋‹ค๋ฉด, ์›ํ•˜๋Š” ์ฝ”๋“œ๋Š” 0xAC00 + x*21*28 + y*28 + z (x=์ดˆ์„ฑ๋ฒˆํ˜ธ, y=์ค‘์„ฑ๋ฒˆํ˜ธ, z=์ข…์„ฑ๋ฒˆํ˜ธ)๋กœ ๋งŒ๋“ค ์ˆ˜ ์žˆ๋‹ค. ์ข…์„ฑ์—์„œ 0 ๋ฒˆ์งธ์— ํ•ด๋‹นํ•˜๋Š” ๊ฒƒ์€ '์—†์Œ'์ด๋ฏ€๋กœ ์œ ๋‹ˆ์ฝ”๋“œ๊ฐ’์—์„œ 0xAC00์„ ๋บ€ ํ›„์— 28๋กœ ๋‚˜๋ˆ„์–ด ๋–จ์–ด์ง€๋Š”์ง€ ํ™•์ธํ•˜๋ฉด ๋ฉ๋‹ˆ๋‹ค.http://jof4002.net/Unicodewchar_t* pString = L"๊ฐ€๊ฐ๋‚˜๋“ฏ";cout << (pString[0] - 0xAC00) % 28 << endl; // 0cout << (pString[1] - 0xAC00) % 28 << endl; // 1cout << (pString[2] - 0xAC00) % 28 << endl; // 0cout << (pString[3] - 0xAC00) % 28 << endl; // 19
  • 26.
    Unicode ๋ณ€ํ™˜USES_CONVERSION;pI->SomeFunctionNeedsUnicode(T2OLE(lpszA));๋งคํฌ๋กœ ์ธ์ž๊ฒฐ๊ณผA2CW (LPCSTR) (LPCWSTR)A2W (LPCSTR) (LPWSTR)W2CA (LPCWSTR) (LPCSTR)W2A (LPCWSTR) (LPSTR)T2COLE (LPCTSTR) (LPCOLESTR)T2OLE (LPCTSTR) (LPOLESTR)OLE2CT (LPCOLESTR) (LPCTSTR)OLE2T (LPCOLESTR) (LPCSTR)
  • 27.
    Unicode in VC++std::locale::global(std::locale(""));wcin.imbue(locale("korean")); ์™€ wcout.imbue(locale("korean"));wcout.fail() ๋กœํ™•์ธํ•˜๊ณ , wcout.clear();_setmode(_fileno(stdout), _O_U16TEXT);
  • 28.
    UTF-16 ๋ฌธ์ž ๊ฐœ์ˆ˜๊ตฌํ•˜๊ธฐcode snippet http://dodoubt.tistory.com/40์ฐธ๊ณ 
  • 29.
    BOM(Byte Order Mark)ํŒŒ์ผ์ด์–ด๋–ค ์‹์œผ๋กœ ์ธ์ฝ”๋”ฉ๋˜์–ด ์žˆ๋Š”์ง€ ์•Œ๋ ค์ฃผ๋Š” ํ—ค๋” ์—ญํ• UTF-32, big-endian : 00 00 FE FFUTF-32, little-endian : FF FE 00 00UTF-16, big-endian : FE FFUTF-16, little-endian : FF FEUTF-8 : EF BB BFUTF-8 ์—์„œ๋Š” BOM ์‚ฌ์šฉ์„ ๋ณ„๋กœ ๊ถŒ์žฅํ•˜์ง€ ์•Š์Œ. UTF-8 ์ด ๊ธฐ๋ณธ ์–ธ์–ด๋Š” ASCII ์™€ ํ˜ธํ™˜๋œ๋‹ค๋Š” ์žฅ์ ์ด ์žˆ๋Š”๋ฐ, BOM ์ฒ˜๋ฆฌ๋ฅผ ํ•˜์ง€ ์•Š๋Š” editor ๋‚˜ ์›นํŽ˜์ด์ง€์—์„œ๋Š” BOM ์„ iโ‰ซยฟ ๋กœ ์ถœ๋ ฅํ•  ์ˆ˜ ์žˆ๋‹ค.
  • 30.
    Font๋ฌธ์ž -> ์œ ๋‹ˆ์ฝ”๋“œ-> ์œ ๋‹ˆ์ฝ”๋“œ ์ธ์ฝ”๋”ฉ-> ํ™”๋ฉด ๊ทธ๋ฆฌ๊ธฐ์œ ๋‹ˆ์ฝ”๋“œ ํฐํŠธArial Unicode MS(ARIALUNI.TTF, 22,730KB)ํ•จ์ดˆ๋กฑ์ฒด, ํ•œ์ปด๋ฐ”ํƒ• : http://maplestory.pe.kr/1785๊ณ ์ •๊ธธ์ด ํฐํŠธ(Monospace Font)๊ตด๋ฆผ์ฒด, ๋ฐ”ํƒ•์ฒด, ๋‹์›€์ฒด๊ฐ€๋ณ€๊ธธ์ด ํฐํŠธ๊ตด๋ฆผ, ๋ฐ”ํƒ•, ๋‹์›€์ƒ๊ด€์—†์ง€๋งŒ ๋‚˜๋ˆ”๊ณ ๋”• ์ฝ”๋”ฉ๊ธ€๊ผดhttp://dev.naver.com/projects/nanumfont/downloadBitstream Vera Sans Mono + ๋ง‘์€๊ณ ๋”•http://ggotbo.egloos.com/2334938
  • 31.
    Console ์—์„œ์˜ ํฐํŠธ[HKEY_CURRENT_USER\Console\%SystemRoot%_system32_cmd.exe]"CodePage"=dword:000001b5"FontSize"=dword:000c0000"FontFamily"=dword:00000036"FontWeight"=dword:00000190"FaceName"="๊ตด๋ฆผ์ฒดโ€œ๋ช…๋ น ์ฐฝ์—์„œ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ๊ธ€๊ผด์— ๋Œ€ํ•ด ํ•„์š”ํ•œ ์กฐ๊ฑด fixed-pitch font, not italic font,no negative A or C space if (TrueType) FF_MODERN else OEM_CHARSEThttp://support.microsoft.com/kb/Q247815๋ช…๋ น ํ”„๋กฌํ”„ํŠธ ๋””ํดํŠธ ํฐํŠธ ๋ฐ”๊พธ๋Š” ๋ฒ•http://pcwinvista.tistory.com/340http://dodoubt.tistory.com/34
  • 32.
  • 33.
  • 34.
    ์ธ์ฝ”๋”ฉSBCS(Single Byte CharacterSet)ASCIIMBCS(Multi Byte Character Set)UTF-16, UTF-8๋ฌธ์ž์—ด ๊ธธ์ด๋ฅผ ๋ฐ”๋กœ ์•Œ ์ˆ˜ ์—†๋‹ค.WBCS(Wide Byte Character Set)UTF-32, UCS-2, UCS-4๋ฌธ์ž์…‹๊ณผ์ธ์ฝ”๋”ฉ์ด ๋™์ผSBCD, MBCS, WBCS ๋Š” ์ธ์ฝ”๋”ฉ ๋ฐฉ๋ฒ•์ด์ง€ ์ธ์ฝ”๋”ฉ์ด ์•„๋‹˜
  • 35.
    ReferenceUnicode ์˜ ์ดํ•ดโ€“ novo networkshttp://www.novonetworks.com/jamestic/Unicode_1.0.pdf์ง„์ˆ™์˜ ์œ ๋‹ˆ์ฝ”๋“œ ์ž…๋ฌธ์„œhttp://www.kristalinfo.com/K-Lab/unicode/Unicode_intro-kr.htmlMBCS ์™€ ์œ ๋‹ˆ์ฝ”๋“œ http://www.animalpicturesarchive.com/jinsuk-kim/diary/read.php?2006/0203์œ„ํ‚ค๋ฐฑ๊ณผโ€“ ์œ ๋‹ˆ์ฝ”๋“œ, ์œ ๋‹ˆ์ฝ”๋“œ ๋ฒ”์œ„ ๋ชฉ๋กUnicode 5.2 Character Code Chartshttp://www.unicode.org/charts/์กฐ์—˜ ์˜จ ์†Œํ”„ํŠธ์›จ์–ด : ์œ ๋‹ˆ์ฝ”๋“œ์™€ ๋ฌธ์ž์ง‘ํ•ฉ์— ๋Œ€ํ•œ ๊ณ ์ฐฐhttp://www.joelonsoftware.com/articles/Unicode.htmlCharacter sets and codepageshttp://www.microsoft.com/typography/unicode/cscp.htmhttp://www.microsoft.com/typography/unicode/1250.gifํ•œ๊ธ€ ์ฝ”๋“œํŽ˜์ด์ง€ http://www.unicode.org/charts/PDF/UAC00.pdfKS C 5601 ์™„์„ฑํ˜• ์ฝ”๋“œhttp://zbxe.bluegate.kr/42http://whatisthat.co.kr/6
  • 36.
    Referencehttp://jof4002.net/UnicodeVC++ : ์œ ๋‹ˆ์ฝ”๋“œ๋ฅผํ‘œ์ค€ ์ถœ๋ ฅ์— ๋‚ด๋ณด๋‚ด๊ธฐhttp://kaistizen.net/EE/index.php/weblog/comments/unicode_hangul_to_stdout/IdeAthinKING - C fileโ€™s orientationhttp://ideathinking.com/blog/?p=108http://ideathinking.com/blog/?p=109rein : ์ธ์ฝ”๋”ฉ๊ณผ ๋ฌธ์ž์ง‘ํ•ฉ: Unicodehttp://rein.kr/blog/archives/280rein : Windows Character Encoding: UCS2? UTF-16?http://rein.kr/blog/archives/585STL string ์‚ฌ์šฉ์‹œ wstring์ผ๋•Œ, ์ถœ๋ ฅ์ด ๋˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค. http://kldp.org/node/93573http://en.wikipedia.org/wiki/Code_pagehttp://gpgstudy.com/gpgiki/์œˆ๋„์šฐ ๋‹ค๊ตญ์–ด ํ”„๋กœ๊ทธ๋ž˜๋ฐMBCS ์™€ UNICODE FAQ ์ •๋ฆฌhttp://mynotepad.tistory.com/67
  • 37.
    ReferenceUnicode - (1)๊ฐœ๋…http://dodoubt.tistory.com/29Standard output์œผ๋กœ unicode๋ฌธ์ž๋ฅผ ์ถœ๋ ฅํ•˜๊ธฐ (Win32 console application)http://dodoubt.tistory.com/35Unicode - (2) UTF-16(wide character) in Windowshttp://dodoubt.tistory.com/36Unicode - (3) UTF-8 in Windowshttp://dodoubt.tistory.com/38Unicode - (4) ๋ฌธ์ž ๊ฐœ์ˆ˜ ๊ตฌํ•˜๊ธฐ, ๋ณ€ํ™˜(convert) code snippethttp://dodoubt.tistory.com/40window command prompt(cmd.exe)์—์„œ ์‚ฌ์šฉํ•˜๋Š” font ์ถ”๊ฐ€ ๋ฐ ๋ณ€๊ฒฝํ•˜๊ธฐhttp://dodoubt.tistory.com/34ASCII and Unicode quotation marks by Markus Kuhn http://www.cl.cam.ac.uk/~mgk25/ucs/quotes.html์œ ๋‹ˆ ์ฝ”๋“œ (๊ตฌ์›์˜ ์—ฌ์‹ ์˜ ๋“ฑ์žฅ?) - ๋ฐ•์šฐ์˜http://web.edunet4u.net/~han0416/%ED%95%98%EB%93%9C%EC%9B%A8%EC%96%B4%20%EA%B0%95%EC%A2%8C/chapter2/uni_code.htmCode2001, a Plane 1 Unicode-based Fonthttp://www.code2000.net/code2001.htmwprintf/wcout and unicode characters in VS2005http://blog.kalmbachnet.de/?postid=98
  • 38.
    ReferenceDavid Myriad Rosenbaum'sFont Sanctuary (Ugaritic Font)http://davidmyriad.tripod.com/myriads.font.page.htmlhttp://www.alanwood.net/unicode/fonts-middle-eastern.html#ugaritic์™ธ๊ตญ์–ด ์ง€์›์„ ์œ„ํ•œ Unicode ํ™œ์šฉ ๋ฐฉ๋ฒ•http://www.ibm.com/developerworks/kr/library/l-linuni.htmlASCII Tablehttp://www.asciitable.com/์‹ฌ์‹ฌํ• ๋•Œ ์ฝ์–ด๋ณด๋Š” ๋ฌธ์ž์…‹, ์ธ์ฝ”๋”ฉ ์ด์•ผ๊ธฐhttp://blog.daum.net/_blog/tagArticleList.do?blogid=0Idq4&tagName=%EB%AC%B8%EC%9E%90%EC%85%8B#ajax_history_homeํ•œ๊ธ€ ์ธ์ฝ”๋”ฉ ์ด์•ผ๊ธฐ - (1) ASCII, ์™„์„ฑํ˜•, ์กฐํ•ฉํ˜•, EUCKR, CP949http://heyjimin.tistory.com/14ํ•œ๊ธ€ ์ธ์ฝ”๋”ฉ ์ด์•ผ๊ธฐ - (2) ์œ ๋‹ˆ์ฝ”๋“œ, UCS-2, UTF-8, UTF-16http://heyjimin.tistory.com/15http://namoda.springnote.com/pages/2017552์œ ๋‹ˆ์ฝ”๋“œ ๋ณผ ์ˆ˜ ์žˆ๋Š” ์—๋””ํ„ฐ? KORAIShttp://korais.sourceforge.net/screenshots.html