SlideShare a Scribd company logo
1
W3C Workshop on Internationalizing SSML
SSML Extension for Korean
Workshop : 2005/11/02 (Wed)
Sang-Jin Kim
sangjin@icu.ac.kr
2
Contents
 Characteristic of Korean
 SSML Extension for Chinese Characters in Korean
 SSML Extension for Homograph Words in Korean
 Conclusion
3
Characteristic of Korean
 Hangul, The Korean Character
 Consists of forty letters
 21 vowels (including 13 diphthongs), and 19 consonants
 Syllable
 V, CV, VC, and CVC (C : consonant, V : vowel)
 Eojeol, the word phrase is different from a phrase in English
 Completely different from Japanese except for the grammatical
structure
 Completely different from Chinese although Korean has
borrowed many Chinese words and some Chinese characters
4
Characteristic of Korean
 Vowels in Hangul, The Korean Character
 Monothong vowels classified according to tongue position and
height
5
Characteristic of Korean
 Consonants in Hangul, The Korean Character
 Consonants classified according to place and manner of
articulation
6
SSML Extension for
Chinese Characters in Korean
 Chinese Characters in Korean
 Present Korean and Japanese use many Chinese Characters
 But, pronunciation of the characters is different
 Same characters is represented differently according to the
country
 These simplified characters are not used in Korea
7
SSML Extension for
Chinese Characters in Korean
 Chinese Characters in Korean
 We can write text only with Korean characters
 Not unusual to use Chinese characters as well
 The pronunciation of the are exactly same
8
SSML Extension for
Chinese Characters in Korean
 Chinese Characters in Korean TTS
 The input text for text-to-speech(TTS) system has to be
converted into a phonetic list
 If Chinese characters are mixed with Korean characters, they
have to be substituted to Korean
 We don’t use all Chinese characters, rather there is a
frequently-used-Chinese-character-list recommended by our
Korean government and its size is 2000
 We need to utilize this list and their pronunciations in the
Korean TTS system, since the pronunciations of them are
different from Chinese and Japanese
9
SSML Extension for
Chinese Characters in Korean
 SSML Extension for Chinese Characters in Korean
 Same characters but different pronunciation in Chinese
Characters according to the country
<lexicon xml:lang=”ko” uri=”http://www.multilingual.org/lexicon.file”>
<lexicon xml:lang=”ko-CN” uri=”http://www.multilingual.org/Chinese_lexicon_freq_KR.file”>
<lexicon xml:lang=”ko-CN” uri=”http://www.multilingual.org/Chinese_lexicon_technical.file”>
<lexicon xml:lang=”ja-KR” uri=”http://www.multilingual.org/Chinese_lexicon_JP.file”>
<lexicon xml:lang=”cn-KR” uri=”http://www.multilingual.org/Chinese_lexicon_CN.file”>
10
SSML Extension for
Homograph Words in Korean
 Homograph Words in Korean
 Same word, different pronunciation, different meaning
 The difference is “duration”
11
SSML Extension for
Homograph Words in Korean
 SSML Extension for Homograph Words in Korean
 Only the difference for these words is the duration in
pronunciation
 necessary to give the duration information to a TTS system for
these kinds of words
 SSML recommendation supports “say-as” element and “sub”
element, these elements cannot handle the above problem
successfully
12
SSML Extension for
Homograph Words in Korean
 SSML Extension for Homograph Words in Korean
 We suggest “tone” tag for this problem
 Attribute values for tone element are ‘long’, ‘short’ and ‘default’
would be enough for Korean.
13
Conclusion
 SSML Extension for Chinese Characters in Korean
 lexicon element doesn’t support “xml:lang” tag
 We suggest xml:lang=“ko”, xml:lang=“ko-CN”, xml:lang=“ja-
KR”, xml:lang=“cn-KR” tags
 SSML Extension for Homograph Words in Korean
 “say-as” and “sub” elements cannot handle homograph
problem successfully
 We suggest “tone” element
 Attribute values, type=“long”, type=“short”, and type=“default”
would be enough for Korean

More Related Content

More from Sampath Samudrala

The_barista_book.pptx
The_barista_book.pptxThe_barista_book.pptx
The_barista_book.pptx
Sampath Samudrala
 
appliedpolymersciences.pdf
appliedpolymersciences.pdfappliedpolymersciences.pdf
appliedpolymersciences.pdf
Sampath Samudrala
 
sugarcanebagasselignin.pdf
sugarcanebagasselignin.pdfsugarcanebagasselignin.pdf
sugarcanebagasselignin.pdf
Sampath Samudrala
 
KineticModelingandMechanismsofAcid-CatalyzedDelignificationofSugarcaneBagasse...
KineticModelingandMechanismsofAcid-CatalyzedDelignificationofSugarcaneBagasse...KineticModelingandMechanismsofAcid-CatalyzedDelignificationofSugarcaneBagasse...
KineticModelingandMechanismsofAcid-CatalyzedDelignificationofSugarcaneBagasse...
Sampath Samudrala
 
fpntc_effective_refs_training_guide_slides_2019-06.pptx
fpntc_effective_refs_training_guide_slides_2019-06.pptxfpntc_effective_refs_training_guide_slides_2019-06.pptx
fpntc_effective_refs_training_guide_slides_2019-06.pptx
Sampath Samudrala
 
mcreferral.ppt
mcreferral.pptmcreferral.ppt
mcreferral.ppt
Sampath Samudrala
 

More from Sampath Samudrala (6)

The_barista_book.pptx
The_barista_book.pptxThe_barista_book.pptx
The_barista_book.pptx
 
appliedpolymersciences.pdf
appliedpolymersciences.pdfappliedpolymersciences.pdf
appliedpolymersciences.pdf
 
sugarcanebagasselignin.pdf
sugarcanebagasselignin.pdfsugarcanebagasselignin.pdf
sugarcanebagasselignin.pdf
 
KineticModelingandMechanismsofAcid-CatalyzedDelignificationofSugarcaneBagasse...
KineticModelingandMechanismsofAcid-CatalyzedDelignificationofSugarcaneBagasse...KineticModelingandMechanismsofAcid-CatalyzedDelignificationofSugarcaneBagasse...
KineticModelingandMechanismsofAcid-CatalyzedDelignificationofSugarcaneBagasse...
 
fpntc_effective_refs_training_guide_slides_2019-06.pptx
fpntc_effective_refs_training_guide_slides_2019-06.pptxfpntc_effective_refs_training_guide_slides_2019-06.pptx
fpntc_effective_refs_training_guide_slides_2019-06.pptx
 
mcreferral.ppt
mcreferral.pptmcreferral.ppt
mcreferral.ppt
 

Recently uploaded

CADAVER AS OUR FIRST TEACHER anatomt in your.pptx
CADAVER AS OUR FIRST TEACHER anatomt in your.pptxCADAVER AS OUR FIRST TEACHER anatomt in your.pptx
CADAVER AS OUR FIRST TEACHER anatomt in your.pptx
fakeloginn69
 
BeMetals Presentation_May_22_2024 .pdf
BeMetals Presentation_May_22_2024   .pdfBeMetals Presentation_May_22_2024   .pdf
BeMetals Presentation_May_22_2024 .pdf
DerekIwanaka1
 
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
BBPMedia1
 
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
BBPMedia1
 
Search Disrupted Google’s Leaked Documents Rock the SEO World.pdf
Search Disrupted Google’s Leaked Documents Rock the SEO World.pdfSearch Disrupted Google’s Leaked Documents Rock the SEO World.pdf
Search Disrupted Google’s Leaked Documents Rock the SEO World.pdf
Arihant Webtech Pvt. Ltd
 
Sustainability: Balancing the Environment, Equity & Economy
Sustainability: Balancing the Environment, Equity & EconomySustainability: Balancing the Environment, Equity & Economy
Sustainability: Balancing the Environment, Equity & Economy
Operational Excellence Consulting
 
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-indiafalcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
Falcon Invoice Discounting
 
Unveiling the Secrets How Does Generative AI Work.pdf
Unveiling the Secrets How Does Generative AI Work.pdfUnveiling the Secrets How Does Generative AI Work.pdf
Unveiling the Secrets How Does Generative AI Work.pdf
Sam H
 
What are the main advantages of using HR recruiter services.pdf
What are the main advantages of using HR recruiter services.pdfWhat are the main advantages of using HR recruiter services.pdf
What are the main advantages of using HR recruiter services.pdf
HumanResourceDimensi1
 
Digital Transformation in PLM - WHAT and HOW - for distribution.pdf
Digital Transformation in PLM - WHAT and HOW - for distribution.pdfDigital Transformation in PLM - WHAT and HOW - for distribution.pdf
Digital Transformation in PLM - WHAT and HOW - for distribution.pdf
Jos Voskuil
 
Role of Remote Sensing and Monitoring in Mining
Role of Remote Sensing and Monitoring in MiningRole of Remote Sensing and Monitoring in Mining
Role of Remote Sensing and Monitoring in Mining
Naaraayani Minerals Pvt.Ltd
 
Cracking the Workplace Discipline Code Main.pptx
Cracking the Workplace Discipline Code Main.pptxCracking the Workplace Discipline Code Main.pptx
Cracking the Workplace Discipline Code Main.pptx
Workforce Group
 
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBdCree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
creerey
 
Maksym Vyshnivetskyi: PMO Quality Management (UA)
Maksym Vyshnivetskyi: PMO Quality Management (UA)Maksym Vyshnivetskyi: PMO Quality Management (UA)
Maksym Vyshnivetskyi: PMO Quality Management (UA)
Lviv Startup Club
 
Memorandum Of Association Constitution of Company.ppt
Memorandum Of Association Constitution of Company.pptMemorandum Of Association Constitution of Company.ppt
Memorandum Of Association Constitution of Company.ppt
seri bangash
 
Set off and carry forward of losses and assessment of individuals.pptx
Set off and carry forward of losses and assessment of individuals.pptxSet off and carry forward of losses and assessment of individuals.pptx
Set off and carry forward of losses and assessment of individuals.pptx
HARSHITHV26
 
Skye Residences | Extended Stay Residences Near Toronto Airport
Skye Residences | Extended Stay Residences Near Toronto AirportSkye Residences | Extended Stay Residences Near Toronto Airport
Skye Residences | Extended Stay Residences Near Toronto Airport
marketingjdass
 
Discover the innovative and creative projects that highlight my journey throu...
Discover the innovative and creative projects that highlight my journey throu...Discover the innovative and creative projects that highlight my journey throu...
Discover the innovative and creative projects that highlight my journey throu...
dylandmeas
 
Business Valuation Principles for Entrepreneurs
Business Valuation Principles for EntrepreneursBusiness Valuation Principles for Entrepreneurs
Business Valuation Principles for Entrepreneurs
Ben Wann
 
Putting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptxPutting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptx
Cynthia Clay
 

Recently uploaded (20)

CADAVER AS OUR FIRST TEACHER anatomt in your.pptx
CADAVER AS OUR FIRST TEACHER anatomt in your.pptxCADAVER AS OUR FIRST TEACHER anatomt in your.pptx
CADAVER AS OUR FIRST TEACHER anatomt in your.pptx
 
BeMetals Presentation_May_22_2024 .pdf
BeMetals Presentation_May_22_2024   .pdfBeMetals Presentation_May_22_2024   .pdf
BeMetals Presentation_May_22_2024 .pdf
 
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
 
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
 
Search Disrupted Google’s Leaked Documents Rock the SEO World.pdf
Search Disrupted Google’s Leaked Documents Rock the SEO World.pdfSearch Disrupted Google’s Leaked Documents Rock the SEO World.pdf
Search Disrupted Google’s Leaked Documents Rock the SEO World.pdf
 
Sustainability: Balancing the Environment, Equity & Economy
Sustainability: Balancing the Environment, Equity & EconomySustainability: Balancing the Environment, Equity & Economy
Sustainability: Balancing the Environment, Equity & Economy
 
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-indiafalcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
 
Unveiling the Secrets How Does Generative AI Work.pdf
Unveiling the Secrets How Does Generative AI Work.pdfUnveiling the Secrets How Does Generative AI Work.pdf
Unveiling the Secrets How Does Generative AI Work.pdf
 
What are the main advantages of using HR recruiter services.pdf
What are the main advantages of using HR recruiter services.pdfWhat are the main advantages of using HR recruiter services.pdf
What are the main advantages of using HR recruiter services.pdf
 
Digital Transformation in PLM - WHAT and HOW - for distribution.pdf
Digital Transformation in PLM - WHAT and HOW - for distribution.pdfDigital Transformation in PLM - WHAT and HOW - for distribution.pdf
Digital Transformation in PLM - WHAT and HOW - for distribution.pdf
 
Role of Remote Sensing and Monitoring in Mining
Role of Remote Sensing and Monitoring in MiningRole of Remote Sensing and Monitoring in Mining
Role of Remote Sensing and Monitoring in Mining
 
Cracking the Workplace Discipline Code Main.pptx
Cracking the Workplace Discipline Code Main.pptxCracking the Workplace Discipline Code Main.pptx
Cracking the Workplace Discipline Code Main.pptx
 
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBdCree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
 
Maksym Vyshnivetskyi: PMO Quality Management (UA)
Maksym Vyshnivetskyi: PMO Quality Management (UA)Maksym Vyshnivetskyi: PMO Quality Management (UA)
Maksym Vyshnivetskyi: PMO Quality Management (UA)
 
Memorandum Of Association Constitution of Company.ppt
Memorandum Of Association Constitution of Company.pptMemorandum Of Association Constitution of Company.ppt
Memorandum Of Association Constitution of Company.ppt
 
Set off and carry forward of losses and assessment of individuals.pptx
Set off and carry forward of losses and assessment of individuals.pptxSet off and carry forward of losses and assessment of individuals.pptx
Set off and carry forward of losses and assessment of individuals.pptx
 
Skye Residences | Extended Stay Residences Near Toronto Airport
Skye Residences | Extended Stay Residences Near Toronto AirportSkye Residences | Extended Stay Residences Near Toronto Airport
Skye Residences | Extended Stay Residences Near Toronto Airport
 
Discover the innovative and creative projects that highlight my journey throu...
Discover the innovative and creative projects that highlight my journey throu...Discover the innovative and creative projects that highlight my journey throu...
Discover the innovative and creative projects that highlight my journey throu...
 
Business Valuation Principles for Entrepreneurs
Business Valuation Principles for EntrepreneursBusiness Valuation Principles for Entrepreneurs
Business Valuation Principles for Entrepreneurs
 
Putting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptxPutting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptx
 

Korea - SSML_workshop.ppt

  • 1. 1 W3C Workshop on Internationalizing SSML SSML Extension for Korean Workshop : 2005/11/02 (Wed) Sang-Jin Kim sangjin@icu.ac.kr
  • 2. 2 Contents  Characteristic of Korean  SSML Extension for Chinese Characters in Korean  SSML Extension for Homograph Words in Korean  Conclusion
  • 3. 3 Characteristic of Korean  Hangul, The Korean Character  Consists of forty letters  21 vowels (including 13 diphthongs), and 19 consonants  Syllable  V, CV, VC, and CVC (C : consonant, V : vowel)  Eojeol, the word phrase is different from a phrase in English  Completely different from Japanese except for the grammatical structure  Completely different from Chinese although Korean has borrowed many Chinese words and some Chinese characters
  • 4. 4 Characteristic of Korean  Vowels in Hangul, The Korean Character  Monothong vowels classified according to tongue position and height
  • 5. 5 Characteristic of Korean  Consonants in Hangul, The Korean Character  Consonants classified according to place and manner of articulation
  • 6. 6 SSML Extension for Chinese Characters in Korean  Chinese Characters in Korean  Present Korean and Japanese use many Chinese Characters  But, pronunciation of the characters is different  Same characters is represented differently according to the country  These simplified characters are not used in Korea
  • 7. 7 SSML Extension for Chinese Characters in Korean  Chinese Characters in Korean  We can write text only with Korean characters  Not unusual to use Chinese characters as well  The pronunciation of the are exactly same
  • 8. 8 SSML Extension for Chinese Characters in Korean  Chinese Characters in Korean TTS  The input text for text-to-speech(TTS) system has to be converted into a phonetic list  If Chinese characters are mixed with Korean characters, they have to be substituted to Korean  We don’t use all Chinese characters, rather there is a frequently-used-Chinese-character-list recommended by our Korean government and its size is 2000  We need to utilize this list and their pronunciations in the Korean TTS system, since the pronunciations of them are different from Chinese and Japanese
  • 9. 9 SSML Extension for Chinese Characters in Korean  SSML Extension for Chinese Characters in Korean  Same characters but different pronunciation in Chinese Characters according to the country <lexicon xml:lang=”ko” uri=”http://www.multilingual.org/lexicon.file”> <lexicon xml:lang=”ko-CN” uri=”http://www.multilingual.org/Chinese_lexicon_freq_KR.file”> <lexicon xml:lang=”ko-CN” uri=”http://www.multilingual.org/Chinese_lexicon_technical.file”> <lexicon xml:lang=”ja-KR” uri=”http://www.multilingual.org/Chinese_lexicon_JP.file”> <lexicon xml:lang=”cn-KR” uri=”http://www.multilingual.org/Chinese_lexicon_CN.file”>
  • 10. 10 SSML Extension for Homograph Words in Korean  Homograph Words in Korean  Same word, different pronunciation, different meaning  The difference is “duration”
  • 11. 11 SSML Extension for Homograph Words in Korean  SSML Extension for Homograph Words in Korean  Only the difference for these words is the duration in pronunciation  necessary to give the duration information to a TTS system for these kinds of words  SSML recommendation supports “say-as” element and “sub” element, these elements cannot handle the above problem successfully
  • 12. 12 SSML Extension for Homograph Words in Korean  SSML Extension for Homograph Words in Korean  We suggest “tone” tag for this problem  Attribute values for tone element are ‘long’, ‘short’ and ‘default’ would be enough for Korean.
  • 13. 13 Conclusion  SSML Extension for Chinese Characters in Korean  lexicon element doesn’t support “xml:lang” tag  We suggest xml:lang=“ko”, xml:lang=“ko-CN”, xml:lang=“ja- KR”, xml:lang=“cn-KR” tags  SSML Extension for Homograph Words in Korean  “say-as” and “sub” elements cannot handle homograph problem successfully  We suggest “tone” element  Attribute values, type=“long”, type=“short”, and type=“default” would be enough for Korean