SlideShare a Scribd company logo
1 of 18
Classical Chinese Poetry Automation
and
Chinese Text Analysis
Minghui Yu
Agenda
1. About me
2. Classical Chinese poetry automation
• Classical Chinese poetry features
• Challenges
• Progresses
3. Chinese text analysis
• Chinese vs English
• As a research tool
About me
•Minghui Yu
- Programmer Analyst, UBC IT (Mon-Fri, 9 AM- 5PM)
- Classical Chinese poetry enthusiast, amateur poet poem
writer (rest of the day)
- MBA (SFU,2016)
Childhood dream Today’s me
Poetry writing: Art or Science
v.s.
Classical Chinese Poetry Features
Conciseness
 Usually short
 i.e.: many are as short as only 20 Chinese characters
Forms(*)
 Rhyming
 Tone
 Vocabulary
*Regulated verse, since from early Tang Dynasty (618–907)
A poem example
(L) O L L O
红 豆 生 南 国
L L O O L
春 来 发 几 枝
(O) L L O O
愿 君 多 采 撷
O O O L L
此 物 最 相 思
Tone Pattern:
(L/O)-O-L-L-O
L-L-O-O-L
(L/O)-L-L-O-O
(L/O)-O-O-L-L
L: Level tone
O: Oblique tone
(L/O): Can be either
Rhyming pattern:
枝 Zhi 思 Si
相思 Miss You
Red beans in the South grow.
Spring comes, sprays aglow.
Pick more beans, may thou?
Of love they're the best show.
Challenges
• Syntax (easy)
 Tone (level or oblique tone)
 Rhyme (i.e.: Zhi, Si)
• Semantics (harder)
 Spring Autumn comes, sprays aglow?
• Pragmatics (hardest)
 If Autumn, then perhaps
chrysanthemum?
 Is chrysanthemum a good metaphor
for love?
Progress
Word Salad
- non-sense “poem”
- Syntax okay
Computer Aided; Template
based
- More like an advanced
proof reading/editing tool
Statistical Modeling(*)
- Case based reasoning
- Phrase-based
- Sentence Poem Block
Machine Learning(*):
- Recurrent neural networks
- Genetic algorithm
*Both use extensive statistical concepts and techniques; there is no
such a clear boundary.
Goal
Input Processing… Output
Topic:
Sense of urgency (忧患意识)
Keywords:
Great Wall (长城)
Dilapidated (破落)
Rhyme:
Level tone: Zhi (上平四支)
Tone:
(O)OLLO,
LLOOL.
(L)LLOO,
(O)OOLL .
Statistical Modeling
Machine Learning
四海承平久,
长城日渐危.
明朝胡犯我,
御虏又凭谁?
The country has been peaceful for
long,
The Great Wall is dilapidated and
broke down,
Shall enemies pour forces on,
Whom I can depend upon?
By the way, I wrote this poem, not computer 
Not easy
• 向晚黄鹂隔叶鸣
• 唤回残梦两三声
• 向晚: early evening/later afternoon
• 残梦: messy dreams (memory of dreams)
Messy dreams in the early evening/late afternoon?
Sounds like a semantic error.
But doable
白鹭窥鱼立
Egrets stood, peeping fishes
青山照水开
Water was still, reflecting mountains
夜来风不动
The wind went down by nightfall,
明月见楼台
as the moon came up by the tower.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language
Processing (EMNLP), pages 670–680
Question:
An egret is any of several herons. Some
herons hunt in the night.
But do egrets hunt in the night?
If egrets do, this poem is good.
Otherwise, it has a semantic error too.
Chinese text analysis
Chinese VS English text analysis
Similarities
Differences
Source text: 我叫余明辉我老家是安徽省我住在Richmond市
Source text: My name is Yu Minghui My hometown is Anhui Province I live in City of Richmond
As a research tool
• Literature study
• Help researchers search certain topics more efficiently
• Compare literatures
• Journalism
• Business
• … …
Very similar to English text analysis as a research tool
There are more similarities than dissimilarities
Last but not least
Curiosity
Aspiration
Beauty
Application
Why Classical Chinese poetry
automation?
 The pursuit of beauty
 A journey of curiosity
 …
文章本天成
妙手偶得之 (陆游)
A great piece of work was created
by the Heaven,
A skilled person got it by chance
But
Can an algorithm got it by design?
Once again: poetry writing: Art or
Science?
Thank you!
Some resources
Open Source Natural Language Processing (NLP) framework:
• Stanford CoreNLP: https://stanfordnlp.github.io/CoreNLP/
Some Chinese NLP application demos:
• http://bosonnlp.com/
• https://cloud.baidu.com/product/nlp.html
• http://thulac.thunlp.org
• http://www.ltp-cloud.com/
• http://nlp.qq.com/

More Related Content

Recently uploaded

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 

Recently uploaded (20)

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 

Featured

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 

Featured (20)

Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 

Classical chinese poetry automation and chinese text analysis

  • 1. Classical Chinese Poetry Automation and Chinese Text Analysis Minghui Yu
  • 2. Agenda 1. About me 2. Classical Chinese poetry automation • Classical Chinese poetry features • Challenges • Progresses 3. Chinese text analysis • Chinese vs English • As a research tool
  • 3. About me •Minghui Yu - Programmer Analyst, UBC IT (Mon-Fri, 9 AM- 5PM) - Classical Chinese poetry enthusiast, amateur poet poem writer (rest of the day) - MBA (SFU,2016)
  • 5. Poetry writing: Art or Science v.s.
  • 6. Classical Chinese Poetry Features Conciseness  Usually short  i.e.: many are as short as only 20 Chinese characters Forms(*)  Rhyming  Tone  Vocabulary *Regulated verse, since from early Tang Dynasty (618–907)
  • 7. A poem example (L) O L L O 红 豆 生 南 国 L L O O L 春 来 发 几 枝 (O) L L O O 愿 君 多 采 撷 O O O L L 此 物 最 相 思 Tone Pattern: (L/O)-O-L-L-O L-L-O-O-L (L/O)-L-L-O-O (L/O)-O-O-L-L L: Level tone O: Oblique tone (L/O): Can be either Rhyming pattern: 枝 Zhi 思 Si 相思 Miss You Red beans in the South grow. Spring comes, sprays aglow. Pick more beans, may thou? Of love they're the best show.
  • 8. Challenges • Syntax (easy)  Tone (level or oblique tone)  Rhyme (i.e.: Zhi, Si) • Semantics (harder)  Spring Autumn comes, sprays aglow? • Pragmatics (hardest)  If Autumn, then perhaps chrysanthemum?  Is chrysanthemum a good metaphor for love?
  • 9. Progress Word Salad - non-sense “poem” - Syntax okay Computer Aided; Template based - More like an advanced proof reading/editing tool Statistical Modeling(*) - Case based reasoning - Phrase-based - Sentence Poem Block Machine Learning(*): - Recurrent neural networks - Genetic algorithm *Both use extensive statistical concepts and techniques; there is no such a clear boundary.
  • 10. Goal Input Processing… Output Topic: Sense of urgency (忧患意识) Keywords: Great Wall (长城) Dilapidated (破落) Rhyme: Level tone: Zhi (上平四支) Tone: (O)OLLO, LLOOL. (L)LLOO, (O)OOLL . Statistical Modeling Machine Learning 四海承平久, 长城日渐危. 明朝胡犯我, 御虏又凭谁? The country has been peaceful for long, The Great Wall is dilapidated and broke down, Shall enemies pour forces on, Whom I can depend upon? By the way, I wrote this poem, not computer 
  • 11. Not easy • 向晚黄鹂隔叶鸣 • 唤回残梦两三声 • 向晚: early evening/later afternoon • 残梦: messy dreams (memory of dreams) Messy dreams in the early evening/late afternoon? Sounds like a semantic error.
  • 12. But doable 白鹭窥鱼立 Egrets stood, peeping fishes 青山照水开 Water was still, reflecting mountains 夜来风不动 The wind went down by nightfall, 明月见楼台 as the moon came up by the tower. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 670–680 Question: An egret is any of several herons. Some herons hunt in the night. But do egrets hunt in the night? If egrets do, this poem is good. Otherwise, it has a semantic error too.
  • 13. Chinese text analysis Chinese VS English text analysis Similarities Differences
  • 14. Source text: 我叫余明辉我老家是安徽省我住在Richmond市 Source text: My name is Yu Minghui My hometown is Anhui Province I live in City of Richmond
  • 15. As a research tool • Literature study • Help researchers search certain topics more efficiently • Compare literatures • Journalism • Business • … … Very similar to English text analysis as a research tool There are more similarities than dissimilarities
  • 16. Last but not least Curiosity Aspiration Beauty Application Why Classical Chinese poetry automation?  The pursuit of beauty  A journey of curiosity  … 文章本天成 妙手偶得之 (陆游) A great piece of work was created by the Heaven, A skilled person got it by chance But Can an algorithm got it by design? Once again: poetry writing: Art or Science?
  • 18. Some resources Open Source Natural Language Processing (NLP) framework: • Stanford CoreNLP: https://stanfordnlp.github.io/CoreNLP/ Some Chinese NLP application demos: • http://bosonnlp.com/ • https://cloud.baidu.com/product/nlp.html • http://thulac.thunlp.org • http://www.ltp-cloud.com/ • http://nlp.qq.com/