SlideShare a Scribd company logo
1 of 45
Human Interface Laboratory
How a Young Speech Researcher
Dived into Computational Linguistics and
Where He Heads Now
2019. 8. 26, @Working Memory
Won Ik Cho
About Me
• 조원익
 B.S. in EE/Mathematics (SNU, ’10~’14)
 Ph.D. student (SNU INMC, ‘14~)
• Academic background
 Interested in mathematics > EE!
 Double major?
• ...
 Early years in Speech processing lab
• Source separation
• Voice activity & endpoint detection
• Automatic music composition
 Currently studying on computational linguistics
1
About Me
2
Early years
• Undergraduate years
 Played guitar alone ... or with friends
3
Early years
• Why started mathematics?
 Long dreamed romance
4
From 2008!
but...
Early years
• Undergraduate design project: Music source separation
 Why? – To automatically extract and transcribe the score of polyphonic
music for guitar orchestration
• And result?
5
Image: http://kimgooni.blog.me/221427899920
Early years
• Could not give up mathematics... (although it dismissed me)
 First, aimed at cryptography laboratory
6
Image top: https://blog.goodaudience.com/cryptography-for-dummies-part-1-a811d4852daa
bottom: https://www.coindesk.com/bitcoin-hits-new-2019-high-above-8900
or...?
Early years
• Epic FAIL!
7
Early years
• How? 어케들왔누
8
Early years
• Signal processing domain revisited
9
EE!
Circuit
Power
Semiconductor
ControlBio
Communication
?
?
System
Early years
• One step toward grand dream of polyphony music transcription?
 Paper survey on ...
• Multiple pitch estimation
• Music grammar
• Automatic music composition
10
Image: https://www.youtube.com/watch?v=TwQybAwL7NY
Early years
• And reality...
11
Image: https://www.lumenvox.com/resources/caseStudies/redmond/redmond-software-3.aspx
Early years
• First paper technical report on rule-based 4-part chorus
composition!
 Done while attending to a lecture on music theory (harmony)
 EXTREMELY HEURISTIC AND NOT NATURAL!
12
Early years
• Started a new government-funded project from April 2017
 Development of free-running speech
recognition technologies for
embedded robot system
13
Image: https://www.musicaslanguage.com/, https://imgur.com/gallery/iWKad22
Dive into computational linguistics
• New task?
 Development of free-running speech recognition technologies for
embedded robot system
 로봇용 free-running 임베디드 자연어 대화음성인식을 위한 원천 기술 개발
• In other words:
 Non wake-up-word based speech understanding system
 ...?
14
오늘 또
떨어졌네
이게 대체
며칠째
파란불이냐
지금 손실이
얼마지
Dive into computational linguistics
• How?
 Related to many aspects of (speaker-dependent) speech recognition
• Speaker-dependency (in terms of a personal assistant)
• Noisy far-talk recognition and beamforming
• Speech intention understanding
– To which utterances should AI react?
15
오늘 또
떨어졌네
이게 대체
며칠째
파란불이냐
지금 손실이
얼마지
Dive into computational linguistics
• It’s about finding an internal intention of a human speech
 And in Korean?
16
Image: Top https://onepageinfo.tistory.com/52
Bottom: https://m.blog.naver.com/barim12/220831241685
데이터도 없는데 어떻게 해요!
일단 만들어!
Dive into computational linguistics
• Need to find what `questions’ and `commands’ are
• Sentence type?
 밥 먹었다
 밥 먹었니
 밥 먹어라
• But...
 너랑 오랜만에 밥 먹고 싶네
 부른지가 언젠데 밥 안 먹냐
 밥 좀 작작 먹어라
• ...?
17
Image: Top https://www.pinterest.co.kr/pin/367887863281568581/?lp=true
Bottom https://www.hkn24.com/news/articleView.html?idxno=69187
Dive into computational linguistics
• WHY??
 The utterance intention should be identified among colloquial
conversations (non-wake-up-word-based!)
 In pragmatics and sentence-level semantics, this is called `speech act’
(Searle, 1976)
• And in some cases, `dialog act’ (Stolcke et al., 2000)
• It is best to use dialog history and prosodic information, if possible
 But only text data (about 800K, unlabeled single utterances) for us
• and need to show the potential for the left years!
 Let’s first choose 20K randomly and make a labeled corpus on:
18
Dive into computational linguistics
• Intonation-dependent utterances
 How to figure out if the utterances is intonation-dependent?
19
천천히 가고 있어! (utterance)
천천 히 가 고 있 어 (transcript)
question
statement
command
?
Dive into computational linguistics
• Intonation-dependent utterances
 Underspecified sentence enders
• -어, -지, -대, -해, -라고, -다며, etc. (differs from –다, -니, -라)
• Sentence type is determined based upon the sentence-final intonations that are
assigned considering the speech act
 Conversation maxim (Levinson, 2000)
• 정보성-원리 Informativeness-principle (단순화 버전)
– 화자: 필요한 것 이상으로 말하지 말라.
» Do not say more than is required (bearing the Q-principle in mind)
– 청자: 화자가 일반적으로 말한 것은 전형적으로 그리고 특칭적으로 해석하라.
» What is generally said is stereotypically and specifically exemplified.
– e.g., 내일 학생회관에서 두시 반에 만나서 얘기해 (질문? x)
20
Dive into computational linguistics
• Introducing phonetic features: Intonation-dependency
 Annotating proper intention for possible cases of intonation
• 기본적으로 문말 억양을 고려함
• 한 가지 intonation에서 여러 intention이 가능한 경우는 ambiguous한 것으로 봄
• 부사, 수일치 등과 관련하여, 서술이 아닌 것으로 해석하기 어색한 것들은 제외함.
• 너무 많은 정보를 담고 있는 문장을 질문으로 판단하는 것을 피함
• Wh-particle들이 의문사의 기능을 하지 않는 경우들에 주의함
• 많은 한국어 문장이 그렇듯, 주어가 생략되어 1,2,3-인칭 등으로 해석할 수 있을 경
우에는, 각각을 대입해 보고, 어색하지 않은 것들로 판단함
• 호격의 유무에 주의함
21
Dive into computational linguistics
• Intention understanding – how?
 Our approach (for Korean)
22
단일 문장인가?
Intonation 정보로
결정 가능한가?
Question set이 있고
청자의 답을 필요로 하는가?
Effective한 To-do list가
청자에게 부여되는가?
No
Yes
No
Yes
요구 (Commands)
수사명령문 (RC)
Full clause를
포함하는가?
No
No
Compound sentence: 힘이 강한 화행에 중점
(서로 다른 문장도 같은 토픽일 때 한 문장으로 간주)
Fragments (FR)
질문 (Questions)
No
Context-dependent (CD)
Yes
Yes
Yes
Intonation 정보가
필요한가?
Yes
Intonation-dependent (ID)
No Questions /
Embedded form
Requirements /
Prohibitions
수사의문문 (RQ)
Target: single sentence
without context
nor punctuation
Otherwise
서술 (Statements)
Dive into computational linguistics
• System overview: Text-based sieving + Speech-aided analysis
 Compatible to text-speech alignment
23
Dive into computational linguistics
• 2017.04~2017.11
 Setup phase
• Study syntax, semantics, and pragmatics for making up an annotation
guideline for colloquial utterances
• 2017.12~2018.08
 Corpus annotation
• Along with undergraduate students (design project)
• 2018.09~2018.11
 Implement classifiers and make a documentation (paper on arXiv)
• Repository for an open sourcing
• 2018.12~
 Modifying & maintaining the repository
• https://github.com/warnikchow/3i4k
 Article (in Korean) on the annotation guideline (DisCog 26:3)
24
The Struggle within
• Project is successful (so far) - But does it bring me a publication?
 Approach #1: First, let’s make a similar guideline for English! (2017.11)
• Manual tagging on Cornell Movie dialog dataset (binary: only obligatory/non-
obligatory)
25
and REJECT! (for a signal processing conference)
... My sense here is that the two positive reviewers come from a linguistics
background, and are happy to see linguistic insights being applied to a new real-
world problem. The two negative reviewers seem more aware of the methods
currently in use for this type of problem. The field today is hugely dominated by
data-driven methods with very few linguistics insights, so I think it's important to
make room for papers like this. But at the same time, I think this is a bridge too
far for people working in this area to really engage with. There were lots of missed
opportunities – in addition to the reviewers' comments, this paper would have
been saved by experiments on a common dataset, like DSTC2 or ATIS. I therefore
hesitantly recommend reject.
The Struggle within
• How about finer categorization and bigger dataset?
 Approach #2: `question’, `command’, and `statement’
26
and REJECT! (for a computational linguistics conference)
... One weakness or point of criticism is that it did not become clear to me whether
the annotated corpus is being made available open source as a corpus for further
study ... I am not entirely convinced that it is the best idea to use abbreviations /
names for features that are so similar to established "academic" terminology for
sentence types. While the intention is obvious to point out the relationship, it might
be a good idea to make the difference more explicit in the names (int, imp, dec) ...
We learn very little about the annotation guidelines, their granularity, publication
status etc. ... We learn little about the prospective application to spoken language
corpora and the expected impact of an application to spoken / phonetic data incl.
phonological features.
The Struggle within
• And the issues solved?
 Approach #3: More on justification
27
and REJECT! (for an AI/ML conference)
... I agree with the main motivation of categorize the utterance in a conversation
based on the expected response of that one who receives the information. This
position could make procedures more dynamic and direct. However, the authors do
not make an argument that the proposed categories are sufficient for a dialogue,
they might be the main but I will suggest to pay attention to clarifications and
continuations which not necessary correspond to an answer or action. ... It is not
clear what shows the results of the classification regarding the proposal of category.
A good classification means that the proposal categories are good? I missing a
guide on how to interpret this relation. ...
The Struggle within
• Should I do it for my own language?
 Approach #4: Similar categorization in Korean, incorporating a new label
regarding acoustic cues for a head-final language
28
and REJECT! (for an NLP conference)
Summary: The paper presents an approach for intention recognition in Korean,
leveraging both text and acoustic information.
Strengths: The approach is relevant for SLU or dialog systems, and addresses issues
of recognition for head-final languages. It also considers acoustic information
specifically in those cases where acoustic cues are the best discriminators.
Weaknesses: The paper would benefit from specific comparison between the
proposed, somewhat complex architecture and other possible alternative models to
justify the system. It was unclear whether the approach was limited to head-final
languages, and Korean in particular, which would yield a narrow result, and if so,why.
The Struggle within
• There were encouraging words though!
29
The Struggle within
• And some unexpected invitations from linguistic venues/journals
30
And to speech again
• Conference to be presented: ICPhS
 International Congress of Phonetic Science
• 8/5-9, Melbourne, Australia
31
(although I couldn’t attend...)
And to speech again
• Prosody-ambiguous statements?
 Problems in: Wh-intervention?
• Needs disambiguation
32
몇 개 가져오래
Should I bring some?
How many should I bring?
They told you to bring some?
And to speech again
• Prosody-ambiguous statements? How about constructing a corpus that contains ONLY the
utterances whose syntactic ambiguity can be resolved by introducing prosody?
# Wh- particles
누구 (nwukwu, who), 뭐 (mwe, what),
어디 (eti, where), 언제 (encey, when),
어떻게 (ettehkey, how), 몇 (myech, how much)
왜 (way, why) was not utilized because it is not used as an existential quantifier
# Predicates
Depend on the wh- particle being adopted
Chosen among 5,800 frequently used lexicons
Pronouns and polarity items were added in some cases
# Reportive particles
Added to form an evidential mood
Induces rhetoricalness for some questions
# Sentence enders (SEs)
SEs with an unfixed role (underspecified SE)
e.g. -래 (ray), -어 (e), -지 (ci)
SEs with a fixed role
e.g. -까 (kka: interrogative)
# Politeness suffix
Attached at the end of a sentence to assign politeness
Restricts rhetoricalness under some circumstances
33
And to speech again
• Prosody-ambiguous statements?
 Created 1,292 sentences
 Constructed 3,552 utterances (with speech intention, under consensus)
34
And to speech again
• Prosody-ambiguous statements?
 Recorded 7,104 utterances (female/male)
35
And to speech again
• Prosody-ambiguous statements?
36
AdvisorPresenter
Editing
Wrote paper
Checked dataset & Recording
Co-author / Equal contribution
Checked dataset
Proofreading
And to speech again
• Prosody-ambiguous statements?
37
And to speech again
• Prosody-ambiguous statements?
 And in future?
• Disambiguation inspired by neuro-scientific phenomenon
38
Interdisciplinary
• Cowork with the friends of similar interest
39
Interdisciplinary
• AI ethics? (ACL workshop topic)
 Measuring gender bias in machine translation
 Originally claimed to deal with the proposal of government project ... but
40
Done and afterward
• Done
 억양 의존성 및 rhetoricalness를 고려한, 음성인식 output 분석에 적합한 일
반언어학적 speech act 분류 방법론 제시
 한국어를 위한 annotation guideline 정립, corpus 구성 및 모델 학습
 한국어 Speech intention의 disambiguation을 위한 corpus 구성
 질문/요구 paraphrasing 위한 parallel corpus 제작 (under progress)
• Afterward?
 Speech disambiguation을 위한 co-attention framework 개량
 대화체/비정형 질문/요구의 structured paraphrasing
 Task-oriented와 non-oriented 간 code switching이 자유로운 dialog
manager 시스템의 개발
41
Done and afterward
• Where do I head now?
42
Image: Top https://phdcomics.com/comics/archive_print.php?comicid=1733
Bottom: https://slideplayer.com/slide/15366786/
Reference
• Searle, John R. A classification of illocutionary acts. Language in society 5.1 (1976): 1-23.
• Stolcke, Andreas, et al. Dialogue act modeling for automatic tagging and recognition of
conversational speech. Computational linguistics 26.3 (2000): 339-373.
• Levinson, Stephen C. Presumptive meanings: The theory of generalized conversational
implicature. MIT press (2000).
• Cho, Won Ik, Hyeon Seung Lee, Ji Won Yoon, Seok Min Kim, and Nam Soo Kim. Speech
intention understanding in a head-final language: A disambiguation utilizing intonation-
dependency. arXiv preprint arXiv:1811.04231 (2018).
• Cho, Won Ik, and Nam Soo Kim. Discourse component-based Korean speech act categorization
to resolve the vagueness in understanding text intention: A computational linguistics
perspective. Discourse and Cognition 26.3 (2019): 227-247. [Korean]
• Cho, Won Ik*, Jeonghwa Cho*, Jeemin Kang, and Nam Soo Kim. Prosody-semantics interface in
Seoul Korean: Corpus for a disambiguation of wh- intervention. Proc. ICPhS (2019): 3902-3906.
43
Thank you!
EndOfPresentation

More Related Content

What's hot

Assessing reading
Assessing readingAssessing reading
Assessing readingmrssuarez
 
Year 9 assignment.imaginative literary transformation
Year 9 assignment.imaginative literary transformationYear 9 assignment.imaginative literary transformation
Year 9 assignment.imaginative literary transformationjennifer_lawrence
 
Personalised statistical writing analysis
Personalised statistical writing analysisPersonalised statistical writing analysis
Personalised statistical writing analysisjohn6938
 
Language Assessment - Assessing Reading Full Description with Picture and Cha...
Language Assessment - Assessing Reading Full Description with Picture and Cha...Language Assessment - Assessing Reading Full Description with Picture and Cha...
Language Assessment - Assessing Reading Full Description with Picture and Cha...EFL Learning
 
English revision 2015
English revision 2015English revision 2015
English revision 2015Emma Sinclair
 
How to assess and test reading
How to assess and test readingHow to assess and test reading
How to assess and test readingLeslie Gomez
 
As 1.1 excellence exemplar
As 1.1 excellence exemplarAs 1.1 excellence exemplar
As 1.1 excellence exemplarsohana1
 
Assessment criteria3ºesophase2ppp copia
Assessment criteria3ºesophase2ppp   copiaAssessment criteria3ºesophase2ppp   copia
Assessment criteria3ºesophase2ppp copiarogapuebla
 

What's hot (15)

Lecture1
Lecture1Lecture1
Lecture1
 
Lecture1
Lecture1Lecture1
Lecture1
 
Amherst 8 09
Amherst 8 09Amherst 8 09
Amherst 8 09
 
Assessing reading
Assessing readingAssessing reading
Assessing reading
 
Year 9 assignment.imaginative literary transformation
Year 9 assignment.imaginative literary transformationYear 9 assignment.imaginative literary transformation
Year 9 assignment.imaginative literary transformation
 
Personalised statistical writing analysis
Personalised statistical writing analysisPersonalised statistical writing analysis
Personalised statistical writing analysis
 
Language Assessment - Assessing Reading Full Description with Picture and Cha...
Language Assessment - Assessing Reading Full Description with Picture and Cha...Language Assessment - Assessing Reading Full Description with Picture and Cha...
Language Assessment - Assessing Reading Full Description with Picture and Cha...
 
Developmental reading
Developmental readingDevelopmental reading
Developmental reading
 
English revision 2015
English revision 2015English revision 2015
English revision 2015
 
Rubrics sample
Rubrics sampleRubrics sample
Rubrics sample
 
How to assess and test reading
How to assess and test readingHow to assess and test reading
How to assess and test reading
 
As 1.1 excellence exemplar
As 1.1 excellence exemplarAs 1.1 excellence exemplar
As 1.1 excellence exemplar
 
Syllabus 9
Syllabus 9Syllabus 9
Syllabus 9
 
Assessment criteria3ºesophase2ppp copia
Assessment criteria3ºesophase2ppp   copiaAssessment criteria3ºesophase2ppp   copia
Assessment criteria3ºesophase2ppp copia
 
Testing Reading
Testing ReadingTesting Reading
Testing Reading
 

Similar to 1908 working memory

Nlp presentation
Nlp presentationNlp presentation
Nlp presentationSurya Sg
 
Teaching Listening
Teaching ListeningTeaching Listening
Teaching Listeningbrandybarter
 
Speaking another language
Speaking another languageSpeaking another language
Speaking another languageyounes Anas
 
Warnikchow - SAIT - 0529
Warnikchow - SAIT - 0529Warnikchow - SAIT - 0529
Warnikchow - SAIT - 0529WarNik Chow
 
MANAGEMENT OF CULTURAL ENTITIES IN SPOKEN ENGLISH: A Discourse Analysis
MANAGEMENT OF  CULTURAL ENTITIES IN SPOKEN ENGLISH:  A Discourse AnalysisMANAGEMENT OF  CULTURAL ENTITIES IN SPOKEN ENGLISH:  A Discourse Analysis
MANAGEMENT OF CULTURAL ENTITIES IN SPOKEN ENGLISH: A Discourse Analysisbharathirajas6
 
Marriage of speech, vision and natural language processing
Marriage of speech, vision and natural language processingMarriage of speech, vision and natural language processing
Marriage of speech, vision and natural language processingYaman Kumar
 
Week 1 class Introduction ntut pragmatics.pdf
Week 1 class Introduction ntut pragmatics.pdfWeek 1 class Introduction ntut pragmatics.pdf
Week 1 class Introduction ntut pragmatics.pdfripipurba11
 
Global accessibility awareness day 2021
Global accessibility awareness day 2021  Global accessibility awareness day 2021
Global accessibility awareness day 2021 Amy Czuba
 
Definiendo el enfoque lfe
Definiendo el enfoque lfeDefiniendo el enfoque lfe
Definiendo el enfoque lfeNatalia Rojo
 
Hortatory exposition 1
Hortatory exposition 1Hortatory exposition 1
Hortatory exposition 1sman 2 mataram
 
Discussion Board Guidelines Students must respond individually
Discussion Board Guidelines Students must respond individuallyDiscussion Board Guidelines Students must respond individually
Discussion Board Guidelines Students must respond individuallyLyndonPelletier761
 
Research writing tips to prevent journal rejection: 5 Ground truths for clear...
Research writing tips to prevent journal rejection: 5 Ground truths for clear...Research writing tips to prevent journal rejection: 5 Ground truths for clear...
Research writing tips to prevent journal rejection: 5 Ground truths for clear...Nigel Daly
 
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffffnlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffffSushantVyas1
 
Evo research topics to r qs (judith hanks), january 2016 (1)
Evo research topics to r qs (judith hanks), january 2016 (1)Evo research topics to r qs (judith hanks), january 2016 (1)
Evo research topics to r qs (judith hanks), january 2016 (1)ClassResearchEVO
 
Natural Language Processing: L01 introduction
Natural Language Processing: L01 introductionNatural Language Processing: L01 introduction
Natural Language Processing: L01 introductionananth
 

Similar to 1908 working memory (20)

Nlp presentation
Nlp presentationNlp presentation
Nlp presentation
 
Teaching Listening
Teaching ListeningTeaching Listening
Teaching Listening
 
Speaking another language
Speaking another languageSpeaking another language
Speaking another language
 
Warnikchow - SAIT - 0529
Warnikchow - SAIT - 0529Warnikchow - SAIT - 0529
Warnikchow - SAIT - 0529
 
MANAGEMENT OF CULTURAL ENTITIES IN SPOKEN ENGLISH: A Discourse Analysis
MANAGEMENT OF  CULTURAL ENTITIES IN SPOKEN ENGLISH:  A Discourse AnalysisMANAGEMENT OF  CULTURAL ENTITIES IN SPOKEN ENGLISH:  A Discourse Analysis
MANAGEMENT OF CULTURAL ENTITIES IN SPOKEN ENGLISH: A Discourse Analysis
 
Marriage of speech, vision and natural language processing
Marriage of speech, vision and natural language processingMarriage of speech, vision and natural language processing
Marriage of speech, vision and natural language processing
 
Week 1 class Introduction ntut pragmatics.pdf
Week 1 class Introduction ntut pragmatics.pdfWeek 1 class Introduction ntut pragmatics.pdf
Week 1 class Introduction ntut pragmatics.pdf
 
Global accessibility awareness day 2021
Global accessibility awareness day 2021  Global accessibility awareness day 2021
Global accessibility awareness day 2021
 
Definiendo el enfoque lfe
Definiendo el enfoque lfeDefiniendo el enfoque lfe
Definiendo el enfoque lfe
 
Hortatory exposition 1
Hortatory exposition 1Hortatory exposition 1
Hortatory exposition 1
 
Discussion Board Guidelines Students must respond individually
Discussion Board Guidelines Students must respond individuallyDiscussion Board Guidelines Students must respond individually
Discussion Board Guidelines Students must respond individually
 
17. assessments
17. assessments17. assessments
17. assessments
 
Research writing tips to prevent journal rejection: 5 Ground truths for clear...
Research writing tips to prevent journal rejection: 5 Ground truths for clear...Research writing tips to prevent journal rejection: 5 Ground truths for clear...
Research writing tips to prevent journal rejection: 5 Ground truths for clear...
 
Approaches to ESAP Elmira Kocheva
Approaches to ESAP Elmira KochevaApproaches to ESAP Elmira Kocheva
Approaches to ESAP Elmira Kocheva
 
Discourse
Discourse Discourse
Discourse
 
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffffnlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
 
Webinar 2: LIST 4373
Webinar 2: LIST 4373Webinar 2: LIST 4373
Webinar 2: LIST 4373
 
Euro Exam
Euro Exam Euro Exam
Euro Exam
 
Evo research topics to r qs (judith hanks), january 2016 (1)
Evo research topics to r qs (judith hanks), january 2016 (1)Evo research topics to r qs (judith hanks), january 2016 (1)
Evo research topics to r qs (judith hanks), january 2016 (1)
 
Natural Language Processing: L01 introduction
Natural Language Processing: L01 introductionNatural Language Processing: L01 introduction
Natural Language Processing: L01 introduction
 

More from WarNik Chow

2206 FAccT_inperson
2206 FAccT_inperson2206 FAccT_inperson
2206 FAccT_inpersonWarNik Chow
 
2204 Kakao talk on Hate speech dataset
2204 Kakao talk on Hate speech dataset2204 Kakao talk on Hate speech dataset
2204 Kakao talk on Hate speech datasetWarNik Chow
 
2108 [LangCon2021] kosp2e
2108 [LangCon2021] kosp2e2108 [LangCon2021] kosp2e
2108 [LangCon2021] kosp2eWarNik Chow
 
2102 Redone seminar
2102 Redone seminar2102 Redone seminar
2102 Redone seminarWarNik Chow
 
2010 INTERSPEECH
2010 INTERSPEECH 2010 INTERSPEECH
2010 INTERSPEECH WarNik Chow
 
2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categoriesWarNik Chow
 
2010 HCLT Hate Speech
2010 HCLT Hate Speech2010 HCLT Hate Speech
2010 HCLT Hate SpeechWarNik Chow
 

More from WarNik Chow (20)

2312 PACLIC
2312 PACLIC2312 PACLIC
2312 PACLIC
 
2311 EAAMO
2311 EAAMO2311 EAAMO
2311 EAAMO
 
2211 HCOMP
2211 HCOMP2211 HCOMP
2211 HCOMP
 
2211 APSIPA
2211 APSIPA2211 APSIPA
2211 APSIPA
 
2211 AACL
2211 AACL2211 AACL
2211 AACL
 
2210 CODI
2210 CODI2210 CODI
2210 CODI
 
2206 FAccT_inperson
2206 FAccT_inperson2206 FAccT_inperson
2206 FAccT_inperson
 
2206 Modupop!
2206 Modupop!2206 Modupop!
2206 Modupop!
 
2204 Kakao talk on Hate speech dataset
2204 Kakao talk on Hate speech dataset2204 Kakao talk on Hate speech dataset
2204 Kakao talk on Hate speech dataset
 
2108 [LangCon2021] kosp2e
2108 [LangCon2021] kosp2e2108 [LangCon2021] kosp2e
2108 [LangCon2021] kosp2e
 
2106 PRSLLS
2106 PRSLLS2106 PRSLLS
2106 PRSLLS
 
2106 JWLLP
2106 JWLLP2106 JWLLP
2106 JWLLP
 
2106 ACM DIS
2106 ACM DIS2106 ACM DIS
2106 ACM DIS
 
2104 Talk @SSU
2104 Talk @SSU2104 Talk @SSU
2104 Talk @SSU
 
2103 ACM FAccT
2103 ACM FAccT2103 ACM FAccT
2103 ACM FAccT
 
2102 Redone seminar
2102 Redone seminar2102 Redone seminar
2102 Redone seminar
 
2011 NLP-OSS
2011 NLP-OSS2011 NLP-OSS
2011 NLP-OSS
 
2010 INTERSPEECH
2010 INTERSPEECH 2010 INTERSPEECH
2010 INTERSPEECH
 
2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories
 
2010 HCLT Hate Speech
2010 HCLT Hate Speech2010 HCLT Hate Speech
2010 HCLT Hate Speech
 

Recently uploaded

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 

Recently uploaded (20)

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 

1908 working memory

  • 1. Human Interface Laboratory How a Young Speech Researcher Dived into Computational Linguistics and Where He Heads Now 2019. 8. 26, @Working Memory Won Ik Cho
  • 2. About Me • 조원익  B.S. in EE/Mathematics (SNU, ’10~’14)  Ph.D. student (SNU INMC, ‘14~) • Academic background  Interested in mathematics > EE!  Double major? • ...  Early years in Speech processing lab • Source separation • Voice activity & endpoint detection • Automatic music composition  Currently studying on computational linguistics 1
  • 4. Early years • Undergraduate years  Played guitar alone ... or with friends 3
  • 5. Early years • Why started mathematics?  Long dreamed romance 4 From 2008! but...
  • 6. Early years • Undergraduate design project: Music source separation  Why? – To automatically extract and transcribe the score of polyphonic music for guitar orchestration • And result? 5 Image: http://kimgooni.blog.me/221427899920
  • 7. Early years • Could not give up mathematics... (although it dismissed me)  First, aimed at cryptography laboratory 6 Image top: https://blog.goodaudience.com/cryptography-for-dummies-part-1-a811d4852daa bottom: https://www.coindesk.com/bitcoin-hits-new-2019-high-above-8900 or...?
  • 9. Early years • How? 어케들왔누 8
  • 10. Early years • Signal processing domain revisited 9 EE! Circuit Power Semiconductor ControlBio Communication ? ? System
  • 11. Early years • One step toward grand dream of polyphony music transcription?  Paper survey on ... • Multiple pitch estimation • Music grammar • Automatic music composition 10 Image: https://www.youtube.com/watch?v=TwQybAwL7NY
  • 12. Early years • And reality... 11 Image: https://www.lumenvox.com/resources/caseStudies/redmond/redmond-software-3.aspx
  • 13. Early years • First paper technical report on rule-based 4-part chorus composition!  Done while attending to a lecture on music theory (harmony)  EXTREMELY HEURISTIC AND NOT NATURAL! 12
  • 14. Early years • Started a new government-funded project from April 2017  Development of free-running speech recognition technologies for embedded robot system 13 Image: https://www.musicaslanguage.com/, https://imgur.com/gallery/iWKad22
  • 15. Dive into computational linguistics • New task?  Development of free-running speech recognition technologies for embedded robot system  로봇용 free-running 임베디드 자연어 대화음성인식을 위한 원천 기술 개발 • In other words:  Non wake-up-word based speech understanding system  ...? 14 오늘 또 떨어졌네 이게 대체 며칠째 파란불이냐 지금 손실이 얼마지
  • 16. Dive into computational linguistics • How?  Related to many aspects of (speaker-dependent) speech recognition • Speaker-dependency (in terms of a personal assistant) • Noisy far-talk recognition and beamforming • Speech intention understanding – To which utterances should AI react? 15 오늘 또 떨어졌네 이게 대체 며칠째 파란불이냐 지금 손실이 얼마지
  • 17. Dive into computational linguistics • It’s about finding an internal intention of a human speech  And in Korean? 16 Image: Top https://onepageinfo.tistory.com/52 Bottom: https://m.blog.naver.com/barim12/220831241685 데이터도 없는데 어떻게 해요! 일단 만들어!
  • 18. Dive into computational linguistics • Need to find what `questions’ and `commands’ are • Sentence type?  밥 먹었다  밥 먹었니  밥 먹어라 • But...  너랑 오랜만에 밥 먹고 싶네  부른지가 언젠데 밥 안 먹냐  밥 좀 작작 먹어라 • ...? 17 Image: Top https://www.pinterest.co.kr/pin/367887863281568581/?lp=true Bottom https://www.hkn24.com/news/articleView.html?idxno=69187
  • 19. Dive into computational linguistics • WHY??  The utterance intention should be identified among colloquial conversations (non-wake-up-word-based!)  In pragmatics and sentence-level semantics, this is called `speech act’ (Searle, 1976) • And in some cases, `dialog act’ (Stolcke et al., 2000) • It is best to use dialog history and prosodic information, if possible  But only text data (about 800K, unlabeled single utterances) for us • and need to show the potential for the left years!  Let’s first choose 20K randomly and make a labeled corpus on: 18
  • 20. Dive into computational linguistics • Intonation-dependent utterances  How to figure out if the utterances is intonation-dependent? 19 천천히 가고 있어! (utterance) 천천 히 가 고 있 어 (transcript) question statement command ?
  • 21. Dive into computational linguistics • Intonation-dependent utterances  Underspecified sentence enders • -어, -지, -대, -해, -라고, -다며, etc. (differs from –다, -니, -라) • Sentence type is determined based upon the sentence-final intonations that are assigned considering the speech act  Conversation maxim (Levinson, 2000) • 정보성-원리 Informativeness-principle (단순화 버전) – 화자: 필요한 것 이상으로 말하지 말라. » Do not say more than is required (bearing the Q-principle in mind) – 청자: 화자가 일반적으로 말한 것은 전형적으로 그리고 특칭적으로 해석하라. » What is generally said is stereotypically and specifically exemplified. – e.g., 내일 학생회관에서 두시 반에 만나서 얘기해 (질문? x) 20
  • 22. Dive into computational linguistics • Introducing phonetic features: Intonation-dependency  Annotating proper intention for possible cases of intonation • 기본적으로 문말 억양을 고려함 • 한 가지 intonation에서 여러 intention이 가능한 경우는 ambiguous한 것으로 봄 • 부사, 수일치 등과 관련하여, 서술이 아닌 것으로 해석하기 어색한 것들은 제외함. • 너무 많은 정보를 담고 있는 문장을 질문으로 판단하는 것을 피함 • Wh-particle들이 의문사의 기능을 하지 않는 경우들에 주의함 • 많은 한국어 문장이 그렇듯, 주어가 생략되어 1,2,3-인칭 등으로 해석할 수 있을 경 우에는, 각각을 대입해 보고, 어색하지 않은 것들로 판단함 • 호격의 유무에 주의함 21
  • 23. Dive into computational linguistics • Intention understanding – how?  Our approach (for Korean) 22 단일 문장인가? Intonation 정보로 결정 가능한가? Question set이 있고 청자의 답을 필요로 하는가? Effective한 To-do list가 청자에게 부여되는가? No Yes No Yes 요구 (Commands) 수사명령문 (RC) Full clause를 포함하는가? No No Compound sentence: 힘이 강한 화행에 중점 (서로 다른 문장도 같은 토픽일 때 한 문장으로 간주) Fragments (FR) 질문 (Questions) No Context-dependent (CD) Yes Yes Yes Intonation 정보가 필요한가? Yes Intonation-dependent (ID) No Questions / Embedded form Requirements / Prohibitions 수사의문문 (RQ) Target: single sentence without context nor punctuation Otherwise 서술 (Statements)
  • 24. Dive into computational linguistics • System overview: Text-based sieving + Speech-aided analysis  Compatible to text-speech alignment 23
  • 25. Dive into computational linguistics • 2017.04~2017.11  Setup phase • Study syntax, semantics, and pragmatics for making up an annotation guideline for colloquial utterances • 2017.12~2018.08  Corpus annotation • Along with undergraduate students (design project) • 2018.09~2018.11  Implement classifiers and make a documentation (paper on arXiv) • Repository for an open sourcing • 2018.12~  Modifying & maintaining the repository • https://github.com/warnikchow/3i4k  Article (in Korean) on the annotation guideline (DisCog 26:3) 24
  • 26. The Struggle within • Project is successful (so far) - But does it bring me a publication?  Approach #1: First, let’s make a similar guideline for English! (2017.11) • Manual tagging on Cornell Movie dialog dataset (binary: only obligatory/non- obligatory) 25 and REJECT! (for a signal processing conference) ... My sense here is that the two positive reviewers come from a linguistics background, and are happy to see linguistic insights being applied to a new real- world problem. The two negative reviewers seem more aware of the methods currently in use for this type of problem. The field today is hugely dominated by data-driven methods with very few linguistics insights, so I think it's important to make room for papers like this. But at the same time, I think this is a bridge too far for people working in this area to really engage with. There were lots of missed opportunities – in addition to the reviewers' comments, this paper would have been saved by experiments on a common dataset, like DSTC2 or ATIS. I therefore hesitantly recommend reject.
  • 27. The Struggle within • How about finer categorization and bigger dataset?  Approach #2: `question’, `command’, and `statement’ 26 and REJECT! (for a computational linguistics conference) ... One weakness or point of criticism is that it did not become clear to me whether the annotated corpus is being made available open source as a corpus for further study ... I am not entirely convinced that it is the best idea to use abbreviations / names for features that are so similar to established "academic" terminology for sentence types. While the intention is obvious to point out the relationship, it might be a good idea to make the difference more explicit in the names (int, imp, dec) ... We learn very little about the annotation guidelines, their granularity, publication status etc. ... We learn little about the prospective application to spoken language corpora and the expected impact of an application to spoken / phonetic data incl. phonological features.
  • 28. The Struggle within • And the issues solved?  Approach #3: More on justification 27 and REJECT! (for an AI/ML conference) ... I agree with the main motivation of categorize the utterance in a conversation based on the expected response of that one who receives the information. This position could make procedures more dynamic and direct. However, the authors do not make an argument that the proposed categories are sufficient for a dialogue, they might be the main but I will suggest to pay attention to clarifications and continuations which not necessary correspond to an answer or action. ... It is not clear what shows the results of the classification regarding the proposal of category. A good classification means that the proposal categories are good? I missing a guide on how to interpret this relation. ...
  • 29. The Struggle within • Should I do it for my own language?  Approach #4: Similar categorization in Korean, incorporating a new label regarding acoustic cues for a head-final language 28 and REJECT! (for an NLP conference) Summary: The paper presents an approach for intention recognition in Korean, leveraging both text and acoustic information. Strengths: The approach is relevant for SLU or dialog systems, and addresses issues of recognition for head-final languages. It also considers acoustic information specifically in those cases where acoustic cues are the best discriminators. Weaknesses: The paper would benefit from specific comparison between the proposed, somewhat complex architecture and other possible alternative models to justify the system. It was unclear whether the approach was limited to head-final languages, and Korean in particular, which would yield a narrow result, and if so,why.
  • 30. The Struggle within • There were encouraging words though! 29
  • 31. The Struggle within • And some unexpected invitations from linguistic venues/journals 30
  • 32. And to speech again • Conference to be presented: ICPhS  International Congress of Phonetic Science • 8/5-9, Melbourne, Australia 31 (although I couldn’t attend...)
  • 33. And to speech again • Prosody-ambiguous statements?  Problems in: Wh-intervention? • Needs disambiguation 32 몇 개 가져오래 Should I bring some? How many should I bring? They told you to bring some?
  • 34. And to speech again • Prosody-ambiguous statements? How about constructing a corpus that contains ONLY the utterances whose syntactic ambiguity can be resolved by introducing prosody? # Wh- particles 누구 (nwukwu, who), 뭐 (mwe, what), 어디 (eti, where), 언제 (encey, when), 어떻게 (ettehkey, how), 몇 (myech, how much) 왜 (way, why) was not utilized because it is not used as an existential quantifier # Predicates Depend on the wh- particle being adopted Chosen among 5,800 frequently used lexicons Pronouns and polarity items were added in some cases # Reportive particles Added to form an evidential mood Induces rhetoricalness for some questions # Sentence enders (SEs) SEs with an unfixed role (underspecified SE) e.g. -래 (ray), -어 (e), -지 (ci) SEs with a fixed role e.g. -까 (kka: interrogative) # Politeness suffix Attached at the end of a sentence to assign politeness Restricts rhetoricalness under some circumstances 33
  • 35. And to speech again • Prosody-ambiguous statements?  Created 1,292 sentences  Constructed 3,552 utterances (with speech intention, under consensus) 34
  • 36. And to speech again • Prosody-ambiguous statements?  Recorded 7,104 utterances (female/male) 35
  • 37. And to speech again • Prosody-ambiguous statements? 36 AdvisorPresenter Editing Wrote paper Checked dataset & Recording Co-author / Equal contribution Checked dataset Proofreading
  • 38. And to speech again • Prosody-ambiguous statements? 37
  • 39. And to speech again • Prosody-ambiguous statements?  And in future? • Disambiguation inspired by neuro-scientific phenomenon 38
  • 40. Interdisciplinary • Cowork with the friends of similar interest 39
  • 41. Interdisciplinary • AI ethics? (ACL workshop topic)  Measuring gender bias in machine translation  Originally claimed to deal with the proposal of government project ... but 40
  • 42. Done and afterward • Done  억양 의존성 및 rhetoricalness를 고려한, 음성인식 output 분석에 적합한 일 반언어학적 speech act 분류 방법론 제시  한국어를 위한 annotation guideline 정립, corpus 구성 및 모델 학습  한국어 Speech intention의 disambiguation을 위한 corpus 구성  질문/요구 paraphrasing 위한 parallel corpus 제작 (under progress) • Afterward?  Speech disambiguation을 위한 co-attention framework 개량  대화체/비정형 질문/요구의 structured paraphrasing  Task-oriented와 non-oriented 간 code switching이 자유로운 dialog manager 시스템의 개발 41
  • 43. Done and afterward • Where do I head now? 42 Image: Top https://phdcomics.com/comics/archive_print.php?comicid=1733 Bottom: https://slideplayer.com/slide/15366786/
  • 44. Reference • Searle, John R. A classification of illocutionary acts. Language in society 5.1 (1976): 1-23. • Stolcke, Andreas, et al. Dialogue act modeling for automatic tagging and recognition of conversational speech. Computational linguistics 26.3 (2000): 339-373. • Levinson, Stephen C. Presumptive meanings: The theory of generalized conversational implicature. MIT press (2000). • Cho, Won Ik, Hyeon Seung Lee, Ji Won Yoon, Seok Min Kim, and Nam Soo Kim. Speech intention understanding in a head-final language: A disambiguation utilizing intonation- dependency. arXiv preprint arXiv:1811.04231 (2018). • Cho, Won Ik, and Nam Soo Kim. Discourse component-based Korean speech act categorization to resolve the vagueness in understanding text intention: A computational linguistics perspective. Discourse and Cognition 26.3 (2019): 227-247. [Korean] • Cho, Won Ik*, Jeonghwa Cho*, Jeemin Kang, and Nam Soo Kim. Prosody-semantics interface in Seoul Korean: Corpus for a disambiguation of wh- intervention. Proc. ICPhS (2019): 3902-3906. 43

Editor's Notes

  1. .