Building Widely-Interpretable Semantic Networks for Dialogue Contexts

•

0 likes•82 views

Emory NLP Weekly - Date: 9/16/2020 - Presenter: Lydia Feng (BS student) https://www.linkedin.com/events/6711760000370515968/

Technology

Building Widely-Interpretable
Semantic Networks (WISeN)
for Dialogue Contexts
Emory NLP Weekly
09/16/2020
Lydia Feng

TABLE OF CONTENTS
01 DIALOGUE SPEECH
What it is, how it’s
different, and why it
matters
02 AVAILABLE DATASETS
Switchboard Corpus,
ParlAI, TopicalChat, etc
CHALLENGES
How current NER and
AMR schemes are not
equipped for dialogue
SOLUTIONS
Possible remedies and a
timeline for future work
03
04

WHAT IS IT?
- Spontaneous, naturally-occurring human-to-human
conversation
HOW IS IT DIFFERENT?
- Imperfect syntax, interruptions, repetition, run-ons,
conversational pragmatics
- Structure is ad-hoc, adaptive
WHY DOES IT MATTER?
- NLP tasks have focused largely on well-written text
- Integral to understanding / generating more human-like
representations of language
DIALOGUE SPEECH

AVAILABLE DIALOGUE
DATASETS
- 1992
- 2,400 total
dialogues
- phone
conversations
- assigned topics
- U.S.
- 10,000+ total
dialogues
- instant
messaging
(mturk)
- based loosely
on provided
information
- 8 topics
- 10,000+ total
dialogues
- act as a given
persona
- 1,000+ personas
- instant
messaging
(mturk)
SWITCHBOARD
- 1997
- 120 dialogues
- 5-10 minute
conversations
- phone
conversations
- family and close
friends
- U.S.
TOPICAL CHAT PERSONA CHAT CALL HOME

● 227 dialogues from
Switchboard
● 10+ utterances
● General topics / widely
shared experiences
WEBANNO
● Web-based annotation
tool
● Curation functionality
SWITCHBOARD DATA FOR NLP TASKS

NER
tagging in dialogue
● 4 annotators
● doubly-annotated
A B
C 79.87 75.78
D 77.94 87.64
A B
C 73.86 71.12
D 73.01 83.79
Labeled F1 Score
Unlabeled F1 Score

Stuttering / Repetition
CHALLENGES IN NER TAGGING
Colloquial Speech
1. Back in [eighty-seven]...
2. It hit [one hundred two] today.
Date / Time Subjectivity
1. In [two days] I will be home.
2. I will be home for [two days].
3. The first [two days] I was home...
Time references vs duration
vs repetition

AMR vs. DDR
Deep Dependency Representation
● Syntax-based
● Every word accounted for
● Unintuitive
● Requires speciﬁc linguistic
knowledge
Abstract Meaning Representation
● Semantics-based
● Rough meaning of a sentence
● No one-to-one correspondence to
an English sentence

CHALLENGES IN AMR
1. IDIOMS
So I think the tests themselves are not really that cut and dried , you know
(s / straightforward
:polarity -
:ARG1 (t / test))
2. EXTERNAL ARGUMENTS
-- I think it 's a , I think it 's essential .
(b / be
:ARG0 (i / it)
:ARG1 (e / essential)
3. SEMANTICALLY SPARSE
it 's something like that .
(b / be
ARG0: (i / it)
ARG1: (s / something)

CHALLENGES IN AMR
4. INTERRUPTIONS
you know , that they were going to do all them ﬁrst .
The executives ?
Uh-huh .
Would n't that be awful if you were --
Which I thought was interesting .
-- if you were using , and , and --
yeah .
-- oh , lose your job and everything .

OCTOBER 1-15
50 dialogues
doubly annotated,
revise guidelines
SEPTEMBER
Finish draft of AMR
guidelines based
on NLP tasks
NOVEMBER
Coreference
resolution
DECEMBER
Prolog
OCTOBER 15-31
Complete 227
annotations
TIMELINE
fall semester

takeaways
1. Human language is very
different from well-written
news article text
2. Current representations just
don’t make sense for dialogue
speech

What's hot

Writing in the discipline Subsentential terminologyAldrin Nepomuceno

Barreiro-Batista-LR4NLP@Coling2018-presentationINESC-ID (Spoken Language Systems Laboratory - L2F)

Untranslatability in translation Mohsine Mahraj

Phrase structure grammarSubramanianMuthusamy3

L8. b. cognition chp 7tdavi72

CL - Urdu and English NERAhsana Idris

Principles and Parameters in SyntaxOusama Bziker

Week 2.2 lin321Dr. Russell Rodrigo

Review: Deep contextualized word representationsInstitute of Agricultural Machinery, NARO

NLP_KASHK:MorphologyHemantha Kulathilake

What's hot (10)

Writing in the discipline Subsentential terminology

Barreiro-Batista-LR4NLP@Coling2018-presentation

Untranslatability in translation

Phrase structure grammar

L8. b. cognition chp 7

CL - Urdu and English NER

Principles and Parameters in Syntax

Week 2.2 lin321

Review: Deep contextualized word representations

NLP_KASHK:Morphology

Similar to Building Widely-Interpretable Semantic Networks for Dialogue Contexts

Deep Learning for Natural Language ProcessingParrotAI

Map constraint for abstractionLawrie Hunter

Text Analytics for SecurityTao Xie

Part of speech taggerarteimi

NLP_KASHK:POS TaggingHemantha Kulathilake

Deep network notes.pdfRamya Nellutla

Natural language processingBasha Chand

Developing Korean Chatbot 101Jaemin Cho

nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffffSushantVyas1

Jarrar: Introduction to Natural Language ProcessingMustafa Jarrar

NLP introduced and in 47 slides Lecture 1.pptOlusolaTop

information retrieval --> dictionary.pptssusere3b1a2

LL Lower sec - Using contextual cluesLanguagelab Group

HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William EnckTao Xie

13. Constantin Orasan (UoW) Natural Language Processing for TranslationRIILP

Natural Language ProcessingRishikese MR

Natural language processing (Python)Sumit Raj

Natural Language Processing (NLP)Yuriy Guts

NLP and Deep LearningRamaseshan Ramachandran

Effective Approach for Disambiguating Chinese Polyphonic AmbiguityIDES Editor

Similar to Building Widely-Interpretable Semantic Networks for Dialogue Contexts (20)

Deep Learning for Natural Language Processing

Map constraint for abstraction

Text Analytics for Security

Part of speech tagger

NLP_KASHK:POS Tagging

Deep network notes.pdf

Natural language processing

Developing Korean Chatbot 101

nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff

Jarrar: Introduction to Natural Language Processing

NLP introduced and in 47 slides Lecture 1.ppt

information retrieval --> dictionary.ppt

LL Lower sec - Using contextual clues

HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William Enck

13. Constantin Orasan (UoW) Natural Language Processing for Translation

Natural Language Processing

Natural language processing (Python)

Natural Language Processing (NLP)

NLP and Deep Learning

Effective Approach for Disambiguating Chinese Polyphonic Ambiguity

Recently uploaded

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent

Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime

Artificial intelligence in the post-deep learning eraDeakin University

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

AI as an Interface for Commercial BuildingsMemoori

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Pigging Solutions in Pet Food ManufacturingPigging Solutions

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

How to Remove Document Management Hurdles with X-Docs?XfilesPro

The transition to renewables in India.pdfCompetition Advisory Services (India) LLP

Key Features Of Token Development (1).pptxLBM Solutions

Recently uploaded (20)

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Next-generation AAM aircraft unveiled by Supernal, S-A2

Breaking the Kubernetes Kill Chain: Host Path Mount

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...

Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget

Artificial intelligence in the post-deep learning era

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

Advanced Test Driven-Development @ php[tek] 2024

AI as an Interface for Commercial Buildings

Unblocking The Main Thread Solving ANRs and Frozen Frames

Benefits Of Flutter Compared To Other Frameworks

08448380779 Call Girls In Civil Lines Women Seeking Men

Pigging Solutions in Pet Food Manufacturing

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Maximizing Board Effectiveness 2024 Webinar.pptx

08448380779 Call Girls In Friends Colony Women Seeking Men

How to Remove Document Management Hurdles with X-Docs?

The transition to renewables in India.pdf

Key Features Of Token Development (1).pptx

Building Widely-Interpretable Semantic Networks for Dialogue Contexts

1. Building Widely-Interpretable Semantic Networks (WISeN) for Dialogue Contexts Emory NLP Weekly 09/16/2020 Lydia Feng

2. TABLE OF CONTENTS 01 DIALOGUE SPEECH What it is, how it’s different, and why it matters 02 AVAILABLE DATASETS Switchboard Corpus, ParlAI, TopicalChat, etc CHALLENGES How current NER and AMR schemes are not equipped for dialogue SOLUTIONS Possible remedies and a timeline for future work 03 04

3. WHAT IS IT? - Spontaneous, naturally-occurring human-to-human conversation HOW IS IT DIFFERENT? - Imperfect syntax, interruptions, repetition, run-ons, conversational pragmatics - Structure is ad-hoc, adaptive WHY DOES IT MATTER? - NLP tasks have focused largely on well-written text - Integral to understanding / generating more human-like representations of language DIALOGUE SPEECH

4. AVAILABLE DIALOGUE DATASETS - 1992 - 2,400 total dialogues - phone conversations - assigned topics - U.S. - 10,000+ total dialogues - instant messaging (mturk) - based loosely on provided information - 8 topics - 10,000+ total dialogues - act as a given persona - 1,000+ personas - instant messaging (mturk) SWITCHBOARD - 1997 - 120 dialogues - 5-10 minute conversations - phone conversations - family and close friends - U.S. TOPICAL CHAT PERSONA CHAT CALL HOME

5. ● 227 dialogues from Switchboard ● 10+ utterances ● General topics / widely shared experiences WEBANNO ● Web-based annotation tool ● Curation functionality SWITCHBOARD DATA FOR NLP TASKS

6. NER tagging in dialogue ● 4 annotators ● doubly-annotated A B C 79.87 75.78 D 77.94 87.64 A B C 73.86 71.12 D 73.01 83.79 Labeled F1 Score Unlabeled F1 Score

7. Stuttering / Repetition CHALLENGES IN NER TAGGING Colloquial Speech 1. Back in [eighty-seven]... 2. It hit [one hundred two] today. Date / Time Subjectivity 1. In [two days] I will be home. 2. I will be home for [two days]. 3. The first [two days] I was home... Time references vs duration vs repetition

8. AMR vs. DDR Deep Dependency Representation ● Syntax-based ● Every word accounted for ● Unintuitive ● Requires speciﬁc linguistic knowledge Abstract Meaning Representation ● Semantics-based ● Rough meaning of a sentence ● No one-to-one correspondence to an English sentence

9. CHALLENGES IN AMR 1. IDIOMS So I think the tests themselves are not really that cut and dried , you know (s / straightforward :polarity - :ARG1 (t / test)) 2. EXTERNAL ARGUMENTS -- I think it 's a , I think it 's essential . (b / be :ARG0 (i / it) :ARG1 (e / essential) 3. SEMANTICALLY SPARSE it 's something like that . (b / be ARG0: (i / it) ARG1: (s / something)

10. CHALLENGES IN AMR 4. INTERRUPTIONS you know , that they were going to do all them ﬁrst . The executives ? Uh-huh . Would n't that be awful if you were -- Which I thought was interesting . -- if you were using , and , and -- yeah . -- oh , lose your job and everything .

11. OCTOBER 1-15 50 dialogues doubly annotated, revise guidelines SEPTEMBER Finish draft of AMR guidelines based on NLP tasks NOVEMBER Coreference resolution DECEMBER Prolog OCTOBER 15-31 Complete 227 annotations TIMELINE fall semester

12. takeaways 1. Human language is very different from well-written news article text 2. Current representations just don’t make sense for dialogue speech

Building Widely-Interpretable Semantic Networks for Dialogue Contexts

Recommended

Recommended

More Related Content

What's hot

What's hot (10)

Similar to Building Widely-Interpretable Semantic Networks for Dialogue Contexts

Similar to Building Widely-Interpretable Semantic Networks for Dialogue Contexts (20)

More from Jinho Choi

More from Jinho Choi (20)

Recently uploaded

Recently uploaded (20)

Building Widely-Interpretable Semantic Networks for Dialogue Contexts