SlideShare a Scribd company logo
1 of 25
Analyzing Human and Machine
Performance In Resolving
Ambiguous Spoken Sentences
Hussein Ghaly1 and Michael Mandel2
1 Graduate Center, City University of New York, 2Brooklyn College, City
University of New York
Motivation
When an ambiguous sentence is spoken, what information does speech have which text alone doesn’t?
Our goal is to examine this information by analyzing human disambiguation of both text and speech for
different types of ambiguities, and developing a model for automatic disambiguation using this
information
Research Summary
- Record sentences containing some ambiguity, with the speaker aware of the correct
interpretation
- Subjects hear or read sentences, predict the correct interpretation
- Analyze acoustic features of each utterance, including multiple recordings of the same sentence
- Develop a Machine Learning approach to predict the intended reading given the acoustic features
Types of Ambiguity
- Lexical Ambiguity (I forgot my bag at the bank)
- Syntactic Ambiguity (old men and women)
- Comma Ambiguity
- PP-attachment
- NP-ambiguity
- Coordination ambiguity … etc
Comma Ambiguity
A woman without her man is nothing.
A woman: without her, man is nothing.
Comma Ambiguity
- Without punctuation (e.g. out of ASR) text can be
ambiguous
- Can the written text be disambiguated by humans?
- Can the spoken sentence be disambiguated by
humans?
PP-Attachment
I saw [the boy with the telescope]
I saw the boy [with the telescope]
me the boy
PP-Attachment
This sentence has two possible interpretations, i.e., a structural ambiguity
me the boy me the boy
PP-Attachment - Early vs. Late Closure
me the boy me the boy
Late Closure Early Closure
late closure is the principle that
new words (or "incoming lexical
items") tend to be associated with
the phrase or clause currently
being processed rather than with
structures farther back in the
sentence. *
*https://www.thoughtco.com/late-closure-sentence-processing-
1691101
Hypotheses
- When there is ambiguity in any sentence and the speaker is aware of the correct reading, they will
convey their knowledge of the correct reading using certain prosodic cues.
- Listeners will be able to use these cues to identify the correct reading better than readers will
- These prosodic cues can be measured and analyzed and used as features for automatic
disambiguation system using machine learning
Previous Research
Psychology - Snedeker and Trueswell, 2003 - Using prosody to avoid ambiguity: Effects of speaker
awareness and referential context.
“informative prosodic cues depend upon speaker's knowledge of the situation: speakers provide prosodic
cues when needed; listeners use these prosodic cues when present.”
Prosodic cues include pauses and word durations, as shown from the utterances of a speaker who is
aware of the intended meaning.
“tap [the frog with the flower]” - modifier
“tap the frog [with the flower]” - instrument
Previous Research
NLP - Levi et al, 2012 - The effect of pitch, intensity and pause duration in punctuation detection
Predicting punctuation from different prosodic cues of speech using neural networks
Cues included: pitch, intensity and pause duration
Achieved a punctuation detection rate of 54%
Data
- We created a collection of 26 constructed sentences (6 pairs of sentences with comma ambiguity
and 7 pairs of sentences with PP-attachment ambiguity)
- We recorded the sentences spoken by a native speaker, each sentence recorded five times (total
130 recording files)
Comma Ambiguity - Speaker Tasks
Record 6 pairs of constructed Comma-ambiguous sentences
Example:
3a: John, said Mary, was the nicest person at the party.
3b: John said Mary was the nicest person at the party.
Comma Ambiguity - Listener Tasks
For each Comma-ambiguous sentence, identify the intended meaning:
Task 1 - Using Text only
Task 2 - Using Audio Only
Example:
Sentence: John, said Mary, was the nicest person at the party.
Question: Who was said to be the nicest person at the party? A- John B- Mary
PP-Attachment Ambiguity - Speaker Tasks
Record 7 pairs of sentences with PP-attachment ambiguity, each pair contains a different preceding
context supporting one reading of the sentence
Example:
4a: One of the boys got a telescope. I saw the boy with the telescope.
4b:- I have a new telescope. I saw the boy with the telescope.
PP-Attachment Ambiguity - Listener Tasks
For the following settings, identify the correct meaning by answering a question. For the last setting,
sentences recordings were trimmed from the previous context.
Who has the telescope? A- The boy B- The speaker
Setting Presentation
Text with context I have a new telescope. I saw the boy with the telescope.
Audio with context
Text without context I saw the boy with the telescope.
Audio without context
Results - Human Evaluation
Ambiguity Modality Accuracy
Comma Text 99.3%
Comma Audio 94.7%
PP-attachment with context Text 93.1%
PP-attachment with context Audio 97.1%
PP-attachment without context Text 52.0%
PP-attachment without context Audio 74.4%
Results - PP-Attachment - Acoustic Analysis
acoustic feature values averaged over the 20 productions of the following sentences
They discussed the mistakes in the second meeting.
The lawyer contested the proceedings in the third hearing.
Late Early
Preposition Duration (ms) 147 143
Preceding silent pauses (ms) 0 48
Intensity (dB) 57.8 56.4
Following NP duration (ms) 579 640
Preceding Silent Pause Following NPPreposition
Acoustic Analysis - Early vs. Late Closure
Early Closure
Late Closure
Results - PP-Attachment - Machine Evaluation
Feature Matrix - Extracted manually from 10 audio files for the sentence “They discussed the mistakes in
the second meeting”.
duration of
preposition
(ms)
preceding
silence
(ms)
following
NP
duration
(ms)
Preposition
Intensity
(dB)
Closure
Type
160 0 690 56.6 early
175 0 660 59.0 late
120 0 470 56.2 late
140 80 620 55.6 early
145 0 600 58.7 late
140 90 635 57.8 early
135 0 510 61.1 late
150 110 600 57.9 early
130 0 620 61.0 late
140 60 580 58.8 early
Machine Evaluation
Using Decision Trees for 20 data points with 5-fold cross-validation: 80% average accuracy in predicting
early vs. late closure
All sentences were using in training and testing each fold
Conclusions
- Humans can disambiguate sentences with comma ambiguity with audio alone almost as well as
with text containing punctuation
- Humans can disambiguate spoken sentences with PP-attachment ambiguity without context, but
cannot disambiguate the same sentences as text
- When speakers are aware of the intended meaning, they can produce sentences in a way that can
- Be disambiguated by listeners, even without context
- Be identified through certain acoustic cues
- Be disambiguated to some extent by machines, initial results are promising
Thank you!
Questions?

More Related Content

What's hot

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
Yasir Khan
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
Mariana Soffer
 

What's hot (20)

DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf EremyanDataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing
Natural Language Processing Natural Language Processing
Natural Language Processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing and Machine Learning
Natural Language Processing and Machine LearningNatural Language Processing and Machine Learning
Natural Language Processing and Machine Learning
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Nlp
NlpNlp
Nlp
 
Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Natural language processing (Python)
Natural language processing (Python)Natural language processing (Python)
Natural language processing (Python)
 
A Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingA Panorama of Natural Language Processing
A Panorama of Natural Language Processing
 
NLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in PythonNLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in Python
 
Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...
 
The Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information RetrievalThe Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information Retrieval
 

Similar to Analyzing Human and Machine Performance In Resolving Ambiguous Spoken Sentences

SPEECH PERCEPTION THEORIES MASLP
SPEECH PERCEPTION THEORIES MASLPSPEECH PERCEPTION THEORIES MASLP
SPEECH PERCEPTION THEORIES MASLP
HimaniBansal15
 
Research Proposal.edited
Research Proposal.editedResearch Proposal.edited
Research Proposal.edited
Kendrik Jones
 
A Neural Probabilistic Language Model.pptx
A Neural Probabilistic Language Model.pptxA Neural Probabilistic Language Model.pptx
A Neural Probabilistic Language Model.pptx
Rama Irsheidat
 
Silent sound interface
Silent sound interfaceSilent sound interface
Silent sound interface
Jeevitha Reddy
 
naturallanguageprocessing1-131229185458-phpapp01.pdf
naturallanguageprocessing1-131229185458-phpapp01.pdfnaturallanguageprocessing1-131229185458-phpapp01.pdf
naturallanguageprocessing1-131229185458-phpapp01.pdf
109MukeshSingh
 

Similar to Analyzing Human and Machine Performance In Resolving Ambiguous Spoken Sentences (20)

Theories of speech perception.pptx
Theories of speech perception.pptxTheories of speech perception.pptx
Theories of speech perception.pptx
 
SPEECH PERCEPTION THEORIES MASLP
SPEECH PERCEPTION THEORIES MASLPSPEECH PERCEPTION THEORIES MASLP
SPEECH PERCEPTION THEORIES MASLP
 
Brain signal seminar
Brain signal seminar Brain signal seminar
Brain signal seminar
 
Isolated English Word Recognition System: Appropriate for Bengali-accented En...
Isolated English Word Recognition System: Appropriate for Bengali-accented En...Isolated English Word Recognition System: Appropriate for Bengali-accented En...
Isolated English Word Recognition System: Appropriate for Bengali-accented En...
 
The Neurophysiology of Speech
The Neurophysiology of SpeechThe Neurophysiology of Speech
The Neurophysiology of Speech
 
natural-language-processing.ppt
natural-language-processing.pptnatural-language-processing.ppt
natural-language-processing.ppt
 
WORD RECOGNITION MASLP
WORD RECOGNITION MASLPWORD RECOGNITION MASLP
WORD RECOGNITION MASLP
 
NLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptNLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.ppt
 
NLP
NLPNLP
NLP
 
Animal Voice Morphing System
Animal Voice Morphing SystemAnimal Voice Morphing System
Animal Voice Morphing System
 
Research Proposal.edited
Research Proposal.editedResearch Proposal.edited
Research Proposal.edited
 
Talking at the Speed of Neurons
Talking at the Speed of NeuronsTalking at the Speed of Neurons
Talking at the Speed of Neurons
 
Assign
AssignAssign
Assign
 
COMPUTATIONAL APPROACHES TO THE SYNTAX-PROSODY INTERFACE: USING PROSODY TO IM...
COMPUTATIONAL APPROACHES TO THE SYNTAX-PROSODY INTERFACE: USING PROSODY TO IM...COMPUTATIONAL APPROACHES TO THE SYNTAX-PROSODY INTERFACE: USING PROSODY TO IM...
COMPUTATIONAL APPROACHES TO THE SYNTAX-PROSODY INTERFACE: USING PROSODY TO IM...
 
Theories of Speech Perception
Theories of Speech PerceptionTheories of Speech Perception
Theories of Speech Perception
 
A Neural Probabilistic Language Model.pptx
A Neural Probabilistic Language Model.pptxA Neural Probabilistic Language Model.pptx
A Neural Probabilistic Language Model.pptx
 
Silent sound interface
Silent sound interfaceSilent sound interface
Silent sound interface
 
Comprehension and Language (handout version)
Comprehension and Language (handout version)Comprehension and Language (handout version)
Comprehension and Language (handout version)
 
naturallanguageprocessing1-131229185458-phpapp01.pdf
naturallanguageprocessing1-131229185458-phpapp01.pdfnaturallanguageprocessing1-131229185458-phpapp01.pdf
naturallanguageprocessing1-131229185458-phpapp01.pdf
 
TASK-OPTIMIZED DEEP NEURAL NETWORK TO REPLICATE THE HUMAN AUDITORY CORTEX
TASK-OPTIMIZED DEEP NEURAL NETWORK TO REPLICATE THE HUMAN AUDITORY CORTEXTASK-OPTIMIZED DEEP NEURAL NETWORK TO REPLICATE THE HUMAN AUDITORY CORTEX
TASK-OPTIMIZED DEEP NEURAL NETWORK TO REPLICATE THE HUMAN AUDITORY CORTEX
 

Recently uploaded

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 

Recently uploaded (20)

80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 

Analyzing Human and Machine Performance In Resolving Ambiguous Spoken Sentences

  • 1. Analyzing Human and Machine Performance In Resolving Ambiguous Spoken Sentences Hussein Ghaly1 and Michael Mandel2 1 Graduate Center, City University of New York, 2Brooklyn College, City University of New York
  • 2.
  • 3. Motivation When an ambiguous sentence is spoken, what information does speech have which text alone doesn’t? Our goal is to examine this information by analyzing human disambiguation of both text and speech for different types of ambiguities, and developing a model for automatic disambiguation using this information
  • 4. Research Summary - Record sentences containing some ambiguity, with the speaker aware of the correct interpretation - Subjects hear or read sentences, predict the correct interpretation - Analyze acoustic features of each utterance, including multiple recordings of the same sentence - Develop a Machine Learning approach to predict the intended reading given the acoustic features
  • 5. Types of Ambiguity - Lexical Ambiguity (I forgot my bag at the bank) - Syntactic Ambiguity (old men and women) - Comma Ambiguity - PP-attachment - NP-ambiguity - Coordination ambiguity … etc
  • 6. Comma Ambiguity A woman without her man is nothing. A woman: without her, man is nothing.
  • 7. Comma Ambiguity - Without punctuation (e.g. out of ASR) text can be ambiguous - Can the written text be disambiguated by humans? - Can the spoken sentence be disambiguated by humans?
  • 8. PP-Attachment I saw [the boy with the telescope] I saw the boy [with the telescope] me the boy
  • 9. PP-Attachment This sentence has two possible interpretations, i.e., a structural ambiguity me the boy me the boy
  • 10. PP-Attachment - Early vs. Late Closure me the boy me the boy Late Closure Early Closure late closure is the principle that new words (or "incoming lexical items") tend to be associated with the phrase or clause currently being processed rather than with structures farther back in the sentence. * *https://www.thoughtco.com/late-closure-sentence-processing- 1691101
  • 11. Hypotheses - When there is ambiguity in any sentence and the speaker is aware of the correct reading, they will convey their knowledge of the correct reading using certain prosodic cues. - Listeners will be able to use these cues to identify the correct reading better than readers will - These prosodic cues can be measured and analyzed and used as features for automatic disambiguation system using machine learning
  • 12. Previous Research Psychology - Snedeker and Trueswell, 2003 - Using prosody to avoid ambiguity: Effects of speaker awareness and referential context. “informative prosodic cues depend upon speaker's knowledge of the situation: speakers provide prosodic cues when needed; listeners use these prosodic cues when present.” Prosodic cues include pauses and word durations, as shown from the utterances of a speaker who is aware of the intended meaning. “tap [the frog with the flower]” - modifier “tap the frog [with the flower]” - instrument
  • 13. Previous Research NLP - Levi et al, 2012 - The effect of pitch, intensity and pause duration in punctuation detection Predicting punctuation from different prosodic cues of speech using neural networks Cues included: pitch, intensity and pause duration Achieved a punctuation detection rate of 54%
  • 14. Data - We created a collection of 26 constructed sentences (6 pairs of sentences with comma ambiguity and 7 pairs of sentences with PP-attachment ambiguity) - We recorded the sentences spoken by a native speaker, each sentence recorded five times (total 130 recording files)
  • 15. Comma Ambiguity - Speaker Tasks Record 6 pairs of constructed Comma-ambiguous sentences Example: 3a: John, said Mary, was the nicest person at the party. 3b: John said Mary was the nicest person at the party.
  • 16. Comma Ambiguity - Listener Tasks For each Comma-ambiguous sentence, identify the intended meaning: Task 1 - Using Text only Task 2 - Using Audio Only Example: Sentence: John, said Mary, was the nicest person at the party. Question: Who was said to be the nicest person at the party? A- John B- Mary
  • 17. PP-Attachment Ambiguity - Speaker Tasks Record 7 pairs of sentences with PP-attachment ambiguity, each pair contains a different preceding context supporting one reading of the sentence Example: 4a: One of the boys got a telescope. I saw the boy with the telescope. 4b:- I have a new telescope. I saw the boy with the telescope.
  • 18. PP-Attachment Ambiguity - Listener Tasks For the following settings, identify the correct meaning by answering a question. For the last setting, sentences recordings were trimmed from the previous context. Who has the telescope? A- The boy B- The speaker Setting Presentation Text with context I have a new telescope. I saw the boy with the telescope. Audio with context Text without context I saw the boy with the telescope. Audio without context
  • 19. Results - Human Evaluation Ambiguity Modality Accuracy Comma Text 99.3% Comma Audio 94.7% PP-attachment with context Text 93.1% PP-attachment with context Audio 97.1% PP-attachment without context Text 52.0% PP-attachment without context Audio 74.4%
  • 20. Results - PP-Attachment - Acoustic Analysis acoustic feature values averaged over the 20 productions of the following sentences They discussed the mistakes in the second meeting. The lawyer contested the proceedings in the third hearing. Late Early Preposition Duration (ms) 147 143 Preceding silent pauses (ms) 0 48 Intensity (dB) 57.8 56.4 Following NP duration (ms) 579 640 Preceding Silent Pause Following NPPreposition
  • 21. Acoustic Analysis - Early vs. Late Closure Early Closure Late Closure
  • 22. Results - PP-Attachment - Machine Evaluation Feature Matrix - Extracted manually from 10 audio files for the sentence “They discussed the mistakes in the second meeting”. duration of preposition (ms) preceding silence (ms) following NP duration (ms) Preposition Intensity (dB) Closure Type 160 0 690 56.6 early 175 0 660 59.0 late 120 0 470 56.2 late 140 80 620 55.6 early 145 0 600 58.7 late 140 90 635 57.8 early 135 0 510 61.1 late 150 110 600 57.9 early 130 0 620 61.0 late 140 60 580 58.8 early
  • 23. Machine Evaluation Using Decision Trees for 20 data points with 5-fold cross-validation: 80% average accuracy in predicting early vs. late closure All sentences were using in training and testing each fold
  • 24. Conclusions - Humans can disambiguate sentences with comma ambiguity with audio alone almost as well as with text containing punctuation - Humans can disambiguate spoken sentences with PP-attachment ambiguity without context, but cannot disambiguate the same sentences as text - When speakers are aware of the intended meaning, they can produce sentences in a way that can - Be disambiguated by listeners, even without context - Be identified through certain acoustic cues - Be disambiguated to some extent by machines, initial results are promising

Editor's Notes

  1. Is it ok to start with a joke?
  2. General Motivation slide before the research summary Why we’re doing this
  3. <div>Icons made by <a href="http://www.freepik.com" title="Freepik">Freepik</a> from <a href="https://www.flaticon.com/" title="Flaticon">www.flaticon.com</a> is licensed by <a href="http://creativecommons.org/licenses/by/3.0/" title="Creative Commons BY 3.0" target="_blank">CC 3.0 BY</a></div> <div>Icons made by <a href="http://www.freepik.com" title="Freepik">Freepik</a> from <a href="https://www.flaticon.com/" title="Flaticon">www.flaticon.com</a> is licensed by <a href="http://creativecommons.org/licenses/by/3.0/" title="Creative Commons BY 3.0" target="_blank">CC 3.0 BY</a></div>
  4. <div>Icons made by <a href="https://www.flaticon.com/authors/scott-de-jonge" title="Scott de Jonge">Scott de Jonge</a> from <a href="https://www.flaticon.com/" title="Flaticon">www.flaticon.com</a> is licensed by <a href="http://creativecommons.org/licenses/by/3.0/" title="Creative Commons BY 3.0" target="_blank">CC 3.0 BY</a></div> <div>Icons made by <a href="http://www.freepik.com" title="Freepik">Freepik</a> from <a href="https://www.flaticon.com/" title="Flaticon">www.flaticon.com</a> is licensed by <a href="http://creativecommons.org/licenses/by/3.0/" title="Creative Commons BY 3.0" target="_blank">CC 3.0 BY</a></div> <div>Icons made by <a href="http://www.freepik.com" title="Freepik">Freepik</a> from <a href="https://www.flaticon.com/" title="Flaticon">www.flaticon.com</a> is licensed by <a href="http://creativecommons.org/licenses/by/3.0/" title="Creative Commons BY 3.0" target="_blank">CC 3.0 BY</a></div> <div>Icons made by <a href="http://www.freepik.com" title="Freepik">Freepik</a> from <a href="https://www.flaticon.com/" title="Flaticon">www.flaticon.com</a> is licensed by <a href="http://creativecommons.org/licenses/by/3.0/" title="Creative Commons BY 3.0" target="_blank">CC 3.0 BY</a></div>
  5. https://www.thoughtco.com/late-closure-sentence-processing-1691101
  6. Show the table, one bullet at a time first 4 2nd one from last - chance Last - above chance
  7. Include parse trees - and audio wave form of both early and late closure - play the file - sort by closure type