SlideShare a Scribd company logo
End-to-End Task-Completion
Neural Dialogue Systems
Xiujun Li Yun-Nung (Vivian) Chen Lihong Li Jianfeng Gao Asli Celikyilmaz
The 8th International Joint Conference on Natural Language Processing
https://github.com/MiuLab/TC-Bot
1
Dialogue Interactions
“I want to talk” Chit-chat
“I have a question” Information consumption
“I need to get this done” Task completion
Task-oriented
Non task-oriented
2
Dialogue Interactions
“I want to talk” Chit-chat
“I have a question” Information consumption
“I need to get this done” Task completion
• What is the paper review schedule?
• Which room is the dialogue tutorial in?
• When is the IJCNLP 2017 conference?
• What does NLP stand for?
3
Dialogue Interactions
“I want to talk” Chit-chat
“I have a question” Information consumption
“I need to get this done” Task completion
• Book me the flight from Seattle to Taipei
• Reserve a table at Din Tai Fung for 5 people, 7PM tonight
• Purchase two movie tickets about Thor 3 at 9PM tonight
4
Goal: navigating users to complete tasks (transactions, API calls, etc.)
Challenges: including multiple information consumption
Task-Completion Dialogues
• Crowdsourced data
– Domain: Movie
– Dialogue act: intent & slot-value pairs
• Problem setting
– User wants to achieve a specific task, i.e. buying a movie ticket
– Agent tries to understand user’s goal via dialogue before booking the ticket
Turn Speaker Intent & Slots Utterance
1 User greeting(greeting=hi) Hi
1 User
request(moviename; city=Houma;
state=Louisiana; date=this week; genre=foreign)
Are there any good movies showing around
Houma Louisiana this week?
2 Agent request(date)
What date would you like me to look for a
reservation
3 User inform(date=9th) How about the 9th
4 Agent inform(result={}; genre=foreign)
Unfortunately here are no foreign movies
playing at this time
5
Rule-Based Agent
• Gather all information via a
series of requests (form-filling)
• Issues
1) No notion about what requests
can be skipped
User goal – Two tickets for “Deadpool” tomorrow
9PM at AMC Pacific Place 11 theater, Seattle.
User: Do you have any ticket for movie ‘Deadpool’ at 9:00 pm?
Agent: Which theater do you want?
User: AMC pacific place 11 theater, please!
Agent: How many tickets do you want?
User: Two!
Agent: What date?
User: Tomorrow!
Agent: Which city do you want?
User: Seattle!
Agent: Do you have any specification for zip code?
User: No, I do not care.
Agent: Do you have any distance constraint?
User: No, I do not care.
Agent: How about video format? 2d? Imax 3d?
User: Any kind is okay.
Agent: Do you have any constraint to price?
User: No!
Agent: Okay, I will issue 2 tickets for you, tomorrow 9:00 pm at
AMC pacific place 11 theater, Seattle, movie ‘Deadpool’.
User: Okay, thanks.
Agent: Thanks!
Wasteful! (very likely
the user does not care)
6
Rule-Based Agent
• Gather all information via a
series of requests (form-filling)
• Issues
1) No notion about what requests
can be skipped
2) No principle way to deal with
errors/uncertainty in language
understanding
User goal – Two tickets for “Hail Caesar” tomorrow
7:15PM at AMC Lowes Oak tree 6, Seattle.
User: Do you any ticket for movie ‘Hail Caesar’ at theater
AMC Lowes Oak Tree 6?
Agent: What start time do you prefer?
User: 7:15 pm Tomorrow
Agent: Which city you are looking for?
User: Seattle
Agent: How many tickets do you want?
User: Two
Agent: Okay, I have issued two tickets for movie ‘Hail Caeser’
tomorrow 7:50 pm at theater AMC Lowes Oak Tree 6, Seattle.
User: Oh, no.
Agent: Thanks!
Task failed!
(cannot correct the errors from proceeding component)
7
Rule-Based Agent
• Gather all information via a
series of requests (form-filling)
• Issues
1) No notion about what requests
can be skipped
2) No principle way to deal with
errors/uncertainty in language
understanding
3) Do not know how to
recommend options if the
user’s goal is not achievable
8
Task-Oriented Dialogue System Framework
Language Understanding
(LU)
Natural Language
Generation (NLG)
Semantic Frame
request_ticket
(moviename=Star Wars;
numberofpeople=5)
System Action/ Policy
request(theater)
Text Response
“Which theater do you prefer?”
Text Input
“Book 5 tickets for movie Star Wars”
Knowledge
Database
Dialogue State Tracking
(DST)
Dialogue Policy
Learning
Dialogue Management (DM)
9
State Representation
Task-Oriented Dialogue System Framework
Language Understanding
(LU)
Natural Language
Generation (NLG)
Dialogue Act
Natural Language
Natural Language
Knowledge
Database
Dialogue State Tracking
(DST)
Dialogue Policy
Learning
Dialogue Management (DM)
Dialogue Act
10
State Representation
User
System
User Simulation in Frame-Level Semantics
User
Dialogue Act
Knowledge
Database
Dialogue State Tracking
(DST)
Dialogue Policy
Learning
Dialogue Management (DM)
Error Model Controller
• Recognition error
• LU error
User Model
User Simulation
System
Dialogue Act
User
Dialogue Act
11
State Representation
User Simulation in Natural Language Level
Natural Language
Generation (NLG)
User Model
User Simulation
System
Dialogue Act
Language
Understanding
(LU)
NL
12
Knowledge
Database
Dialogue State Tracking
(DST)
Dialogue Policy
Learning
Dialogue Management (DM)
State RepresentationUser
Dialogue Act
Task-Completion Neural Dialogue Systems
• Language Understanding
– Joint semantic frame parsing by BLSTM (Hakkani-Tur et al., 2016)
• Dialogue State Tracking
– Available results returned by the formed symbolic query
– Latest user dialogue action
• Dialogue Policy Learning
– Reinforcement learning policy (Mnih et al., 2015)
• Natural Language Generation
– Template-based
– Model-based: semantically-conditioned LSTM generation (SC-LSTM)
(Wen et al., 2015)
13
Supervised
Supervised
Supervised
Reinforcement
+ Reinforcement (fine-tuning)
+ Reinforcement (fine-tuning)
+ Reinforcement (fine-tuning)
wi
<slot>
wi+1
O
EOS
<intent>
wi
<slot>
wi+1
O
EOS
<intent>
End-to-End Neural Dialogue Systems
• LU, DST (neural dialogue system), and NLG (user simulation)
are trained in supervised way
• End-to-end training for dialogue policy learning
Knowledge
Database
Neural
Dialogue
System
User Model
User Simulation
Dialogue
Policy
Natural
Language
w
0
w1 w2
NLG
EOS
User Goal
wi
<slot>
wi+1
O
EOS
<intent>
LU
𝑠𝑡
DST
𝑠1 𝑠2 𝑠 𝑛
𝑎1 𝑎2 𝑎 𝑘
……
…
Dialogue Policy Learning
14
Reinforcement Learning Agent
• Dialogue policy learning
– Deep Q-network: estimate Q-value given the state and action pair
• Reward
– Success: agent answers all the requested slots based on the user’s
constraints, and book the movie tickets within the max turns
– Failure: 1) agent finds no matching movie based on the user’s constraints
or 2) exceed the max turns
15
Experiments
• Rule-based agent
– Actions
• ask question (request)
• answer question (inform)
• give multiple_choice to user
• say “confirm_answer” to user
• say “closing”, “thanks”
– Agent asks the slot in a priority
order, but within a sliding
window (say, size = 2 or 3),
there is randomness.
• RL agent
– Model: Deep Q-Network
– Rewards
• Success: 2 × max_turn
• Fail: - max_turn
• -1 for per turn penalty
– Actions: 45 actions
• i.e. request(starttime), inform(moviename),
confirm(question), etc.
– State transition tuples ( 𝑠𝑡, 𝑎 𝑡, 𝑟𝑡, 𝑠𝑡+1)
– Experience replay (Schaul et al., 2015)
• Priority, dynamic pool
• Starts with a pool of rule-based tuples
16
• Frame-level semantics  Natural language
The RL agent is able to learn how to interact with users to complete tasks more
efficiently and effectively, and outperforms the rule-based agent.
End-to-End Reinforcement Learning17
Language Understanding Impact
• Different error rates in LU
18
Language understanding results are crucial in terms of dialogue-level performance
Intent Error Analysis
• Intent error type
– I0: random
– I1: within group
– I2: between group
• Intent error rate
– I3: 0.00
– I4: 0.10
– I5: 0.20
Intent errors slightly influence the RL system performance
Group 1: greeting(), thanks(), etc
Group 2: inform(xx)
Group 3: request(xx)
Between-group intent errors degrade the system performance more
request_moviename(actor=Robert Downey Jr)
request_year
19
Slot Error Analysis
• Slot error type
– I0: random
– I1: slot deletion
– I2: value substitution
– I3: slot substitution
• Slot error rate
– S4: 0.00
– S5: 0.10
– S6: 0.20
Slot errors significantly degrade the RL system performance
Value substitution has the largest impact on the system performance
request_moviename
(actor=Robert Downey Jr)
director Robert Downey Sr20
Error Comparison
• Intent error rate • Slot error rate
The RL agent has better robustness to intent errors for dialogue-level performance
Slot filling is more important than intent detection in language understanding
21
Human Evaluation
• Users presented with a predefined user goal
• User rating: 1 (worst) to 5 (best) based on both naturalness
and coherence of the dialogue
22
User goal – Two tickets for “Deadpool” tomorrow 9PM at AMC Pacific Place 11 theater, Seattle.
RULE BASED SYSTEM
User: Do you have any ticket for movie ‘Deadpool’ at 9:00 pm?
Agent: Which theater do you want?
User: AMC pacific place 11 theater, please!
Agent: How many tickets do you want?
User: Two!
Agent: What date?
User: Tomorrow!
Agent: Which city do you want?
User: Seattle!
Agent: Do you have any specification for zip code?
User: No, I do not care.
Agent: Do you have any distance constraint?
User: No, I do not care.
Agent: How about video format? 2d? Imax 3d?
User: Any kind is okay.
Agent: Do you have any constraint to price?
User: No!
Agent: Okay, I will issue 2 tickets for you, tomorrow 9:00 pm at AMC
pacific place 11 theater, Seattle, movie ‘Deadpool’.
User: Okay, thanks.
Agent: Thanks!
REINFORCEMENT LEARNING SYSTEM
User: I want two tickets for movie ‘Deadpool’.
Agent: What is the starttime you prefer?
User: 9:00 PM
Agent: Which day?
User: Tomorrow
Agent: Which city?
User: Seattle
Agent: Which theater do you prefer?
User: AMC Pacific Place 11 theater.
Agent: Okay, I will issue 2 tickets for you, tomorrow 9:00 pm
at AMC pacific place 11 theater, Seattle, movie ‘Deadpool’.
User: Okay, thanks.
Agent: Thanks!
Dialogue Examples
Skip the requests the user may not care about to improve efficiency
Issue 1: no notion about what
requests can be skipped
23
User goal – Two tickets for “Hail Caesar” tomorrow 7:15PM at AMC Lowes Oak tree 6, Seattle.
Dialogue Examples
Automatically learn when and what to confirm for better robustness
REINFORCEMENT LEARNING SYSTEM
Agent: What start time do you prefer?
User: 7:15 pm Tomorrow
Agent: Which city you are looking for?
User: Seattle
Agent: Okay, I will issue you tickets for movie ‘Hail Caeser’
tomorrow 7:50 pm at theater AMC Lowes Oak Tree 6, Seattle.
User: Oh, no.
Agent: It is 7:50 pm, right?
User: Two tickets for 7:15 pm, please.
Agent: Okay, I will issue you two tickets for movie ‘Hail Caeser’
tomorrow 7:15 pm at theater AMC Lowes Oak Tree 6, Seattle.
User: Thanks!
Agent: Thanks!
RULE BASED SYSTEM
User: Do you any ticket for movie ‘Hail Caesar’ at theater
AMC Lowes Oak Tree 6?
Agent: What start time do you prefer?
User: 7:15 pm Tomorrow
Agent: Which city you are looking for?
User: Seattle
Agent: How many tickets do you want?
User: Two
Agent: Okay, I have issued two tickets for movie ‘Hail
Caeser’ tomorrow 7:50 pm at theater AMC Lowes Oak
Tree 6, Seattle.
User: Oh, no.
Agent: Thanks!
Issue 2: no principle way to deal
with LU errors/uncertainty
24
Conclusions
• Our neural dialogue system demonstrates
– Robustness
• selecting actions based on uncertainty and confusion by RL
• systematic analysis about language understanding errors on dialogue
system performance
 slot-level errors affect system performance more than intent-level errors
 slot value replacement degrades the performance most
– Flexibility
• first neural dialogue system that allows user-initiated behaviors during
conversations
– Reproducibility
• demonstrate how to evaluate RL dialogue agents using crowdsourced
datasets and simulated users in an end-to-end fashion
• guaranteeing reproducibility and consistent comparisons of competing
methods in an identical setting
25
Thanks for Attention!
Q & A
https://github.com/MiuLab/TC-Bot

More Related Content

What's hot

Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
StreamNative
 
遠赤外線カメラと可視カメラを利用した悪条件下における画像取得
遠赤外線カメラと可視カメラを利用した悪条件下における画像取得遠赤外線カメラと可視カメラを利用した悪条件下における画像取得
遠赤外線カメラと可視カメラを利用した悪条件下における画像取得
Masayuki Tanaka
 
Whoops, The Numbers Are Wrong! Scaling Data Quality @ Netflix
Whoops, The Numbers Are Wrong! Scaling Data Quality @ NetflixWhoops, The Numbers Are Wrong! Scaling Data Quality @ Netflix
Whoops, The Numbers Are Wrong! Scaling Data Quality @ Netflix
DataWorks Summit
 
스타트업을 위한 Confluent 세미나
스타트업을 위한 Confluent 세미나스타트업을 위한 Confluent 세미나
스타트업을 위한 Confluent 세미나
confluent
 
Building a Real-Time Analytics Application with Apache Pulsar and Apache Pinot
Building a Real-Time Analytics Application with  Apache Pulsar and Apache PinotBuilding a Real-Time Analytics Application with  Apache Pulsar and Apache Pinot
Building a Real-Time Analytics Application with Apache Pulsar and Apache Pinot
Altinity Ltd
 
Visual Question Answering (VQA) - CVPR2018動向分析 (CVPR 2018 完全読破チャレンジ報告会)
Visual Question Answering (VQA) - CVPR2018動向分析 (CVPR 2018 完全読破チャレンジ報告会)Visual Question Answering (VQA) - CVPR2018動向分析 (CVPR 2018 完全読破チャレンジ報告会)
Visual Question Answering (VQA) - CVPR2018動向分析 (CVPR 2018 完全読破チャレンジ報告会)
cvpaper. challenge
 
Transformer in Computer Vision
Transformer in Computer VisionTransformer in Computer Vision
Transformer in Computer Vision
Dongmin Choi
 
Collaborative Filtering at Spotify
Collaborative Filtering at SpotifyCollaborative Filtering at Spotify
Collaborative Filtering at Spotify
Erik Bernhardsson
 
Instana - ClickHouse presentation
Instana - ClickHouse presentationInstana - ClickHouse presentation
Instana - ClickHouse presentation
Miel Donkers
 
Deformable DETR Review [CDM]
Deformable DETR Review [CDM]Deformable DETR Review [CDM]
Deformable DETR Review [CDM]
Dongmin Choi
 
Eyeriss Introduction
Eyeriss IntroductionEyeriss Introduction
Eyeriss Introduction
Michael Lee
 
Netflix Recommendations Using Spark + Cassandra (Prasanna Padmanabhan & Roopa...
Netflix Recommendations Using Spark + Cassandra (Prasanna Padmanabhan & Roopa...Netflix Recommendations Using Spark + Cassandra (Prasanna Padmanabhan & Roopa...
Netflix Recommendations Using Spark + Cassandra (Prasanna Padmanabhan & Roopa...
DataStax
 
Deep Learning: a birds eye view
Deep Learning: a birds eye viewDeep Learning: a birds eye view
Deep Learning: a birds eye view
Roelof Pieters
 
Recsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedRecsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem Revisited
Xavier Amatriain
 
You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object DetectionYou Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object Detection
DADAJONJURAKUZIEV
 
20190708 bumsookim yolact
20190708 bumsookim yolact20190708 bumsookim yolact
20190708 bumsookim yolact
Brian Kim
 
Deep learning実装の基礎と実践
Deep learning実装の基礎と実践Deep learning実装の基礎と実践
Deep learning実装の基礎と実践
Seiya Tokui
 
Deep Learning for Video: Action Recognition (UPC 2018)
Deep Learning for Video: Action Recognition (UPC 2018)Deep Learning for Video: Action Recognition (UPC 2018)
Deep Learning for Video: Action Recognition (UPC 2018)
Universitat Politècnica de Catalunya
 
AI技術の現状と課題 〜aiboおよび自然言語処理〜
AI技術の現状と課題 〜aiboおよび自然言語処理〜AI技術の現状と課題 〜aiboおよび自然言語処理〜
AI技術の現状と課題 〜aiboおよび自然言語処理〜
Naoki Fujiwara
 
Yurii Pashchenko: Zero-shot learning capabilities of CLIP model from OpenAI
Yurii Pashchenko: Zero-shot learning capabilities of CLIP model from OpenAIYurii Pashchenko: Zero-shot learning capabilities of CLIP model from OpenAI
Yurii Pashchenko: Zero-shot learning capabilities of CLIP model from OpenAI
Lviv Startup Club
 

What's hot (20)

Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
 
遠赤外線カメラと可視カメラを利用した悪条件下における画像取得
遠赤外線カメラと可視カメラを利用した悪条件下における画像取得遠赤外線カメラと可視カメラを利用した悪条件下における画像取得
遠赤外線カメラと可視カメラを利用した悪条件下における画像取得
 
Whoops, The Numbers Are Wrong! Scaling Data Quality @ Netflix
Whoops, The Numbers Are Wrong! Scaling Data Quality @ NetflixWhoops, The Numbers Are Wrong! Scaling Data Quality @ Netflix
Whoops, The Numbers Are Wrong! Scaling Data Quality @ Netflix
 
스타트업을 위한 Confluent 세미나
스타트업을 위한 Confluent 세미나스타트업을 위한 Confluent 세미나
스타트업을 위한 Confluent 세미나
 
Building a Real-Time Analytics Application with Apache Pulsar and Apache Pinot
Building a Real-Time Analytics Application with  Apache Pulsar and Apache PinotBuilding a Real-Time Analytics Application with  Apache Pulsar and Apache Pinot
Building a Real-Time Analytics Application with Apache Pulsar and Apache Pinot
 
Visual Question Answering (VQA) - CVPR2018動向分析 (CVPR 2018 完全読破チャレンジ報告会)
Visual Question Answering (VQA) - CVPR2018動向分析 (CVPR 2018 完全読破チャレンジ報告会)Visual Question Answering (VQA) - CVPR2018動向分析 (CVPR 2018 完全読破チャレンジ報告会)
Visual Question Answering (VQA) - CVPR2018動向分析 (CVPR 2018 完全読破チャレンジ報告会)
 
Transformer in Computer Vision
Transformer in Computer VisionTransformer in Computer Vision
Transformer in Computer Vision
 
Collaborative Filtering at Spotify
Collaborative Filtering at SpotifyCollaborative Filtering at Spotify
Collaborative Filtering at Spotify
 
Instana - ClickHouse presentation
Instana - ClickHouse presentationInstana - ClickHouse presentation
Instana - ClickHouse presentation
 
Deformable DETR Review [CDM]
Deformable DETR Review [CDM]Deformable DETR Review [CDM]
Deformable DETR Review [CDM]
 
Eyeriss Introduction
Eyeriss IntroductionEyeriss Introduction
Eyeriss Introduction
 
Netflix Recommendations Using Spark + Cassandra (Prasanna Padmanabhan & Roopa...
Netflix Recommendations Using Spark + Cassandra (Prasanna Padmanabhan & Roopa...Netflix Recommendations Using Spark + Cassandra (Prasanna Padmanabhan & Roopa...
Netflix Recommendations Using Spark + Cassandra (Prasanna Padmanabhan & Roopa...
 
Deep Learning: a birds eye view
Deep Learning: a birds eye viewDeep Learning: a birds eye view
Deep Learning: a birds eye view
 
Recsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedRecsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem Revisited
 
You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object DetectionYou Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object Detection
 
20190708 bumsookim yolact
20190708 bumsookim yolact20190708 bumsookim yolact
20190708 bumsookim yolact
 
Deep learning実装の基礎と実践
Deep learning実装の基礎と実践Deep learning実装の基礎と実践
Deep learning実装の基礎と実践
 
Deep Learning for Video: Action Recognition (UPC 2018)
Deep Learning for Video: Action Recognition (UPC 2018)Deep Learning for Video: Action Recognition (UPC 2018)
Deep Learning for Video: Action Recognition (UPC 2018)
 
AI技術の現状と課題 〜aiboおよび自然言語処理〜
AI技術の現状と課題 〜aiboおよび自然言語処理〜AI技術の現状と課題 〜aiboおよび自然言語処理〜
AI技術の現状と課題 〜aiboおよび自然言語処理〜
 
Yurii Pashchenko: Zero-shot learning capabilities of CLIP model from OpenAI
Yurii Pashchenko: Zero-shot learning capabilities of CLIP model from OpenAIYurii Pashchenko: Zero-shot learning capabilities of CLIP model from OpenAI
Yurii Pashchenko: Zero-shot learning capabilities of CLIP model from OpenAI
 

Similar to End-to-End Task-Completion Neural Dialogue Systems

Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
AI Frontiers
 
2017 Tutorial - Deep Learning for Dialogue Systems
2017 Tutorial - Deep Learning for Dialogue Systems2017 Tutorial - Deep Learning for Dialogue Systems
2017 Tutorial - Deep Learning for Dialogue Systems
MLReview
 
End-to-End Joint Learning of Natural Language Understanding and Dialogue Manager
End-to-End Joint Learning of Natural Language Understanding and Dialogue ManagerEnd-to-End Joint Learning of Natural Language Understanding and Dialogue Manager
End-to-End Joint Learning of Natural Language Understanding and Dialogue Manager
Yun-Nung (Vivian) Chen
 
Deep Learning for Dialogue Modeling - NTHU
Deep Learning for Dialogue Modeling - NTHUDeep Learning for Dialogue Modeling - NTHU
Deep Learning for Dialogue Modeling - NTHU
Yun-Nung (Vivian) Chen
 
Deep Learning for Dialogue Systems
Deep Learning for Dialogue SystemsDeep Learning for Dialogue Systems
Deep Learning for Dialogue Systems
Yun-Nung (Vivian) Chen
 
Chatbots
ChatbotsChatbots
SXSW 2015 Shredding Wireframes: Intro to Rapid Prototyping
SXSW 2015 Shredding Wireframes: Intro to Rapid PrototypingSXSW 2015 Shredding Wireframes: Intro to Rapid Prototyping
SXSW 2015 Shredding Wireframes: Intro to Rapid Prototyping
Kyle Outlaw
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Association for Computational Linguistics
 
Games for Analysis of Technologies in Human-Intensive Systems - Dr. Tim Dasey...
Games for Analysis of Technologies in Human-Intensive Systems - Dr. Tim Dasey...Games for Analysis of Technologies in Human-Intensive Systems - Dr. Tim Dasey...
Games for Analysis of Technologies in Human-Intensive Systems - Dr. Tim Dasey...
SeriousGamesAssoc
 
AAMAS-2006 TANDEM Design Method (poster format)
AAMAS-2006 TANDEM Design Method (poster format)AAMAS-2006 TANDEM Design Method (poster format)
AAMAS-2006 TANDEM Design Method (poster format)
Steve Goschnick
 
CarolinaCon Presentation on Streaming Analytics
CarolinaCon Presentation on Streaming AnalyticsCarolinaCon Presentation on Streaming Analytics
CarolinaCon Presentation on Streaming Analytics
John Eberhardt
 
Identifying MMORPG Bots: A Traffic Analysis Approach
Identifying MMORPG Bots: A Traffic Analysis ApproachIdentifying MMORPG Bots: A Traffic Analysis Approach
Identifying MMORPG Bots: A Traffic Analysis Approach
Academia Sinica
 
One Day for Bot 一天搞懂聊天機器人
One Day for Bot 一天搞懂聊天機器人One Day for Bot 一天搞懂聊天機器人
One Day for Bot 一天搞懂聊天機器人
Yun-Nung (Vivian) Chen
 
[系列活動] 一天搞懂對話機器人
[系列活動] 一天搞懂對話機器人[系列活動] 一天搞懂對話機器人
[系列活動] 一天搞懂對話機器人
台灣資料科學年會
 
CrowdQ: Crowdsourced Query Understanding
CrowdQ: Crowdsourced Query UnderstandingCrowdQ: Crowdsourced Query Understanding
CrowdQ: Crowdsourced Query Understanding
eXascale Infolab
 
402 w2
402 w2402 w2
UX STRAT Europe 2019: Zhaochang He, VMware
UX STRAT Europe 2019: Zhaochang He, VMwareUX STRAT Europe 2019: Zhaochang He, VMware
UX STRAT Europe 2019: Zhaochang He, VMware
UX STRAT
 
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...
Yun-Nung (Vivian) Chen
 
Chatbots 101
Chatbots 101Chatbots 101
Chatbots 101
Venu Vasudevan
 
Analyzing behavioral data for improving search experience
Analyzing behavioral data for improving search experienceAnalyzing behavioral data for improving search experience
Analyzing behavioral data for improving search experiencePavel Serdyukov
 

Similar to End-to-End Task-Completion Neural Dialogue Systems (20)

Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
 
2017 Tutorial - Deep Learning for Dialogue Systems
2017 Tutorial - Deep Learning for Dialogue Systems2017 Tutorial - Deep Learning for Dialogue Systems
2017 Tutorial - Deep Learning for Dialogue Systems
 
End-to-End Joint Learning of Natural Language Understanding and Dialogue Manager
End-to-End Joint Learning of Natural Language Understanding and Dialogue ManagerEnd-to-End Joint Learning of Natural Language Understanding and Dialogue Manager
End-to-End Joint Learning of Natural Language Understanding and Dialogue Manager
 
Deep Learning for Dialogue Modeling - NTHU
Deep Learning for Dialogue Modeling - NTHUDeep Learning for Dialogue Modeling - NTHU
Deep Learning for Dialogue Modeling - NTHU
 
Deep Learning for Dialogue Systems
Deep Learning for Dialogue SystemsDeep Learning for Dialogue Systems
Deep Learning for Dialogue Systems
 
Chatbots
ChatbotsChatbots
Chatbots
 
SXSW 2015 Shredding Wireframes: Intro to Rapid Prototyping
SXSW 2015 Shredding Wireframes: Intro to Rapid PrototypingSXSW 2015 Shredding Wireframes: Intro to Rapid Prototyping
SXSW 2015 Shredding Wireframes: Intro to Rapid Prototyping
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
 
Games for Analysis of Technologies in Human-Intensive Systems - Dr. Tim Dasey...
Games for Analysis of Technologies in Human-Intensive Systems - Dr. Tim Dasey...Games for Analysis of Technologies in Human-Intensive Systems - Dr. Tim Dasey...
Games for Analysis of Technologies in Human-Intensive Systems - Dr. Tim Dasey...
 
AAMAS-2006 TANDEM Design Method (poster format)
AAMAS-2006 TANDEM Design Method (poster format)AAMAS-2006 TANDEM Design Method (poster format)
AAMAS-2006 TANDEM Design Method (poster format)
 
CarolinaCon Presentation on Streaming Analytics
CarolinaCon Presentation on Streaming AnalyticsCarolinaCon Presentation on Streaming Analytics
CarolinaCon Presentation on Streaming Analytics
 
Identifying MMORPG Bots: A Traffic Analysis Approach
Identifying MMORPG Bots: A Traffic Analysis ApproachIdentifying MMORPG Bots: A Traffic Analysis Approach
Identifying MMORPG Bots: A Traffic Analysis Approach
 
One Day for Bot 一天搞懂聊天機器人
One Day for Bot 一天搞懂聊天機器人One Day for Bot 一天搞懂聊天機器人
One Day for Bot 一天搞懂聊天機器人
 
[系列活動] 一天搞懂對話機器人
[系列活動] 一天搞懂對話機器人[系列活動] 一天搞懂對話機器人
[系列活動] 一天搞懂對話機器人
 
CrowdQ: Crowdsourced Query Understanding
CrowdQ: Crowdsourced Query UnderstandingCrowdQ: Crowdsourced Query Understanding
CrowdQ: Crowdsourced Query Understanding
 
402 w2
402 w2402 w2
402 w2
 
UX STRAT Europe 2019: Zhaochang He, VMware
UX STRAT Europe 2019: Zhaochang He, VMwareUX STRAT Europe 2019: Zhaochang He, VMware
UX STRAT Europe 2019: Zhaochang He, VMware
 
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...
 
Chatbots 101
Chatbots 101Chatbots 101
Chatbots 101
 
Analyzing behavioral data for improving search experience
Analyzing behavioral data for improving search experienceAnalyzing behavioral data for improving search experience
Analyzing behavioral data for improving search experience
 

More from Yun-Nung (Vivian) Chen

How the Context Matters Language and Interaction in Dialogues
How the Context Matters Language and Interaction in DialoguesHow the Context Matters Language and Interaction in Dialogues
How the Context Matters Language and Interaction in Dialogues
Yun-Nung (Vivian) Chen
 
Chatbot的智慧與靈魂
Chatbot的智慧與靈魂Chatbot的智慧與靈魂
Chatbot的智慧與靈魂
Yun-Nung (Vivian) Chen
 
Language Empowering Intelligent Assistants (CHT)
Language Empowering Intelligent Assistants (CHT)Language Empowering Intelligent Assistants (CHT)
Language Empowering Intelligent Assistants (CHT)
Yun-Nung (Vivian) Chen
 
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
Yun-Nung (Vivian) Chen
 
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
Yun-Nung (Vivian) Chen
 
Automatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course LecturesAutomatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course Lectures
Yun-Nung (Vivian) Chen
 
Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...
Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...
Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...
Yun-Nung (Vivian) Chen
 
Automatic Key Term Extraction and Summarization from Spoken Course Lectures
Automatic Key Term Extraction and Summarization from Spoken Course LecturesAutomatic Key Term Extraction and Summarization from Spoken Course Lectures
Automatic Key Term Extraction and Summarization from Spoken Course Lectures
Yun-Nung (Vivian) Chen
 
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Yun-Nung (Vivian) Chen
 
An Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task UnderstandingAn Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task Understanding
Yun-Nung (Vivian) Chen
 
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Yun-Nung (Vivian) Chen
 
"Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli...
"Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli..."Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli...
"Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli...
Yun-Nung (Vivian) Chen
 
Statistical Learning from Dialogues for Intelligent Assistants
Statistical Learning from Dialogues for Intelligent AssistantsStatistical Learning from Dialogues for Intelligent Assistants
Statistical Learning from Dialogues for Intelligent Assistants
Yun-Nung (Vivian) Chen
 
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Yun-Nung (Vivian) Chen
 

More from Yun-Nung (Vivian) Chen (14)

How the Context Matters Language and Interaction in Dialogues
How the Context Matters Language and Interaction in DialoguesHow the Context Matters Language and Interaction in Dialogues
How the Context Matters Language and Interaction in Dialogues
 
Chatbot的智慧與靈魂
Chatbot的智慧與靈魂Chatbot的智慧與靈魂
Chatbot的智慧與靈魂
 
Language Empowering Intelligent Assistants (CHT)
Language Empowering Intelligent Assistants (CHT)Language Empowering Intelligent Assistants (CHT)
Language Empowering Intelligent Assistants (CHT)
 
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
 
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
 
Automatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course LecturesAutomatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course Lectures
 
Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...
Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...
Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...
 
Automatic Key Term Extraction and Summarization from Spoken Course Lectures
Automatic Key Term Extraction and Summarization from Spoken Course LecturesAutomatic Key Term Extraction and Summarization from Spoken Course Lectures
Automatic Key Term Extraction and Summarization from Spoken Course Lectures
 
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
 
An Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task UnderstandingAn Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task Understanding
 
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
 
"Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli...
"Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli..."Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli...
"Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli...
 
Statistical Learning from Dialogues for Intelligent Assistants
Statistical Learning from Dialogues for Intelligent AssistantsStatistical Learning from Dialogues for Intelligent Assistants
Statistical Learning from Dialogues for Intelligent Assistants
 
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
 

Recently uploaded

Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 

Recently uploaded (20)

Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 

End-to-End Task-Completion Neural Dialogue Systems

  • 1. End-to-End Task-Completion Neural Dialogue Systems Xiujun Li Yun-Nung (Vivian) Chen Lihong Li Jianfeng Gao Asli Celikyilmaz The 8th International Joint Conference on Natural Language Processing https://github.com/MiuLab/TC-Bot 1
  • 2. Dialogue Interactions “I want to talk” Chit-chat “I have a question” Information consumption “I need to get this done” Task completion Task-oriented Non task-oriented 2
  • 3. Dialogue Interactions “I want to talk” Chit-chat “I have a question” Information consumption “I need to get this done” Task completion • What is the paper review schedule? • Which room is the dialogue tutorial in? • When is the IJCNLP 2017 conference? • What does NLP stand for? 3
  • 4. Dialogue Interactions “I want to talk” Chit-chat “I have a question” Information consumption “I need to get this done” Task completion • Book me the flight from Seattle to Taipei • Reserve a table at Din Tai Fung for 5 people, 7PM tonight • Purchase two movie tickets about Thor 3 at 9PM tonight 4 Goal: navigating users to complete tasks (transactions, API calls, etc.) Challenges: including multiple information consumption
  • 5. Task-Completion Dialogues • Crowdsourced data – Domain: Movie – Dialogue act: intent & slot-value pairs • Problem setting – User wants to achieve a specific task, i.e. buying a movie ticket – Agent tries to understand user’s goal via dialogue before booking the ticket Turn Speaker Intent & Slots Utterance 1 User greeting(greeting=hi) Hi 1 User request(moviename; city=Houma; state=Louisiana; date=this week; genre=foreign) Are there any good movies showing around Houma Louisiana this week? 2 Agent request(date) What date would you like me to look for a reservation 3 User inform(date=9th) How about the 9th 4 Agent inform(result={}; genre=foreign) Unfortunately here are no foreign movies playing at this time 5
  • 6. Rule-Based Agent • Gather all information via a series of requests (form-filling) • Issues 1) No notion about what requests can be skipped User goal – Two tickets for “Deadpool” tomorrow 9PM at AMC Pacific Place 11 theater, Seattle. User: Do you have any ticket for movie ‘Deadpool’ at 9:00 pm? Agent: Which theater do you want? User: AMC pacific place 11 theater, please! Agent: How many tickets do you want? User: Two! Agent: What date? User: Tomorrow! Agent: Which city do you want? User: Seattle! Agent: Do you have any specification for zip code? User: No, I do not care. Agent: Do you have any distance constraint? User: No, I do not care. Agent: How about video format? 2d? Imax 3d? User: Any kind is okay. Agent: Do you have any constraint to price? User: No! Agent: Okay, I will issue 2 tickets for you, tomorrow 9:00 pm at AMC pacific place 11 theater, Seattle, movie ‘Deadpool’. User: Okay, thanks. Agent: Thanks! Wasteful! (very likely the user does not care) 6
  • 7. Rule-Based Agent • Gather all information via a series of requests (form-filling) • Issues 1) No notion about what requests can be skipped 2) No principle way to deal with errors/uncertainty in language understanding User goal – Two tickets for “Hail Caesar” tomorrow 7:15PM at AMC Lowes Oak tree 6, Seattle. User: Do you any ticket for movie ‘Hail Caesar’ at theater AMC Lowes Oak Tree 6? Agent: What start time do you prefer? User: 7:15 pm Tomorrow Agent: Which city you are looking for? User: Seattle Agent: How many tickets do you want? User: Two Agent: Okay, I have issued two tickets for movie ‘Hail Caeser’ tomorrow 7:50 pm at theater AMC Lowes Oak Tree 6, Seattle. User: Oh, no. Agent: Thanks! Task failed! (cannot correct the errors from proceeding component) 7
  • 8. Rule-Based Agent • Gather all information via a series of requests (form-filling) • Issues 1) No notion about what requests can be skipped 2) No principle way to deal with errors/uncertainty in language understanding 3) Do not know how to recommend options if the user’s goal is not achievable 8
  • 9. Task-Oriented Dialogue System Framework Language Understanding (LU) Natural Language Generation (NLG) Semantic Frame request_ticket (moviename=Star Wars; numberofpeople=5) System Action/ Policy request(theater) Text Response “Which theater do you prefer?” Text Input “Book 5 tickets for movie Star Wars” Knowledge Database Dialogue State Tracking (DST) Dialogue Policy Learning Dialogue Management (DM) 9 State Representation
  • 10. Task-Oriented Dialogue System Framework Language Understanding (LU) Natural Language Generation (NLG) Dialogue Act Natural Language Natural Language Knowledge Database Dialogue State Tracking (DST) Dialogue Policy Learning Dialogue Management (DM) Dialogue Act 10 State Representation User System
  • 11. User Simulation in Frame-Level Semantics User Dialogue Act Knowledge Database Dialogue State Tracking (DST) Dialogue Policy Learning Dialogue Management (DM) Error Model Controller • Recognition error • LU error User Model User Simulation System Dialogue Act User Dialogue Act 11 State Representation
  • 12. User Simulation in Natural Language Level Natural Language Generation (NLG) User Model User Simulation System Dialogue Act Language Understanding (LU) NL 12 Knowledge Database Dialogue State Tracking (DST) Dialogue Policy Learning Dialogue Management (DM) State RepresentationUser Dialogue Act
  • 13. Task-Completion Neural Dialogue Systems • Language Understanding – Joint semantic frame parsing by BLSTM (Hakkani-Tur et al., 2016) • Dialogue State Tracking – Available results returned by the formed symbolic query – Latest user dialogue action • Dialogue Policy Learning – Reinforcement learning policy (Mnih et al., 2015) • Natural Language Generation – Template-based – Model-based: semantically-conditioned LSTM generation (SC-LSTM) (Wen et al., 2015) 13 Supervised Supervised Supervised Reinforcement + Reinforcement (fine-tuning) + Reinforcement (fine-tuning) + Reinforcement (fine-tuning)
  • 14. wi <slot> wi+1 O EOS <intent> wi <slot> wi+1 O EOS <intent> End-to-End Neural Dialogue Systems • LU, DST (neural dialogue system), and NLG (user simulation) are trained in supervised way • End-to-end training for dialogue policy learning Knowledge Database Neural Dialogue System User Model User Simulation Dialogue Policy Natural Language w 0 w1 w2 NLG EOS User Goal wi <slot> wi+1 O EOS <intent> LU 𝑠𝑡 DST 𝑠1 𝑠2 𝑠 𝑛 𝑎1 𝑎2 𝑎 𝑘 …… … Dialogue Policy Learning 14
  • 15. Reinforcement Learning Agent • Dialogue policy learning – Deep Q-network: estimate Q-value given the state and action pair • Reward – Success: agent answers all the requested slots based on the user’s constraints, and book the movie tickets within the max turns – Failure: 1) agent finds no matching movie based on the user’s constraints or 2) exceed the max turns 15
  • 16. Experiments • Rule-based agent – Actions • ask question (request) • answer question (inform) • give multiple_choice to user • say “confirm_answer” to user • say “closing”, “thanks” – Agent asks the slot in a priority order, but within a sliding window (say, size = 2 or 3), there is randomness. • RL agent – Model: Deep Q-Network – Rewards • Success: 2 × max_turn • Fail: - max_turn • -1 for per turn penalty – Actions: 45 actions • i.e. request(starttime), inform(moviename), confirm(question), etc. – State transition tuples ( 𝑠𝑡, 𝑎 𝑡, 𝑟𝑡, 𝑠𝑡+1) – Experience replay (Schaul et al., 2015) • Priority, dynamic pool • Starts with a pool of rule-based tuples 16
  • 17. • Frame-level semantics  Natural language The RL agent is able to learn how to interact with users to complete tasks more efficiently and effectively, and outperforms the rule-based agent. End-to-End Reinforcement Learning17
  • 18. Language Understanding Impact • Different error rates in LU 18 Language understanding results are crucial in terms of dialogue-level performance
  • 19. Intent Error Analysis • Intent error type – I0: random – I1: within group – I2: between group • Intent error rate – I3: 0.00 – I4: 0.10 – I5: 0.20 Intent errors slightly influence the RL system performance Group 1: greeting(), thanks(), etc Group 2: inform(xx) Group 3: request(xx) Between-group intent errors degrade the system performance more request_moviename(actor=Robert Downey Jr) request_year 19
  • 20. Slot Error Analysis • Slot error type – I0: random – I1: slot deletion – I2: value substitution – I3: slot substitution • Slot error rate – S4: 0.00 – S5: 0.10 – S6: 0.20 Slot errors significantly degrade the RL system performance Value substitution has the largest impact on the system performance request_moviename (actor=Robert Downey Jr) director Robert Downey Sr20
  • 21. Error Comparison • Intent error rate • Slot error rate The RL agent has better robustness to intent errors for dialogue-level performance Slot filling is more important than intent detection in language understanding 21
  • 22. Human Evaluation • Users presented with a predefined user goal • User rating: 1 (worst) to 5 (best) based on both naturalness and coherence of the dialogue 22
  • 23. User goal – Two tickets for “Deadpool” tomorrow 9PM at AMC Pacific Place 11 theater, Seattle. RULE BASED SYSTEM User: Do you have any ticket for movie ‘Deadpool’ at 9:00 pm? Agent: Which theater do you want? User: AMC pacific place 11 theater, please! Agent: How many tickets do you want? User: Two! Agent: What date? User: Tomorrow! Agent: Which city do you want? User: Seattle! Agent: Do you have any specification for zip code? User: No, I do not care. Agent: Do you have any distance constraint? User: No, I do not care. Agent: How about video format? 2d? Imax 3d? User: Any kind is okay. Agent: Do you have any constraint to price? User: No! Agent: Okay, I will issue 2 tickets for you, tomorrow 9:00 pm at AMC pacific place 11 theater, Seattle, movie ‘Deadpool’. User: Okay, thanks. Agent: Thanks! REINFORCEMENT LEARNING SYSTEM User: I want two tickets for movie ‘Deadpool’. Agent: What is the starttime you prefer? User: 9:00 PM Agent: Which day? User: Tomorrow Agent: Which city? User: Seattle Agent: Which theater do you prefer? User: AMC Pacific Place 11 theater. Agent: Okay, I will issue 2 tickets for you, tomorrow 9:00 pm at AMC pacific place 11 theater, Seattle, movie ‘Deadpool’. User: Okay, thanks. Agent: Thanks! Dialogue Examples Skip the requests the user may not care about to improve efficiency Issue 1: no notion about what requests can be skipped 23
  • 24. User goal – Two tickets for “Hail Caesar” tomorrow 7:15PM at AMC Lowes Oak tree 6, Seattle. Dialogue Examples Automatically learn when and what to confirm for better robustness REINFORCEMENT LEARNING SYSTEM Agent: What start time do you prefer? User: 7:15 pm Tomorrow Agent: Which city you are looking for? User: Seattle Agent: Okay, I will issue you tickets for movie ‘Hail Caeser’ tomorrow 7:50 pm at theater AMC Lowes Oak Tree 6, Seattle. User: Oh, no. Agent: It is 7:50 pm, right? User: Two tickets for 7:15 pm, please. Agent: Okay, I will issue you two tickets for movie ‘Hail Caeser’ tomorrow 7:15 pm at theater AMC Lowes Oak Tree 6, Seattle. User: Thanks! Agent: Thanks! RULE BASED SYSTEM User: Do you any ticket for movie ‘Hail Caesar’ at theater AMC Lowes Oak Tree 6? Agent: What start time do you prefer? User: 7:15 pm Tomorrow Agent: Which city you are looking for? User: Seattle Agent: How many tickets do you want? User: Two Agent: Okay, I have issued two tickets for movie ‘Hail Caeser’ tomorrow 7:50 pm at theater AMC Lowes Oak Tree 6, Seattle. User: Oh, no. Agent: Thanks! Issue 2: no principle way to deal with LU errors/uncertainty 24
  • 25. Conclusions • Our neural dialogue system demonstrates – Robustness • selecting actions based on uncertainty and confusion by RL • systematic analysis about language understanding errors on dialogue system performance  slot-level errors affect system performance more than intent-level errors  slot value replacement degrades the performance most – Flexibility • first neural dialogue system that allows user-initiated behaviors during conversations – Reproducibility • demonstrate how to evaluate RL dialogue agents using crowdsourced datasets and simulated users in an end-to-end fashion • guaranteeing reproducibility and consistent comparisons of competing methods in an identical setting 25
  • 26. Thanks for Attention! Q & A https://github.com/MiuLab/TC-Bot

Editor's Notes

  1. General goal, of course involves treating natural language as a knowledge representation language. So MR involves deriving structured information from free text, and then doing something with it. But the work that’s been done in this space represents a tangle of different agendas. Let’s unpack them a bit. Where does it make sense for MS to focus its efforts in this space? For 1, doesn’t drive mainstream AI research. Hobby strand, though maybe good for PR purposes. Not a good driver of research/product. For 2 & 3 the user already has some idea of what the information need is. For 4, the user is overwhelmed by the complexity of the problem/solution space, needs help navigating.
  2. General goal, of course involves treating natural language as a knowledge representation language. So MR involves deriving structured information from free text, and then doing something with it. But the work that’s been done in this space represents a tangle of different agendas. Let’s unpack them a bit. Where does it make sense for MS to focus its efforts in this space? For 1, doesn’t drive mainstream AI research. Hobby strand, though maybe good for PR purposes. Not a good driver of research/product. For 2 & 3 the user already has some idea of what the information need is. For 4, the user is overwhelmed by the complexity of the problem/solution space, needs help navigating.
  3. General goal, of course involves treating natural language as a knowledge representation language. So MR involves deriving structured information from free text, and then doing something with it. But the work that’s been done in this space represents a tangle of different agendas. Let’s unpack them a bit. Where does it make sense for MS to focus its efforts in this space? For 1, doesn’t drive mainstream AI research. Hobby strand, though maybe good for PR purposes. Not a good driver of research/product. For 2 & 3 the user already has some idea of what the information need is. For 4, the user is overwhelmed by the complexity of the problem/solution space, needs help navigating.