SlideShare a Scribd company logo
1 of 30
Ming Sun
Yun-Nung Chen
Alexander I. Rudnicky
School of Computer Science
Carnegie Mellon University
An Intelligent Assistant for High-Level
Task Understanding
Many devices, many apps…
2
 Rich functionality
is now available
 How well can it
support user
activities?
3
Book a restaurant!
I can do Yelp!
4
Show me bus to China Palace.
I know Maps!
5
Tell kids we will have Chinese
tonight.
I will Text!
Multi-domain Dialog – Current
System
System: What can I do for you?
User: Could you arrange a dinner for me and my friends?
System: Sorry I don’t understand that. What can I do for you?
User: Can I book a table for three in Täkō downtown for this
Friday?
…
System: OK. What can I do for you next?
User: Show me the bus from here.
System: Where is your destination please?
User: Täkō downtown Pittsburgh.
…
System: What should I do next?
User: Send the bus route to Carrie and Peter.
System: OK. Sending the following message to Carrie and Peter:6
Cannot handle complex intention
Passively support cross-domain dialo
No shared context
Multi-domain Dialog – Human Assistant
Assistant: What can I do for you?
User: Could you arrange a dinner for me and my friends?
Assistant : What kind of food do you prefer?
User: Mexican?
Assistant : How about Täkō? I can book a table for you.
User: Sounds good! Can I take a bus there?
Assistant : 61 A/B/C/D can take you there. Do you want to send
this to your friends?
User: Great! Send it to Carrie and Peter.
Assistant: OK. The bus route 61 has been sent.
7
Understand complex intentions
Actively support cross-domain dialo
Maintain context
Intention Understanding
Yes Dad!
Special Team 1!
8
Plan a dinner!
Intention Understanding
Find a restaurant!
9
Yelp!
Intention Understanding
Find a bus route!
10
Maps!
Intention Understanding
Message Agnes & Edith!
11
Messenger!
Intention Understanding
Yes Dad!
Special Team 2!
12
Set up meeting!
Approach
 Step 1: Observe human user perform multi-
domain tasks
 Step 2: Learn to assist at task level
 Map an activity description to a set of domain apps
 Interact at the task level
13
Data Collection 1 – Smart Phone
Wednesday
17:08 – 17:14
CMU
Messenge
r
Gmail Browser Calendar
Schedule a visit to CMU Lab
 Log app invocation + time/date/location
 Separate log into episodes if there is 3
minute inactivity
14
Data Collection 2 – Wizard-of-Oz
15
Find me an Indian place near
CMU.
Yuva India is
nearby.
Monday
10:08 – 10:15
Home
Yelp Maps
Messeng
er
Schedule a lunch with
David.
Music
Data Collection 2 – Wizard-of-Oz
16
When is the next bus to school? In 10 min, 61C.
Monday
10:08 – 10:15
Home
Yelp Maps
Messeng
er
Schedule a lunch with
David.
Music
Data Collection 2 – Wizard-of-Oz
17
Tell David to meet me there in
15 min.
Message sent.
Monday
10:08 – 10:15
Home
Yelp Maps
Messeng
er
Schedule a lunch with
David.
Music
Corpus
 533 real-life multi-domain interactions from 14
real users
 12 native English speakers (2 non-)
 4 males & 10 females
 Mean age: 31
 Total # unique apps: 130 (Mean = 19/user)
18
Resources Examples Usage
App sequences Yelp->Maps->Messenger structure/arrangement
Task descriptions “Schedule a lunch with David” nature of the
intention,
language reference
User utterances “Find me an Indian place near
CMU.”
language reference
Meta data Monday, 10:08 – 10:15, Home contexts of the tasks
Intention
Realization
Model
• [Yelp, Maps, Uber]
• November, weekday, afternoon,
office
• “Try to arrange evening out”
• [United Airlines, AirBnB,
Calendar]
• September, weekend,
morning, home
• “Planning a trip to California”
• [TripAdvisor, United
Airlines]
• July, weekend, morning,
home
• “I was planning a trip to
Oregon”
“Plan a weekend in
Virginia”
• […, …, …]
• …
•“Shared picture to
Alexis”
Infer:
1) Supportive Domains: United Airlines, TripAdvisor, AirBnB
2) Summarization: “plan trip”
19
Find similar past experience
 Cluster-based:
 K-means clustering on user generated language
 Neighbor-based:
 KNN
1
2
3
Cluster-
based
Neighbor-
based
20
Yelp
Yelp
OpenTabl
e
Yelp
Maps
Maps
Maps
Maps
Messeng
er
Email
Email
Email
Realize domains from past
experience
 Representative Sequence
 Multi-label Classification
App sequences of similar
experience
(ROVER)
21
Some Obstacles to Remove
 Language-mismatch
 Solution: Query Enrichment (QryEnr)
 [“shoot”, “photo”] -> [“shoot”, “take”, “photo”, “picture”]
 word2vec, GoogleNews model
 App-mismatch
 Solution: App Similarity (AppSim)
 Functionality space (derived from app descriptions) to
identify apps
 Data-driven: doc2vec on app store texts
 Rule-based: app package name
 Knowledge-driven: Google Play similar app suggestions
22
23
Gap between Generic and
Personalized Models
QryEnr, AppSim, QryEnr+AppSim reduce the gap of F1
0
5
10
15
20
25
30
35
Compare different AppSim
0
10
20
30
40
50
60
70
Precision Recall F1
Baseline Data Knowledge Rule Combine
24
Compare different AppSim
 Combining three approaches performs the best
 Knowledge-driven and data-driven have low
coverage among (manufacture) apps
 Rule-based is better than the other two individual
approaches
25
Learning to talk at the task level
 Techniques:
 (Extractive/abstractive) summarization
 Key phrase extraction [RAKE]
 User study:
 Key phrase extraction + user generated language
 Ranked list of key phrases + user’s binary judgment
1. solutions online
2. project file
3. Google drive
4. math problems
5. physics homework
6. answers online
…
[descriptions]
Looking up math problems.
Now open a browser.
Go to slader.com.
Doing physics homework.
…
[utterances]
Go to my Google drive.
Look up kinematic equations.
Now open my calculator so I can plug in numbers.
…
26
1. solutions online
2. project file
3. Google drive
4. math problems
5. physics homework
6. answers online
…
Learning to talk at the task level
 Metrics
 Mean Reciprocal Rank (MRR)
 Result:
 MRR ~0.6
 understandable verbal reference show up in top 2 of the ranked list
[descriptions]
Looking up math problems.
Now open a browser.
Go to slader.com.
Doing physics homework.
…
[utterances]
Go to my Google drive.
Look up kinematic equations.
Now open my calculator so I can plug in numbers.
…
27
Summary
 Collected real-life cross-domain interactions from
real users
 HELPR: a framework to learn assistance at the
task level
 Suggest a set of supportive domains to accomplish
the task
 Personalized model > Generic model
 The gap can be reduced by QryEnr + AppSim
 Generate language reference to communicate
verbally at task level
28
HELPR demo
 Interface
 HELPR display
 GoogleASR
 Android TTS
 HELPR server
 User models
29
Thank you
 Questions?
30

More Related Content

What's hot

2017 Tutorial - Deep Learning for Dialogue Systems
2017 Tutorial - Deep Learning for Dialogue Systems2017 Tutorial - Deep Learning for Dialogue Systems
2017 Tutorial - Deep Learning for Dialogue SystemsMLReview
 
Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...
Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...
Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...Yun-Nung (Vivian) Chen
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System ReviewNguyen Quang
 
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...AI Frontiers
 
How the Context Matters Language and Interaction in Dialogues
How the Context Matters Language and Interaction in DialoguesHow the Context Matters Language and Interaction in Dialogues
How the Context Matters Language and Interaction in DialoguesYun-Nung (Vivian) Chen
 
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Yun-Nung (Vivian) Chen
 
Statistical Learning from Dialogues for Intelligent Assistants
Statistical Learning from Dialogues for Intelligent AssistantsStatistical Learning from Dialogues for Intelligent Assistants
Statistical Learning from Dialogues for Intelligent AssistantsYun-Nung (Vivian) Chen
 
"Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli...
"Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli..."Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli...
"Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli...Yun-Nung (Vivian) Chen
 
Omar Tawakol at AI Frontiers: The Rise Of Voice-Activated Assistants In The W...
Omar Tawakol at AI Frontiers: The Rise Of Voice-Activated Assistants In The W...Omar Tawakol at AI Frontiers: The Rise Of Voice-Activated Assistants In The W...
Omar Tawakol at AI Frontiers: The Rise Of Voice-Activated Assistants In The W...AI Frontiers
 
One Day for Bot 一天搞懂聊天機器人
One Day for Bot 一天搞懂聊天機器人One Day for Bot 一天搞懂聊天機器人
One Day for Bot 一天搞懂聊天機器人Yun-Nung (Vivian) Chen
 
Lukasz Kaiser at AI Frontiers: How Deep Learning Quietly Revolutionized NLP
Lukasz Kaiser at AI Frontiers: How Deep Learning Quietly Revolutionized NLPLukasz Kaiser at AI Frontiers: How Deep Learning Quietly Revolutionized NLP
Lukasz Kaiser at AI Frontiers: How Deep Learning Quietly Revolutionized NLPAI Frontiers
 
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)AI Frontiers
 
Recent Advances in NLP
  Recent Advances in NLP  Recent Advances in NLP
Recent Advances in NLPAnuj Gupta
 
Talk from NVidia Developer Connect
Talk from NVidia Developer ConnectTalk from NVidia Developer Connect
Talk from NVidia Developer ConnectAnuj Gupta
 
Anthiil Inside workshop on NLP
Anthiil Inside workshop on NLPAnthiil Inside workshop on NLP
Anthiil Inside workshop on NLPSatyam Saxena
 
Multiskill Conversational AI
Multiskill Conversational AIMultiskill Conversational AI
Multiskill Conversational AIDaniel Kornev
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova musicNAVER D2
 

What's hot (18)

2017 Tutorial - Deep Learning for Dialogue Systems
2017 Tutorial - Deep Learning for Dialogue Systems2017 Tutorial - Deep Learning for Dialogue Systems
2017 Tutorial - Deep Learning for Dialogue Systems
 
Chatbot的智慧與靈魂
Chatbot的智慧與靈魂Chatbot的智慧與靈魂
Chatbot的智慧與靈魂
 
Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...
Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...
Detecting Actionable Items in Meetings by Convolutional Deep Structured Seman...
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System Review
 
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...
 
How the Context Matters Language and Interaction in Dialogues
How the Context Matters Language and Interaction in DialoguesHow the Context Matters Language and Interaction in Dialogues
How the Context Matters Language and Interaction in Dialogues
 
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
 
Statistical Learning from Dialogues for Intelligent Assistants
Statistical Learning from Dialogues for Intelligent AssistantsStatistical Learning from Dialogues for Intelligent Assistants
Statistical Learning from Dialogues for Intelligent Assistants
 
"Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli...
"Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli..."Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli...
"Sorry, I didn't get that!" - Statistical Learning from Dialogues for Intelli...
 
Omar Tawakol at AI Frontiers: The Rise Of Voice-Activated Assistants In The W...
Omar Tawakol at AI Frontiers: The Rise Of Voice-Activated Assistants In The W...Omar Tawakol at AI Frontiers: The Rise Of Voice-Activated Assistants In The W...
Omar Tawakol at AI Frontiers: The Rise Of Voice-Activated Assistants In The W...
 
One Day for Bot 一天搞懂聊天機器人
One Day for Bot 一天搞懂聊天機器人One Day for Bot 一天搞懂聊天機器人
One Day for Bot 一天搞懂聊天機器人
 
Lukasz Kaiser at AI Frontiers: How Deep Learning Quietly Revolutionized NLP
Lukasz Kaiser at AI Frontiers: How Deep Learning Quietly Revolutionized NLPLukasz Kaiser at AI Frontiers: How Deep Learning Quietly Revolutionized NLP
Lukasz Kaiser at AI Frontiers: How Deep Learning Quietly Revolutionized NLP
 
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
 
Recent Advances in NLP
  Recent Advances in NLP  Recent Advances in NLP
Recent Advances in NLP
 
Talk from NVidia Developer Connect
Talk from NVidia Developer ConnectTalk from NVidia Developer Connect
Talk from NVidia Developer Connect
 
Anthiil Inside workshop on NLP
Anthiil Inside workshop on NLPAnthiil Inside workshop on NLP
Anthiil Inside workshop on NLP
 
Multiskill Conversational AI
Multiskill Conversational AIMultiskill Conversational AI
Multiskill Conversational AI
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music
 

Viewers also liked

Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Yun-Nung (Vivian) Chen
 
Automatic Key Term Extraction and Summarization from Spoken Course Lectures
Automatic Key Term Extraction and Summarization from Spoken Course LecturesAutomatic Key Term Extraction and Summarization from Spoken Course Lectures
Automatic Key Term Extraction and Summarization from Spoken Course LecturesYun-Nung (Vivian) Chen
 
Human Activities as Linked Data
Human Activities as Linked DataHuman Activities as Linked Data
Human Activities as Linked DataPaolo Pareti
 
significance of theory for nursing as a profession
significance of theory for nursing as a professionsignificance of theory for nursing as a profession
significance of theory for nursing as a professionJohn Christian Villanueva
 
1st Seminar- Intelligent Agent for Medium-Level Artificial Intelligence in Re...
1st Seminar- Intelligent Agent for Medium-Level Artificial Intelligence in Re...1st Seminar- Intelligent Agent for Medium-Level Artificial Intelligence in Re...
1st Seminar- Intelligent Agent for Medium-Level Artificial Intelligence in Re...Muhamad Hesham
 
青云虚拟机部署私有Docker Registry
青云虚拟机部署私有Docker Registry青云虚拟机部署私有Docker Registry
青云虚拟机部署私有Docker RegistryZhichao Liang
 
RでMplusがもっと便利にーmplusAutomationパッケージー #Hiroshimar05
RでMplusがもっと便利にーmplusAutomationパッケージー #Hiroshimar05 RでMplusがもっと便利にーmplusAutomationパッケージー #Hiroshimar05
RでMplusがもっと便利にーmplusAutomationパッケージー #Hiroshimar05 Masaru Tokuoka
 
微软Bot framework简介
微软Bot framework简介微软Bot framework简介
微软Bot framework简介Zhichao Liang
 
From data to numbers to knowledge: semantic embeddings By Alvaro Barbero
From data to numbers to knowledge: semantic embeddings By Alvaro BarberoFrom data to numbers to knowledge: semantic embeddings By Alvaro Barbero
From data to numbers to knowledge: semantic embeddings By Alvaro BarberoBig Data Spain
 
Automatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course LecturesAutomatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course LecturesYun-Nung (Vivian) Chen
 
Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...
Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...
Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...Big Data Spain
 
Intelligent Agent Perception
Intelligent Agent PerceptionIntelligent Agent Perception
Intelligent Agent PerceptionMolly Maymar
 
Los MOOC ventajas y desventajas
Los MOOC ventajas y desventajasLos MOOC ventajas y desventajas
Los MOOC ventajas y desventajasluz deluna
 
Cascon 2016 Keynote: Disrupting Developer Productivity One Bot at a Time
Cascon 2016 Keynote: Disrupting Developer Productivity One Bot at a TimeCascon 2016 Keynote: Disrupting Developer Productivity One Bot at a Time
Cascon 2016 Keynote: Disrupting Developer Productivity One Bot at a TimeMargaret-Anne Storey
 
Intelligent agent
Intelligent agent Intelligent agent
Intelligent agent Arvind sahu
 
Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016
Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016
Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016MLconf
 
Artificial Intelligence Chapter two agents
Artificial Intelligence Chapter two agentsArtificial Intelligence Chapter two agents
Artificial Intelligence Chapter two agentsEhsan Nowrouzi
 

Viewers also liked (20)

Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
 
Automatic Key Term Extraction and Summarization from Spoken Course Lectures
Automatic Key Term Extraction and Summarization from Spoken Course LecturesAutomatic Key Term Extraction and Summarization from Spoken Course Lectures
Automatic Key Term Extraction and Summarization from Spoken Course Lectures
 
Human Activities as Linked Data
Human Activities as Linked DataHuman Activities as Linked Data
Human Activities as Linked Data
 
thesis_palogiannidi
thesis_palogiannidithesis_palogiannidi
thesis_palogiannidi
 
significance of theory for nursing as a profession
significance of theory for nursing as a professionsignificance of theory for nursing as a profession
significance of theory for nursing as a profession
 
1st Seminar- Intelligent Agent for Medium-Level Artificial Intelligence in Re...
1st Seminar- Intelligent Agent for Medium-Level Artificial Intelligence in Re...1st Seminar- Intelligent Agent for Medium-Level Artificial Intelligence in Re...
1st Seminar- Intelligent Agent for Medium-Level Artificial Intelligence in Re...
 
青云虚拟机部署私有Docker Registry
青云虚拟机部署私有Docker Registry青云虚拟机部署私有Docker Registry
青云虚拟机部署私有Docker Registry
 
On using monolingual corpora in neural machine translation
On using monolingual corpora in neural machine translationOn using monolingual corpora in neural machine translation
On using monolingual corpora in neural machine translation
 
RでMplusがもっと便利にーmplusAutomationパッケージー #Hiroshimar05
RでMplusがもっと便利にーmplusAutomationパッケージー #Hiroshimar05 RでMplusがもっと便利にーmplusAutomationパッケージー #Hiroshimar05
RでMplusがもっと便利にーmplusAutomationパッケージー #Hiroshimar05
 
Deep Learning for Machine Translation
Deep Learning for Machine TranslationDeep Learning for Machine Translation
Deep Learning for Machine Translation
 
微软Bot framework简介
微软Bot framework简介微软Bot framework简介
微软Bot framework简介
 
From data to numbers to knowledge: semantic embeddings By Alvaro Barbero
From data to numbers to knowledge: semantic embeddings By Alvaro BarberoFrom data to numbers to knowledge: semantic embeddings By Alvaro Barbero
From data to numbers to knowledge: semantic embeddings By Alvaro Barbero
 
Automatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course LecturesAutomatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course Lectures
 
Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...
Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...
Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...
 
Intelligent Agent Perception
Intelligent Agent PerceptionIntelligent Agent Perception
Intelligent Agent Perception
 
Los MOOC ventajas y desventajas
Los MOOC ventajas y desventajasLos MOOC ventajas y desventajas
Los MOOC ventajas y desventajas
 
Cascon 2016 Keynote: Disrupting Developer Productivity One Bot at a Time
Cascon 2016 Keynote: Disrupting Developer Productivity One Bot at a TimeCascon 2016 Keynote: Disrupting Developer Productivity One Bot at a Time
Cascon 2016 Keynote: Disrupting Developer Productivity One Bot at a Time
 
Intelligent agent
Intelligent agent Intelligent agent
Intelligent agent
 
Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016
Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016
Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016
 
Artificial Intelligence Chapter two agents
Artificial Intelligence Chapter two agentsArtificial Intelligence Chapter two agents
Artificial Intelligence Chapter two agents
 

Similar to An Intelligent Assistant for High-Level Task Understanding

Designing for People
Designing for PeopleDesigning for People
Designing for PeopleSarah Horton
 
UX Tutorial Session Day 2
UX Tutorial Session Day 2UX Tutorial Session Day 2
UX Tutorial Session Day 2Fergus Roche
 
Missioinformatics - How Information Technology Contribute to Mission Study
Missioinformatics - How Information Technology Contribute to Mission StudyMissioinformatics - How Information Technology Contribute to Mission Study
Missioinformatics - How Information Technology Contribute to Mission StudyHokuto Ide
 
Strategies for integrating people and machine learning in online systems
Strategies for integrating people and machine learning in online systems Strategies for integrating people and machine learning in online systems
Strategies for integrating people and machine learning in online systems Jason Laska
 
Lessonplan exploring
Lessonplan exploringLessonplan exploring
Lessonplan exploringSusan Ferdon
 
Anytime, Anywhere, Any Device: Developing a Mobile Website for Your Library
Anytime, Anywhere, Any Device: Developing a Mobile Website for Your LibraryAnytime, Anywhere, Any Device: Developing a Mobile Website for Your Library
Anytime, Anywhere, Any Device: Developing a Mobile Website for Your LibraryMobile Libraries
 
08-Task Analysis.pdf
08-Task Analysis.pdf08-Task Analysis.pdf
08-Task Analysis.pdfshreyakale11
 
Directions Type out the answers to all questions on a separate .docx
Directions Type out the answers to all questions on a separate .docxDirections Type out the answers to all questions on a separate .docx
Directions Type out the answers to all questions on a separate .docxvickeylintern
 
GI Learner Local Gent
GI Learner Local GentGI Learner Local Gent
GI Learner Local GentGeoBlogs
 
Community Visioning Workshop Preview
Community Visioning Workshop PreviewCommunity Visioning Workshop Preview
Community Visioning Workshop PreviewHeartland2050
 
Turning On The Lights
Turning On The LightsTurning On The Lights
Turning On The Lightsnbruno
 
Brightfind world usability day 2016 full deck final
Brightfind world usability day 2016   full deck finalBrightfind world usability day 2016   full deck final
Brightfind world usability day 2016 full deck finalBrightfind
 
Dyson c mobile learning lesson plan 7330
Dyson c mobile learning lesson plan 7330Dyson c mobile learning lesson plan 7330
Dyson c mobile learning lesson plan 7330Christy Dyson
 
Leader Keynote Pp Final
Leader Keynote Pp FinalLeader Keynote Pp Final
Leader Keynote Pp Finaldjleu
 
Kevin Byrne’s Presentation: Graticules & Google – Online Learning About Susta...
Kevin Byrne’s Presentation: Graticules & Google – Online Learning About Susta...Kevin Byrne’s Presentation: Graticules & Google – Online Learning About Susta...
Kevin Byrne’s Presentation: Graticules & Google – Online Learning About Susta...J. Kevin Byrne
 
[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systems[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systemsQi He
 
Designing Mobile Search: A workshop for eBay Classifieds Group
Designing Mobile Search: A workshop for eBay Classifieds GroupDesigning Mobile Search: A workshop for eBay Classifieds Group
Designing Mobile Search: A workshop for eBay Classifieds GroupTyler Tate
 

Similar to An Intelligent Assistant for High-Level Task Understanding (20)

Iui 1.0 clean
Iui 1.0  cleanIui 1.0  clean
Iui 1.0 clean
 
Designing for People
Designing for PeopleDesigning for People
Designing for People
 
UX Tutorial Session Day 2
UX Tutorial Session Day 2UX Tutorial Session Day 2
UX Tutorial Session Day 2
 
Missioinformatics - How Information Technology Contribute to Mission Study
Missioinformatics - How Information Technology Contribute to Mission StudyMissioinformatics - How Information Technology Contribute to Mission Study
Missioinformatics - How Information Technology Contribute to Mission Study
 
Strategies for integrating people and machine learning in online systems
Strategies for integrating people and machine learning in online systems Strategies for integrating people and machine learning in online systems
Strategies for integrating people and machine learning in online systems
 
Lessonplan exploring
Lessonplan exploringLessonplan exploring
Lessonplan exploring
 
Anytime, Anywhere, Any Device: Developing a Mobile Website for Your Library
Anytime, Anywhere, Any Device: Developing a Mobile Website for Your LibraryAnytime, Anywhere, Any Device: Developing a Mobile Website for Your Library
Anytime, Anywhere, Any Device: Developing a Mobile Website for Your Library
 
08-Task Analysis.pdf
08-Task Analysis.pdf08-Task Analysis.pdf
08-Task Analysis.pdf
 
Directions Type out the answers to all questions on a separate .docx
Directions Type out the answers to all questions on a separate .docxDirections Type out the answers to all questions on a separate .docx
Directions Type out the answers to all questions on a separate .docx
 
GI Learner Local Gent
GI Learner Local GentGI Learner Local Gent
GI Learner Local Gent
 
Community Visioning Workshop Preview
Community Visioning Workshop PreviewCommunity Visioning Workshop Preview
Community Visioning Workshop Preview
 
Turning On The Lights
Turning On The LightsTurning On The Lights
Turning On The Lights
 
Brightfind world usability day 2016 full deck final
Brightfind world usability day 2016   full deck finalBrightfind world usability day 2016   full deck final
Brightfind world usability day 2016 full deck final
 
Dyson c mobile learning lesson plan 7330
Dyson c mobile learning lesson plan 7330Dyson c mobile learning lesson plan 7330
Dyson c mobile learning lesson plan 7330
 
Leader Keynote Pp Final
Leader Keynote Pp FinalLeader Keynote Pp Final
Leader Keynote Pp Final
 
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure SoulierHow to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
 
Resume
ResumeResume
Resume
 
Kevin Byrne’s Presentation: Graticules & Google – Online Learning About Susta...
Kevin Byrne’s Presentation: Graticules & Google – Online Learning About Susta...Kevin Byrne’s Presentation: Graticules & Google – Online Learning About Susta...
Kevin Byrne’s Presentation: Graticules & Google – Online Learning About Susta...
 
[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systems[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systems
 
Designing Mobile Search: A workshop for eBay Classifieds Group
Designing Mobile Search: A workshop for eBay Classifieds GroupDesigning Mobile Search: A workshop for eBay Classifieds Group
Designing Mobile Search: A workshop for eBay Classifieds Group
 

Recently uploaded

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 

Recently uploaded (20)

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 

An Intelligent Assistant for High-Level Task Understanding

  • 1. Ming Sun Yun-Nung Chen Alexander I. Rudnicky School of Computer Science Carnegie Mellon University An Intelligent Assistant for High-Level Task Understanding
  • 2. Many devices, many apps… 2  Rich functionality is now available  How well can it support user activities?
  • 3. 3 Book a restaurant! I can do Yelp!
  • 4. 4 Show me bus to China Palace. I know Maps!
  • 5. 5 Tell kids we will have Chinese tonight. I will Text!
  • 6. Multi-domain Dialog – Current System System: What can I do for you? User: Could you arrange a dinner for me and my friends? System: Sorry I don’t understand that. What can I do for you? User: Can I book a table for three in Täkō downtown for this Friday? … System: OK. What can I do for you next? User: Show me the bus from here. System: Where is your destination please? User: Täkō downtown Pittsburgh. … System: What should I do next? User: Send the bus route to Carrie and Peter. System: OK. Sending the following message to Carrie and Peter:6 Cannot handle complex intention Passively support cross-domain dialo No shared context
  • 7. Multi-domain Dialog – Human Assistant Assistant: What can I do for you? User: Could you arrange a dinner for me and my friends? Assistant : What kind of food do you prefer? User: Mexican? Assistant : How about Täkō? I can book a table for you. User: Sounds good! Can I take a bus there? Assistant : 61 A/B/C/D can take you there. Do you want to send this to your friends? User: Great! Send it to Carrie and Peter. Assistant: OK. The bus route 61 has been sent. 7 Understand complex intentions Actively support cross-domain dialo Maintain context
  • 8. Intention Understanding Yes Dad! Special Team 1! 8 Plan a dinner!
  • 9. Intention Understanding Find a restaurant! 9 Yelp!
  • 10. Intention Understanding Find a bus route! 10 Maps!
  • 11. Intention Understanding Message Agnes & Edith! 11 Messenger!
  • 12. Intention Understanding Yes Dad! Special Team 2! 12 Set up meeting!
  • 13. Approach  Step 1: Observe human user perform multi- domain tasks  Step 2: Learn to assist at task level  Map an activity description to a set of domain apps  Interact at the task level 13
  • 14. Data Collection 1 – Smart Phone Wednesday 17:08 – 17:14 CMU Messenge r Gmail Browser Calendar Schedule a visit to CMU Lab  Log app invocation + time/date/location  Separate log into episodes if there is 3 minute inactivity 14
  • 15. Data Collection 2 – Wizard-of-Oz 15 Find me an Indian place near CMU. Yuva India is nearby. Monday 10:08 – 10:15 Home Yelp Maps Messeng er Schedule a lunch with David. Music
  • 16. Data Collection 2 – Wizard-of-Oz 16 When is the next bus to school? In 10 min, 61C. Monday 10:08 – 10:15 Home Yelp Maps Messeng er Schedule a lunch with David. Music
  • 17. Data Collection 2 – Wizard-of-Oz 17 Tell David to meet me there in 15 min. Message sent. Monday 10:08 – 10:15 Home Yelp Maps Messeng er Schedule a lunch with David. Music
  • 18. Corpus  533 real-life multi-domain interactions from 14 real users  12 native English speakers (2 non-)  4 males & 10 females  Mean age: 31  Total # unique apps: 130 (Mean = 19/user) 18 Resources Examples Usage App sequences Yelp->Maps->Messenger structure/arrangement Task descriptions “Schedule a lunch with David” nature of the intention, language reference User utterances “Find me an Indian place near CMU.” language reference Meta data Monday, 10:08 – 10:15, Home contexts of the tasks
  • 19. Intention Realization Model • [Yelp, Maps, Uber] • November, weekday, afternoon, office • “Try to arrange evening out” • [United Airlines, AirBnB, Calendar] • September, weekend, morning, home • “Planning a trip to California” • [TripAdvisor, United Airlines] • July, weekend, morning, home • “I was planning a trip to Oregon” “Plan a weekend in Virginia” • […, …, …] • … •“Shared picture to Alexis” Infer: 1) Supportive Domains: United Airlines, TripAdvisor, AirBnB 2) Summarization: “plan trip” 19
  • 20. Find similar past experience  Cluster-based:  K-means clustering on user generated language  Neighbor-based:  KNN 1 2 3 Cluster- based Neighbor- based 20
  • 21. Yelp Yelp OpenTabl e Yelp Maps Maps Maps Maps Messeng er Email Email Email Realize domains from past experience  Representative Sequence  Multi-label Classification App sequences of similar experience (ROVER) 21
  • 22. Some Obstacles to Remove  Language-mismatch  Solution: Query Enrichment (QryEnr)  [“shoot”, “photo”] -> [“shoot”, “take”, “photo”, “picture”]  word2vec, GoogleNews model  App-mismatch  Solution: App Similarity (AppSim)  Functionality space (derived from app descriptions) to identify apps  Data-driven: doc2vec on app store texts  Rule-based: app package name  Knowledge-driven: Google Play similar app suggestions 22
  • 23. 23 Gap between Generic and Personalized Models QryEnr, AppSim, QryEnr+AppSim reduce the gap of F1 0 5 10 15 20 25 30 35
  • 24. Compare different AppSim 0 10 20 30 40 50 60 70 Precision Recall F1 Baseline Data Knowledge Rule Combine 24
  • 25. Compare different AppSim  Combining three approaches performs the best  Knowledge-driven and data-driven have low coverage among (manufacture) apps  Rule-based is better than the other two individual approaches 25
  • 26. Learning to talk at the task level  Techniques:  (Extractive/abstractive) summarization  Key phrase extraction [RAKE]  User study:  Key phrase extraction + user generated language  Ranked list of key phrases + user’s binary judgment 1. solutions online 2. project file 3. Google drive 4. math problems 5. physics homework 6. answers online … [descriptions] Looking up math problems. Now open a browser. Go to slader.com. Doing physics homework. … [utterances] Go to my Google drive. Look up kinematic equations. Now open my calculator so I can plug in numbers. … 26
  • 27. 1. solutions online 2. project file 3. Google drive 4. math problems 5. physics homework 6. answers online … Learning to talk at the task level  Metrics  Mean Reciprocal Rank (MRR)  Result:  MRR ~0.6  understandable verbal reference show up in top 2 of the ranked list [descriptions] Looking up math problems. Now open a browser. Go to slader.com. Doing physics homework. … [utterances] Go to my Google drive. Look up kinematic equations. Now open my calculator so I can plug in numbers. … 27
  • 28. Summary  Collected real-life cross-domain interactions from real users  HELPR: a framework to learn assistance at the task level  Suggest a set of supportive domains to accomplish the task  Personalized model > Generic model  The gap can be reduced by QryEnr + AppSim  Generate language reference to communicate verbally at task level 28
  • 29. HELPR demo  Interface  HELPR display  GoogleASR  Android TTS  HELPR server  User models 29

Editor's Notes

  1. Many agents… but just a collection of silo apps