Evorus: A Crowd-Powered Conversational Assistant Built to Automate Itself Over Time
Apr. 26, 2018•0 likes•947 views
Download to read offline
Report
Technology
Evorus: A Crowd-Powered Conversational Assistant Built to Automate Itself Over Time.
Ting-Hao K. Huang, Joseph Chee Chang, Jeffrey P. Bigham.
In Proceedings of Conference on Human Factors in Computing Systems 2018 (CHI 2018), 2018, Montréal, Canada.
Evorus: A Crowd-Powered Conversational Assistant Built to Automate Itself Over Time
1. 1
Live Note/QA: http://tinyurl.com/Evorus
Evorus:
A Crowd-Powered
Conversational Assistant
Built to Automate Itself Over Time
Ting-Hao (Kenneth) Huang*
Joseph Chee Chang
Jeffrey P. Bigham
[ Question / Feedback: http://tinyurl.com/Evorus ]
*Kenneth will join in Fall 2018
7. 7
Live Note/QA: http://tinyurl.com/Evorus
What just
happened?
• Open Conversation
• Multi-turn interaction
• Multiple domains
• Personalized
• Coherent dialog
• Mix of task-oriented
and social conversation
9. 9
Live Note/QA: http://tinyurl.com/Evorus
Existing Approaches to
Open Conversation
• Combining multiple task-oriented dialog systems
• DialPort (Zhao, et al., 2016)
• End-to-end framework for dialogue systems
• Serban, et al. 2016; Li, et al. 2017
• Adapting a model to many other domains
• Walker, et al., 2007; Sun, et al., 2016
• Chit-chat systems (social bot)
• Hold social conversations (Banchs, et al., 2012)
• Still a very hard problem…
10. 10
Live Note/QA: http://tinyurl.com/Evorus
Existing Approaches to
Open Conversation
• Combining multiple task-oriented dialog systems
• DialPort (Zhao, et al., 2016)
• End-to-end framework for dialogue systems
• Serban, et al. 2016; Li, et al. 2017
• Adapting a model to many other domains
• Walker, et al., 2007; Sun, et al., 2016
• Chit-chat systems (social bot)
• Hold social conversations (Banchs, et al., 2012)
• Still a very hard problem…
MIT Technology Review
Feb 27, 2018
16. 16
Live Note/QA: http://tinyurl.com/Evorus
•Crowd workers collectively hold a
conversation by:
1. Propose Responses
2. Vote Responses
3. Take Notes
Reward points for
each action
"Is there anything else I can help you with?": Challenges in Deploying an On-Demand
Crowd-Powered Conversational Agent. Huang, et al. HCOMP 2016.
Chorus: A Crowd-Powered
Conversation Assistant
30. 30
Live Note/QA: http://tinyurl.com/Evorus
Ranking Chatbots: Performance & Topic
Topic Similarity
User Message
Domain of
the Chatbot
Hey what should
I eat in Montreal?
Find me some
good restaurants !
Where can I get
Chinese food?
Example
Triggering
Message
31. 31
Live Note/QA: http://tinyurl.com/Evorus
Ranking Chatbots: Performance & Topic
Topic Similarity
User Message
Domain of
the Chatbot
Hey what should
I eat in Montreal?
Example
Triggering
Message
Find me some
good restaurants !
Where can I get
Chinese food?
Topic
Similarity
36. 36
Live Note/QA: http://tinyurl.com/Evorus
Find the Best Confidence Threshold
• High Threshold
• Only vote when pretty sure
• High precision, but little benefit
• Low Threshold
• Nearly always vote
• Grant agreement bonus by mistake
• Damage conversation quality
39. 39
Live Note/QA: http://tinyurl.com/Evorus
Automating Open Conversations
• Setup
• A 5-month-long deployment, 80 Users
• 4 chatbots + 1 voting bot
• Result
• Automated responses were chosen 12.44% of the time.
• Human upvotes were reduced by 13.81%.
• The cost of each message is reduced by 32.76%.
• Conversation quality and user
satisfaction level remains.
• Conversation Quality: Satisfaction,
Clarity, Responsiveness, Comfort
(Liu, et al., 2010)
40. 40
Live Note/QA: http://tinyurl.com/Evorus
Conclusion & Future Work
• Automating some part of an open conversation is
possible.
• Future Work
• 1000+ Chatbots!
• Deploy Chorus as an Open Research Platform
• Chorus on Smart Devices
41. 41
Live Note/QA: http://tinyurl.com/Evorus
Conclusion & Future Work
• Automating some part of an open conversation is
possible.
• Future Work
• 1000+ Chatbots!
• Deploy Chorus as an Open Research Platform
• Chorus on Smart Devices
@windx0303
KennethHuang.cc
Ting-Hao (Kenneth) Huang
QA: http://tinyurl.com/Evorus