SlideShare a Scribd company logo
Using machine learning
for customer service
@neal_lathia
January 2021 - Data Talks Club
Monzo is a UK bank
Rewind to 2019, when we
started a project on chat bots
● What were we trying to achieve? How can
we measure this?
● How can we design, build, and iterate on a
bot? How can we demonstrate that it is
working?
● How did we ship this system? How did we
trade off between system design and
machine learning?
Framing the problem
What do customers need and
how do we help them?
Customer needs
information
Customer needs an
action
Self-Service is available Service via humans
Does my card work
abroad?
Can I have a bank
statement?
What is the status of
the cheque that I sent
you?
I don’t recognise this
transaction, can you
investigate?
Self-service is the fastest way to get help for a subset of
queries and enables agents to focus on where they are
really needed
Customer needs
information
Customer needs an
action
Self-Service is available Service via humans
Does my card work
abroad?
Can I have a bank
statement?
What is the status of
the cheque that I sent
you?
I don’t recognise this
transaction, can you
investigate?
How can we get a sense of
the size of these problems?
Several different metrics available:
● How many customers are getting in touch?
(Overall contact rate)
● How many customers are getting in touch
about specific topics? (Contact rate by
conversation tag)
● How many customers are using
self-service product features compared to
those getting in touch about the problem?
Iteration #1:
Search for article
recommendations
Our app already had a search
feature
⬅ Search query
⬅ Used to be an entry point to
chat
⬅ Help article search results
⬅ Prompt for information
⬅ Recommendations (search
results)
⬅ Self-selected chat closure
⬅ The chat first turn (this could
be more than one message)
Our first experiment: add search into chat
Why take this approach?
● The product team could focus on adding all of the
components to make this work
○ Typing detection
○ New types of chat bubbles
○ Auto-close if the customer abandons the chat for a
long time
● The analysts could set up the new metrics for this
system
○ Self-selected (& auto) chat close rate
○ Customers proceeding through to chat
● We didn’t need to ship any new ML systems
○ We ran our first BERT experiment, which showed a
+9% increase in self-service rate compared to our
in-house model
We ran our first BERT
experiment
● We didn’t need to ship any new ML systems
● The product team could focus on adding all of the
components to make this work
○ Typing detection
○ New types of chat bubbles
○ Auto-close if the customer abandons the chat for a
long time
● The analysts could set up the new metrics for this
system
○ Self-selected (& auto) chat close rate
○ Customers proceeding through to chat
Iteration #2:
Giving customers
answers for some
queries
Why change the approach?
Article recommendation is working okay, but it still requires
customers to put in the effort to find out if the suggestions
are relevant
We could identify specific topics that our
customers talked to us about that were:
● Valuable - we get a lot of queries about the
topic
● Eligible for self service - there is a simple
and direct call to action
● Predictable - we could train classifiers to
detect the topic!
⬅ Prompt for information
⬅ Predefined answer with a
direct call to action
⬅ Self-selected chat closure
⬅ The chat first turn (this could
be more than one message)
How can we test this?
We did not want to build out an entire system (weeks of
work) before validating whether this could action work. So
we started with one topic.
● Fine-tune a BERT model based on tagged
conversations
● Deploy this model in shadow mode to
monitor the prediction quality
● If the accuracy looks good, deploy this topic!
Combine giving answers (for this topic) with
giving article suggestions (for everything
else).
Offline validation
Precision
Out-of-sample
precision
Card replacement 0.86 0.69
Card not arrived 0.93 0.60
Update details 0.93 0.52 😫
... ... ...
We deployed a number of these models in shadow
prediction mode. This allows it to make predictions on
live data, but it’s not sending customers any answers.
Large set of noisy tags
Small set of re-labelled
examples
Hi there! How can I transfer money into a savings pot?
System design
Why consider this angle?
We wanted to collaborate!
● Having one huge multi-class model means
retraining everything when we want to add
a new topic
● It is very difficult for two Machine Learning
Scientists to work on one model
collaboratively
A bank built with thousands
of microservices
https://qconlondon.com/london2020/presentation/modern-banking-1500-microservices
Orchestrator
Classifier for A
Classifier for B
Classifier for C
Winner!
Orchestrator
Classifier for A
Classifier for B
Classifier for C
Winner!
Rules that determine
eligibility for topic C
Classifier for D
Thanks!
@neal_lathia
Some extra writing about
these topics
● Customer service is full of machine
learning problems
● Shadow mode deployments
● Machine learning, faster

More Related Content

What's hot

Research Updates from Rasa: Transformers in NLU and Dialogue
Research Updates from Rasa: Transformers in NLU and DialogueResearch Updates from Rasa: Transformers in NLU and Dialogue
Research Updates from Rasa: Transformers in NLU and Dialogue
Rasa Technologies
 
Writing Good User Stories (Hint: It's not about writing)
Writing Good User Stories (Hint: It's not about writing)Writing Good User Stories (Hint: It's not about writing)
Writing Good User Stories (Hint: It's not about writing)
one80
 
Shaking up Sales
Shaking up SalesShaking up Sales
Shaking up Sales
Giorgio Buccilli
 
Stories, defects and tasks
Stories, defects and tasksStories, defects and tasks
Stories, defects and tasksWalther Lalk
 
Thoughts on machine translation
Thoughts on machine translationThoughts on machine translation
Thoughts on machine translation
Lucía Guerrero
 
How Our Team Uses Rasa to Learn from Real Conversations | Rasa Summit
How Our Team Uses Rasa to Learn from Real Conversations | Rasa SummitHow Our Team Uses Rasa to Learn from Real Conversations | Rasa Summit
How Our Team Uses Rasa to Learn from Real Conversations | Rasa Summit
Rasa Technologies
 
How to Humanize Customer Experience in this Robotic Age
How to Humanize Customer Experience in this Robotic AgeHow to Humanize Customer Experience in this Robotic Age
How to Humanize Customer Experience in this Robotic Age
Niharika Gupta
 
Phone Survey Software: The All You Can Know Guide
Phone Survey Software: The All You Can Know GuidePhone Survey Software: The All You Can Know Guide
Phone Survey Software: The All You Can Know Guide
Niharika Gupta
 

What's hot (8)

Research Updates from Rasa: Transformers in NLU and Dialogue
Research Updates from Rasa: Transformers in NLU and DialogueResearch Updates from Rasa: Transformers in NLU and Dialogue
Research Updates from Rasa: Transformers in NLU and Dialogue
 
Writing Good User Stories (Hint: It's not about writing)
Writing Good User Stories (Hint: It's not about writing)Writing Good User Stories (Hint: It's not about writing)
Writing Good User Stories (Hint: It's not about writing)
 
Shaking up Sales
Shaking up SalesShaking up Sales
Shaking up Sales
 
Stories, defects and tasks
Stories, defects and tasksStories, defects and tasks
Stories, defects and tasks
 
Thoughts on machine translation
Thoughts on machine translationThoughts on machine translation
Thoughts on machine translation
 
How Our Team Uses Rasa to Learn from Real Conversations | Rasa Summit
How Our Team Uses Rasa to Learn from Real Conversations | Rasa SummitHow Our Team Uses Rasa to Learn from Real Conversations | Rasa Summit
How Our Team Uses Rasa to Learn from Real Conversations | Rasa Summit
 
How to Humanize Customer Experience in this Robotic Age
How to Humanize Customer Experience in this Robotic AgeHow to Humanize Customer Experience in this Robotic Age
How to Humanize Customer Experience in this Robotic Age
 
Phone Survey Software: The All You Can Know Guide
Phone Survey Software: The All You Can Know GuidePhone Survey Software: The All You Can Know Guide
Phone Survey Software: The All You Can Know Guide
 

Similar to Using machine learning for customer service (Data Talks Club)

it-Build-a-Chatbot-Proof-of-Concept-Executive-Brief-V1.pptx
it-Build-a-Chatbot-Proof-of-Concept-Executive-Brief-V1.pptxit-Build-a-Chatbot-Proof-of-Concept-Executive-Brief-V1.pptx
it-Build-a-Chatbot-Proof-of-Concept-Executive-Brief-V1.pptx
Carlos Olivares
 
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, ExponeaLive predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
Data Science Club
 
Demystifying ML/AI
Demystifying ML/AIDemystifying ML/AI
Demystifying ML/AI
Matthew Reynolds
 
How to Present Test Results to Inspire Action
How to Present Test Results to Inspire ActionHow to Present Test Results to Inspire Action
How to Present Test Results to Inspire Action
Jason Packer
 
Advanced Integrations of MuleSoft with ChatGTP
Advanced Integrations of MuleSoft with ChatGTPAdvanced Integrations of MuleSoft with ChatGTP
Advanced Integrations of MuleSoft with ChatGTP
NeerajKumar1965
 
08.2023_Marketo Engage User Groups Housekeeping Slides-merged.pdf
08.2023_Marketo Engage User Groups Housekeeping Slides-merged.pdf08.2023_Marketo Engage User Groups Housekeeping Slides-merged.pdf
08.2023_Marketo Engage User Groups Housekeeping Slides-merged.pdf
JackSiebert
 
How to Avoid Common Mistakes in Product by Cake Product Manager
How to Avoid Common Mistakes in Product by Cake Product ManagerHow to Avoid Common Mistakes in Product by Cake Product Manager
How to Avoid Common Mistakes in Product by Cake Product Manager
Product School
 
Intro to Data Analytics with Oscar's Director of Product
 Intro to Data Analytics with Oscar's Director of Product Intro to Data Analytics with Oscar's Director of Product
Intro to Data Analytics with Oscar's Director of Product
Product School
 
A Detailed Explanation On The Marking Criteria For Ielts Writing Task 1 ...
A Detailed Explanation On The Marking Criteria For Ielts Writing Task 1 ...A Detailed Explanation On The Marking Criteria For Ielts Writing Task 1 ...
A Detailed Explanation On The Marking Criteria For Ielts Writing Task 1 ...
Shannon Olsen
 
Pin the tail on the metric v01 2016 oct
Pin the tail on the metric v01 2016 octPin the tail on the metric v01 2016 oct
Pin the tail on the metric v01 2016 oct
Steven Martin
 
The five essential steps to building a data product
The five essential steps to building a data productThe five essential steps to building a data product
The five essential steps to building a data product
Birst
 
IoT Product Design and Prototyping
IoT Product Design and PrototypingIoT Product Design and Prototyping
IoT Product Design and Prototyping
Dr. Shivananda Koteshwar
 
#NoEstimates - Stop lying to yourself and your customers, and stop estimating
#NoEstimates - Stop lying to yourself and your customers, and stop estimating#NoEstimates - Stop lying to yourself and your customers, and stop estimating
#NoEstimates - Stop lying to yourself and your customers, and stop estimating
gerardbeckerleg
 
How to Master Product Management Case Studies by fmr Groupon PM
How to Master Product Management Case Studies by fmr Groupon PMHow to Master Product Management Case Studies by fmr Groupon PM
How to Master Product Management Case Studies by fmr Groupon PM
Product School
 
Dialogflow Chat Experiences Best Practices for Intent Detection // Measuring ...
Dialogflow Chat Experiences Best Practices for Intent Detection // Measuring ...Dialogflow Chat Experiences Best Practices for Intent Detection // Measuring ...
Dialogflow Chat Experiences Best Practices for Intent Detection // Measuring ...
Grid Dynamics
 
BAD Toolkit - Berlin Overview
BAD Toolkit - Berlin OverviewBAD Toolkit - Berlin Overview
BAD Toolkit - Berlin Overview
BAD-gcrossley
 
Practical AI use cases in Customer Service
Practical AI use cases in Customer ServicePractical AI use cases in Customer Service
Practical AI use cases in Customer Service
Denys Holovatyi
 
ODSC East 2020 : Continuous_learning_systems
ODSC East 2020 : Continuous_learning_systemsODSC East 2020 : Continuous_learning_systems
ODSC East 2020 : Continuous_learning_systems
Anuj Gupta
 
Machine Learning Product Managers Meetup Event
Machine Learning Product Managers Meetup EventMachine Learning Product Managers Meetup Event
Machine Learning Product Managers Meetup Event
Benjamin Schulte
 
Doing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating AnalyticsDoing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating Analytics
Tasktop
 

Similar to Using machine learning for customer service (Data Talks Club) (20)

it-Build-a-Chatbot-Proof-of-Concept-Executive-Brief-V1.pptx
it-Build-a-Chatbot-Proof-of-Concept-Executive-Brief-V1.pptxit-Build-a-Chatbot-Proof-of-Concept-Executive-Brief-V1.pptx
it-Build-a-Chatbot-Proof-of-Concept-Executive-Brief-V1.pptx
 
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, ExponeaLive predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
 
Demystifying ML/AI
Demystifying ML/AIDemystifying ML/AI
Demystifying ML/AI
 
How to Present Test Results to Inspire Action
How to Present Test Results to Inspire ActionHow to Present Test Results to Inspire Action
How to Present Test Results to Inspire Action
 
Advanced Integrations of MuleSoft with ChatGTP
Advanced Integrations of MuleSoft with ChatGTPAdvanced Integrations of MuleSoft with ChatGTP
Advanced Integrations of MuleSoft with ChatGTP
 
08.2023_Marketo Engage User Groups Housekeeping Slides-merged.pdf
08.2023_Marketo Engage User Groups Housekeeping Slides-merged.pdf08.2023_Marketo Engage User Groups Housekeeping Slides-merged.pdf
08.2023_Marketo Engage User Groups Housekeeping Slides-merged.pdf
 
How to Avoid Common Mistakes in Product by Cake Product Manager
How to Avoid Common Mistakes in Product by Cake Product ManagerHow to Avoid Common Mistakes in Product by Cake Product Manager
How to Avoid Common Mistakes in Product by Cake Product Manager
 
Intro to Data Analytics with Oscar's Director of Product
 Intro to Data Analytics with Oscar's Director of Product Intro to Data Analytics with Oscar's Director of Product
Intro to Data Analytics with Oscar's Director of Product
 
A Detailed Explanation On The Marking Criteria For Ielts Writing Task 1 ...
A Detailed Explanation On The Marking Criteria For Ielts Writing Task 1 ...A Detailed Explanation On The Marking Criteria For Ielts Writing Task 1 ...
A Detailed Explanation On The Marking Criteria For Ielts Writing Task 1 ...
 
Pin the tail on the metric v01 2016 oct
Pin the tail on the metric v01 2016 octPin the tail on the metric v01 2016 oct
Pin the tail on the metric v01 2016 oct
 
The five essential steps to building a data product
The five essential steps to building a data productThe five essential steps to building a data product
The five essential steps to building a data product
 
IoT Product Design and Prototyping
IoT Product Design and PrototypingIoT Product Design and Prototyping
IoT Product Design and Prototyping
 
#NoEstimates - Stop lying to yourself and your customers, and stop estimating
#NoEstimates - Stop lying to yourself and your customers, and stop estimating#NoEstimates - Stop lying to yourself and your customers, and stop estimating
#NoEstimates - Stop lying to yourself and your customers, and stop estimating
 
How to Master Product Management Case Studies by fmr Groupon PM
How to Master Product Management Case Studies by fmr Groupon PMHow to Master Product Management Case Studies by fmr Groupon PM
How to Master Product Management Case Studies by fmr Groupon PM
 
Dialogflow Chat Experiences Best Practices for Intent Detection // Measuring ...
Dialogflow Chat Experiences Best Practices for Intent Detection // Measuring ...Dialogflow Chat Experiences Best Practices for Intent Detection // Measuring ...
Dialogflow Chat Experiences Best Practices for Intent Detection // Measuring ...
 
BAD Toolkit - Berlin Overview
BAD Toolkit - Berlin OverviewBAD Toolkit - Berlin Overview
BAD Toolkit - Berlin Overview
 
Practical AI use cases in Customer Service
Practical AI use cases in Customer ServicePractical AI use cases in Customer Service
Practical AI use cases in Customer Service
 
ODSC East 2020 : Continuous_learning_systems
ODSC East 2020 : Continuous_learning_systemsODSC East 2020 : Continuous_learning_systems
ODSC East 2020 : Continuous_learning_systems
 
Machine Learning Product Managers Meetup Event
Machine Learning Product Managers Meetup EventMachine Learning Product Managers Meetup Event
Machine Learning Product Managers Meetup Event
 
Doing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating AnalyticsDoing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating Analytics
 

More from Neal Lathia

AI & Personalised Experiences
AI & Personalised ExperiencesAI & Personalised Experiences
AI & Personalised Experiences
Neal Lathia
 
Opportunities & Challenges in Personalised Travel
Opportunities & Challenges in Personalised TravelOpportunities & Challenges in Personalised Travel
Opportunities & Challenges in Personalised Travel
Neal Lathia
 
Bootstrapping a Destination Recommendation Engine
Bootstrapping a Destination Recommendation EngineBootstrapping a Destination Recommendation Engine
Bootstrapping a Destination Recommendation Engine
Neal Lathia
 
Machine Learning for Product Managers
Machine Learning for Product ManagersMachine Learning for Product Managers
Machine Learning for Product Managers
Neal Lathia
 
Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)
Neal Lathia
 
Happier and Healthier with Smartphone Data
Happier and Healthier with Smartphone DataHappier and Healthier with Smartphone Data
Happier and Healthier with Smartphone Data
Neal Lathia
 
Data Science in Digital Health
Data Science in Digital HealthData Science in Digital Health
Data Science in Digital Health
Neal Lathia
 
Using Smartphones to Measure (and Intervene in) Daily Life
Using Smartphones to Measure (and Intervene in) Daily LifeUsing Smartphones to Measure (and Intervene in) Daily Life
Using Smartphones to Measure (and Intervene in) Daily Life
Neal Lathia
 
Analysing Daily Behaviours with Large-Scale Smartphone Data
Analysing Daily Behaviours with Large-Scale Smartphone DataAnalysing Daily Behaviours with Large-Scale Smartphone Data
Analysing Daily Behaviours with Large-Scale Smartphone Data
Neal Lathia
 
Cambridge Quantified Self Meetup
Cambridge Quantified Self MeetupCambridge Quantified Self Meetup
Cambridge Quantified Self Meetup
Neal Lathia
 
Data Science in #mHealth
Data Science in #mHealthData Science in #mHealth
Data Science in #mHealth
Neal Lathia
 
Tube Star: Crowd-Sourced Experiences on Public Transport
Tube Star: Crowd-Sourced Experiences on Public Transport Tube Star: Crowd-Sourced Experiences on Public Transport
Tube Star: Crowd-Sourced Experiences on Public Transport
Neal Lathia
 
Emotion Sense: From Design to Deployment
Emotion Sense: From Design to DeploymentEmotion Sense: From Design to Deployment
Emotion Sense: From Design to Deployment
Neal Lathia
 
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Neal Lathia
 
Using Smartphones to Research Daily Life
Using Smartphones to Research Daily LifeUsing Smartphones to Research Daily Life
Using Smartphones to Research Daily LifeNeal Lathia
 
The Ubhave Framework
The Ubhave FrameworkThe Ubhave Framework
The Ubhave Framework
Neal Lathia
 
Contextual Dissonance: Design Bias in Sensor-Based Experience Sampling Methods
Contextual Dissonance: Design Bias in Sensor-Based Experience Sampling MethodsContextual Dissonance: Design Bias in Sensor-Based Experience Sampling Methods
Contextual Dissonance: Design Bias in Sensor-Based Experience Sampling Methods
Neal Lathia
 
The Ubhave Project (Part 1/2)
The Ubhave Project (Part 1/2)The Ubhave Project (Part 1/2)
The Ubhave Project (Part 1/2)
Neal Lathia
 
Open Source Smartphone Libraries for Computational Social Science
Open Source Smartphone Libraries for Computational Social Science Open Source Smartphone Libraries for Computational Social Science
Open Source Smartphone Libraries for Computational Social Science
Neal Lathia
 
Building Systems to Capture, Measure, and Use Emotions and Personality
Building Systems to Capture, Measure, and Use Emotions and PersonalityBuilding Systems to Capture, Measure, and Use Emotions and Personality
Building Systems to Capture, Measure, and Use Emotions and Personality
Neal Lathia
 

More from Neal Lathia (20)

AI & Personalised Experiences
AI & Personalised ExperiencesAI & Personalised Experiences
AI & Personalised Experiences
 
Opportunities & Challenges in Personalised Travel
Opportunities & Challenges in Personalised TravelOpportunities & Challenges in Personalised Travel
Opportunities & Challenges in Personalised Travel
 
Bootstrapping a Destination Recommendation Engine
Bootstrapping a Destination Recommendation EngineBootstrapping a Destination Recommendation Engine
Bootstrapping a Destination Recommendation Engine
 
Machine Learning for Product Managers
Machine Learning for Product ManagersMachine Learning for Product Managers
Machine Learning for Product Managers
 
Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)
 
Happier and Healthier with Smartphone Data
Happier and Healthier with Smartphone DataHappier and Healthier with Smartphone Data
Happier and Healthier with Smartphone Data
 
Data Science in Digital Health
Data Science in Digital HealthData Science in Digital Health
Data Science in Digital Health
 
Using Smartphones to Measure (and Intervene in) Daily Life
Using Smartphones to Measure (and Intervene in) Daily LifeUsing Smartphones to Measure (and Intervene in) Daily Life
Using Smartphones to Measure (and Intervene in) Daily Life
 
Analysing Daily Behaviours with Large-Scale Smartphone Data
Analysing Daily Behaviours with Large-Scale Smartphone DataAnalysing Daily Behaviours with Large-Scale Smartphone Data
Analysing Daily Behaviours with Large-Scale Smartphone Data
 
Cambridge Quantified Self Meetup
Cambridge Quantified Self MeetupCambridge Quantified Self Meetup
Cambridge Quantified Self Meetup
 
Data Science in #mHealth
Data Science in #mHealthData Science in #mHealth
Data Science in #mHealth
 
Tube Star: Crowd-Sourced Experiences on Public Transport
Tube Star: Crowd-Sourced Experiences on Public Transport Tube Star: Crowd-Sourced Experiences on Public Transport
Tube Star: Crowd-Sourced Experiences on Public Transport
 
Emotion Sense: From Design to Deployment
Emotion Sense: From Design to DeploymentEmotion Sense: From Design to Deployment
Emotion Sense: From Design to Deployment
 
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
 
Using Smartphones to Research Daily Life
Using Smartphones to Research Daily LifeUsing Smartphones to Research Daily Life
Using Smartphones to Research Daily Life
 
The Ubhave Framework
The Ubhave FrameworkThe Ubhave Framework
The Ubhave Framework
 
Contextual Dissonance: Design Bias in Sensor-Based Experience Sampling Methods
Contextual Dissonance: Design Bias in Sensor-Based Experience Sampling MethodsContextual Dissonance: Design Bias in Sensor-Based Experience Sampling Methods
Contextual Dissonance: Design Bias in Sensor-Based Experience Sampling Methods
 
The Ubhave Project (Part 1/2)
The Ubhave Project (Part 1/2)The Ubhave Project (Part 1/2)
The Ubhave Project (Part 1/2)
 
Open Source Smartphone Libraries for Computational Social Science
Open Source Smartphone Libraries for Computational Social Science Open Source Smartphone Libraries for Computational Social Science
Open Source Smartphone Libraries for Computational Social Science
 
Building Systems to Capture, Measure, and Use Emotions and Personality
Building Systems to Capture, Measure, and Use Emotions and PersonalityBuilding Systems to Capture, Measure, and Use Emotions and Personality
Building Systems to Capture, Measure, and Use Emotions and Personality
 

Recently uploaded

CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
karthi keyan
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
AhmedHussein950959
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
Osamah Alsalih
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
manasideore6
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
fxintegritypublishin
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
Pratik Pawar
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
BrazilAccount1
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
obonagu
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
BrazilAccount1
 
block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
Divya Somashekar
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
Robbie Edward Sayers
 

Recently uploaded (20)

CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
 
block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
 

Using machine learning for customer service (Data Talks Club)

  • 1. Using machine learning for customer service @neal_lathia January 2021 - Data Talks Club
  • 2. Monzo is a UK bank
  • 3. Rewind to 2019, when we started a project on chat bots ● What were we trying to achieve? How can we measure this? ● How can we design, build, and iterate on a bot? How can we demonstrate that it is working? ● How did we ship this system? How did we trade off between system design and machine learning?
  • 5. What do customers need and how do we help them? Customer needs information Customer needs an action Self-Service is available Service via humans Does my card work abroad? Can I have a bank statement? What is the status of the cheque that I sent you? I don’t recognise this transaction, can you investigate?
  • 6. Self-service is the fastest way to get help for a subset of queries and enables agents to focus on where they are really needed Customer needs information Customer needs an action Self-Service is available Service via humans Does my card work abroad? Can I have a bank statement? What is the status of the cheque that I sent you? I don’t recognise this transaction, can you investigate?
  • 7. How can we get a sense of the size of these problems? Several different metrics available: ● How many customers are getting in touch? (Overall contact rate) ● How many customers are getting in touch about specific topics? (Contact rate by conversation tag) ● How many customers are using self-service product features compared to those getting in touch about the problem?
  • 8. Iteration #1: Search for article recommendations
  • 9. Our app already had a search feature ⬅ Search query ⬅ Used to be an entry point to chat ⬅ Help article search results
  • 10. ⬅ Prompt for information ⬅ Recommendations (search results) ⬅ Self-selected chat closure ⬅ The chat first turn (this could be more than one message) Our first experiment: add search into chat
  • 11. Why take this approach? ● The product team could focus on adding all of the components to make this work ○ Typing detection ○ New types of chat bubbles ○ Auto-close if the customer abandons the chat for a long time ● The analysts could set up the new metrics for this system ○ Self-selected (& auto) chat close rate ○ Customers proceeding through to chat ● We didn’t need to ship any new ML systems ○ We ran our first BERT experiment, which showed a +9% increase in self-service rate compared to our in-house model
  • 12. We ran our first BERT experiment ● We didn’t need to ship any new ML systems ● The product team could focus on adding all of the components to make this work ○ Typing detection ○ New types of chat bubbles ○ Auto-close if the customer abandons the chat for a long time ● The analysts could set up the new metrics for this system ○ Self-selected (& auto) chat close rate ○ Customers proceeding through to chat
  • 14. Why change the approach? Article recommendation is working okay, but it still requires customers to put in the effort to find out if the suggestions are relevant We could identify specific topics that our customers talked to us about that were: ● Valuable - we get a lot of queries about the topic ● Eligible for self service - there is a simple and direct call to action ● Predictable - we could train classifiers to detect the topic!
  • 15. ⬅ Prompt for information ⬅ Predefined answer with a direct call to action ⬅ Self-selected chat closure ⬅ The chat first turn (this could be more than one message)
  • 16. How can we test this? We did not want to build out an entire system (weeks of work) before validating whether this could action work. So we started with one topic. ● Fine-tune a BERT model based on tagged conversations ● Deploy this model in shadow mode to monitor the prediction quality ● If the accuracy looks good, deploy this topic! Combine giving answers (for this topic) with giving article suggestions (for everything else).
  • 17. Offline validation Precision Out-of-sample precision Card replacement 0.86 0.69 Card not arrived 0.93 0.60 Update details 0.93 0.52 😫 ... ... ... We deployed a number of these models in shadow prediction mode. This allows it to make predictions on live data, but it’s not sending customers any answers.
  • 18. Large set of noisy tags Small set of re-labelled examples Hi there! How can I transfer money into a savings pot?
  • 20. Why consider this angle? We wanted to collaborate! ● Having one huge multi-class model means retraining everything when we want to add a new topic ● It is very difficult for two Machine Learning Scientists to work on one model collaboratively
  • 21. A bank built with thousands of microservices https://qconlondon.com/london2020/presentation/modern-banking-1500-microservices
  • 22. Orchestrator Classifier for A Classifier for B Classifier for C Winner!
  • 23. Orchestrator Classifier for A Classifier for B Classifier for C Winner! Rules that determine eligibility for topic C Classifier for D
  • 25. Some extra writing about these topics ● Customer service is full of machine learning problems ● Shadow mode deployments ● Machine learning, faster