How to Effectively Test Your Chatbot | Rasa Summit

Rasa Technologies
Rasa TechnologiesRasa Technologies
How to
”Effectively” ”Test”
your Chatbot
Soumya Mukherjee
Director QA, DevOps & AIML
Apty.IO
How are we doing our QA today
• Testing is Blackbox for testers
• Mostly manual testing done in organization
• Conversational flow testing
• Small Talk
• Fallback checks
• Integrations
• Automation done on UI and API layer
• Testing is mostly done on same training data
• Models are trained by engineers and are not being
monitored by QA
• There are analytics tools available to monitor but it
needs technical expertise for the QA
• Result : More than 90% times bot breaks (no one
understands when it will break), most of them fallback
and get stuck - once bot is stuck it is stuck
Q ?
A
What are the issues in QA ?
• Bots are evolving and continuous story creation is a problem
• No tool manage story coverage
• Your training data may not correspond to new stories or vice versa (it’s a
mismatch) – most org keep training on the same data
• Most automation tools offers record and playback (My stories are
already written how to port is the question)
What are the issues in QA ?
• No (unified) centralized dashboard present where QA can check (everything is quite scattered)
• Intent Matching
• Entity Testing – Slot identification
• Entity Testing – Entity Validation
• Confidence score
• Confusion Matrix along with Precision/Recall/F1-Score
• No easy way to reset the failed bot !
• Bot versioning is a mess and A/B testing becomes difficult
• Multilingual bot QA is a challenge (have to make 2 separate bots)
• High confidence score is also a problem as your bot will only predict same thing (if the data is same
for multiple intents then it will predict the one with highest confidence score – may be incorrect)
How to make sure your bot never breaks ?
How to make your test effective ?
• Create scenarios for happy path, contextual questions, digressions, domain
specific questions, stateless conversations
• Map proper entities for common scenarios (example bus fee, tuition fee) –
flow should change with entities in the stories
• Automated tests should consume all stories and run them each time as part
of regression testing
• Story coverage visualization
• For Manual Testing use Bot emulation product (like RasaX, Botfront) to test
How to make your test effective ?
• Central dashboarding including :
• Confusion matrix, Precision, Recall and F1-Score
• Cumulative accuracy profile
• Cross validation results
• Perform Exhaustive testing (bot resiliency), Integration checks across
platforms, Webhooks
• Perform fault tolerance testing by performing performance testing (bot
response, session management) & security testing (api interaction,
typing speed check, punctuations, typo errors)
Other KPIs to track
• Activity Volume
• Bounce rate
• Retention rate
• Open sessions count
• Session times (conversation length)
• Goal completion rate
• User feedback (sentiments)
• Fallback rate (Confusion rate, reset rate & Human takeover rate)
Thanks
@QASoumya
Linkedin.com/in/mukherjeesoumya
1 of 8

Recommended

Building an End-to-End Test Automation Pipeline for Conversational AI | Rasa ... by
Building an End-to-End Test Automation Pipeline for Conversational AI | Rasa ...Building an End-to-End Test Automation Pipeline for Conversational AI | Rasa ...
Building an End-to-End Test Automation Pipeline for Conversational AI | Rasa ...Rasa Technologies
156 views4 slides
Building an AI Assistant Factory - Rasa Summit 2021 by
Building an AI Assistant Factory - Rasa Summit 2021Building an AI Assistant Factory - Rasa Summit 2021
Building an AI Assistant Factory - Rasa Summit 2021Rasa Technologies
119 views18 slides
Six Steps to Conversation Driven Development by
Six Steps to Conversation Driven DevelopmentSix Steps to Conversation Driven Development
Six Steps to Conversation Driven DevelopmentRasa Technologies
355 views37 slides
BOTS TESTING BOTS: From manual to automated testing for conversational AI by
BOTS TESTING BOTS: From manual to automated testing for conversational AIBOTS TESTING BOTS: From manual to automated testing for conversational AI
BOTS TESTING BOTS: From manual to automated testing for conversational AIDatabricks
256 views12 slides
Deploy your Rasa Chatbots like a Boss with DevOps | Rasa Summit 2021 by
Deploy your Rasa Chatbots like a Boss with DevOps | Rasa Summit 2021Deploy your Rasa Chatbots like a Boss with DevOps | Rasa Summit 2021
Deploy your Rasa Chatbots like a Boss with DevOps | Rasa Summit 2021Rasa Technologies
184 views24 slides
Rasa Open Source - What's next? by
Rasa Open Source - What's next?Rasa Open Source - What's next?
Rasa Open Source - What's next?Rasa Technologies
155 views26 slides

More Related Content

What's hot

AI and Python: Developing a Conversational Interface using Python by
AI and Python: Developing a Conversational Interface using PythonAI and Python: Developing a Conversational Interface using Python
AI and Python: Developing a Conversational Interface using Pythonamyiris
28.7K views98 slides
Introduction to Aspect Oriented Programming by
Introduction to Aspect Oriented ProgrammingIntroduction to Aspect Oriented Programming
Introduction to Aspect Oriented ProgrammingYan Cui
8.6K views53 slides
Chatbot Tutorial - Create your first bot with Xatkit by
Chatbot Tutorial - Create your first bot with Xatkit Chatbot Tutorial - Create your first bot with Xatkit
Chatbot Tutorial - Create your first bot with Xatkit Jordi Cabot
1.8K views41 slides
Aspect Oriented Programing - Introduction by
Aspect Oriented Programing - IntroductionAspect Oriented Programing - Introduction
Aspect Oriented Programing - IntroductionVenkaiah Chowdary Koneru
1.1K views13 slides
Code Review tool for personal effectiveness and waste analysis by
Code Review tool for personal effectiveness and waste analysisCode Review tool for personal effectiveness and waste analysis
Code Review tool for personal effectiveness and waste analysisMikalai Alimenkou
1.4K views24 slides
Webinar: How to Use Integrated Version Control in Rasa X by
Webinar: How to Use Integrated Version Control in Rasa XWebinar: How to Use Integrated Version Control in Rasa X
Webinar: How to Use Integrated Version Control in Rasa XRasa Technologies
247 views25 slides

What's hot(20)

AI and Python: Developing a Conversational Interface using Python by amyiris
AI and Python: Developing a Conversational Interface using PythonAI and Python: Developing a Conversational Interface using Python
AI and Python: Developing a Conversational Interface using Python
amyiris28.7K views
Introduction to Aspect Oriented Programming by Yan Cui
Introduction to Aspect Oriented ProgrammingIntroduction to Aspect Oriented Programming
Introduction to Aspect Oriented Programming
Yan Cui8.6K views
Chatbot Tutorial - Create your first bot with Xatkit by Jordi Cabot
Chatbot Tutorial - Create your first bot with Xatkit Chatbot Tutorial - Create your first bot with Xatkit
Chatbot Tutorial - Create your first bot with Xatkit
Jordi Cabot1.8K views
Code Review tool for personal effectiveness and waste analysis by Mikalai Alimenkou
Code Review tool for personal effectiveness and waste analysisCode Review tool for personal effectiveness and waste analysis
Code Review tool for personal effectiveness and waste analysis
Mikalai Alimenkou1.4K views
Webinar: How to Use Integrated Version Control in Rasa X by Rasa Technologies
Webinar: How to Use Integrated Version Control in Rasa XWebinar: How to Use Integrated Version Control in Rasa X
Webinar: How to Use Integrated Version Control in Rasa X
Rasa Technologies247 views
DevOps & Technical Agility: From Theory to Practice by Lemi Orhan Ergin
DevOps & Technical Agility: From Theory to PracticeDevOps & Technical Agility: From Theory to Practice
DevOps & Technical Agility: From Theory to Practice
Lemi Orhan Ergin1.3K views
Developing Intelligent Chatbots using RASA, OW2con'19, June 12-13, 2019 in Paris by OW2
Developing Intelligent Chatbots using RASA, OW2con'19, June 12-13, 2019 in ParisDeveloping Intelligent Chatbots using RASA, OW2con'19, June 12-13, 2019 in Paris
Developing Intelligent Chatbots using RASA, OW2con'19, June 12-13, 2019 in Paris
OW2885 views
When you get lost in api testing #ForumPHP by Paula Čučuk
When you get lost in api testing #ForumPHPWhen you get lost in api testing #ForumPHP
When you get lost in api testing #ForumPHP
Paula Čučuk879 views
Best Practices for a Repeatable Shift-Left Commitment by Applause
Best Practices for a Repeatable Shift-Left CommitmentBest Practices for a Repeatable Shift-Left Commitment
Best Practices for a Repeatable Shift-Left Commitment
Applause88 views
Skillshare - From Noob to Tech CEO - nov 7th, 2011 by Kareem Amin
Skillshare - From Noob to Tech CEO - nov 7th, 2011Skillshare - From Noob to Tech CEO - nov 7th, 2011
Skillshare - From Noob to Tech CEO - nov 7th, 2011
Kareem Amin430 views
Kaiser Permanente CSUN 2018 by Mark Stimson
Kaiser Permanente CSUN 2018Kaiser Permanente CSUN 2018
Kaiser Permanente CSUN 2018
Mark Stimson318 views
The 7 minute accessibility assessment and app rating system by Aidan Tierney
The 7 minute accessibility assessment and app rating systemThe 7 minute accessibility assessment and app rating system
The 7 minute accessibility assessment and app rating system
Aidan Tierney2.9K views
Introduction to Aspect Oriented Programming (DDD South West 4.0) by Yan Cui
Introduction to Aspect Oriented Programming (DDD South West 4.0)Introduction to Aspect Oriented Programming (DDD South West 4.0)
Introduction to Aspect Oriented Programming (DDD South West 4.0)
Yan Cui3.8K views
Writing Testable Code in SharePoint by Tim McCarthy
Writing Testable Code in SharePointWriting Testable Code in SharePoint
Writing Testable Code in SharePoint
Tim McCarthy686 views
Research Updates from Rasa: Transformers in NLU and Dialogue by Rasa Technologies
Research Updates from Rasa: Transformers in NLU and DialogueResearch Updates from Rasa: Transformers in NLU and Dialogue
Research Updates from Rasa: Transformers in NLU and Dialogue
Rasa Technologies712 views
Low-code vs Model-Driven Engineering by Jordi Cabot
Low-code vs Model-Driven EngineeringLow-code vs Model-Driven Engineering
Low-code vs Model-Driven Engineering
Jordi Cabot16.5K views
Android application development part2 by Mayank Bhatt
Android application development part2Android application development part2
Android application development part2
Mayank Bhatt42 views
Elements of a Test Framework by SmartBear
Elements of a Test FrameworkElements of a Test Framework
Elements of a Test Framework
SmartBear1K views

Similar to How to Effectively Test Your Chatbot | Rasa Summit

Thomas Haver - Mobile Testing.pdf by
Thomas Haver - Mobile Testing.pdfThomas Haver - Mobile Testing.pdf
Thomas Haver - Mobile Testing.pdfQA or the Highway
2 views34 slides
QAorHighway2016 by
QAorHighway2016QAorHighway2016
QAorHighway2016Bhupesh Dahal
189 views33 slides
Creating testing tools to support development by
Creating testing tools to support developmentCreating testing tools to support development
Creating testing tools to support developmentChema del Barco
383 views53 slides
Test automation lesson by
Test automation lessonTest automation lesson
Test automation lessonSadaaki Emura
184 views24 slides
Test Automation Architecture That Works by Bhupesh Dahal by
Test Automation Architecture That Works by Bhupesh DahalTest Automation Architecture That Works by Bhupesh Dahal
Test Automation Architecture That Works by Bhupesh DahalQA or the Highway
1.2K views29 slides
Karishma Kolli – Myth Busters on Test Automation by
Karishma Kolli – Myth Busters on Test AutomationKarishma Kolli – Myth Busters on Test Automation
Karishma Kolli – Myth Busters on Test AutomationPractiTest
231 views19 slides

Similar to How to Effectively Test Your Chatbot | Rasa Summit(20)

Creating testing tools to support development by Chema del Barco
Creating testing tools to support developmentCreating testing tools to support development
Creating testing tools to support development
Chema del Barco383 views
Test Automation Architecture That Works by Bhupesh Dahal by QA or the Highway
Test Automation Architecture That Works by Bhupesh DahalTest Automation Architecture That Works by Bhupesh Dahal
Test Automation Architecture That Works by Bhupesh Dahal
QA or the Highway1.2K views
Karishma Kolli – Myth Busters on Test Automation by PractiTest
Karishma Kolli – Myth Busters on Test AutomationKarishma Kolli – Myth Busters on Test Automation
Karishma Kolli – Myth Busters on Test Automation
PractiTest231 views
CV_Sachin_11Years_Automation_Performance by Sachin Kodagali
CV_Sachin_11Years_Automation_PerformanceCV_Sachin_11Years_Automation_Performance
CV_Sachin_11Years_Automation_Performance
Sachin Kodagali498 views
Automated Testing but like for PowerShell (April 2012) by Rob Reynolds
Automated Testing but like for PowerShell (April 2012)Automated Testing but like for PowerShell (April 2012)
Automated Testing but like for PowerShell (April 2012)
Rob Reynolds639 views
Test team dynamics, Антон Мужайло by Sigma Software
Test team dynamics, Антон МужайлоTest team dynamics, Антон Мужайло
Test team dynamics, Антон Мужайло
Sigma Software191 views
Testing Conversational AI by Shama Ugale
Testing Conversational AITesting Conversational AI
Testing Conversational AI
Shama Ugale416 views
How to scale your Test Automation by Klaus Salchner
How to scale your Test AutomationHow to scale your Test Automation
How to scale your Test Automation
Klaus Salchner799 views
Why test automation projects are failing by Igor Khrol
Why test automation projects are failingWhy test automation projects are failing
Why test automation projects are failing
Igor Khrol12.9K views
Start with passing tests (tdd for bugs) v0.5 (22 sep 2016) by Dinis Cruz
Start with passing tests (tdd for bugs) v0.5 (22 sep 2016)Start with passing tests (tdd for bugs) v0.5 (22 sep 2016)
Start with passing tests (tdd for bugs) v0.5 (22 sep 2016)
Dinis Cruz1.5K views
SauceCon 2017: Making Your Mobile App Automatable by Sauce Labs
SauceCon 2017: Making Your Mobile App AutomatableSauceCon 2017: Making Your Mobile App Automatable
SauceCon 2017: Making Your Mobile App Automatable
Sauce Labs181 views
A Sampling of Tools by Dawn Code
A Sampling of ToolsA Sampling of Tools
A Sampling of Tools
Dawn Code46 views
Winning the battle against Automated testing by Elena Laskavaia
Winning the battle against Automated testingWinning the battle against Automated testing
Winning the battle against Automated testing
Elena Laskavaia65 views
How to Go Codeless for Automated Mobile App Testing by Applause
How to Go Codeless for Automated Mobile App TestingHow to Go Codeless for Automated Mobile App Testing
How to Go Codeless for Automated Mobile App Testing
Applause76 views
Automated Acceptance Test Practices and Pitfalls by Wyn B. Van Devanter
Automated Acceptance Test Practices and PitfallsAutomated Acceptance Test Practices and Pitfalls
Automated Acceptance Test Practices and Pitfalls
Wyn B. Van Devanter2.6K views

More from Rasa Technologies

Beyond Sentiment Analysis: Creating Engaging Conversational Experiences throu... by
Beyond Sentiment Analysis: Creating Engaging Conversational Experiences throu...Beyond Sentiment Analysis: Creating Engaging Conversational Experiences throu...
Beyond Sentiment Analysis: Creating Engaging Conversational Experiences throu...Rasa Technologies
250 views22 slides
End-to-end dialogue systems, or a feature which wasn’t meant to happen | Rasa... by
End-to-end dialogue systems, or a feature which wasn’t meant to happen | Rasa...End-to-end dialogue systems, or a feature which wasn’t meant to happen | Rasa...
End-to-end dialogue systems, or a feature which wasn’t meant to happen | Rasa...Rasa Technologies
194 views17 slides
Voice First: Ready Your Content to Serve 50% of Global Searches | Rasa Summit... by
Voice First: Ready Your Content to Serve 50% of Global Searches | Rasa Summit...Voice First: Ready Your Content to Serve 50% of Global Searches | Rasa Summit...
Voice First: Ready Your Content to Serve 50% of Global Searches | Rasa Summit...Rasa Technologies
112 views48 slides
The missing link: How AI can help create a safer society and better businesse... by
The missing link: How AI can help create a safer society and better businesse...The missing link: How AI can help create a safer society and better businesse...
The missing link: How AI can help create a safer society and better businesse...Rasa Technologies
120 views14 slides
Boss - Bringing More Diversity to Tech | Rasa Summit by
Boss - Bringing More Diversity to Tech | Rasa SummitBoss - Bringing More Diversity to Tech | Rasa Summit
Boss - Bringing More Diversity to Tech | Rasa SummitRasa Technologies
116 views25 slides
How Our Team Uses Rasa to Learn from Real Conversations | Rasa Summit by
How Our Team Uses Rasa to Learn from Real Conversations | Rasa SummitHow Our Team Uses Rasa to Learn from Real Conversations | Rasa Summit
How Our Team Uses Rasa to Learn from Real Conversations | Rasa SummitRasa Technologies
136 views27 slides

More from Rasa Technologies(20)

Beyond Sentiment Analysis: Creating Engaging Conversational Experiences throu... by Rasa Technologies
Beyond Sentiment Analysis: Creating Engaging Conversational Experiences throu...Beyond Sentiment Analysis: Creating Engaging Conversational Experiences throu...
Beyond Sentiment Analysis: Creating Engaging Conversational Experiences throu...
Rasa Technologies250 views
End-to-end dialogue systems, or a feature which wasn’t meant to happen | Rasa... by Rasa Technologies
End-to-end dialogue systems, or a feature which wasn’t meant to happen | Rasa...End-to-end dialogue systems, or a feature which wasn’t meant to happen | Rasa...
End-to-end dialogue systems, or a feature which wasn’t meant to happen | Rasa...
Rasa Technologies194 views
Voice First: Ready Your Content to Serve 50% of Global Searches | Rasa Summit... by Rasa Technologies
Voice First: Ready Your Content to Serve 50% of Global Searches | Rasa Summit...Voice First: Ready Your Content to Serve 50% of Global Searches | Rasa Summit...
Voice First: Ready Your Content to Serve 50% of Global Searches | Rasa Summit...
Rasa Technologies112 views
The missing link: How AI can help create a safer society and better businesse... by Rasa Technologies
The missing link: How AI can help create a safer society and better businesse...The missing link: How AI can help create a safer society and better businesse...
The missing link: How AI can help create a safer society and better businesse...
Rasa Technologies120 views
Boss - Bringing More Diversity to Tech | Rasa Summit by Rasa Technologies
Boss - Bringing More Diversity to Tech | Rasa SummitBoss - Bringing More Diversity to Tech | Rasa Summit
Boss - Bringing More Diversity to Tech | Rasa Summit
Rasa Technologies116 views
How Our Team Uses Rasa to Learn from Real Conversations | Rasa Summit by Rasa Technologies
How Our Team Uses Rasa to Learn from Real Conversations | Rasa SummitHow Our Team Uses Rasa to Learn from Real Conversations | Rasa Summit
How Our Team Uses Rasa to Learn from Real Conversations | Rasa Summit
Rasa Technologies136 views
Applying Conversational AI in the Enterprise by Rasa Technologies
Applying Conversational AI in the EnterpriseApplying Conversational AI in the Enterprise
Applying Conversational AI in the Enterprise
Rasa Technologies118 views
STAR: A Schema-Guided Dialog Dataset for Transfer Learning | Rasa Summit 2021 by Rasa Technologies
STAR: A Schema-Guided Dialog Dataset for Transfer Learning | Rasa Summit 2021 STAR: A Schema-Guided Dialog Dataset for Transfer Learning | Rasa Summit 2021
STAR: A Schema-Guided Dialog Dataset for Transfer Learning | Rasa Summit 2021
Rasa Technologies215 views
Continuous Improvement of Conversational AI in Production | Rasa Summit by Rasa Technologies
Continuous Improvement of Conversational AI in Production | Rasa SummitContinuous Improvement of Conversational AI in Production | Rasa Summit
Continuous Improvement of Conversational AI in Production | Rasa Summit
Rasa Technologies130 views
Ethnobots: Reimagining Chatbots as Ethnographic Research Tools | Rasa Summit ... by Rasa Technologies
Ethnobots: Reimagining Chatbots as Ethnographic Research Tools | Rasa Summit ...Ethnobots: Reimagining Chatbots as Ethnographic Research Tools | Rasa Summit ...
Ethnobots: Reimagining Chatbots as Ethnographic Research Tools | Rasa Summit ...
The State of Conversation Design - Designing for the Conversational Future by Rasa Technologies
The State of Conversation Design - Designing for the Conversational FutureThe State of Conversation Design - Designing for the Conversational Future
The State of Conversation Design - Designing for the Conversational Future
Rasa Technologies348 views
What’s next in CDD: Intent Clashes and Selective Confidence | Rasa Summit 2021 by Rasa Technologies
What’s next in CDD: Intent Clashes and Selective Confidence | Rasa Summit 2021What’s next in CDD: Intent Clashes and Selective Confidence | Rasa Summit 2021
What’s next in CDD: Intent Clashes and Selective Confidence | Rasa Summit 2021
Rasa Technologies170 views
Conversational Teams: Moving Fast at Scale | Rasa Summit 2021 by Rasa Technologies
Conversational Teams: Moving Fast at Scale | Rasa Summit 2021Conversational Teams: Moving Fast at Scale | Rasa Summit 2021
Conversational Teams: Moving Fast at Scale | Rasa Summit 2021
Rasa Technologies173 views
Rasa Developer Summit - Bing Liu - Interactive Learning of Task-Oriented Dial... by Rasa Technologies
Rasa Developer Summit - Bing Liu - Interactive Learning of Task-Oriented Dial...Rasa Developer Summit - Bing Liu - Interactive Learning of Task-Oriented Dial...
Rasa Developer Summit - Bing Liu - Interactive Learning of Task-Oriented Dial...
Rasa Technologies770 views
Rasa Developer Summit - Josh Converse, Dynamic Offset - Three Part Harmony: H... by Rasa Technologies
Rasa Developer Summit - Josh Converse, Dynamic Offset - Three Part Harmony: H...Rasa Developer Summit - Josh Converse, Dynamic Offset - Three Part Harmony: H...
Rasa Developer Summit - Josh Converse, Dynamic Offset - Three Part Harmony: H...
Rasa Developer Summit - Praneeth Gubbala, NLP Engineer, Sam's Club at Walmart... by Rasa Technologies
Rasa Developer Summit - Praneeth Gubbala, NLP Engineer, Sam's Club at Walmart...Rasa Developer Summit - Praneeth Gubbala, NLP Engineer, Sam's Club at Walmart...
Rasa Developer Summit - Praneeth Gubbala, NLP Engineer, Sam's Club at Walmart...
Rasa Technologies837 views
Rasa Developer Summit - Tom Bocklisch, Rasa - Product Updates from Rasa by Rasa Technologies
Rasa Developer Summit - Tom Bocklisch, Rasa - Product Updates from RasaRasa Developer Summit - Tom Bocklisch, Rasa - Product Updates from Rasa
Rasa Developer Summit - Tom Bocklisch, Rasa - Product Updates from Rasa
Rasa Technologies669 views
Rasa Developer Summit - Alan Nichol, Rasa - Welcome & Intro by Rasa Technologies
Rasa Developer Summit - Alan Nichol, Rasa - Welcome & IntroRasa Developer Summit - Alan Nichol, Rasa - Welcome & Intro
Rasa Developer Summit - Alan Nichol, Rasa - Welcome & Intro
Rasa Technologies470 views
Rasa Developer Summit - Juste Petraityte, Rasa - Rasa Community Updates & Out... by Rasa Technologies
Rasa Developer Summit - Juste Petraityte, Rasa - Rasa Community Updates & Out...Rasa Developer Summit - Juste Petraityte, Rasa - Rasa Community Updates & Out...
Rasa Developer Summit - Juste Petraityte, Rasa - Rasa Community Updates & Out...
Rasa Technologies597 views

Recently uploaded

SAP Automation Using Bar Code and FIORI.pdf by
SAP Automation Using Bar Code and FIORI.pdfSAP Automation Using Bar Code and FIORI.pdf
SAP Automation Using Bar Code and FIORI.pdfVirendra Rai, PMP
19 views38 slides
DALI Basics Course 2023 by
DALI Basics Course  2023DALI Basics Course  2023
DALI Basics Course 2023Ivory Egg
14 views12 slides
Info Session November 2023.pdf by
Info Session November 2023.pdfInfo Session November 2023.pdf
Info Session November 2023.pdfAleksandraKoprivica4
10 views15 slides
AMAZON PRODUCT RESEARCH.pdf by
AMAZON PRODUCT RESEARCH.pdfAMAZON PRODUCT RESEARCH.pdf
AMAZON PRODUCT RESEARCH.pdfJerikkLaureta
15 views13 slides
Java Platform Approach 1.0 - Picnic Meetup by
Java Platform Approach 1.0 - Picnic MeetupJava Platform Approach 1.0 - Picnic Meetup
Java Platform Approach 1.0 - Picnic MeetupRick Ossendrijver
25 views39 slides
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors by
TouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective SensorsTouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective Sensors
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensorssugiuralab
15 views15 slides

Recently uploaded(20)

SAP Automation Using Bar Code and FIORI.pdf by Virendra Rai, PMP
SAP Automation Using Bar Code and FIORI.pdfSAP Automation Using Bar Code and FIORI.pdf
SAP Automation Using Bar Code and FIORI.pdf
DALI Basics Course 2023 by Ivory Egg
DALI Basics Course  2023DALI Basics Course  2023
DALI Basics Course 2023
Ivory Egg14 views
AMAZON PRODUCT RESEARCH.pdf by JerikkLaureta
AMAZON PRODUCT RESEARCH.pdfAMAZON PRODUCT RESEARCH.pdf
AMAZON PRODUCT RESEARCH.pdf
JerikkLaureta15 views
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors by sugiuralab
TouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective SensorsTouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective Sensors
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors
sugiuralab15 views
Igniting Next Level Productivity with AI-Infused Data Integration Workflows by Safe Software
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software225 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Data-centric AI and the convergence of data and model engineering: opportunit... by Paolo Missier
Data-centric AI and the convergence of data and model engineering:opportunit...Data-centric AI and the convergence of data and model engineering:opportunit...
Data-centric AI and the convergence of data and model engineering: opportunit...
Paolo Missier34 views
Unit 1_Lecture 2_Physical Design of IoT.pdf by StephenTec
Unit 1_Lecture 2_Physical Design of IoT.pdfUnit 1_Lecture 2_Physical Design of IoT.pdf
Unit 1_Lecture 2_Physical Design of IoT.pdf
StephenTec11 views
Business Analyst Series 2023 - Week 3 Session 5 by DianaGray10
Business Analyst Series 2023 -  Week 3 Session 5Business Analyst Series 2023 -  Week 3 Session 5
Business Analyst Series 2023 - Week 3 Session 5
DianaGray10209 views
Attacking IoT Devices from a Web Perspective - Linux Day by Simone Onofri
Attacking IoT Devices from a Web Perspective - Linux Day Attacking IoT Devices from a Web Perspective - Linux Day
Attacking IoT Devices from a Web Perspective - Linux Day
Simone Onofri15 views
1st parposal presentation.pptx by i238212
1st parposal presentation.pptx1st parposal presentation.pptx
1st parposal presentation.pptx
i2382129 views
Case Study Copenhagen Energy and Business Central.pdf by Aitana
Case Study Copenhagen Energy and Business Central.pdfCase Study Copenhagen Energy and Business Central.pdf
Case Study Copenhagen Energy and Business Central.pdf
Aitana12 views
HTTP headers that make your website go faster - devs.gent November 2023 by Thijs Feryn
HTTP headers that make your website go faster - devs.gent November 2023HTTP headers that make your website go faster - devs.gent November 2023
HTTP headers that make your website go faster - devs.gent November 2023
Thijs Feryn19 views
Perth MeetUp November 2023 by Michael Price
Perth MeetUp November 2023 Perth MeetUp November 2023
Perth MeetUp November 2023
Michael Price15 views

How to Effectively Test Your Chatbot | Rasa Summit

  • 1. How to ”Effectively” ”Test” your Chatbot Soumya Mukherjee Director QA, DevOps & AIML Apty.IO
  • 2. How are we doing our QA today • Testing is Blackbox for testers • Mostly manual testing done in organization • Conversational flow testing • Small Talk • Fallback checks • Integrations • Automation done on UI and API layer • Testing is mostly done on same training data • Models are trained by engineers and are not being monitored by QA • There are analytics tools available to monitor but it needs technical expertise for the QA • Result : More than 90% times bot breaks (no one understands when it will break), most of them fallback and get stuck - once bot is stuck it is stuck Q ? A
  • 3. What are the issues in QA ? • Bots are evolving and continuous story creation is a problem • No tool manage story coverage • Your training data may not correspond to new stories or vice versa (it’s a mismatch) – most org keep training on the same data • Most automation tools offers record and playback (My stories are already written how to port is the question)
  • 4. What are the issues in QA ? • No (unified) centralized dashboard present where QA can check (everything is quite scattered) • Intent Matching • Entity Testing – Slot identification • Entity Testing – Entity Validation • Confidence score • Confusion Matrix along with Precision/Recall/F1-Score • No easy way to reset the failed bot ! • Bot versioning is a mess and A/B testing becomes difficult • Multilingual bot QA is a challenge (have to make 2 separate bots) • High confidence score is also a problem as your bot will only predict same thing (if the data is same for multiple intents then it will predict the one with highest confidence score – may be incorrect) How to make sure your bot never breaks ?
  • 5. How to make your test effective ? • Create scenarios for happy path, contextual questions, digressions, domain specific questions, stateless conversations • Map proper entities for common scenarios (example bus fee, tuition fee) – flow should change with entities in the stories • Automated tests should consume all stories and run them each time as part of regression testing • Story coverage visualization • For Manual Testing use Bot emulation product (like RasaX, Botfront) to test
  • 6. How to make your test effective ? • Central dashboarding including : • Confusion matrix, Precision, Recall and F1-Score • Cumulative accuracy profile • Cross validation results • Perform Exhaustive testing (bot resiliency), Integration checks across platforms, Webhooks • Perform fault tolerance testing by performing performance testing (bot response, session management) & security testing (api interaction, typing speed check, punctuations, typo errors)
  • 7. Other KPIs to track • Activity Volume • Bounce rate • Retention rate • Open sessions count • Session times (conversation length) • Goal completion rate • User feedback (sentiments) • Fallback rate (Confusion rate, reset rate & Human takeover rate)