SlideShare a Scribd company logo
A NewVoice in Customer Experience
Introduction
Shop ByVoice is a product developed by Firebird Summit, Inc. for purposes of applying voice-driven
customer interface to a standard shopping experience.
The demonstration details included here are for the purposes of showcasing the technical capabilities
of the system, including how it can be compared to standard, commercially available voice solutions.
Custom integration with a commerce platform and adaptation to retailer-specific shopping lexicon is
done during implementation.
For a live demonstration or additional questions, please contact the Firebird Summit team at
info@firebirdsummit.com
Limitations of commercially available voice solutions
Say the phrase “I want Illy Espresso…”
Alexa hears:
Google hears:
Watson hears:
Microsoft hears:
Siri hears:
Standard commercial Voice
Technology is only the start of an
actual voice-based solution for
business-specific needs
Voice engines for
product search
The strength of SBV technology is in
enhancing the accuracy of underlying
technical capabilities of commercial
engines.
Even when a specific product is not found
by a standard voice solution, SBV’s
custom engine and algorithms can still
produce accurate results.
The addition of background noise can make a significant difference in the quality of results from various engines.
Metrics included herein all demonstrate these differences by showing test results with and without added noise.
Shop ByVoice for a
specific product: Illy
Espresso
Using our custom engine and full-context
search, SBV is able to return specific
results (with uncommon proper nouns)
with 95% accuracy.
Say the phrase “I want Illy Espresso…”
Contextual
relevance
Context matters.
Each of the engines includes natively-
imbedded assumptions about how to
interpret different words.
Meat = Meet – unless there is enough
context to change the interpretation.
But these are generic to average uses,
and are not directly customizable to an
individual business context.
No
Context
Minimal
Context Specific
Context
Solving the Problem
Wreck a nice beach?
Reckon eyes peach?
Recognize speech?
Acoustic Model
Basic Accoustic Model (Commercial Voice Service) +
Ecommerce Domain Acoustic Model +
Dynamic Custom Acoustic Model =>
Dynamic Custom Domain Acoustic Model
Language Model
Basic Language Model (Commercial Voice Service) +
Customer Domain Analysis +
Ecommerce Domain Language Model +
Dynamic Custom Language Model =>
Dynamic Custom Domain Language Model
User
Speech
System
Text
Commercial voice
technologies are the
beginning of a solution
– not the whole solution!
Voice Engine
Word Error Rate
(WER)
Not all voice engines are created equally
to work in all circumstances.
Word error rate* is the calculation of an
engine's native capability to understand
contextually relevant, business-specific
language.
*WER technical definition and calculations described in detail in Appendix
Shop by voice for a generic product: Meat
Select product to add
to cart by number
Returns popular
results next
Browse through more
options
Shop By Voice uses a
customer’s order history and
different types of preferences
to intelligently return search
results for products with large
numbers of possibilities.
Just ask and Shop By Voice will
read the titles of returned
results aloud.
Returns previously
purchased product
first
Returns prioritized
‘favorites’ second
Returns regular
‘favorites’ next
Voice engine
recall rate
Voice engine’s native ability to accurately
recognize spoken language.
Shop byVoice Reporting Dashboard
SBV Administration Dashboard
shows voice-driven data about
users, products, orders and devices.
Native responsive design works on computer, tablet and mobile screen sizes.
Thank you for your interest!
Contact info@firebirdsummit.com to arrange
a live demo or ask questions.
Appendix
Engine comparison metrics and testing methodology
Test description and assumptions
We ran 1000 * 7 * 4 = 28000 tests for determining the
current level of SBV accuracy. Our testing set contained
common search queries for grocery stores. Most phrases
are single words or two words queries. Most of the tested
phrases are common words that are used also in articles,
human speech, web.
Example of tested phrases: aloe, cranberries, pork, stain
remover, vegetarian, fish.
We calculated the widely known metrics for speech
recognition engines:
• Word error rate
• Word accuracy (not included here)
• Recall rate
We also tested algorithms inside our system. Metrics that
show the quality of our system we measure as
percentage of Found Products depending on Automatic
Speech Recognition (ASR) results.
We used IBM voice synthesis to emulate speech. We used
male and female voices, with and without noise, with
American and British English accents.
Usage of robotic voice here is a system-naive approach
which is far from representative in real world scenarios.
This naive assumption significantly improves ASR results
and metric values can not be used as real world
indicators, but this approach gives opportunity to rate
ASR engines and to compare them relatively.
Our comparison model included Shop ByVoice, and six
commercially available engines:
1. Google
2. Microsoft
3. IBMWatson
4. Open Source
5. Apple’s Siri
6. AmazonAlexa
Word Error Rate
Word error rate is a common metric of the performance
of a speech recognition or machine translation system.
This performance calculation is computed by comparing
a reference transcription with the transcription output by
the speech recognizer. In simple words,WER shows the
number of transformations needed to be applied to ASR
hypothesis to receive a reference. It is the most accurate
metric to compare ASRs. From this comparison it is
possible to compute the number of errors, which typically
belong to 3 categories:
1. Insertions I (when in the output of the ASR it is
present a word not present in the reference)
2. Deletions D (a word is missed in the ASR output)
3. Substitutions S (a word is confused with another
one)
Word error rate can then be computed as:
where
• S is the number of substitutions,
• D is the number of deletions,
• I is the number of insertions,
• C is the number of the corrects,
• N is the number of words in the reference transcription.
The main issue in computing this score is the required
alignment between the 2 word sequences. This can be
obtained through dynamic programming, using the so-
called Levenstein distance.
Recall and Found Product Metrics
Recall (information retrieval)
This is a metric that represents a ratio
of correctly recognized words (H) to
the total number of words in reference
(N).This metric is used to measure ASR
performance. However this metric
does not count the amount of noise
that can be generated by an ASR and
can invert or undermine the phrase
context.
Found Product Rate
Found products is calculated as ratio of
correctly found* products to all
products (all products here equal the
number of tests).The product is
considered correctly found if the
product returned by the search engine
based on the reference phrase which is
equal to the product that is returned by
the search engine based on the ASR
hypothesis phrase.
*Search engine used is a native ecommerce, on-site engine across all ASRs to establish consistent results. Typical on-site ecommerce search
engines include enhanced results management for common needs, such as misspellings, related words, suggested alternatives, etc.

More Related Content

Similar to Shop By Voice Product Overview

Build Intelligent Apps Using AI Services
Build Intelligent Apps Using AI ServicesBuild Intelligent Apps Using AI Services
Build Intelligent Apps Using AI Services
Amazon Web Services
 
Big Data LDN 2017: Applied AI using Cognitive Services
Big Data LDN 2017: Applied AI using Cognitive ServicesBig Data LDN 2017: Applied AI using Cognitive Services
Big Data LDN 2017: Applied AI using Cognitive Services
Matt Stubbs
 
Improving Customer Experience: Enhanced Customer Insights Using Natural Langu...
Improving Customer Experience: Enhanced Customer Insights Using Natural Langu...Improving Customer Experience: Enhanced Customer Insights Using Natural Langu...
Improving Customer Experience: Enhanced Customer Insights Using Natural Langu...
Amazon Web Services
 
Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software
Jame Williamson
 
Build Intelligent Apps Using AI Services.pdf
Build Intelligent Apps Using AI Services.pdfBuild Intelligent Apps Using AI Services.pdf
Build Intelligent Apps Using AI Services.pdf
Amazon Web Services
 
Optimizing Healthcare Call Centers with Natural Language Understanding (HLC30...
Optimizing Healthcare Call Centers with Natural Language Understanding (HLC30...Optimizing Healthcare Call Centers with Natural Language Understanding (HLC30...
Optimizing Healthcare Call Centers with Natural Language Understanding (HLC30...
Amazon Web Services
 
Essential Elements of Excellent Multilingual Search
Essential Elements of Excellent Multilingual SearchEssential Elements of Excellent Multilingual Search
Essential Elements of Excellent Multilingual Search
andrew_paulsen
 
Build_Intelligent_Apps_Using_AI_Services.pdf
Build_Intelligent_Apps_Using_AI_Services.pdfBuild_Intelligent_Apps_Using_AI_Services.pdf
Build_Intelligent_Apps_Using_AI_Services.pdf
Amazon Web Services
 
Introduction to AI/ML with AWS
Introduction to AI/ML with AWSIntroduction to AI/ML with AWS
Introduction to AI/ML with AWS
Suman Debnath
 
General Speereo Technology
General Speereo TechnologyGeneral Speereo Technology
General Speereo Technology
Daniel Ischenko
 
Why Should Businesses Set Up An IVR- Interactive Voice Response?
Why Should Businesses Set Up An IVR- Interactive Voice Response?Why Should Businesses Set Up An IVR- Interactive Voice Response?
Why Should Businesses Set Up An IVR- Interactive Voice Response?
USDSI
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
Ilhaan Marwat
 
Build Intelligent Apps with Amazon ML
Build Intelligent Apps with Amazon ML Build Intelligent Apps with Amazon ML
Build Intelligent Apps with Amazon ML
Amazon Web Services
 
Sviluppare applicazioni voice-first con AWS e Amazon Alexa
Sviluppare applicazioni voice-first con AWS e Amazon AlexaSviluppare applicazioni voice-first con AWS e Amazon Alexa
Sviluppare applicazioni voice-first con AWS e Amazon Alexa
Amazon Web Services
 
AWS_AI_Services_2023_edition_1679400999.pdf
AWS_AI_Services_2023_edition_1679400999.pdfAWS_AI_Services_2023_edition_1679400999.pdf
AWS_AI_Services_2023_edition_1679400999.pdf
abcdsidprats
 
AWS의 새로운 언어, 음성, 텍스트 처리 인공지능 서비스::Vikram Anbazhagan::AWS Summit Seoul 2018
AWS의 새로운 언어, 음성, 텍스트 처리 인공지능 서비스::Vikram Anbazhagan::AWS Summit Seoul 2018AWS의 새로운 언어, 음성, 텍스트 처리 인공지능 서비스::Vikram Anbazhagan::AWS Summit Seoul 2018
AWS의 새로운 언어, 음성, 텍스트 처리 인공지능 서비스::Vikram Anbazhagan::AWS Summit Seoul 2018
Amazon Web Services Korea
 
IBM Cloud Artificial Intelligence : A Comprehensive Overview
IBM Cloud Artificial Intelligence : A Comprehensive OverviewIBM Cloud Artificial Intelligence : A Comprehensive Overview
IBM Cloud Artificial Intelligence : A Comprehensive Overview
Satyajit Panda
 
Build Intelligent Apps with Amazon ML - Language Services - BDA302 - Chicago ...
Build Intelligent Apps with Amazon ML - Language Services - BDA302 - Chicago ...Build Intelligent Apps with Amazon ML - Language Services - BDA302 - Chicago ...
Build Intelligent Apps with Amazon ML - Language Services - BDA302 - Chicago ...
Amazon Web Services
 
MCL308_Using a Digital Assistant in the Enterprise for Business Productivity
MCL308_Using a Digital Assistant in the Enterprise for Business ProductivityMCL308_Using a Digital Assistant in the Enterprise for Business Productivity
MCL308_Using a Digital Assistant in the Enterprise for Business Productivity
Amazon Web Services
 
Strata - Final_IB_02_17
Strata - Final_IB_02_17Strata - Final_IB_02_17
Strata - Final_IB_02_17
Irina Borisova
 

Similar to Shop By Voice Product Overview (20)

Build Intelligent Apps Using AI Services
Build Intelligent Apps Using AI ServicesBuild Intelligent Apps Using AI Services
Build Intelligent Apps Using AI Services
 
Big Data LDN 2017: Applied AI using Cognitive Services
Big Data LDN 2017: Applied AI using Cognitive ServicesBig Data LDN 2017: Applied AI using Cognitive Services
Big Data LDN 2017: Applied AI using Cognitive Services
 
Improving Customer Experience: Enhanced Customer Insights Using Natural Langu...
Improving Customer Experience: Enhanced Customer Insights Using Natural Langu...Improving Customer Experience: Enhanced Customer Insights Using Natural Langu...
Improving Customer Experience: Enhanced Customer Insights Using Natural Langu...
 
Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software
 
Build Intelligent Apps Using AI Services.pdf
Build Intelligent Apps Using AI Services.pdfBuild Intelligent Apps Using AI Services.pdf
Build Intelligent Apps Using AI Services.pdf
 
Optimizing Healthcare Call Centers with Natural Language Understanding (HLC30...
Optimizing Healthcare Call Centers with Natural Language Understanding (HLC30...Optimizing Healthcare Call Centers with Natural Language Understanding (HLC30...
Optimizing Healthcare Call Centers with Natural Language Understanding (HLC30...
 
Essential Elements of Excellent Multilingual Search
Essential Elements of Excellent Multilingual SearchEssential Elements of Excellent Multilingual Search
Essential Elements of Excellent Multilingual Search
 
Build_Intelligent_Apps_Using_AI_Services.pdf
Build_Intelligent_Apps_Using_AI_Services.pdfBuild_Intelligent_Apps_Using_AI_Services.pdf
Build_Intelligent_Apps_Using_AI_Services.pdf
 
Introduction to AI/ML with AWS
Introduction to AI/ML with AWSIntroduction to AI/ML with AWS
Introduction to AI/ML with AWS
 
General Speereo Technology
General Speereo TechnologyGeneral Speereo Technology
General Speereo Technology
 
Why Should Businesses Set Up An IVR- Interactive Voice Response?
Why Should Businesses Set Up An IVR- Interactive Voice Response?Why Should Businesses Set Up An IVR- Interactive Voice Response?
Why Should Businesses Set Up An IVR- Interactive Voice Response?
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Build Intelligent Apps with Amazon ML
Build Intelligent Apps with Amazon ML Build Intelligent Apps with Amazon ML
Build Intelligent Apps with Amazon ML
 
Sviluppare applicazioni voice-first con AWS e Amazon Alexa
Sviluppare applicazioni voice-first con AWS e Amazon AlexaSviluppare applicazioni voice-first con AWS e Amazon Alexa
Sviluppare applicazioni voice-first con AWS e Amazon Alexa
 
AWS_AI_Services_2023_edition_1679400999.pdf
AWS_AI_Services_2023_edition_1679400999.pdfAWS_AI_Services_2023_edition_1679400999.pdf
AWS_AI_Services_2023_edition_1679400999.pdf
 
AWS의 새로운 언어, 음성, 텍스트 처리 인공지능 서비스::Vikram Anbazhagan::AWS Summit Seoul 2018
AWS의 새로운 언어, 음성, 텍스트 처리 인공지능 서비스::Vikram Anbazhagan::AWS Summit Seoul 2018AWS의 새로운 언어, 음성, 텍스트 처리 인공지능 서비스::Vikram Anbazhagan::AWS Summit Seoul 2018
AWS의 새로운 언어, 음성, 텍스트 처리 인공지능 서비스::Vikram Anbazhagan::AWS Summit Seoul 2018
 
IBM Cloud Artificial Intelligence : A Comprehensive Overview
IBM Cloud Artificial Intelligence : A Comprehensive OverviewIBM Cloud Artificial Intelligence : A Comprehensive Overview
IBM Cloud Artificial Intelligence : A Comprehensive Overview
 
Build Intelligent Apps with Amazon ML - Language Services - BDA302 - Chicago ...
Build Intelligent Apps with Amazon ML - Language Services - BDA302 - Chicago ...Build Intelligent Apps with Amazon ML - Language Services - BDA302 - Chicago ...
Build Intelligent Apps with Amazon ML - Language Services - BDA302 - Chicago ...
 
MCL308_Using a Digital Assistant in the Enterprise for Business Productivity
MCL308_Using a Digital Assistant in the Enterprise for Business ProductivityMCL308_Using a Digital Assistant in the Enterprise for Business Productivity
MCL308_Using a Digital Assistant in the Enterprise for Business Productivity
 
Strata - Final_IB_02_17
Strata - Final_IB_02_17Strata - Final_IB_02_17
Strata - Final_IB_02_17
 

More from Alora Chistiakoff

Tackling the Tough Stuff: Consulting, Training & Coaching for Startup Perform...
Tackling the Tough Stuff: Consulting, Training & Coaching for Startup Perform...Tackling the Tough Stuff: Consulting, Training & Coaching for Startup Perform...
Tackling the Tough Stuff: Consulting, Training & Coaching for Startup Perform...
Alora Chistiakoff
 
Alora's Quest
Alora's QuestAlora's Quest
Alora's Quest
Alora Chistiakoff
 
Ecommerce Pre-Kick-Off Discovery Questions
Ecommerce Pre-Kick-Off Discovery QuestionsEcommerce Pre-Kick-Off Discovery Questions
Ecommerce Pre-Kick-Off Discovery Questions
Alora Chistiakoff
 
Developing a Product Feedback Program with Existing Clients
Developing a Product Feedback Program with Existing ClientsDeveloping a Product Feedback Program with Existing Clients
Developing a Product Feedback Program with Existing Clients
Alora Chistiakoff
 
Where to Place the Product Manager within an Organization
Where to Place the Product Manager within an OrganizationWhere to Place the Product Manager within an Organization
Where to Place the Product Manager within an Organization
Alora Chistiakoff
 
Getting LinkedIn
Getting LinkedInGetting LinkedIn
Getting LinkedIn
Alora Chistiakoff
 
Technical PM Presentation
Technical PM PresentationTechnical PM Presentation
Technical PM Presentation
Alora Chistiakoff
 

More from Alora Chistiakoff (7)

Tackling the Tough Stuff: Consulting, Training & Coaching for Startup Perform...
Tackling the Tough Stuff: Consulting, Training & Coaching for Startup Perform...Tackling the Tough Stuff: Consulting, Training & Coaching for Startup Perform...
Tackling the Tough Stuff: Consulting, Training & Coaching for Startup Perform...
 
Alora's Quest
Alora's QuestAlora's Quest
Alora's Quest
 
Ecommerce Pre-Kick-Off Discovery Questions
Ecommerce Pre-Kick-Off Discovery QuestionsEcommerce Pre-Kick-Off Discovery Questions
Ecommerce Pre-Kick-Off Discovery Questions
 
Developing a Product Feedback Program with Existing Clients
Developing a Product Feedback Program with Existing ClientsDeveloping a Product Feedback Program with Existing Clients
Developing a Product Feedback Program with Existing Clients
 
Where to Place the Product Manager within an Organization
Where to Place the Product Manager within an OrganizationWhere to Place the Product Manager within an Organization
Where to Place the Product Manager within an Organization
 
Getting LinkedIn
Getting LinkedInGetting LinkedIn
Getting LinkedIn
 
Technical PM Presentation
Technical PM PresentationTechnical PM Presentation
Technical PM Presentation
 

Recently uploaded

How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 

Recently uploaded (20)

How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 

Shop By Voice Product Overview

  • 1. A NewVoice in Customer Experience
  • 2. Introduction Shop ByVoice is a product developed by Firebird Summit, Inc. for purposes of applying voice-driven customer interface to a standard shopping experience. The demonstration details included here are for the purposes of showcasing the technical capabilities of the system, including how it can be compared to standard, commercially available voice solutions. Custom integration with a commerce platform and adaptation to retailer-specific shopping lexicon is done during implementation. For a live demonstration or additional questions, please contact the Firebird Summit team at info@firebirdsummit.com
  • 3. Limitations of commercially available voice solutions Say the phrase “I want Illy Espresso…” Alexa hears: Google hears: Watson hears: Microsoft hears: Siri hears: Standard commercial Voice Technology is only the start of an actual voice-based solution for business-specific needs
  • 4. Voice engines for product search The strength of SBV technology is in enhancing the accuracy of underlying technical capabilities of commercial engines. Even when a specific product is not found by a standard voice solution, SBV’s custom engine and algorithms can still produce accurate results. The addition of background noise can make a significant difference in the quality of results from various engines. Metrics included herein all demonstrate these differences by showing test results with and without added noise.
  • 5. Shop ByVoice for a specific product: Illy Espresso Using our custom engine and full-context search, SBV is able to return specific results (with uncommon proper nouns) with 95% accuracy. Say the phrase “I want Illy Espresso…”
  • 6. Contextual relevance Context matters. Each of the engines includes natively- imbedded assumptions about how to interpret different words. Meat = Meet – unless there is enough context to change the interpretation. But these are generic to average uses, and are not directly customizable to an individual business context. No Context Minimal Context Specific Context
  • 7. Solving the Problem Wreck a nice beach? Reckon eyes peach? Recognize speech? Acoustic Model Basic Accoustic Model (Commercial Voice Service) + Ecommerce Domain Acoustic Model + Dynamic Custom Acoustic Model => Dynamic Custom Domain Acoustic Model Language Model Basic Language Model (Commercial Voice Service) + Customer Domain Analysis + Ecommerce Domain Language Model + Dynamic Custom Language Model => Dynamic Custom Domain Language Model User Speech System Text Commercial voice technologies are the beginning of a solution – not the whole solution!
  • 8. Voice Engine Word Error Rate (WER) Not all voice engines are created equally to work in all circumstances. Word error rate* is the calculation of an engine's native capability to understand contextually relevant, business-specific language. *WER technical definition and calculations described in detail in Appendix
  • 9. Shop by voice for a generic product: Meat Select product to add to cart by number Returns popular results next Browse through more options Shop By Voice uses a customer’s order history and different types of preferences to intelligently return search results for products with large numbers of possibilities. Just ask and Shop By Voice will read the titles of returned results aloud. Returns previously purchased product first Returns prioritized ‘favorites’ second Returns regular ‘favorites’ next
  • 10. Voice engine recall rate Voice engine’s native ability to accurately recognize spoken language.
  • 11. Shop byVoice Reporting Dashboard SBV Administration Dashboard shows voice-driven data about users, products, orders and devices. Native responsive design works on computer, tablet and mobile screen sizes.
  • 12. Thank you for your interest! Contact info@firebirdsummit.com to arrange a live demo or ask questions.
  • 13. Appendix Engine comparison metrics and testing methodology
  • 14. Test description and assumptions We ran 1000 * 7 * 4 = 28000 tests for determining the current level of SBV accuracy. Our testing set contained common search queries for grocery stores. Most phrases are single words or two words queries. Most of the tested phrases are common words that are used also in articles, human speech, web. Example of tested phrases: aloe, cranberries, pork, stain remover, vegetarian, fish. We calculated the widely known metrics for speech recognition engines: • Word error rate • Word accuracy (not included here) • Recall rate We also tested algorithms inside our system. Metrics that show the quality of our system we measure as percentage of Found Products depending on Automatic Speech Recognition (ASR) results. We used IBM voice synthesis to emulate speech. We used male and female voices, with and without noise, with American and British English accents. Usage of robotic voice here is a system-naive approach which is far from representative in real world scenarios. This naive assumption significantly improves ASR results and metric values can not be used as real world indicators, but this approach gives opportunity to rate ASR engines and to compare them relatively. Our comparison model included Shop ByVoice, and six commercially available engines: 1. Google 2. Microsoft 3. IBMWatson 4. Open Source 5. Apple’s Siri 6. AmazonAlexa
  • 15. Word Error Rate Word error rate is a common metric of the performance of a speech recognition or machine translation system. This performance calculation is computed by comparing a reference transcription with the transcription output by the speech recognizer. In simple words,WER shows the number of transformations needed to be applied to ASR hypothesis to receive a reference. It is the most accurate metric to compare ASRs. From this comparison it is possible to compute the number of errors, which typically belong to 3 categories: 1. Insertions I (when in the output of the ASR it is present a word not present in the reference) 2. Deletions D (a word is missed in the ASR output) 3. Substitutions S (a word is confused with another one) Word error rate can then be computed as: where • S is the number of substitutions, • D is the number of deletions, • I is the number of insertions, • C is the number of the corrects, • N is the number of words in the reference transcription. The main issue in computing this score is the required alignment between the 2 word sequences. This can be obtained through dynamic programming, using the so- called Levenstein distance.
  • 16. Recall and Found Product Metrics Recall (information retrieval) This is a metric that represents a ratio of correctly recognized words (H) to the total number of words in reference (N).This metric is used to measure ASR performance. However this metric does not count the amount of noise that can be generated by an ASR and can invert or undermine the phrase context. Found Product Rate Found products is calculated as ratio of correctly found* products to all products (all products here equal the number of tests).The product is considered correctly found if the product returned by the search engine based on the reference phrase which is equal to the product that is returned by the search engine based on the ASR hypothesis phrase. *Search engine used is a native ecommerce, on-site engine across all ASRs to establish consistent results. Typical on-site ecommerce search engines include enhanced results management for common needs, such as misspellings, related words, suggested alternatives, etc.