SlideShare a Scribd company logo
1 of 12
A Whole New Way to Interact: Voice User
Interface
https://fibonalabs.com/
Introduction
What is a Voice User Interface? Is it better than our typical methods of
interacting with our devices? In this note, you’ll be able to grasp what is it, how
it works, what’s the tech behind it and why is it future-proof.
Evolution of Interface and Interactions
Initially, when we started to interact with our devices, started with using a
physical device to see the interaction on screen, for eg; using a mouse and a
keyboard to interact with the monitor, using physical keys to interact with a
mobile, then we had a new way of interaction with touch screen phones,
tablets, and laptops, where touching the element could help in interaction. Now,
most of the devices support VUI communications either using a physical trigger
Fun Fact
The volume knob of a speaker is designed in such a way where the user has to
go from ”Left to Right”, which is also related to the motion in which humans
read, write, and clock functions.
Unimodal
A Unimodal interface is a platform where only one of the five human senses is
being used. Be it either listening, touching, seeing, speaking, etc.
Amazon Echo, Google Home, Apple HomePod, etc devices only require a
human voice to function, most of these devices don’t have a screen where
users can touch and interact. When the user says “Okay Google, switch on the
lights”, only the voice of a person is being used for a task.
Multimodal
A multimodal Interface is a platform where more than one human sense is
being used for interaction. It can be a combination of two or more senses. For
e.g.; Siri on iPhones, Google Home Hub, where the users speak to the device
as well as touch the screen to interact with it. Even the infotainment systems of
our vehicles, after asking the assistant of our car to take us to a place, we need
to see the screen which shows the route on the map.
What is VUI?
Voice User Interface (VUI) is another way of interacting with a device where the
users have to use their voice to get the job done. It is an interface for speech
recognition applications. A new way to interact with smart devices to
We have new solutions in the market which support VUI, to name a few;
Amazon Alexa, Siri by Apple, Google Assistant, Cortana by Microsoft. Omega
by I.M plus. These solutions for VUI are used by consumers daily in today’s
time. From scheduling a call to ordering even a cigarette lighter, everything can
be done by just using our voice. In the future, AI will be so smart, it will seem
like we are talking to another human being, almost like a human-to-human
conversation.
Why VUI is Better?
Nowadays, almost all of the world is familiar with GUI (Graphical User
Interface) where we touch a screen to interact, but devices like Amazon Echo,
Google Home, and HomePod by Apple have taken a new leap, where we can
complete a task by just using our voice.
VUI allows us to be efficient enough and do multitask. But how?
Let’s say, John is driving a car, he wants to know the route to the nearest
subway station, he can trigger a voice command by saying; Okay Google, take
me to the nearest train station. Google will help John by showing directions
using google maps on his car’s infotainment system or on phone, and by
dictating the route through the assistant’s voice, google will allow John to focus
on the road and help him to reach his destination safely.
The structure of VUI
Voice command has structured anatomy through which the AI figures out the
exact and correct step to be taken for landing at the optimum result.
..Wake Word
Wake word refers to the trigger word/phrase to activate the voice interface to
perform a task. When our device detects its wake word, it records the next
spoken request and sends a recording of the user's request to Web Services.
For eg; “Okay Google”, “Hey Siri”, “Alexa”.
..Utterance
An utterance is a phrase where the device reacts to what the user phrases the
request.
For eg; “Play Classical Music”.
..Variable
Variable is the type of utterance what the user wanted the device to perform, it
For eg; “Play CLASSICAL Music”
..Invocation
Invocation is the platform where the action happens, whether it is a proprietary
platform or a third-party platform.
For eg; “Play Classical Music on Spotify
In the End
Today, Voice User Interface is a significant part of a tech roadmap for
businesses. Irrespective of the industry, businesses are realizing the benefits
that VUIs bring in and are cashing in upon it. Given the complexity, designing a
VUI requires know-how and experience with computer science, human
psychology, and linguistics, along with cognitive learning.
THANK YOU
A Whole New Way To Interact: Voice User Interface
A Whole New Way To Interact: Voice User Interface

More Related Content

Similar to A Whole New Way To Interact: Voice User Interface

Similar to A Whole New Way To Interact: Voice User Interface (20)

What do you understand by voice user interface design (VUI).pptx
What do you understand by voice user interface design (VUI).pptxWhat do you understand by voice user interface design (VUI).pptx
What do you understand by voice user interface design (VUI).pptx
 
Wake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phoneWake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phone
 
Evolution of Voice AI,.pptx
Evolution of Voice AI,.pptxEvolution of Voice AI,.pptx
Evolution of Voice AI,.pptx
 
A comprehensive guide to conversational interface (CI)
A comprehensive guide to conversational interface (CI)A comprehensive guide to conversational interface (CI)
A comprehensive guide to conversational interface (CI)
 
Speakeasy 04 2017
Speakeasy 04 2017Speakeasy 04 2017
Speakeasy 04 2017
 
Mobile UX Design
Mobile UX DesignMobile UX Design
Mobile UX Design
 
Mobile UX Design
Mobile UX DesignMobile UX Design
Mobile UX Design
 
Mobile UX Design
Mobile UX DesignMobile UX Design
Mobile UX Design
 
ppt project pk.pptx
ppt project pk.pptxppt project pk.pptx
ppt project pk.pptx
 
The Future of User Interfaces
The Future of User InterfacesThe Future of User Interfaces
The Future of User Interfaces
 
ch03.ppt
ch03.pptch03.ppt
ch03.ppt
 
Voice search getting louder
Voice search getting louderVoice search getting louder
Voice search getting louder
 
Future of interface design 2010
Future of interface design 2010Future of interface design 2010
Future of interface design 2010
 
VOICE AI PREDICTED FUTURE TRENDS
VOICE AI PREDICTED FUTURE TRENDSVOICE AI PREDICTED FUTURE TRENDS
VOICE AI PREDICTED FUTURE TRENDS
 
A.I. in the Enterprise: Computer Speech
A.I. in the Enterprise: Computer SpeechA.I. in the Enterprise: Computer Speech
A.I. in the Enterprise: Computer Speech
 
ARTIFICIAL.pptx
ARTIFICIAL.pptxARTIFICIAL.pptx
ARTIFICIAL.pptx
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Voice recognition
Voice recognitionVoice recognition
Voice recognition
 
Smartphone
SmartphoneSmartphone
Smartphone
 
TISBI
TISBITISBI
TISBI
 

More from Fibonalabs

More from Fibonalabs (20)

Data Sharing Between Child and Parent Components in AngularJS
Data Sharing Between Child and Parent Components in AngularJSData Sharing Between Child and Parent Components in AngularJS
Data Sharing Between Child and Parent Components in AngularJS
 
A Complete Guide to Building a Ground-Breaking UX Design Strategy
A Complete Guide to Building a Ground-Breaking UX Design StrategyA Complete Guide to Building a Ground-Breaking UX Design Strategy
A Complete Guide to Building a Ground-Breaking UX Design Strategy
 
React Class Components vs Functional Components: Which is Better?
React Class Components vs Functional Components: Which is Better?React Class Components vs Functional Components: Which is Better?
React Class Components vs Functional Components: Which is Better?
 
Measures to ensure Cyber Security in a serverless environment
Measures to ensure Cyber Security in a serverless environmentMeasures to ensure Cyber Security in a serverless environment
Measures to ensure Cyber Security in a serverless environment
 
Simplifying CRUD operations using budibase
Simplifying CRUD operations using budibaseSimplifying CRUD operations using budibase
Simplifying CRUD operations using budibase
 
How to implement Micro-frontends using Qiankun
How to implement Micro-frontends using QiankunHow to implement Micro-frontends using Qiankun
How to implement Micro-frontends using Qiankun
 
Different Cloud Computing Services Used At Fibonalabs
Different Cloud Computing Services Used At FibonalabsDifferent Cloud Computing Services Used At Fibonalabs
Different Cloud Computing Services Used At Fibonalabs
 
How Can A Startup Benefit From Collaborating With A UX Design Partner
How Can A Startup Benefit From Collaborating With A UX Design PartnerHow Can A Startup Benefit From Collaborating With A UX Design Partner
How Can A Startup Benefit From Collaborating With A UX Design Partner
 
How to make React Applications SEO-friendly
How to make React Applications SEO-friendlyHow to make React Applications SEO-friendly
How to make React Applications SEO-friendly
 
10 Heuristic Principles
10 Heuristic Principles10 Heuristic Principles
10 Heuristic Principles
 
Push Notifications: How to add them to a Flutter App
Push Notifications: How to add them to a Flutter AppPush Notifications: How to add them to a Flutter App
Push Notifications: How to add them to a Flutter App
 
Key Skills Required for Data Engineering
Key Skills Required for Data EngineeringKey Skills Required for Data Engineering
Key Skills Required for Data Engineering
 
Ways for UX Design Iterations: Innovate Faster & Better
Ways for UX Design Iterations: Innovate Faster & BetterWays for UX Design Iterations: Innovate Faster & Better
Ways for UX Design Iterations: Innovate Faster & Better
 
Factors that could impact conversion rate in UX Design
Factors that could impact conversion rate in UX DesignFactors that could impact conversion rate in UX Design
Factors that could impact conversion rate in UX Design
 
Information Architecture in UX: To offer Delightful and Meaningful User Exper...
Information Architecture in UX: To offer Delightful and Meaningful User Exper...Information Architecture in UX: To offer Delightful and Meaningful User Exper...
Information Architecture in UX: To offer Delightful and Meaningful User Exper...
 
Cloud Computing Architecture: Components, Importance, and Tips
Cloud Computing Architecture: Components, Importance, and TipsCloud Computing Architecture: Components, Importance, and Tips
Cloud Computing Architecture: Components, Importance, and Tips
 
Choose the Best Agile Product Development Method for a Successful Business
Choose the Best Agile Product Development Method for a Successful BusinessChoose the Best Agile Product Development Method for a Successful Business
Choose the Best Agile Product Development Method for a Successful Business
 
Atomic Design: Effective Way of Designing UI
Atomic Design: Effective Way of Designing UIAtomic Design: Effective Way of Designing UI
Atomic Design: Effective Way of Designing UI
 
Agile Software Development with Scrum_ A Complete Guide to The Steps in Agile...
Agile Software Development with Scrum_ A Complete Guide to The Steps in Agile...Agile Software Development with Scrum_ A Complete Guide to The Steps in Agile...
Agile Software Development with Scrum_ A Complete Guide to The Steps in Agile...
 
7 Psychology Theories in UX to Provide Better User Experience
7 Psychology Theories in UX to Provide Better User Experience7 Psychology Theories in UX to Provide Better User Experience
7 Psychology Theories in UX to Provide Better User Experience
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 

A Whole New Way To Interact: Voice User Interface

  • 1. A Whole New Way to Interact: Voice User Interface https://fibonalabs.com/
  • 2.
  • 3. Introduction What is a Voice User Interface? Is it better than our typical methods of interacting with our devices? In this note, you’ll be able to grasp what is it, how it works, what’s the tech behind it and why is it future-proof. Evolution of Interface and Interactions Initially, when we started to interact with our devices, started with using a physical device to see the interaction on screen, for eg; using a mouse and a keyboard to interact with the monitor, using physical keys to interact with a mobile, then we had a new way of interaction with touch screen phones, tablets, and laptops, where touching the element could help in interaction. Now, most of the devices support VUI communications either using a physical trigger
  • 4. Fun Fact The volume knob of a speaker is designed in such a way where the user has to go from ”Left to Right”, which is also related to the motion in which humans read, write, and clock functions. Unimodal A Unimodal interface is a platform where only one of the five human senses is being used. Be it either listening, touching, seeing, speaking, etc. Amazon Echo, Google Home, Apple HomePod, etc devices only require a human voice to function, most of these devices don’t have a screen where users can touch and interact. When the user says “Okay Google, switch on the lights”, only the voice of a person is being used for a task.
  • 5. Multimodal A multimodal Interface is a platform where more than one human sense is being used for interaction. It can be a combination of two or more senses. For e.g.; Siri on iPhones, Google Home Hub, where the users speak to the device as well as touch the screen to interact with it. Even the infotainment systems of our vehicles, after asking the assistant of our car to take us to a place, we need to see the screen which shows the route on the map. What is VUI? Voice User Interface (VUI) is another way of interacting with a device where the users have to use their voice to get the job done. It is an interface for speech recognition applications. A new way to interact with smart devices to
  • 6. We have new solutions in the market which support VUI, to name a few; Amazon Alexa, Siri by Apple, Google Assistant, Cortana by Microsoft. Omega by I.M plus. These solutions for VUI are used by consumers daily in today’s time. From scheduling a call to ordering even a cigarette lighter, everything can be done by just using our voice. In the future, AI will be so smart, it will seem like we are talking to another human being, almost like a human-to-human conversation. Why VUI is Better? Nowadays, almost all of the world is familiar with GUI (Graphical User Interface) where we touch a screen to interact, but devices like Amazon Echo, Google Home, and HomePod by Apple have taken a new leap, where we can complete a task by just using our voice.
  • 7. VUI allows us to be efficient enough and do multitask. But how? Let’s say, John is driving a car, he wants to know the route to the nearest subway station, he can trigger a voice command by saying; Okay Google, take me to the nearest train station. Google will help John by showing directions using google maps on his car’s infotainment system or on phone, and by dictating the route through the assistant’s voice, google will allow John to focus on the road and help him to reach his destination safely. The structure of VUI Voice command has structured anatomy through which the AI figures out the exact and correct step to be taken for landing at the optimum result. ..Wake Word
  • 8. Wake word refers to the trigger word/phrase to activate the voice interface to perform a task. When our device detects its wake word, it records the next spoken request and sends a recording of the user's request to Web Services. For eg; “Okay Google”, “Hey Siri”, “Alexa”. ..Utterance An utterance is a phrase where the device reacts to what the user phrases the request. For eg; “Play Classical Music”. ..Variable Variable is the type of utterance what the user wanted the device to perform, it
  • 9. For eg; “Play CLASSICAL Music” ..Invocation Invocation is the platform where the action happens, whether it is a proprietary platform or a third-party platform. For eg; “Play Classical Music on Spotify In the End Today, Voice User Interface is a significant part of a tech roadmap for businesses. Irrespective of the industry, businesses are realizing the benefits that VUIs bring in and are cashing in upon it. Given the complexity, designing a VUI requires know-how and experience with computer science, human psychology, and linguistics, along with cognitive learning.