SlideShare a Scribd company logo
1 of 77
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Where’s Jarvis?
The Future of Voice
Recognition and Natural
Language User Interfaces.
Crispin Reedy, Versay Solutions
@crispinTX crispinreedy.com
#UXPA2016
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
From the session description
• What is voice recognition?
• What is natural language understanding?
• What are the common technologies in the market
today?
• How does this fit with IoT?
• What are design considerations / methods to
evaluate these types of interfaces?
• Implied: Should I speech-enable my ___?
• Bonus Q: Why doesn’t it work the way we want it
to, and when will it?
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Should I Speech-Enable My ___?
Iron Man 2: Marvel Studios, Paramount Pictures
Star Trek Voyager: Paramount Television
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
“Tomato soup”
“Tomato soup.
Ok, what kind?”
“Just plain”
“Coming right
up!”
Implicit
confirmation
Second level-open
ended prompting
Cultural context: plain = hot
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Terms & Technologies
• Speech Recognition
• Natural Language Understanding
• Voice Verification (Biometrics)
• Text to Speech
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Speech Recognition “ASR”
“See the cat.”
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Natural Language Understanding
• Extracting meaning from natural text
“Hello, yes,
I’d like to
pay my
water bill.
Can you
help me with
that?
Intent =
BillPay
Entity
(Bill Type) =
Water
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Voice Verification
“My voice is
my password.”
“Authenticated.
Welcome, Mr.
Smith.”
✓
Text To Speech
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
What Is Good TTS?
• Phonemes change based on location
• “Cat”
• “Alligator”
• Elision
• “I’m. Awaiting. You.”
• “I’m awaiting you.”
• Intonation
• “Do you want coffee?”
• “Do you want soda, tea, or coffee?”
• Most TTS isn’t “Movie Quality”
IMDB
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
SSML Example
SSML
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Speech Recognition
• Hands-free command /
control
• Dictation
• Input text
• Small form factor
device, etc.
Text To Speech
• Output text dynamically
• Respond to input
• Useful when no
display is available
Natural Language
Understanding
• Necessary for all
language-based input
• Extract meaning
• Parse large volumes of
text
Voice Verification
• Security
ASR
Application
Data
• Sign-In
• Interaction
• Request
• Action
• Meaning
• Access Data
• Output
TTS
NLU
Voice
prints
Verifi-
cation
ASR
Application
Data
• Sign-In
• Interaction
• Request
• Action
• Meaning
• Access Data
• Output
TTS
NLU
Voice
prints
Verifi-
cation
Touch
Keyboard
Manage I/O Modality
Determine Meaning in
Context
Visual
Context!
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
ASR
World
Knowledge
Semantics
Syntax
Lexicon
Morphology
Phonetics
Acoustics
Linguistics
Physiology
Concepts
Phrases
Words
Phonemes
Sounds
ASR
NLU
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Speech is ambiguous
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Language is ambiguous
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Everything is ambiguous
Speaker Independence
Speaker
Dependent
Multiple
Speakers
Speaker
Independent
Isolated Words
Connected
Words
Natural Speech
10 words
1000 words
100,000 words
Unlimited
VocabularySize
Humanlike
AUDREY: Automatic Digit
Recognizer
Bell Labs 1952
X — states
y — possible
observations
a — state transition
probabilities
b — output
probabilities
"HiddenMarkovModel" by Tdunningvectorization: Wikimedia
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Training
Speech
Recognition
Engine
Acoustic
Model
SLM and/or
Grammar
Pronunciation
Model
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Utterance
Noise
Levels?
Barge-In?
Feature
Extraction
Endpointing
Speech
Recognition
Engine
Grammar or SLM
Probabilities
n:best list
Literal return
Tokens
Recognition Event
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Early Commercial Adoptions
• Interactive Voice Response
• “Those Phone Menus”
• Server-based ASR
• Nuance
• Microsoft
• Voice-Enabled Handheld Devices
• Industrial / Productivity applications
• Device-based ASR
• Network not needed
Note: Call center
is still an
important
customer
touchpoint!
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Today’s Speech Agents vs. APIs
• Siri / Apple APIs
• Cortana / Cortana APIs
• Google Now / Google Voice Actions
• Amazon Echo (Alexa) / AVS API
• Jibo
• Ubi / Ubi Kit
• Assistant.ai / Api.ai
Alexa Skill vs. Amazon Voice Service
Amazon.com
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Alexa Skill Example
Amazon.com
Amazon.com
Capitol One.com
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
NLU
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Natural Language Understanding
• Parsing input to extract meaning
• Covers a large field
• Commands
• Automatic classification of emails
• Newspaper articles, large chunks of text
• Bots
• Conversational agents
• Messaging apps
• Personal assistants
• Input could be via speech or via text
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Levels of Meaning
Too Broad / Ambiguous Too MuchJust Right
“I’m having a problem
with my account.”
“Well, I was
looking at my
bill, because I
do that every
week, and I was
reviewing
everything on
there, and I
saw…”
“I’m seeing an
unusual charge
on my bill.”
“How can I help you?”
NLU Tasks
http://www.conversational-technologies.com/nldemos/nlDemos.html
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Intents and Entities
• “I’d like to transfer $50 from my checking account
to my savings account.”
• ACTION = Transfer (Intent)
• FROM_ACCOUNT = Checking (Entity)
• TO_ACCOUNT = Savings (Entity)
• AMOUNT = $50 (Entity)
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
NLU APIs
• API.ai
• Alexa
• Microsoft LUIS
• Wit.ai
• Google Voice Actions
• Etc.
Today’s NLU APIs
• Microsoft LUIS (part of Project Oxford)
Microsoft.com
Today’s NLU APIs
API.ai|
• API.ai
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
The Future Is Here
• DNN (Deep Neural Networks)
• Being applied to both ASR and NLU problems
• Requires large amounts of data to train the models
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
What’s The Glue Here?
Consistency
Across
Contexts?
“Omnichannel CX”
Data
Is
Everywhere
State Chart XML?
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
ASR vs. NLU: Wrap Up
ASR
• Spoken aloud
• Requires some NLU
even if it’s hand-crafted
(tagging)
• Useful in hands-free,
eyes-free contexts
NLU
• Focuses on meaning
extraction
• Could be used for chat
bots, etc.
• Machine learning to
train models
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Design Considerations
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Design Considerations
• What are you trying to build?
• What’s your platform?
• Existing guidelines / research
• User testing is key
• Especially if you’re trying to do something complicated
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Should I Speech-Enable My ___?
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
What’s Your ASR/NLU Platform?
Write an app (skill) for
an agent such as
Cortana / Alexa
Use cloud APIs to add
ASR / NLU to your app /
device / page / gadget
Download software and
use full-featured
capabilities for more robust
recognition on a specific
device
Build your own
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Network Availability
• Simply irritating… or totally unusable?
“What’s on my
calendar today?
“Sorry, I can’t
complete that request
right now.”
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Appropriate Modality?
• Voice Only? Voice + Display?
• Is it possible for the user to switch modalities?
• Or would switching potentially be dangerous?
“How long is the
flight from Dallas to
Seattle?
“I’ve got a few results
to show you.”
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Is State Maintained?
• Does your platform support a multiple-stage
interaction?
• Does it remember what you did previously?
“Who is Barack Obama?”
“Barack Obama is the 44th
president of the United
States.”
“How old is he?”
“I’m sorry, I don’t understand
your question.”
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Wake-Up Words
• How many of these “Agents”
will we be talking to?
“Jibo, take a picture.”
“Alexa, play music.”
“OK Google, set the
temperature to 77
degrees.”
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
System Personality
• Are you writing for an “Agent”
who has an existing style?
• What if your skill or app doesn’t
match that style?
• If not, should you create one?
“Hi, I’m Julie!”
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Context
• Real-world context
• Digital context
• How much does your app
know about where you are
and what it can do?
“When I get home,
remind me to take
out the trash.”
“I’m sorry, your calendar
doesn’t support location-
based reminders.”
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
What Are You Trying To Recognize?
• Long utterances work
better than short ones
• Letter names require extra
work
“Start a session”
“Got it”
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
And So Much More….
• What will you do when the
recognizer just can’t get it?
“I want my…. BARK
BARK BARK Timmy STOP
THAT NOW GET
DOWN!”
????
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Existing Guidelines / Research
• Caveat: Best practices evolved in one modality (e.g.
voice-only) may not apply the same way in another
(e.g. combined voice + touch)
• But they could be adapted
• Association for Voice Interaction Design (AVIxD.org)
• Wiki
• Peer-Reviewed Journal
• Virtual “Brown Bags”
• Academic Sources, Books
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
AVIxD.org
CUI Working Group is actively recruiting!
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Specific Example: “Help”
Voice XML
Standard
(2004)
“Help” should
be a global
command
AVIxD Wiki
(2014)
Stop using
“Help” as a
global
Agent API
Doc
(2015)
Offer “Help”
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Specific Example: “Help”
• Designers who tune applications have seen that the
word “help” is a known “False Attractor”
• Other things that you say which are short get recognized
as “help”
• People don’t voluntarily come up with “help”
unless they are prompted
• Give callers a context specific command only
where help may truly be needed, and call it
something besides "help”
• System: Say or enter your account number, or say, where
do I find it.
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Special Case: Car
• “Distracted Driver” is a hot topic!
• Richard Young, Wayne State University
• Paper: “Safe Interaction For Drivers”
• “Visual-Manual Mode” – What we do today
• “Auditory-Vocal Mode” – Speech only. NO GUI.
• “Mixed Mode” – Speech and GUI being used together
• Finding: If you give someone a graphic interface,
they’re going to look at it
• And take their eyes off the road
Design Documents
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Usability Studies / Research
• Special Challenges
• Technical setup
• Phone tap / Recording both sides
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions Warner Bros.
Early Stage Voice Only Prototype
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Should I Speech-Enable My ___?
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
What’s the Use Case?
• Enabling application
• User can’t do it any other way
• New tasks
• Enhancing application
• User can do it now
• But speech makes it better
• Faster
• Safer
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
API-Based
Device-
Based
Roll Your
Own /
Open-
Source
• Flexibility
• Power
• Customization
• Time
• Difficulty
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Cloud vs. Downloadable / Embedded
• Easy to get started
• Lightweight
• Not much specialized
knowledge
• Customizable
• Probably better recognition
• Can be device-specific
• More features
• Higher powered
• May require specialized
knowledge
– Speech scientist
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Open Source ASR
• CMU Sphinx
• pocketsphinx
• Kaldi
• http://kaldi-asr.org/
• Github
• New updates include some pretty interesting stuff (DNN)
• Requires:
• Corpus
• Tech know-how
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Should I Speech-Enable My ___?
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Should I Speech-Enable My ___?
Maybe
Iron Man 2: Marvel Studios, Paramount Pictures
Where’s Jarvis?
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Where’s Jarvis?
Gesture
Based
Interface
Artificial
Intelligence
Voice Based
Interface
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Where’s Jarvis?
ASR
NLU
Voice Design
Context
#UXPA2016Session Survey: http://www.uxpa2016.org/sessionsurvey?sessionid=321© 2016 Versay Solutions
Resources
• Handout / Web page

More Related Content

What's hot

Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentationhimanshubhatti
 
Mind reading computer
Mind reading computerMind reading computer
Mind reading computerAnushri Patil
 
Virtual personal assistant
Virtual personal assistantVirtual personal assistant
Virtual personal assistantShubham Bhalekar
 
Computer science seminar topics
Computer science seminar topicsComputer science seminar topics
Computer science seminar topics123seminarsonly
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by IqbalIqbal
 
Personal voice assistant - jarvis
Personal voice assistant - jarvisPersonal voice assistant - jarvis
Personal voice assistant - jarvisKana Ram Yadav
 
Mind reading computer report
Mind reading computer reportMind reading computer report
Mind reading computer reportIshan Khan
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by IqbalIqbal
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologyAamir-sheriff
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overviewsajanazoya
 
Install ubuntu
Install ubuntuInstall ubuntu
Install ubuntupramoddps
 
Text to-speech & voice recognition
Text to-speech & voice recognitionText to-speech & voice recognition
Text to-speech & voice recognitionMark Williams
 
Virtual Personal Assistant
Virtual Personal AssistantVirtual Personal Assistant
Virtual Personal Assistantsohaildanish
 

What's hot (20)

Jarvis
JarvisJarvis
Jarvis
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
Speaker Recognition
Speaker RecognitionSpeaker Recognition
Speaker Recognition
 
Jarvis
JarvisJarvis
Jarvis
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Mind reading computer
Mind reading computerMind reading computer
Mind reading computer
 
Jarvisproject
JarvisprojectJarvisproject
Jarvisproject
 
Virtual personal assistant
Virtual personal assistantVirtual personal assistant
Virtual personal assistant
 
Computer science seminar topics
Computer science seminar topicsComputer science seminar topics
Computer science seminar topics
 
mobile Os
mobile Osmobile Os
mobile Os
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
Personal voice assistant - jarvis
Personal voice assistant - jarvisPersonal voice assistant - jarvis
Personal voice assistant - jarvis
 
HISTORY OF COMPUTER
HISTORY OF COMPUTERHISTORY OF COMPUTER
HISTORY OF COMPUTER
 
Mind reading computer report
Mind reading computer reportMind reading computer report
Mind reading computer report
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overview
 
Install ubuntu
Install ubuntuInstall ubuntu
Install ubuntu
 
Text to-speech & voice recognition
Text to-speech & voice recognitionText to-speech & voice recognition
Text to-speech & voice recognition
 
Virtual Personal Assistant
Virtual Personal AssistantVirtual Personal Assistant
Virtual Personal Assistant
 

Viewers also liked

Jarvis learning process theory
Jarvis learning process theoryJarvis learning process theory
Jarvis learning process theoryGroup C
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentationlpaviglianiti
 
Mini Jarvis Final Presentation
Mini Jarvis Final PresentationMini Jarvis Final Presentation
Mini Jarvis Final PresentationYash Sanghavi
 
Jarvis' model of adult learning
Jarvis' model of adult learningJarvis' model of adult learning
Jarvis' model of adult learningguest7428ab
 
Peter Jarvis' model of the learning process
Peter Jarvis' model of the learning processPeter Jarvis' model of the learning process
Peter Jarvis' model of the learning processJames Atherton
 
Voice interfaces
Voice interfacesVoice interfaces
Voice interfacesSam Machin
 
Evangelizing and Designing Voice User Interface: Adopting VUI in a GUI world
 Evangelizing and Designing Voice User Interface: Adopting VUI in a GUI world Evangelizing and Designing Voice User Interface: Adopting VUI in a GUI world
Evangelizing and Designing Voice User Interface: Adopting VUI in a GUI worldStephen Gay
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognitionCharu Joshi
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By MatlabAnkit Gujrati
 
Insights & Opportunities in the Mobile Age
Insights & Opportunities in the Mobile AgeInsights & Opportunities in the Mobile Age
Insights & Opportunities in the Mobile AgeFlurry, Inc.
 
NFC Patent Analysis_Report-II
NFC Patent Analysis_Report-IINFC Patent Analysis_Report-II
NFC Patent Analysis_Report-IIHarshad Karmarkar
 
Voice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesVoice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesHarshad Karmarkar
 
Advances in Voice & Speech Recognition
Advances in Voice & Speech RecognitionAdvances in Voice & Speech Recognition
Advances in Voice & Speech RecognitionRising Media, Inc.
 
Speech recognition system
Speech recognition systemSpeech recognition system
Speech recognition systemRipal Ranpara
 
Voice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skill
Voice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skillVoice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skill
Voice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skillKay Lerch
 
Speech Recognition, Text to Speech, and Voice Interfaces
Speech Recognition, Text to Speech, and Voice InterfacesSpeech Recognition, Text to Speech, and Voice Interfaces
Speech Recognition, Text to Speech, and Voice InterfacesChristiana Vasquez
 
How to Succeed With Rewarded Video Ads
How to Succeed With Rewarded Video AdsHow to Succeed With Rewarded Video Ads
How to Succeed With Rewarded Video AdsSohan Maheshwar
 

Viewers also liked (20)

Burns ppt--jarvis
Burns ppt--jarvisBurns ppt--jarvis
Burns ppt--jarvis
 
Jarvis learning process theory
Jarvis learning process theoryJarvis learning process theory
Jarvis learning process theory
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentation
 
Mini Jarvis Final Presentation
Mini Jarvis Final PresentationMini Jarvis Final Presentation
Mini Jarvis Final Presentation
 
Jarvis' model of adult learning
Jarvis' model of adult learningJarvis' model of adult learning
Jarvis' model of adult learning
 
Peter Jarvis' model of the learning process
Peter Jarvis' model of the learning processPeter Jarvis' model of the learning process
Peter Jarvis' model of the learning process
 
Voice interfaces
Voice interfacesVoice interfaces
Voice interfaces
 
Evangelizing and Designing Voice User Interface: Adopting VUI in a GUI world
 Evangelizing and Designing Voice User Interface: Adopting VUI in a GUI world Evangelizing and Designing Voice User Interface: Adopting VUI in a GUI world
Evangelizing and Designing Voice User Interface: Adopting VUI in a GUI world
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 
Insights & Opportunities in the Mobile Age
Insights & Opportunities in the Mobile AgeInsights & Opportunities in the Mobile Age
Insights & Opportunities in the Mobile Age
 
NFC Patent Analysis_Report-II
NFC Patent Analysis_Report-IINFC Patent Analysis_Report-II
NFC Patent Analysis_Report-II
 
Voice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesVoice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devices
 
Thin Film Batteries
Thin Film BatteriesThin Film Batteries
Thin Film Batteries
 
Advances in Voice & Speech Recognition
Advances in Voice & Speech RecognitionAdvances in Voice & Speech Recognition
Advances in Voice & Speech Recognition
 
Speech recognition system
Speech recognition systemSpeech recognition system
Speech recognition system
 
Voice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skill
Voice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skillVoice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skill
Voice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skill
 
Speech Recognition, Text to Speech, and Voice Interfaces
Speech Recognition, Text to Speech, and Voice InterfacesSpeech Recognition, Text to Speech, and Voice Interfaces
Speech Recognition, Text to Speech, and Voice Interfaces
 
How to Succeed With Rewarded Video Ads
How to Succeed With Rewarded Video AdsHow to Succeed With Rewarded Video Ads
How to Succeed With Rewarded Video Ads
 

Similar to Where's Jarvis? The future of Voice Recognition and Natural Language User Interfaces.

Who's Using Our Product? A Story of Enterprise UX Research
Who's Using Our Product? A Story of Enterprise UX ResearchWho's Using Our Product? A Story of Enterprise UX Research
Who's Using Our Product? A Story of Enterprise UX ResearchUXPA International
 
UXPA 2016 - Using UX Skills to Shape Your Career
UXPA 2016 - Using UX Skills to Shape Your CareerUXPA 2016 - Using UX Skills to Shape Your Career
UXPA 2016 - Using UX Skills to Shape Your CareerAmanda Stockwell
 
Using UX skills to craft your career
Using UX skills to craft your careerUsing UX skills to craft your career
Using UX skills to craft your careerUXPA International
 
Voice Recognition and Natural Language - Dallas TechFest 2016
Voice Recognition and Natural Language - Dallas TechFest 2016Voice Recognition and Natural Language - Dallas TechFest 2016
Voice Recognition and Natural Language - Dallas TechFest 2016Crispin Reedy
 
What can social psychology teach us about (better) UX research?
What can social psychology teach us about (better) UX research?What can social psychology teach us about (better) UX research?
What can social psychology teach us about (better) UX research?UXPA International
 
UX Research within an Agile Design and Development Sprint Cycle
UX Research within an Agile Design and Development Sprint CycleUX Research within an Agile Design and Development Sprint Cycle
UX Research within an Agile Design and Development Sprint CycleUXPA International
 
Design Jams! How to run creative sessions with the people who use your product.
Design Jams! How to run creative sessions with the people who use your product.Design Jams! How to run creative sessions with the people who use your product.
Design Jams! How to run creative sessions with the people who use your product.UXPA International
 
Mature Products: The Cycle of UX Reinvention UXPA 2016
Mature Products: The Cycle of UX Reinvention UXPA 2016Mature Products: The Cycle of UX Reinvention UXPA 2016
Mature Products: The Cycle of UX Reinvention UXPA 2016Carol Smith
 
Mature Products: The Cycle of UX Reinvention
Mature Products: The Cycle of UX ReinventionMature Products: The Cycle of UX Reinvention
Mature Products: The Cycle of UX ReinventionUXPA International
 
Strategic User Experience Management
Strategic User Experience ManagementStrategic User Experience Management
Strategic User Experience ManagementUXPA International
 
Conversational User Interfaces, Past and Future
Conversational User Interfaces, Past and FutureConversational User Interfaces, Past and Future
Conversational User Interfaces, Past and FutureCrispin Reedy
 
UserZoom & UXPA Present a Webinar: Build a Better Experience
UserZoom & UXPA Present a Webinar: Build a Better ExperienceUserZoom & UXPA Present a Webinar: Build a Better Experience
UserZoom & UXPA Present a Webinar: Build a Better ExperienceUserZoom
 
Re-use and Recycle: Building sustainable relationships with your users
Re-use and Recycle: Building sustainable relationships with your usersRe-use and Recycle: Building sustainable relationships with your users
Re-use and Recycle: Building sustainable relationships with your usersUXPA International
 
Presumptive Design: "It's not research! We're getting stuff done!"
Presumptive Design: "It's not research! We're getting stuff done!"Presumptive Design: "It's not research! We're getting stuff done!"
Presumptive Design: "It's not research! We're getting stuff done!"UXPA International
 
Prototyping - 4 Strategic Factors for Designers - UXPA 2016
Prototyping - 4 Strategic Factors for Designers - UXPA 2016Prototyping - 4 Strategic Factors for Designers - UXPA 2016
Prototyping - 4 Strategic Factors for Designers - UXPA 2016Lyle Kantrovich
 
Prototyping - 4 Strategic Factors for Designers
Prototyping - 4 Strategic Factors for DesignersPrototyping - 4 Strategic Factors for Designers
Prototyping - 4 Strategic Factors for DesignersUXPA International
 
The UX Toolbelt for Developers
The UX Toolbelt for DevelopersThe UX Toolbelt for Developers
The UX Toolbelt for DevelopersSarah Dutkiewicz
 
Incorporating UX into Your Projects
Incorporating UX into Your ProjectsIncorporating UX into Your Projects
Incorporating UX into Your ProjectsKarl Kaufmann
 
Under the Knife: Plastic Surgery for Classic Software
Under the Knife: Plastic Surgery for Classic SoftwareUnder the Knife: Plastic Surgery for Classic Software
Under the Knife: Plastic Surgery for Classic SoftwareUXPA International
 
The Journey Towards Continuous Deployment
The Journey Towards Continuous DeploymentThe Journey Towards Continuous Deployment
The Journey Towards Continuous DeploymentBrian Mericle
 

Similar to Where's Jarvis? The future of Voice Recognition and Natural Language User Interfaces. (20)

Who's Using Our Product? A Story of Enterprise UX Research
Who's Using Our Product? A Story of Enterprise UX ResearchWho's Using Our Product? A Story of Enterprise UX Research
Who's Using Our Product? A Story of Enterprise UX Research
 
UXPA 2016 - Using UX Skills to Shape Your Career
UXPA 2016 - Using UX Skills to Shape Your CareerUXPA 2016 - Using UX Skills to Shape Your Career
UXPA 2016 - Using UX Skills to Shape Your Career
 
Using UX skills to craft your career
Using UX skills to craft your careerUsing UX skills to craft your career
Using UX skills to craft your career
 
Voice Recognition and Natural Language - Dallas TechFest 2016
Voice Recognition and Natural Language - Dallas TechFest 2016Voice Recognition and Natural Language - Dallas TechFest 2016
Voice Recognition and Natural Language - Dallas TechFest 2016
 
What can social psychology teach us about (better) UX research?
What can social psychology teach us about (better) UX research?What can social psychology teach us about (better) UX research?
What can social psychology teach us about (better) UX research?
 
UX Research within an Agile Design and Development Sprint Cycle
UX Research within an Agile Design and Development Sprint CycleUX Research within an Agile Design and Development Sprint Cycle
UX Research within an Agile Design and Development Sprint Cycle
 
Design Jams! How to run creative sessions with the people who use your product.
Design Jams! How to run creative sessions with the people who use your product.Design Jams! How to run creative sessions with the people who use your product.
Design Jams! How to run creative sessions with the people who use your product.
 
Mature Products: The Cycle of UX Reinvention UXPA 2016
Mature Products: The Cycle of UX Reinvention UXPA 2016Mature Products: The Cycle of UX Reinvention UXPA 2016
Mature Products: The Cycle of UX Reinvention UXPA 2016
 
Mature Products: The Cycle of UX Reinvention
Mature Products: The Cycle of UX ReinventionMature Products: The Cycle of UX Reinvention
Mature Products: The Cycle of UX Reinvention
 
Strategic User Experience Management
Strategic User Experience ManagementStrategic User Experience Management
Strategic User Experience Management
 
Conversational User Interfaces, Past and Future
Conversational User Interfaces, Past and FutureConversational User Interfaces, Past and Future
Conversational User Interfaces, Past and Future
 
UserZoom & UXPA Present a Webinar: Build a Better Experience
UserZoom & UXPA Present a Webinar: Build a Better ExperienceUserZoom & UXPA Present a Webinar: Build a Better Experience
UserZoom & UXPA Present a Webinar: Build a Better Experience
 
Re-use and Recycle: Building sustainable relationships with your users
Re-use and Recycle: Building sustainable relationships with your usersRe-use and Recycle: Building sustainable relationships with your users
Re-use and Recycle: Building sustainable relationships with your users
 
Presumptive Design: "It's not research! We're getting stuff done!"
Presumptive Design: "It's not research! We're getting stuff done!"Presumptive Design: "It's not research! We're getting stuff done!"
Presumptive Design: "It's not research! We're getting stuff done!"
 
Prototyping - 4 Strategic Factors for Designers - UXPA 2016
Prototyping - 4 Strategic Factors for Designers - UXPA 2016Prototyping - 4 Strategic Factors for Designers - UXPA 2016
Prototyping - 4 Strategic Factors for Designers - UXPA 2016
 
Prototyping - 4 Strategic Factors for Designers
Prototyping - 4 Strategic Factors for DesignersPrototyping - 4 Strategic Factors for Designers
Prototyping - 4 Strategic Factors for Designers
 
The UX Toolbelt for Developers
The UX Toolbelt for DevelopersThe UX Toolbelt for Developers
The UX Toolbelt for Developers
 
Incorporating UX into Your Projects
Incorporating UX into Your ProjectsIncorporating UX into Your Projects
Incorporating UX into Your Projects
 
Under the Knife: Plastic Surgery for Classic Software
Under the Knife: Plastic Surgery for Classic SoftwareUnder the Knife: Plastic Surgery for Classic Software
Under the Knife: Plastic Surgery for Classic Software
 
The Journey Towards Continuous Deployment
The Journey Towards Continuous DeploymentThe Journey Towards Continuous Deployment
The Journey Towards Continuous Deployment
 

More from UXPA International

UXPA 2023: Start Strong - Lessons learned from associate programs to platform...
UXPA 2023: Start Strong - Lessons learned from associate programs to platform...UXPA 2023: Start Strong - Lessons learned from associate programs to platform...
UXPA 2023: Start Strong - Lessons learned from associate programs to platform...UXPA International
 
UXPA 2023: Disrupting Inaccessibility: Applying A11Y-Focused Discovery & Idea...
UXPA 2023: Disrupting Inaccessibility: Applying A11Y-Focused Discovery & Idea...UXPA 2023: Disrupting Inaccessibility: Applying A11Y-Focused Discovery & Idea...
UXPA 2023: Disrupting Inaccessibility: Applying A11Y-Focused Discovery & Idea...UXPA International
 
UXPA 2023 Poster: ESG & Sustainable UX
UXPA 2023 Poster: ESG & Sustainable UXUXPA 2023 Poster: ESG & Sustainable UX
UXPA 2023 Poster: ESG & Sustainable UXUXPA International
 
UXPA 2023 Poster: The Two Tracks of UX Under Agile: Tactical and Strategic
UXPA 2023 Poster: The Two Tracks of UX Under Agile: Tactical and StrategicUXPA 2023 Poster: The Two Tracks of UX Under Agile: Tactical and Strategic
UXPA 2023 Poster: The Two Tracks of UX Under Agile: Tactical and StrategicUXPA International
 
UXPA 2023: Data science and UX: Smarter together
UXPA 2023: Data science and UX: Smarter togetherUXPA 2023: Data science and UX: Smarter together
UXPA 2023: Data science and UX: Smarter togetherUXPA International
 
UXPA 2023: UX Fracking: Using Mixed Methods to Extract Hidden Insights
UXPA 2023: UX Fracking: Using Mixed Methods to Extract Hidden InsightsUXPA 2023: UX Fracking: Using Mixed Methods to Extract Hidden Insights
UXPA 2023: UX Fracking: Using Mixed Methods to Extract Hidden InsightsUXPA International
 
UXPA 2023 Poster: Are virtual spaces the future of video conferencing?
UXPA 2023 Poster: Are virtual spaces the future of video conferencing?UXPA 2023 Poster: Are virtual spaces the future of video conferencing?
UXPA 2023 Poster: Are virtual spaces the future of video conferencing?UXPA International
 
UXPA 2023: Learn how to get over personas by swiping right on user roles
UXPA 2023: Learn how to get over personas by swiping right on user rolesUXPA 2023: Learn how to get over personas by swiping right on user roles
UXPA 2023: Learn how to get over personas by swiping right on user rolesUXPA International
 
UXPA 2023 Poster: Pocket Research Guide - Empower your Solution and Foster Cu...
UXPA 2023 Poster: Pocket Research Guide - Empower your Solution and Foster Cu...UXPA 2023 Poster: Pocket Research Guide - Empower your Solution and Foster Cu...
UXPA 2023 Poster: Pocket Research Guide - Empower your Solution and Foster Cu...UXPA International
 
UXPA 2023: Experience Maps - A designer's framework for working in Agile team...
UXPA 2023: Experience Maps - A designer's framework for working in Agile team...UXPA 2023: Experience Maps - A designer's framework for working in Agile team...
UXPA 2023: Experience Maps - A designer's framework for working in Agile team...UXPA International
 
UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...
UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...
UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...UXPA International
 
UXPA 2023 Poster: Leveraging Dial Testing To Measure Real-Time User Frustrati...
UXPA 2023 Poster: Leveraging Dial Testing To Measure Real-Time User Frustrati...UXPA 2023 Poster: Leveraging Dial Testing To Measure Real-Time User Frustrati...
UXPA 2023 Poster: Leveraging Dial Testing To Measure Real-Time User Frustrati...UXPA International
 
UXPA 2023: UX Enterprise Story: How to apply a UX process to a company withou...
UXPA 2023: UX Enterprise Story: How to apply a UX process to a company withou...UXPA 2023: UX Enterprise Story: How to apply a UX process to a company withou...
UXPA 2023: UX Enterprise Story: How to apply a UX process to a company withou...UXPA International
 
UXPA 2023: High-Fives over Zoom: Creating a Remote-First Creative Team
UXPA 2023: High-Fives over Zoom: Creating a Remote-First Creative TeamUXPA 2023: High-Fives over Zoom: Creating a Remote-First Creative Team
UXPA 2023: High-Fives over Zoom: Creating a Remote-First Creative TeamUXPA International
 
UXPA 2023: Behind the Bias: Dissecting human shortcuts for better research & ...
UXPA 2023: Behind the Bias: Dissecting human shortcuts for better research & ...UXPA 2023: Behind the Bias: Dissecting human shortcuts for better research & ...
UXPA 2023: Behind the Bias: Dissecting human shortcuts for better research & ...UXPA International
 
UXPA 2023 Poster: Improving the Internal and External User Experience of a Fe...
UXPA 2023 Poster: Improving the Internal and External User Experience of a Fe...UXPA 2023 Poster: Improving the Internal and External User Experience of a Fe...
UXPA 2023 Poster: Improving the Internal and External User Experience of a Fe...UXPA International
 
UXPA 2023 Poster: 5 Key Findings from Moderated Accessibility Testing with Sc...
UXPA 2023 Poster: 5 Key Findings from Moderated Accessibility Testing with Sc...UXPA 2023 Poster: 5 Key Findings from Moderated Accessibility Testing with Sc...
UXPA 2023 Poster: 5 Key Findings from Moderated Accessibility Testing with Sc...UXPA International
 
UXPA 2023: Lessons for new managers
UXPA 2023: Lessons for new managersUXPA 2023: Lessons for new managers
UXPA 2023: Lessons for new managersUXPA International
 
UXPA 2023: Redesigning An Automotive Feature from Gasoline to Electric Vehicl...
UXPA 2023: Redesigning An Automotive Feature from Gasoline to Electric Vehicl...UXPA 2023: Redesigning An Automotive Feature from Gasoline to Electric Vehicl...
UXPA 2023: Redesigning An Automotive Feature from Gasoline to Electric Vehicl...UXPA International
 

More from UXPA International (20)

UXPA 2023: Start Strong - Lessons learned from associate programs to platform...
UXPA 2023: Start Strong - Lessons learned from associate programs to platform...UXPA 2023: Start Strong - Lessons learned from associate programs to platform...
UXPA 2023: Start Strong - Lessons learned from associate programs to platform...
 
UXPA 2023: Disrupting Inaccessibility: Applying A11Y-Focused Discovery & Idea...
UXPA 2023: Disrupting Inaccessibility: Applying A11Y-Focused Discovery & Idea...UXPA 2023: Disrupting Inaccessibility: Applying A11Y-Focused Discovery & Idea...
UXPA 2023: Disrupting Inaccessibility: Applying A11Y-Focused Discovery & Idea...
 
UXPA 2023 Poster: ESG & Sustainable UX
UXPA 2023 Poster: ESG & Sustainable UXUXPA 2023 Poster: ESG & Sustainable UX
UXPA 2023 Poster: ESG & Sustainable UX
 
UXPA 2023 Poster: The Two Tracks of UX Under Agile: Tactical and Strategic
UXPA 2023 Poster: The Two Tracks of UX Under Agile: Tactical and StrategicUXPA 2023 Poster: The Two Tracks of UX Under Agile: Tactical and Strategic
UXPA 2023 Poster: The Two Tracks of UX Under Agile: Tactical and Strategic
 
UXPA 2023: Data science and UX: Smarter together
UXPA 2023: Data science and UX: Smarter togetherUXPA 2023: Data science and UX: Smarter together
UXPA 2023: Data science and UX: Smarter together
 
UXPA 2023: UX Fracking: Using Mixed Methods to Extract Hidden Insights
UXPA 2023: UX Fracking: Using Mixed Methods to Extract Hidden InsightsUXPA 2023: UX Fracking: Using Mixed Methods to Extract Hidden Insights
UXPA 2023: UX Fracking: Using Mixed Methods to Extract Hidden Insights
 
UXPA 2023 Poster: Are virtual spaces the future of video conferencing?
UXPA 2023 Poster: Are virtual spaces the future of video conferencing?UXPA 2023 Poster: Are virtual spaces the future of video conferencing?
UXPA 2023 Poster: Are virtual spaces the future of video conferencing?
 
UXPA 2023: Learn how to get over personas by swiping right on user roles
UXPA 2023: Learn how to get over personas by swiping right on user rolesUXPA 2023: Learn how to get over personas by swiping right on user roles
UXPA 2023: Learn how to get over personas by swiping right on user roles
 
UXPA 2023: F@#$ User Personas
UXPA 2023: F@#$ User PersonasUXPA 2023: F@#$ User Personas
UXPA 2023: F@#$ User Personas
 
UXPA 2023 Poster: Pocket Research Guide - Empower your Solution and Foster Cu...
UXPA 2023 Poster: Pocket Research Guide - Empower your Solution and Foster Cu...UXPA 2023 Poster: Pocket Research Guide - Empower your Solution and Foster Cu...
UXPA 2023 Poster: Pocket Research Guide - Empower your Solution and Foster Cu...
 
UXPA 2023: Experience Maps - A designer's framework for working in Agile team...
UXPA 2023: Experience Maps - A designer's framework for working in Agile team...UXPA 2023: Experience Maps - A designer's framework for working in Agile team...
UXPA 2023: Experience Maps - A designer's framework for working in Agile team...
 
UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...
UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...
UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...
 
UXPA 2023 Poster: Leveraging Dial Testing To Measure Real-Time User Frustrati...
UXPA 2023 Poster: Leveraging Dial Testing To Measure Real-Time User Frustrati...UXPA 2023 Poster: Leveraging Dial Testing To Measure Real-Time User Frustrati...
UXPA 2023 Poster: Leveraging Dial Testing To Measure Real-Time User Frustrati...
 
UXPA 2023: UX Enterprise Story: How to apply a UX process to a company withou...
UXPA 2023: UX Enterprise Story: How to apply a UX process to a company withou...UXPA 2023: UX Enterprise Story: How to apply a UX process to a company withou...
UXPA 2023: UX Enterprise Story: How to apply a UX process to a company withou...
 
UXPA 2023: High-Fives over Zoom: Creating a Remote-First Creative Team
UXPA 2023: High-Fives over Zoom: Creating a Remote-First Creative TeamUXPA 2023: High-Fives over Zoom: Creating a Remote-First Creative Team
UXPA 2023: High-Fives over Zoom: Creating a Remote-First Creative Team
 
UXPA 2023: Behind the Bias: Dissecting human shortcuts for better research & ...
UXPA 2023: Behind the Bias: Dissecting human shortcuts for better research & ...UXPA 2023: Behind the Bias: Dissecting human shortcuts for better research & ...
UXPA 2023: Behind the Bias: Dissecting human shortcuts for better research & ...
 
UXPA 2023 Poster: Improving the Internal and External User Experience of a Fe...
UXPA 2023 Poster: Improving the Internal and External User Experience of a Fe...UXPA 2023 Poster: Improving the Internal and External User Experience of a Fe...
UXPA 2023 Poster: Improving the Internal and External User Experience of a Fe...
 
UXPA 2023 Poster: 5 Key Findings from Moderated Accessibility Testing with Sc...
UXPA 2023 Poster: 5 Key Findings from Moderated Accessibility Testing with Sc...UXPA 2023 Poster: 5 Key Findings from Moderated Accessibility Testing with Sc...
UXPA 2023 Poster: 5 Key Findings from Moderated Accessibility Testing with Sc...
 
UXPA 2023: Lessons for new managers
UXPA 2023: Lessons for new managersUXPA 2023: Lessons for new managers
UXPA 2023: Lessons for new managers
 
UXPA 2023: Redesigning An Automotive Feature from Gasoline to Electric Vehicl...
UXPA 2023: Redesigning An Automotive Feature from Gasoline to Electric Vehicl...UXPA 2023: Redesigning An Automotive Feature from Gasoline to Electric Vehicl...
UXPA 2023: Redesigning An Automotive Feature from Gasoline to Electric Vehicl...
 

Recently uploaded

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 

Recently uploaded (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Where's Jarvis? The future of Voice Recognition and Natural Language User Interfaces.

Editor's Notes

  1. Voice User Interface Designer 10 years in the field English major, former coder; got interested in UX President of the Association for Voice Interaction Design Consultant for Versay Solutions 2 weeks in a row for conferences
  2. Jarvis: Audio and gestural Perfect recognition. No error recovery needed Great voice quality Connected to vast amounts of data Understands all the parts of the model: “Lose the landscape.” Context-sensitive. Aware of the space around him Sense of humor. “Am I to include the Belgian Waffle stands?” Takes initiative. “What is it you’re trying to achieve, sir?”
  3. Replicator: Good recognition No error recovery needed Good voice quality – understandable Connected to data – perhaps too much so? Context sensitive- but was this enough? A design failure (not a tech failure) Specifically around excessive disambiguation
  4. A Better Replicator Conversation
  5. “Speech to Text” ? Spoken Language – Machine readable format
  6. Not necessarily tied to speech recognition
  7. Also called voiceprints, biometrics, voice authentication, etc. Not going to discuss this one in a lot of detail today but it’s important that you understand the difference between these technologies. Recognizes a person, not necessarily what they are saying. You can have ASR without Voice Verification And vice versa
  8. Human voice talent Hundreds of hours of recording Digitized Phonemes: Concatenated speech synthesis
  9. Dynamic Speech Synthesis Many commercial products are available API-based Downloadable Quality varies If possible, record audio TTS has improved considerably, but is still noticeable High quality TTS may not be available in all situations If you have a lot of dynamic data TTS is useful You can mix recorded audio and TTS You may have to use TTS Voice Agent (Alexa, Cortana, etc.) API-based Some of them do let you mark up your TTS with SSML More phonemes = higher quality voice Also means a bigger download and install (if on device) Exceptions (addresses, names) can be iffy May require a lot of work to handle well St. James St. Saint James Street Punctuation Your data needs to be clean and ready to voice back Acronyms, incomplete sentences will not sound good It is possible to build a custom voice But it takes a lot of work!
  10. Speech Synthesis Markup Language XML based WC3 standard Not universally supported Tags which allow you produce a more natural quality output. Emphasis Break Voice Prosody Pitch
  11. World Knowledge: Concepts of the world around us, i.e. Tables have four legs, what is left and right, what is a car, etc. This is the level before language Semantics: The first level of language. Knowledge can be represented in structured meaningful elements. Example: semantics of a party invitation Syntax: The rules that govern putting words together to form meaningful units Lexicon: What words mean Morphology: How words change their form to perform differently in a language i.e. horse / horses Phonetics: Phonemes and how words are built Acoustics: What phonemes sound like and how to create them
  12. Speech is never stationary Coarticulation Noisy environments Accents Different speakers have voices with different acoustic qualities Goats Challenges vary depending on what you are going to recognize Spelling (short utterances) can be difficult even for humans Phonetic alphabet (Military)
  13. Humans can deduce meaning from context and unknown words “How can I help you?” I’m having a problem with my account. I’d like that one. No, not the green one, the red one. Time flies like an arrow. Fruit flies like a banana.
  14. All modern speech recognition is probabilistic GUI: Button clicked? true / false VUI: There is an 85% chance that button was clicked
  15. Three Dimensions of Speech Problems
  16. AUDREY: Davis, Biddulph, and Balashek - Bell Labs 1952 Analog Isolated digit recognition Pause between digits Speaker-dependent Speech recognition with vacuum tubes – How very steampunk. Her name was AUDREY. Let that sink in a minute. (Automatic Digit Recognizer)
  17. 1980’s: The Power of Statistics The recognition of connected speech becomes a search for the best path in a large network Problem of finding the probabilities Statistical Language Models Not all sequences of words are equally probable Rank all permissible sentences in terms of probability “Correct” grammar is not applicable Restricted by domain Hidden Markov Models (HMM) Unified probabilistic model for speech
  18. You’re Only As Good As What You’re Trained On Corpora Collection of speech used to train a recognizer Acoustic and/or Pronunciation Model Associates sounds with symbols and words. Created by a general speech corpora and a phonetic and orthographic transcription Statistical Language Model (SLM) A probability distribution over sequences of words Created by a domain-specific speech corpora and a tagged transcription to extract meaning
  19. Speech Agent: The “Person” who Distributed speech recognition Collection and compression of speech is on the device The language models are typically on the network Phone can be speaker-dependent Trains itself on your voice and on the acoustic environments you are in most often Many companies are providing APIs to use their speech recognition
  20. Alexa, Ask Capitol One What’s my current credit card balance?
  21. Observations to make: Represents the entirety of a VUI experience Placement of Spanish prompt would vary depending on type of call. Confirmation is variable Confirmation prompt is general
  22. What do you need it for? What kind of device will you be running it on? Connectivity? Can you use cloud based ASR? How much control do you need over the application / user interface?
  23. Jarvis: Audio and gestural Perfect recognition. No error recovery needed Great voice quality Connected to vast amounts of data Understands all the parts of the model: “Lose the landscape.” Context-sensitive. Aware of the space around him Sense of humor. “Am I to include the Belgian Waffle stands?” Takes initiative. “What is it you’re trying to achieve, sir?”