SlideShare a Scribd company logo
1 of 26
Download to read offline
Srivatsan
Srinivasan
Chief Data Scientist
Data Science Blogger
On AI Use Cases
Chief Data Scientist,
LinkedIn Published Author
I am not a data
scientist and so
please no
jargons. I always
wondered how
does Siri work ?
On AI Use Cases
Chief Data Scientist,
LinkedIn Published Author
Did you ever see a
dog making things
complicated ?
Let me explain ..
Concept # 1
Let’s find out, how does Siri
record your voice on iPhone
and convert to digital format
Neuron Learning
Try this …
Chief Data Scientist,
LinkedIn Published Author
Take your hand close to your
mouth and say the word
“Boom”. What did you
experience?
When you speak, pressure
waves are created and
microphone uses these changes
in air-pressure to record sound.
Your ear drum sort of works on
similar principle
Neuron Learning
Show me the waves..
Chief Data Scientist,
LinkedIn Published Author
below is a typical sound wave
and in this case I was asking
Siri directions..
Neuron Learning
After this step, Siri converts
your speech to text. In order
to do this, lot of compute
power is required and hence
your voice is sent to cloud.
That is the reason why Siri
doesn’t work without internet
Recap..
Let’s see
how Siri
converts
speech to
text
Hello..
you say
“Hello”
Siri
records
your voice
your voice
is sent
using
internet
speech
converted
to text
1
2
3
4
Concept # 2
Machine Learning (ML) algorithms
are trained by using lots of voice
samples and providing accurate
“text” for each sample. Very
similar to how we teach kids by
showing pictures..
This is
a
“dog”
Neuron Learning
over 100,000 hours of
audio is used for
training
“my name”
“how long
is”
“what”
“Columbus”
…equal to a full time employee
working over 50 years
This voice labelling is done by humans and then it is
provided to algorithms as ground truth
Apple has been in news for
privacy concerns..
Source: TheGurdian.com
Neuron Learning
This is very important..
In order to train a ML
algorithm, it needs data
which has been correctly
labelled by humans.
Highly recommend you
read last 3 pages if this is
not clear
Neuron Learning
On AI Use Cases
Chief Data Scientist,
LinkedIn Published Author
Are you saying that
during training,
algorithm predicts
the text from voice
samples and these
predictions are
compared against
actual labels
provided by human
beings?
YES, you got it!
Neuron Learning
How does the
algorithm learn to
make good
predictions?
Neuron Learning
That is similar to
how I learn. When I
do a good job you
give me feedback
and then I do more
of that..
Character level predictions..
Chief Data Scientist,
LinkedIn Published Author
Latest algorithms predict
character in audio file. This
prediction is similar to how a non
English speaking person would
spell the word “Wife” if it was said
very slowly “WWIIIFFEE”. After that
rules such as removing duplicate
characters are applied to get the
most probable word
WW_III_FF_EE WIFE
Neuron Learning
Intent behind the text
Chief Data Scientist,
LinkedIn Published Author
Let’s assume a user tells Siri
“Call Wife”. Based on our
insights so far, we know Siri
will convert it to text “Call
Wife”. How does Siri know
what do these word mean?
That is covered in our last
concept. You are almost
there. Sit tight!
Neuron Learning
Concept # 3
Siri doesn’t try to
figure out exactly
what you said,
but rather
connect you to
most relevant
services / apps
based on “trigger
words”
Neuron Learning
“Call Wife”
After Siri has converted your
speech to text, it will identify
“Call” as a trigger word in “Call
Wife” and based on that it will
invoke call placing application
Neuron Learning
Some trigger words..
Tell
Message
Text
Set Appointment
Book Meeting
Alarm
Wake me
Temperature
Umbrella
Weather
Given large volume of voice samples,
Siri has good sense for a different
number of ways people ask for say
“Weather”
Neuron Learning
Back to our “Call Wife”
example
Once Siri realizes it has to
place a “call”, it will get name
of person to be called from
text. In our case if a contact
“Wife” is saved then Siri will
place the call and will
complete your request
Neuron Learning
You have seen this
default answer…
When Siri
stumbles upon
a request
beyond it’s
capability it
mostly invokes
a web search
Neuron Learning
You have seen this
default answer…
When Siri
stumbles upon
a request
beyond it’s
capability it
mostly invokes
a web search.
When you make
a request to
draw a Lion..
OMG. I cannot
believe how easy it
was to
understand. How
can I continue to
learn more?
Follow Amol Palekar
on LinkedIn to get next
edition of Neuron
Learning
For email notification,
subscribe :
NeuronTimes.com
25
Resources
Chief Data Scientist,
LinkedIn Published Author
Natural Language Processing by National Research University Higher School of Economics on Coursera
Sequence Models by deeplearning.ai on Coursera
Adam Coates. - Deep Speech at BAMMF on Youtube https://www.youtube.com/watch?
v=hyZCH3xU42E&list=PLes5o2b5ie2XMM4SLb7drQjQEPnalgJLl
Adam Coates – Deep Learning for Speech Recognition on YouTube
https://www.youtube.com/watch?v=g-sndkf7mCs&t=1980s
TheGurdian.com
These are my personal opinions and for educational purposes only. Content has been simplified to convey
core ideas to non data scientists

More Related Content

Similar to How Siri works ?

UNIT 1 IX (1) (2) (1).pptx
UNIT 1 IX (1) (2) (1).pptxUNIT 1 IX (1) (2) (1).pptx
UNIT 1 IX (1) (2) (1).pptx
siddhichaddha2
 
UNIT 1 IX (1) (2) (3).pptx
UNIT 1 IX (1) (2) (3).pptxUNIT 1 IX (1) (2) (3).pptx
UNIT 1 IX (1) (2) (3).pptx
siddhichaddha2
 
UNIT 1 IX (1) (2) (2).pptx
UNIT 1 IX (1) (2) (2).pptxUNIT 1 IX (1) (2) (2).pptx
UNIT 1 IX (1) (2) (2).pptx
siddhichaddha2
 
Voicemail Techniques for DIscoverOrg
Voicemail Techniques for DIscoverOrgVoicemail Techniques for DIscoverOrg
Voicemail Techniques for DIscoverOrg
DiscoverOrg
 
NLP (4) for class 9 (1).pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP (4) for class 9 (1).pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnNLP (4) for class 9 (1).pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP (4) for class 9 (1).pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnn
shradhasharma2101
 

Similar to How Siri works ? (20)

Reason to work at a startup
Reason to work at a startupReason to work at a startup
Reason to work at a startup
 
Some trending ai.pdf
Some trending ai.pdfSome trending ai.pdf
Some trending ai.pdf
 
AWS re:Invent 2016: Machine Learning State of the Union Mini Con (MAC206)
AWS re:Invent 2016: Machine Learning State of the Union Mini Con (MAC206)AWS re:Invent 2016: Machine Learning State of the Union Mini Con (MAC206)
AWS re:Invent 2016: Machine Learning State of the Union Mini Con (MAC206)
 
Literature Review Format. Online assignment writing service.
Literature Review Format. Online assignment writing service.Literature Review Format. Online assignment writing service.
Literature Review Format. Online assignment writing service.
 
Module 8: Natural language processing Pt 1
Module 8:  Natural language processing Pt 1Module 8:  Natural language processing Pt 1
Module 8: Natural language processing Pt 1
 
Business needs and user needs
Business needs and user needsBusiness needs and user needs
Business needs and user needs
 
sri hari.pptx
sri hari.pptxsri hari.pptx
sri hari.pptx
 
UNIT 1 IX (1) (1).pptx
UNIT 1 IX (1) (1).pptxUNIT 1 IX (1) (1).pptx
UNIT 1 IX (1) (1).pptx
 
UNIT 1 IX (1) (2) (1).pptx
UNIT 1 IX (1) (2) (1).pptxUNIT 1 IX (1) (2) (1).pptx
UNIT 1 IX (1) (2) (1).pptx
 
UNIT 1 IX (1) (2) (3).pptx
UNIT 1 IX (1) (2) (3).pptxUNIT 1 IX (1) (2) (3).pptx
UNIT 1 IX (1) (2) (3).pptx
 
UNIT 1 IX (1) (2) (2).pptx
UNIT 1 IX (1) (2) (2).pptxUNIT 1 IX (1) (2) (2).pptx
UNIT 1 IX (1) (2) (2).pptx
 
Voicemail Techniques for DIscoverOrg
Voicemail Techniques for DIscoverOrgVoicemail Techniques for DIscoverOrg
Voicemail Techniques for DIscoverOrg
 
How To Leverage AI to Triple Your Marketing Newsletter Readership
How To Leverage AI to Triple Your Marketing Newsletter ReadershipHow To Leverage AI to Triple Your Marketing Newsletter Readership
How To Leverage AI to Triple Your Marketing Newsletter Readership
 
C Programming_ Step By Step Beginner's To Experts _ Nho Vĩnh Share
C Programming_ Step By Step Beginner's To Experts _ Nho Vĩnh ShareC Programming_ Step By Step Beginner's To Experts _ Nho Vĩnh Share
C Programming_ Step By Step Beginner's To Experts _ Nho Vĩnh Share
 
The dumbing down of intelligent search
The dumbing down of intelligent searchThe dumbing down of intelligent search
The dumbing down of intelligent search
 
JARVIS.pptx
JARVIS.pptxJARVIS.pptx
JARVIS.pptx
 
Policy Issue Paper Example - Example 1 - Research Pap
Policy Issue Paper Example - Example 1 - Research PapPolicy Issue Paper Example - Example 1 - Research Pap
Policy Issue Paper Example - Example 1 - Research Pap
 
How to tell a better story (in code)(final)
How to tell a better story (in code)(final)How to tell a better story (in code)(final)
How to tell a better story (in code)(final)
 
Adventure On A Magic Carpet Essay. Online assignment writing service.
Adventure On A Magic Carpet Essay. Online assignment writing service.Adventure On A Magic Carpet Essay. Online assignment writing service.
Adventure On A Magic Carpet Essay. Online assignment writing service.
 
NLP (4) for class 9 (1).pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP (4) for class 9 (1).pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnNLP (4) for class 9 (1).pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP (4) for class 9 (1).pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnn
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Recently uploaded (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation Computing
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 

How Siri works ?

  • 2. On AI Use Cases Chief Data Scientist, LinkedIn Published Author I am not a data scientist and so please no jargons. I always wondered how does Siri work ?
  • 3. On AI Use Cases Chief Data Scientist, LinkedIn Published Author Did you ever see a dog making things complicated ? Let me explain ..
  • 4. Concept # 1 Let’s find out, how does Siri record your voice on iPhone and convert to digital format Neuron Learning
  • 5. Try this … Chief Data Scientist, LinkedIn Published Author Take your hand close to your mouth and say the word “Boom”. What did you experience? When you speak, pressure waves are created and microphone uses these changes in air-pressure to record sound. Your ear drum sort of works on similar principle Neuron Learning
  • 6. Show me the waves.. Chief Data Scientist, LinkedIn Published Author below is a typical sound wave and in this case I was asking Siri directions.. Neuron Learning
  • 7. After this step, Siri converts your speech to text. In order to do this, lot of compute power is required and hence your voice is sent to cloud. That is the reason why Siri doesn’t work without internet
  • 8. Recap.. Let’s see how Siri converts speech to text Hello.. you say “Hello” Siri records your voice your voice is sent using internet speech converted to text 1 2 3 4
  • 9. Concept # 2 Machine Learning (ML) algorithms are trained by using lots of voice samples and providing accurate “text” for each sample. Very similar to how we teach kids by showing pictures.. This is a “dog” Neuron Learning
  • 10. over 100,000 hours of audio is used for training “my name” “how long is” “what” “Columbus” …equal to a full time employee working over 50 years This voice labelling is done by humans and then it is provided to algorithms as ground truth
  • 11. Apple has been in news for privacy concerns.. Source: TheGurdian.com Neuron Learning
  • 12. This is very important.. In order to train a ML algorithm, it needs data which has been correctly labelled by humans. Highly recommend you read last 3 pages if this is not clear Neuron Learning
  • 13. On AI Use Cases Chief Data Scientist, LinkedIn Published Author Are you saying that during training, algorithm predicts the text from voice samples and these predictions are compared against actual labels provided by human beings?
  • 14. YES, you got it! Neuron Learning
  • 15. How does the algorithm learn to make good predictions? Neuron Learning
  • 16. That is similar to how I learn. When I do a good job you give me feedback and then I do more of that..
  • 17. Character level predictions.. Chief Data Scientist, LinkedIn Published Author Latest algorithms predict character in audio file. This prediction is similar to how a non English speaking person would spell the word “Wife” if it was said very slowly “WWIIIFFEE”. After that rules such as removing duplicate characters are applied to get the most probable word WW_III_FF_EE WIFE Neuron Learning
  • 18. Intent behind the text Chief Data Scientist, LinkedIn Published Author Let’s assume a user tells Siri “Call Wife”. Based on our insights so far, we know Siri will convert it to text “Call Wife”. How does Siri know what do these word mean? That is covered in our last concept. You are almost there. Sit tight! Neuron Learning
  • 19. Concept # 3 Siri doesn’t try to figure out exactly what you said, but rather connect you to most relevant services / apps based on “trigger words” Neuron Learning
  • 20. “Call Wife” After Siri has converted your speech to text, it will identify “Call” as a trigger word in “Call Wife” and based on that it will invoke call placing application Neuron Learning
  • 21. Some trigger words.. Tell Message Text Set Appointment Book Meeting Alarm Wake me Temperature Umbrella Weather Given large volume of voice samples, Siri has good sense for a different number of ways people ask for say “Weather” Neuron Learning
  • 22. Back to our “Call Wife” example Once Siri realizes it has to place a “call”, it will get name of person to be called from text. In our case if a contact “Wife” is saved then Siri will place the call and will complete your request Neuron Learning
  • 23. You have seen this default answer… When Siri stumbles upon a request beyond it’s capability it mostly invokes a web search Neuron Learning
  • 24. You have seen this default answer… When Siri stumbles upon a request beyond it’s capability it mostly invokes a web search. When you make a request to draw a Lion.. OMG. I cannot believe how easy it was to understand. How can I continue to learn more?
  • 25. Follow Amol Palekar on LinkedIn to get next edition of Neuron Learning For email notification, subscribe : NeuronTimes.com 25
  • 26. Resources Chief Data Scientist, LinkedIn Published Author Natural Language Processing by National Research University Higher School of Economics on Coursera Sequence Models by deeplearning.ai on Coursera Adam Coates. - Deep Speech at BAMMF on Youtube https://www.youtube.com/watch? v=hyZCH3xU42E&list=PLes5o2b5ie2XMM4SLb7drQjQEPnalgJLl Adam Coates – Deep Learning for Speech Recognition on YouTube https://www.youtube.com/watch?v=g-sndkf7mCs&t=1980s TheGurdian.com These are my personal opinions and for educational purposes only. Content has been simplified to convey core ideas to non data scientists