SlideShare a Scribd company logo
1 of 11
Speech Recognition
API
DMI Internal Tech Talk
Sokna Ly
Associate Software Engineer, iOS
Content
1.What is it?
2.Availability
3.Limitation
4.Privacy and Usability
5.Using Speech Recognition & Demo
What is it?
• New Framework for iOS 10
• Same technology as Siri and Dictionary
• Fast - result as your users speak
• Accurate
• Flexible — recording and pre-recorded audio
What is it?
What is it?
• Before Speech Recognition?
• Required presenting keyboard
• Live audio input only
• Can’t customize language
Availability
• Over 50 languages
• Any device running iOS 10
• Usually requires an Internet connection (except some
some languages and device models)
• Always requires user permission
Limitation
• It’s free, but not unlimited
• Per-devices, per day recognition limits
• Per-app limits
• Be prepared to handle failures
• If you’re hitting limit, talk to Apple
• Speech recognition isn’t cheap
• Requires power and data
• Maximum utterance duration — about one minute
Privacy and Usability
• Make sure users know they’re being recorded
• Show something in the UI
• Some speech is not appropriate for recognition
• Passwords
• Sensitive speech
• Show recognition results to the user before acting on
them
Using Speech Recognition
• Explain why in your Info.plist
• Request authorization using
SFSpeechRecognizer.requestAuthorization
• Create recognition request
• Pre-recorded on disk, use
SFSpeechURLRecognitionRequest
• From live audio or memory, use
SFSpeechAudioBufferRecognitionRequest
Demo
Thanks

More Related Content

What's hot

Devices 3 and 4
Devices 3 and 4Devices 3 and 4
Devices 3 and 4Mary Faust
 
First Steps in iOS Development
First Steps in iOS DevelopmentFirst Steps in iOS Development
First Steps in iOS DevelopmentSasha Goldshtein
 
Cegonsoft Final Year Projects In Coimbatore,
Cegonsoft Final Year Projects In Coimbatore,Cegonsoft Final Year Projects In Coimbatore,
Cegonsoft Final Year Projects In Coimbatore,sathyancegon
 
Abdelrahman el semary-21-06-2017
Abdelrahman el semary-21-06-2017Abdelrahman el semary-21-06-2017
Abdelrahman el semary-21-06-2017Abdelrahman Mohamed
 
Elsa presentation deck
Elsa presentation deckElsa presentation deck
Elsa presentation deckTechMeetups
 
Best Java Training in Chandigarh.
Best Java Training in Chandigarh.Best Java Training in Chandigarh.
Best Java Training in Chandigarh.jackRief2
 
Digital accessibility 101
Digital accessibility 101Digital accessibility 101
Digital accessibility 101Intopia
 
Epsy 408 technology review
Epsy 408 technology reviewEpsy 408 technology review
Epsy 408 technology reviewarhine2
 
Senior ios developer
Senior ios developerSenior ios developer
Senior ios developertrawextechblr
 
[DEV5106] chatbots best practices and design patterns
[DEV5106] chatbots best practices and design patterns[DEV5106] chatbots best practices and design patterns
[DEV5106] chatbots best practices and design patternsRuben Rodriguez
 
Senior ios developer
Senior ios developerSenior ios developer
Senior ios developertrawextechblr
 
Cracking the Chat bot Code
Cracking the Chat bot CodeCracking the Chat bot Code
Cracking the Chat bot CodeElvis D'Souza
 
Design and Development of an Educational Arabic Sign Language Mobile App
Design and Development of an Educational Arabic Sign Language Mobile AppDesign and Development of an Educational Arabic Sign Language Mobile App
Design and Development of an Educational Arabic Sign Language Mobile AppHCI Lab
 
Introduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechIntroduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechNgwe Tun
 
Conversational experience by Systango
Conversational experience by SystangoConversational experience by Systango
Conversational experience by SystangoSystango
 

What's hot (20)

Devices 3 and 4
Devices 3 and 4Devices 3 and 4
Devices 3 and 4
 
First Steps in iOS Development
First Steps in iOS DevelopmentFirst Steps in iOS Development
First Steps in iOS Development
 
Cegonsoft Final Year Projects In Coimbatore,
Cegonsoft Final Year Projects In Coimbatore,Cegonsoft Final Year Projects In Coimbatore,
Cegonsoft Final Year Projects In Coimbatore,
 
iPhone IN YOUR FACE
iPhone IN YOUR FACEiPhone IN YOUR FACE
iPhone IN YOUR FACE
 
Abdelrahman el semary-21-06-2017
Abdelrahman el semary-21-06-2017Abdelrahman el semary-21-06-2017
Abdelrahman el semary-21-06-2017
 
Unified logging on iOS
Unified logging on iOSUnified logging on iOS
Unified logging on iOS
 
VOIP Sarhang Sorani
VOIP Sarhang SoraniVOIP Sarhang Sorani
VOIP Sarhang Sorani
 
Elsa presentation deck
Elsa presentation deckElsa presentation deck
Elsa presentation deck
 
Best Java Training in Chandigarh.
Best Java Training in Chandigarh.Best Java Training in Chandigarh.
Best Java Training in Chandigarh.
 
Digital accessibility 101
Digital accessibility 101Digital accessibility 101
Digital accessibility 101
 
Epsy 408 technology review
Epsy 408 technology reviewEpsy 408 technology review
Epsy 408 technology review
 
Senior ios developer
Senior ios developerSenior ios developer
Senior ios developer
 
[DEV5106] chatbots best practices and design patterns
[DEV5106] chatbots best practices and design patterns[DEV5106] chatbots best practices and design patterns
[DEV5106] chatbots best practices and design patterns
 
Senior ios developer
Senior ios developerSenior ios developer
Senior ios developer
 
Cracking the Chat bot Code
Cracking the Chat bot CodeCracking the Chat bot Code
Cracking the Chat bot Code
 
Design and Development of an Educational Arabic Sign Language Mobile App
Design and Development of an Educational Arabic Sign Language Mobile AppDesign and Development of an Educational Arabic Sign Language Mobile App
Design and Development of an Educational Arabic Sign Language Mobile App
 
The Personal Assistant
The Personal AssistantThe Personal Assistant
The Personal Assistant
 
project indesh
project indeshproject indesh
project indesh
 
Introduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechIntroduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-Speech
 
Conversational experience by Systango
Conversational experience by SystangoConversational experience by Systango
Conversational experience by Systango
 

Viewers also liked

Advanced rich push notifications
Advanced rich push notificationsAdvanced rich push notifications
Advanced rich push notificationsErdem Yılmaz
 
Notification Framework
Notification FrameworkNotification Framework
Notification FrameworkSokna Ly
 
What make Swift Awesome
What make Swift AwesomeWhat make Swift Awesome
What make Swift AwesomeSokna Ly
 
SUMMER HEALTH ONE-SHOT WALKING
SUMMER HEALTH ONE-SHOT WALKINGSUMMER HEALTH ONE-SHOT WALKING
SUMMER HEALTH ONE-SHOT WALKINGMike Dolan
 
3 cách chính cần áp dụng để giảm cân, giảm béo an toàn, hiệu quả
3 cách chính cần áp dụng để giảm cân, giảm béo an toàn, hiệu quả3 cách chính cần áp dụng để giảm cân, giảm béo an toàn, hiệu quả
3 cách chính cần áp dụng để giảm cân, giảm béo an toàn, hiệu quảemngan7808548
 
Conversion of company into llp and its taxation
Conversion of company into llp and its taxationConversion of company into llp and its taxation
Conversion of company into llp and its taxationCA. Pramod Jain
 
Andrew Goodwin Music Video Theory
Andrew Goodwin Music Video TheoryAndrew Goodwin Music Video Theory
Andrew Goodwin Music Video TheoryOliver Cooke
 
Accace Romania | Prezentarea companiei
Accace Romania | Prezentarea companieiAccace Romania | Prezentarea companiei
Accace Romania | Prezentarea companieiAccace Romania
 

Viewers also liked (11)

Advanced rich push notifications
Advanced rich push notificationsAdvanced rich push notifications
Advanced rich push notifications
 
Notification Framework
Notification FrameworkNotification Framework
Notification Framework
 
What make Swift Awesome
What make Swift AwesomeWhat make Swift Awesome
What make Swift Awesome
 
Week 7
Week 7Week 7
Week 7
 
SUMMER HEALTH ONE-SHOT WALKING
SUMMER HEALTH ONE-SHOT WALKINGSUMMER HEALTH ONE-SHOT WALKING
SUMMER HEALTH ONE-SHOT WALKING
 
3 cách chính cần áp dụng để giảm cân, giảm béo an toàn, hiệu quả
3 cách chính cần áp dụng để giảm cân, giảm béo an toàn, hiệu quả3 cách chính cần áp dụng để giảm cân, giảm béo an toàn, hiệu quả
3 cách chính cần áp dụng để giảm cân, giảm béo an toàn, hiệu quả
 
Metodologi desain
Metodologi desainMetodologi desain
Metodologi desain
 
Magic seychely
Magic seychelyMagic seychely
Magic seychely
 
Conversion of company into llp and its taxation
Conversion of company into llp and its taxationConversion of company into llp and its taxation
Conversion of company into llp and its taxation
 
Andrew Goodwin Music Video Theory
Andrew Goodwin Music Video TheoryAndrew Goodwin Music Video Theory
Andrew Goodwin Music Video Theory
 
Accace Romania | Prezentarea companiei
Accace Romania | Prezentarea companieiAccace Romania | Prezentarea companiei
Accace Romania | Prezentarea companiei
 

Similar to Speech Recognition API

Speech recognition
Speech recognitionSpeech recognition
Speech recognitionfluffyemily
 
Independent Development and Writing Your Own Engine
Independent Development and Writing Your Own EngineIndependent Development and Writing Your Own Engine
Independent Development and Writing Your Own EngineananseKmensah
 
2a Analyzing iOS Apps Part 1
2a Analyzing iOS Apps Part 12a Analyzing iOS Apps Part 1
2a Analyzing iOS Apps Part 1Sam Bowne
 
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...nehachhh
 
Artificial intelligence in speech recognition
Artificial intelligence in speech recognitionArtificial intelligence in speech recognition
Artificial intelligence in speech recognitionRajanivetha G
 
Everything You Need to Know About Testing i os 13
Everything You Need to Know About Testing i os 13Everything You Need to Know About Testing i os 13
Everything You Need to Know About Testing i os 13Perfecto by Perforce
 
CNIT 128 2. Analyzing iOS Applications (Part 1)
CNIT 128 2. Analyzing iOS Applications (Part 1)CNIT 128 2. Analyzing iOS Applications (Part 1)
CNIT 128 2. Analyzing iOS Applications (Part 1)Sam Bowne
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologyAamir-sheriff
 
Python Programming Course
Python Programming CoursePython Programming Course
Python Programming Courseiseestech
 
Delhi NCR JUG meetup - NLP - APIs - By Vikas Malik
Delhi NCR JUG meetup - NLP - APIs - By Vikas MalikDelhi NCR JUG meetup - NLP - APIs - By Vikas Malik
Delhi NCR JUG meetup - NLP - APIs - By Vikas MalikVikas Malik
 
What Mobile Development Approach Makes Sense
What Mobile Development Approach Makes SenseWhat Mobile Development Approach Makes Sense
What Mobile Development Approach Makes SenseDipesh Mukerji
 
Android operating system by udayan thakurdesai
Android operating system by udayan thakurdesaiAndroid operating system by udayan thakurdesai
Android operating system by udayan thakurdesaiUdayan Thakurdesai
 
Open source and free technologies for study skills
Open source and free technologies for study skillsOpen source and free technologies for study skills
Open source and free technologies for study skillsE.A. Draffan
 
voice recognition security system ppt
voice recognition security system pptvoice recognition security system ppt
voice recognition security system pptNitesh Dubey
 
Chat and Checklist About Chatbot User Experience and Japanese Design
Chat and Checklist About Chatbot User Experience and Japanese DesignChat and Checklist About Chatbot User Experience and Japanese Design
Chat and Checklist About Chatbot User Experience and Japanese DesignUltan O'Broin
 

Similar to Speech Recognition API (20)

Inter-process audio options on iOS
Inter-process audio options on iOSInter-process audio options on iOS
Inter-process audio options on iOS
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Independent Development and Writing Your Own Engine
Independent Development and Writing Your Own EngineIndependent Development and Writing Your Own Engine
Independent Development and Writing Your Own Engine
 
2a Analyzing iOS Apps Part 1
2a Analyzing iOS Apps Part 12a Analyzing iOS Apps Part 1
2a Analyzing iOS Apps Part 1
 
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
 
Artificial intelligence in speech recognition
Artificial intelligence in speech recognitionArtificial intelligence in speech recognition
Artificial intelligence in speech recognition
 
Everything You Need to Know About Testing i os 13
Everything You Need to Know About Testing i os 13Everything You Need to Know About Testing i os 13
Everything You Need to Know About Testing i os 13
 
CNIT 128 2. Analyzing iOS Applications (Part 1)
CNIT 128 2. Analyzing iOS Applications (Part 1)CNIT 128 2. Analyzing iOS Applications (Part 1)
CNIT 128 2. Analyzing iOS Applications (Part 1)
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Python Programming Course
Python Programming CoursePython Programming Course
Python Programming Course
 
Ohoh
OhohOhoh
Ohoh
 
Delhi NCR JUG meetup - NLP - APIs - By Vikas Malik
Delhi NCR JUG meetup - NLP - APIs - By Vikas MalikDelhi NCR JUG meetup - NLP - APIs - By Vikas Malik
Delhi NCR JUG meetup - NLP - APIs - By Vikas Malik
 
What Mobile Development Approach Makes Sense
What Mobile Development Approach Makes SenseWhat Mobile Development Approach Makes Sense
What Mobile Development Approach Makes Sense
 
Android operating system by udayan thakurdesai
Android operating system by udayan thakurdesaiAndroid operating system by udayan thakurdesai
Android operating system by udayan thakurdesai
 
Open source and free technologies for study skills
Open source and free technologies for study skillsOpen source and free technologies for study skills
Open source and free technologies for study skills
 
voice recognition security system ppt
voice recognition security system pptvoice recognition security system ppt
voice recognition security system ppt
 
Fun with DSL
Fun with DSLFun with DSL
Fun with DSL
 
Introduction to ios
Introduction to iosIntroduction to ios
Introduction to ios
 
Swift Development - Mobiloitte
Swift Development - MobiloitteSwift Development - Mobiloitte
Swift Development - Mobiloitte
 
Chat and Checklist About Chatbot User Experience and Japanese Design
Chat and Checklist About Chatbot User Experience and Japanese DesignChat and Checklist About Chatbot User Experience and Japanese Design
Chat and Checklist About Chatbot User Experience and Japanese Design
 

Speech Recognition API

  • 1. Speech Recognition API DMI Internal Tech Talk Sokna Ly Associate Software Engineer, iOS
  • 2. Content 1.What is it? 2.Availability 3.Limitation 4.Privacy and Usability 5.Using Speech Recognition & Demo
  • 3. What is it? • New Framework for iOS 10 • Same technology as Siri and Dictionary • Fast - result as your users speak • Accurate • Flexible — recording and pre-recorded audio
  • 5. What is it? • Before Speech Recognition? • Required presenting keyboard • Live audio input only • Can’t customize language
  • 6. Availability • Over 50 languages • Any device running iOS 10 • Usually requires an Internet connection (except some some languages and device models) • Always requires user permission
  • 7. Limitation • It’s free, but not unlimited • Per-devices, per day recognition limits • Per-app limits • Be prepared to handle failures • If you’re hitting limit, talk to Apple • Speech recognition isn’t cheap • Requires power and data • Maximum utterance duration — about one minute
  • 8. Privacy and Usability • Make sure users know they’re being recorded • Show something in the UI • Some speech is not appropriate for recognition • Passwords • Sensitive speech • Show recognition results to the user before acting on them
  • 9. Using Speech Recognition • Explain why in your Info.plist • Request authorization using SFSpeechRecognizer.requestAuthorization • Create recognition request • Pre-recorded on disk, use SFSpeechURLRecognitionRequest • From live audio or memory, use SFSpeechAudioBufferRecognitionRequest
  • 10. Demo