SlideShare a Scribd company logo
1 of 22
Download to read offline
iOS Dev Scout - May Meetup

                         Building A Siri-like Application
                        Subhransu Behera (Subh) - iOS Application Architect in SAP




                                                           Twitter : @subhransu
                                                           Blog : http://subhb.org
                                                           Email : subhransu.behera@sap.com

Wednesday, May 30, 12
About Me
                        •   iOS and Ruby Developer

                        •   Have passion for Mobile, Web and Smart TV Apps.

                        •   Started as a Linux Developer, fell in love with Ruby, happily
                            married to iOS

                        •   M.S. in Software Systems from BITS, Bachelors in Information
                            Technology from BPUT, Graduated from Founder Institute

                        •   Worked both in early stage startup and fortune 500
                            companies including one from MIT, Media-Labs. Currently
                            working SAP APJ Mobility CoE making enterprises run their
                            mobile Apps.
Wednesday, May 30, 12
When “Siri” Came !




Wednesday, May 30, 12
No Official Siri APIs Yet
          •      Needed to use it for an Electric Vehicle Application

          •      There are few Open Source SDK:

                        •   OpenEars: http://www.politepix.com/openears

                        •   iSpeech: https://www.ispeech.org/developers

                        •   Dragon Speech Kit (Nuance) : http://
                            dragonmobile.nuancemobiledeveloper.com/public/index.php?
                            task=home


Wednesday, May 30, 12
Why Nuance?
                •       Multiple language support (Supports speech recognition in 23
                        international languages)

                • They Make “Voice” really work.
                • Suggestion and Recommendation engine is pretty strong.
                • Have both Speech-to-Text and Text-to-Speech capabilities
                • Supports Multiple Platforms (iOS, Android, Windows Phone)
                • They are the guys behind Siri?
Wednesday, May 30, 12
Tiers & Pricing




Wednesday, May 30, 12
Free for 90 Days




Wednesday, May 30, 12
1. Signup and Download Dragon Mobile SDK from Nuance Developer Portal
                            http://dragonmobile.nuancemobiledeveloper.com




Wednesday, May 30, 12
Get Application Key from Email




Wednesday, May 30, 12
Class
         Class References

        SKRecognizer - Manages entire recognition process including recording
                and server communication

        SKRecognition - Contains recognition results returned from SKRecognizer

        SKVocalizer - Defines server based text to speech implementation

        SpeechKit - Core class providing setup and utility methods. Meant for Core
                 Networking, Recognition, Audio System components


Wednesday, May 30, 12
Protocols
          Protocol References

        SKRecognizerDelegate - Defines the messages sent to a delegate of the
                        SKRecognizer class

        SKVocalizerDelegate - Defines the methods sent by SKVocalizer class
                        during speech synthesis process

        SpeechKitDelegate - Defines the messages sent to a delegate object
                        registered as part of the call to setupWithID.



Wednesday, May 30, 12
How it Works




Wednesday, May 30, 12
Recognizing Speech




Wednesday, May 30, 12
Sample Speech Recognition App



Wednesday, May 30, 12
Step-by-step Guide

          • Open the file Classes/DMRecognizerViewController. Replace
               the SpeechKitApplicationKey[] value with your key.

          • In the setupWithID:host:port:useSSL:delegate:`` method,
               replace the ID parameter with your ID.

          • You should also verify that the host and port provided to that
               method match the host and port provided in the portal.

          • You can now run the application: connect your device and select
               Build and Run.




                                                                              Main Recognition View
Wednesday, May 30, 12
Main Recognizer View
                    This view has 4 sections
              •     Voice Interface contains a multi-purpose button for controlling the
                    recognition process and has a result field.

              •     Recognition Type allows for the selection between the two default
                    recognition models: search and dictation.

              •     Audio Level provides a basic VU meter display.

              •     Alternatives displays the complete list of recognition results.




Wednesday, May 30, 12
Recognition Logic
                    1.   On viewDidLoad of main view, Speech Kit            4.   On a successful completion, the method
                         framework is configured by calling                       recognizer:didFinishWithResults: returns with
                         setupWithID:host:port:useSSL:delegate                   an SKRecognition object that contains the list of
                         method.                                                 recognition results.

                    2.   On tap of "Record" button, a new SKRecognizer is   5.   The top result, retrieved with the firstResult
                         created with the                                        method, is listed in text box. The complete list of
                         initWithType:detection:language:delegate:               results, accessed through the results property, is
                         method.                                                 displayed in the large text field.

                    3.   When the recording finishes, the delegate method    6.   If an error occurs, the method
                         recognizerDidFinishRecording: is called, and the        recognizer:didFinishWithError:suggestion: is
                         button is updated.                                      called, and both the error and suggestion methods
                                                                                 are presented to the user.




Wednesday, May 30, 12
Recognition with Suggestions




Wednesday, May 30, 12
Let’s build our Own App



Wednesday, May 30, 12
Download Source-code from: https://github.com/subhb/FindNearBy-Siri




                         Nearby Restaurants               Nearby Stations

                        Speak “Restaurant” or “Station” to find nearby results

Wednesday, May 30, 12
Twitter Search App Using Nuance
                        http://www.youtube.com/watch?v=ZJILw2iEprE




Wednesday, May 30, 12
Thank You

                               Twitter : @subhransu
                               Blog : http://subhb.org
                               Email : subhransu.behera@sap.com

Wednesday, May 30, 12

More Related Content

Similar to Building a Siri Like Application on iOS

Software Security Assurance for DevOps - Hewlett Packard Enterprise + Black Duck
Software Security Assurance for DevOps - Hewlett Packard Enterprise + Black DuckSoftware Security Assurance for DevOps - Hewlett Packard Enterprise + Black Duck
Software Security Assurance for DevOps - Hewlett Packard Enterprise + Black Duck
Black Duck by Synopsys
 
Intro to Open Cloud Initiative
Intro to Open Cloud InitiativeIntro to Open Cloud Initiative
Intro to Open Cloud Initiative
John Mark Walker
 

Similar to Building a Siri Like Application on iOS (20)

voice recognition security system ppt
voice recognition security system pptvoice recognition security system ppt
voice recognition security system ppt
 
Office 365 Developer Bootcamp: Microsoft Teams
Office 365 Developer Bootcamp: Microsoft TeamsOffice 365 Developer Bootcamp: Microsoft Teams
Office 365 Developer Bootcamp: Microsoft Teams
 
Prototyping & User Testing
Prototyping & User TestingPrototyping & User Testing
Prototyping & User Testing
 
Software Security Assurance for DevOps - Hewlett Packard Enterprise + Black Duck
Software Security Assurance for DevOps - Hewlett Packard Enterprise + Black DuckSoftware Security Assurance for DevOps - Hewlett Packard Enterprise + Black Duck
Software Security Assurance for DevOps - Hewlett Packard Enterprise + Black Duck
 
Resume sb
Resume sbResume sb
Resume sb
 
IBM Bluemix OpenWhisk: Interconnect 2016, Las Vegas: CCD-1088: The Future of ...
IBM Bluemix OpenWhisk: Interconnect 2016, Las Vegas: CCD-1088: The Future of ...IBM Bluemix OpenWhisk: Interconnect 2016, Las Vegas: CCD-1088: The Future of ...
IBM Bluemix OpenWhisk: Interconnect 2016, Las Vegas: CCD-1088: The Future of ...
 
IBM Bluemix Openwhisk
IBM Bluemix OpenwhiskIBM Bluemix Openwhisk
IBM Bluemix Openwhisk
 
Open Source
Open SourceOpen Source
Open Source
 
Intro to Open source. Amit Bhayani
Intro to Open source. Amit BhayaniIntro to Open source. Amit Bhayani
Intro to Open source. Amit Bhayani
 
Conversational AI: What's New?
Conversational AI: What's New?Conversational AI: What's New?
Conversational AI: What's New?
 
Sundar_v5.9_Proj_Summary
Sundar_v5.9_Proj_SummarySundar_v5.9_Proj_Summary
Sundar_v5.9_Proj_Summary
 
VishalBendreQa
VishalBendreQaVishalBendreQa
VishalBendreQa
 
Mobile App vs Mobile Web Development
Mobile App vs Mobile Web DevelopmentMobile App vs Mobile Web Development
Mobile App vs Mobile Web Development
 
Expedia's Journey toward Site Resiliency
Expedia's Journey toward Site ResiliencyExpedia's Journey toward Site Resiliency
Expedia's Journey toward Site Resiliency
 
Open source in India
Open source in IndiaOpen source in India
Open source in India
 
Open Source vs Proprietary
Open Source vs ProprietaryOpen Source vs Proprietary
Open Source vs Proprietary
 
Open Source & What It Means For Self-Sovereign Identity (SSI)
Open Source & What It Means For Self-Sovereign Identity (SSI)Open Source & What It Means For Self-Sovereign Identity (SSI)
Open Source & What It Means For Self-Sovereign Identity (SSI)
 
VishalBendreQa
VishalBendreQaVishalBendreQa
VishalBendreQa
 
Intro to Open Cloud Initiative
Intro to Open Cloud InitiativeIntro to Open Cloud Initiative
Intro to Open Cloud Initiative
 
Scalable Plone hosting with Amazon EC2 for Rice University's Rhaptos open lea...
Scalable Plone hosting with Amazon EC2 for Rice University's Rhaptos open lea...Scalable Plone hosting with Amazon EC2 for Rice University's Rhaptos open lea...
Scalable Plone hosting with Amazon EC2 for Rice University's Rhaptos open lea...
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Building a Siri Like Application on iOS

  • 1. iOS Dev Scout - May Meetup Building A Siri-like Application Subhransu Behera (Subh) - iOS Application Architect in SAP Twitter : @subhransu Blog : http://subhb.org Email : subhransu.behera@sap.com Wednesday, May 30, 12
  • 2. About Me • iOS and Ruby Developer • Have passion for Mobile, Web and Smart TV Apps. • Started as a Linux Developer, fell in love with Ruby, happily married to iOS • M.S. in Software Systems from BITS, Bachelors in Information Technology from BPUT, Graduated from Founder Institute • Worked both in early stage startup and fortune 500 companies including one from MIT, Media-Labs. Currently working SAP APJ Mobility CoE making enterprises run their mobile Apps. Wednesday, May 30, 12
  • 3. When “Siri” Came ! Wednesday, May 30, 12
  • 4. No Official Siri APIs Yet • Needed to use it for an Electric Vehicle Application • There are few Open Source SDK: • OpenEars: http://www.politepix.com/openears • iSpeech: https://www.ispeech.org/developers • Dragon Speech Kit (Nuance) : http:// dragonmobile.nuancemobiledeveloper.com/public/index.php? task=home Wednesday, May 30, 12
  • 5. Why Nuance? • Multiple language support (Supports speech recognition in 23 international languages) • They Make “Voice” really work. • Suggestion and Recommendation engine is pretty strong. • Have both Speech-to-Text and Text-to-Speech capabilities • Supports Multiple Platforms (iOS, Android, Windows Phone) • They are the guys behind Siri? Wednesday, May 30, 12
  • 7. Free for 90 Days Wednesday, May 30, 12
  • 8. 1. Signup and Download Dragon Mobile SDK from Nuance Developer Portal http://dragonmobile.nuancemobiledeveloper.com Wednesday, May 30, 12
  • 9. Get Application Key from Email Wednesday, May 30, 12
  • 10. Class Class References SKRecognizer - Manages entire recognition process including recording and server communication SKRecognition - Contains recognition results returned from SKRecognizer SKVocalizer - Defines server based text to speech implementation SpeechKit - Core class providing setup and utility methods. Meant for Core Networking, Recognition, Audio System components Wednesday, May 30, 12
  • 11. Protocols Protocol References SKRecognizerDelegate - Defines the messages sent to a delegate of the SKRecognizer class SKVocalizerDelegate - Defines the methods sent by SKVocalizer class during speech synthesis process SpeechKitDelegate - Defines the messages sent to a delegate object registered as part of the call to setupWithID. Wednesday, May 30, 12
  • 14. Sample Speech Recognition App Wednesday, May 30, 12
  • 15. Step-by-step Guide • Open the file Classes/DMRecognizerViewController. Replace the SpeechKitApplicationKey[] value with your key. • In the setupWithID:host:port:useSSL:delegate:`` method, replace the ID parameter with your ID. • You should also verify that the host and port provided to that method match the host and port provided in the portal. • You can now run the application: connect your device and select Build and Run. Main Recognition View Wednesday, May 30, 12
  • 16. Main Recognizer View This view has 4 sections • Voice Interface contains a multi-purpose button for controlling the recognition process and has a result field. • Recognition Type allows for the selection between the two default recognition models: search and dictation. • Audio Level provides a basic VU meter display. • Alternatives displays the complete list of recognition results. Wednesday, May 30, 12
  • 17. Recognition Logic 1. On viewDidLoad of main view, Speech Kit 4. On a successful completion, the method framework is configured by calling recognizer:didFinishWithResults: returns with setupWithID:host:port:useSSL:delegate an SKRecognition object that contains the list of method. recognition results. 2. On tap of "Record" button, a new SKRecognizer is 5. The top result, retrieved with the firstResult created with the method, is listed in text box. The complete list of initWithType:detection:language:delegate: results, accessed through the results property, is method. displayed in the large text field. 3. When the recording finishes, the delegate method 6. If an error occurs, the method recognizerDidFinishRecording: is called, and the recognizer:didFinishWithError:suggestion: is button is updated. called, and both the error and suggestion methods are presented to the user. Wednesday, May 30, 12
  • 19. Let’s build our Own App Wednesday, May 30, 12
  • 20. Download Source-code from: https://github.com/subhb/FindNearBy-Siri Nearby Restaurants Nearby Stations Speak “Restaurant” or “Station” to find nearby results Wednesday, May 30, 12
  • 21. Twitter Search App Using Nuance http://www.youtube.com/watch?v=ZJILw2iEprE Wednesday, May 30, 12
  • 22. Thank You Twitter : @subhransu Blog : http://subhb.org Email : subhransu.behera@sap.com Wednesday, May 30, 12