• Save
Speech Recognition in VoiceXML
Upcoming SlideShare
Loading in...5
×
 

Speech Recognition in VoiceXML

on

  • 1,700 views

 

Statistics

Views

Total Views
1,700
Views on SlideShare
1,696
Embed Views
4

Actions

Likes
1
Downloads
0
Comments
0

1 Embed 4

http://www.slideshare.net 4

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Speech Recognition in VoiceXML Speech Recognition in VoiceXML Presentation Transcript

    • Speech Recognition in VoiceXML A pre-lunch fun fest! February 10, 2010 Mark J. Headd
    • Agenda
      • Specifications (VXML, SRGS, SISR)
      • VoiceXML Properties for Speech
      • Grammars (Structure, Formats)
      • Examining Recognition Results
      • VXML 2.1 features
      • Eat
    • Specifications
      • VoiceXML 2.0 spec (minor adds in 2.1)
        • Section 3 ( User Input )
      • Speech Recognition Grammar spec
        • Defines required formats for grammars
        • XML, ABNF formats
      • Semantic Interpretation spec
        • Assigning values based on user input
    • Properties
      • Related to speech rec:
        • inputmodes (defauts to “dtmf voice”)
        • maxnbest (controls # of results returned)
        • confidencelevel
        • sensitivity
        • bargein
        • bargeintype (speech vs. hotword)
        • recordutterance (OoG utterances?)
        • Timing properties ( Appendix D of 2.0 spec)
    • Grammars
      • Builtin ( Appendix P of 2.0 spec)
      • Inline vs. external
      • Formats
        • ABNF
        • XML
        • JSGF (Prophecy)
      • Semantic Interpretation
        • Filling slots with values
    • Grammars Note difference between recognition and interpretation.
    • Examining Results
      • Field Shadow Variables
        • name $.utterance
        • name $.inputmode
        • name $.interpretation
        • name $.confidence
      • application.lastresult$ Array
        • Array of elements holding last recognition
        • Sorted by confidence, from highest to lowest
    • VXML 2.1 features
      • Dynamic referencing of grammars
        • <grammar srcexpr=“’http://host/’ + foo + ‘?bar=‘ + bar”/>
      • Recording utterance while attempting recognition.
        • OoG utterances?
        • Build library of audio for grammar tuning?
    • Demo
      • Demo code available on GitHub
      • Running against Prophecy 8 on local server
      • Analog line through AudioCodes gateway
    • Grub
      • Time to eat.