Voice Recognition

2,814 views
2,665 views

Published on

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
2,814
On SlideShare
0
From Embeds
0
Number of Embeds
8
Actions
Shares
0
Downloads
197
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Voice Recognition

  1. 1. Speech (voice) Recognition
  2. 2. Introduction <ul><li>What is Speech Recognition? </li></ul><ul><li>- Voice Recognition? </li></ul><ul><li>Where can it be used? </li></ul><ul><li>- Dictation </li></ul><ul><li>- System control/navigation </li></ul><ul><li>- Commercial/Industrial applications </li></ul><ul><li>- Hand held digital recorders </li></ul>
  3. 3. Contents: <ul><li>Continuous/Discrete </li></ul><ul><li>How does it work? </li></ul><ul><li>Recent improvements </li></ul><ul><li>Current software options </li></ul><ul><li>Future of SR </li></ul>
  4. 4. Continuous or Discrete? <ul><li>Continuous speech </li></ul><ul><li>- dictation </li></ul><ul><li>Discrete speech </li></ul><ul><li>- system controls </li></ul>
  5. 5. How does SR work? <ul><li>Recognition </li></ul><ul><li>Training </li></ul><ul><li>Correction </li></ul><ul><li>Command/Control </li></ul>
  6. 6. Recognition (1) Voice Input Analog to Digital Acoustic Model Language Model Display Speech Engine Feedback
  7. 7. Recognition (2) <ul><li>Acoustic Modeling </li></ul><ul><li>Spoken words: “I think there are…..” </li></ul><ul><li>Phonemes: ‘ ay th-in-nk-kd dh-eh-r aa-r’ </li></ul><ul><li>H.M.M.’s: 5 state representation </li></ul><ul><li>Speech Engine </li></ul>
  8. 8. Recognition (3) <ul><li>Language Modeling </li></ul><ul><li>Word context </li></ul><ul><li>Word frequency </li></ul><ul><li>Transition possibilities </li></ul>
  9. 9. Voice Training (1) <ul><li>Can be done by: </li></ul><ul><li>Predetermined text segments </li></ul><ul><li>Individual words </li></ul><ul><li>Compare new acoustic with old and combines </li></ul><ul><li>More training = better recognition </li></ul>
  10. 10. Voice Training (2) <ul><li>User specific Voice file </li></ul><ul><li>Voice qualities </li></ul><ul><li>Pronunciation </li></ul><ul><li>Patterns of word use </li></ul><ul><li>Preferred vocabulary </li></ul>
  11. 11. Making Corrections <ul><li>Move cursor by voice command </li></ul><ul><li>Memorize edit commands </li></ul><ul><li>List of possible alternatives </li></ul><ul><li>Make correction manually </li></ul>
  12. 12. Command/Control <ul><li>Desktop grid </li></ul><ul><li>Program or Link name/number </li></ul><ul><li>URL name </li></ul><ul><li>Memorized commands </li></ul>
  13. 13. Recent Improvements in SR <ul><li>Faster training ~10 min. </li></ul><ul><li>Better recognition ~95% </li></ul><ul><li>More compatible software </li></ul><ul><li>Better system control/command </li></ul>
  14. 14. Current Software Options for PC <ul><li>Dragon Systems – Naturally Speaking </li></ul><ul><li>Philips – FreeSpeech </li></ul><ul><li>IBM – ViaVoice </li></ul><ul><li>Lernout & Hauspie – Voice Xpress </li></ul>
  15. 15. How well do the work? Training Dictation Correct. App. Integrat. Command - Control Dragon Excellent Excellent Good Good Philips Fair Fair Good Good IBM Excellent Good Good Excellent L & H Good Good Good Good
  16. 16. Future of SR <ul><li>SUI – Speech-based User Interface </li></ul><ul><li>Improvements needed: </li></ul><ul><li>- Greater accuracy </li></ul><ul><li>- Greater system control/command </li></ul><ul><li>- More compatible software </li></ul>
  17. 17. Conclusion <ul><li>SR Uses </li></ul><ul><li>How does it work? </li></ul><ul><li>Current Software </li></ul><ul><li>Problems of SR </li></ul><ul><li>More SR coming soon…. </li></ul>
  18. 18. References <ul><li>1. Alwang, Greg. “Speech Recognition,” PC Magazine, December 1 1999 </li></ul><ul><li>2. Hauptmann, Alexander G. Jang, Photina Jaeyun. Carnegie Mellon University. “Learning to Recognize Speech by Watching Television,” IEEE Intelligent Systems, September/October 1999. </li></ul><ul><li>3. Miastkowski, Stan. “Latest Speech Software Gets You Up and Running Faster,” PC World, November 1999. </li></ul>

×