P54 Presentation at 2007 ITU Fully Networked Car Workshop

1,052 views
975 views

Published on

These are the slides from the presentation I gave at the 2007 ITU Fully Networked Car Workshop in Geneva, Switzerland.

Published in: Technology, Business
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,052
On SlideShare
0
From Embeds
0
Number of Embeds
31
Actions
Shares
0
Downloads
11
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

P54 Presentation at 2007 ITU Fully Networked Car Workshop

  1. 1. Using Speech to Interact with In-Car Devices in the Project54 System Andrew Kun University of New Hampshire
  2. 2. Outline <ul><li>Introduction </li></ul><ul><li>Speech user interface testing </li></ul><ul><li>Driving simulator studies </li></ul><ul><li>Conclusion </li></ul>
  3. 3. What is the problem?
  4. 4. The system in the car
  5. 5. Speech user interface <ul><li>Command and control interface </li></ul><ul><li>Microsoft speech recognition (SR) engine </li></ul><ul><li>Microsoft text-to-speech (TTS) engine </li></ul><ul><li>Directional microphone </li></ul><ul><li>Push-to-talk (PTT) button </li></ul><ul><li>Grammars </li></ul>
  6. 6. Outline <ul><li>Introduction </li></ul><ul><li>Speech user interface (SUI) testing </li></ul><ul><li>Driving simulator studies </li></ul><ul><li>Conclusion </li></ul>
  7. 7. Testing <ul><li>Officer volunteers: 27 </li></ul><ul><li>Corpus: just under 50,000 utterances </li></ul><ul><li>Utterances: while PTT is pressed </li></ul>
  8. 8. Speech user interface (SUI) performance <ul><li>Recognized: 85 % </li></ul><ul><li>Unrecognized: 4 % </li></ul><ul><li>Misrecognized: 11 % </li></ul>
  9. 9. Reasons for imperfect recognition <ul><li>SR engine error: 37 % </li></ul><ul><li>User error: 63 % </li></ul>
  10. 10. SR engine error example <ul><li>Utterance: 0 3 2 1 8 5 </li></ul><ul><li>-> Recognized 0 3 2 1 0 5 7 1 </li></ul><ul><li>-> Recognized OK </li></ul>
  11. 11. User errors <ul><li>Utterance not in any grammar: 54 % </li></ul><ul><li>Utterance in another grammar: 34 % </li></ul><ul><li>PTT (“Patrol screen” ): 12 % </li></ul>
  12. 12. Outline <ul><li>Introduction </li></ul><ul><li>Speech user interface testing </li></ul><ul><li>Driving simulator studies </li></ul><ul><li>Conclusion </li></ul>
  13. 13. Driving simulator
  14. 14. Driving simulator studies <ul><li>Multi-threaded dialogues </li></ul><ul><li>SUI and driving </li></ul>
  15. 15. Multi-threaded dialogues <ul><li>National Science Foundation grant </li></ul><ul><li>Goal: interact with multiple real-time devices using speech </li></ul><ul><li>Manual-visual task! </li></ul><ul><li>Human-human to human-computer interaction </li></ul>
  16. 16. SUI and driving – police radio
  17. 17. SUI and driving <ul><li>Two experiments (8 subjects) : </li></ul><ul><ul><li>Baseline + radio </li></ul></ul><ul><ul><li>Baseline + SUI </li></ul></ul><ul><li>Record: </li></ul><ul><ul><li>Lane position </li></ul></ul><ul><ul><li>Velocity </li></ul></ul><ul><ul><li>Steering wheel angle </li></ul></ul>
  18. 18. Lane position – radio
  19. 19. Lane position – SUI
  20. 20. Outline <ul><li>Introduction </li></ul><ul><li>Speech user interface testing </li></ul><ul><li>Driving simulator studies </li></ul><ul><li>Conclusion </li></ul>
  21. 21. Status <ul><li>February 2007: ≈900 cars on the road in USA ( New Hampshire , Massachusetts, California, Maryland) </li></ul><ul><li>Industry participation </li></ul>
  22. 22. Research and development directions <ul><li>SUI and driving </li></ul><ul><li>SUI performance improvements (SR training, …) </li></ul><ul><li>SUI performance relation to driving task difficulty, recognizer accuracy </li></ul><ul><li>Intelligent interaction (multi-threaded dialogues, natural language processing, …) </li></ul><ul><li>Non-speech work (handhelds, telematics, …) </li></ul><ul><li>Standards </li></ul>
  23. 23. Acknowledgement <ul><li>US Department of Justice </li></ul><ul><li>National Science Foundation </li></ul>
  24. 24. www.project54.unh.edu

×