More than Just Lines on a Map: Best Practices for U.S Bike Routes
Speech recognition in web
1. Speech Recognition in
web
Report
A small report of usefulness of Speech-Recognition in web
domain and it’s feasibility.
ganesh.sawant
4/23/2012
2. Speech Recognition in web
Apr. 23
What is Speech recognition?
In Computer Science, Speech recognition is the translation of spoken words into text. It is also
known as "automatic speech recognition", "ASR", "computer speech recognition", "speech to
text", or just "STT".
Speech Recognition is technology that can translate spoken words into text. Some SR systems
use "training" where an individual speaker reads sections of text into the SR system. These
systems analyze the person's specific voice and use it to fine tune the recognition of that
person's speech, resulting in more accurate transcription. Systems that do not use training are
called "Speaker Independent" systems. Systems that use training are called "Speaker
Dependent" systems.
2
3. Speech Recognition in web
Apr. 23
Speech recognition in Web
Speech recognition in web is achieved by implementing Speech Recognizing system
using powerful languages which have ability of interacting with operating system (such as
Java, .NET, pearl). Flash and Java Applet is used as front-end which takes user’s input in the
form of voice compare it with grammar base present in the system and returns the result.
Speech recognition in web is consisting of following elements.
1. An engine with various Grammar models or Back-End
2. Front-End which is of Java Applet or Flash
3. Scripting languages API for web programmers
Approaches:
1. HTML5 approach (webkit based approach which currently limited to Google chrome browser
and working fine and recognize dictionary based words correctly)
2. Using Flash, Java Applet and Javascript as front-end and conventional languages at back-end
(available APIs are speechAPI, WAMI, iSpeech) except iSpeech (which is premium API) other two
are experimental API and not working fine.
Usage:
1. Voice web search
2. Speech based games.
3. Speech based web catalog
Conclusion:
So, currently the best approach would be using Google chrome’s webkit based API which gives
optimum results as other APIs are in experimental state.
For detailed description of use cases, please visit the following link
http://css.dzone.com/articles/web-standard-speech
3