Speech recognition in web

•

1 like•364 views

Ganesh Sawant

This document explains about speech recognition in web, this is new trend in web

Technology

Speech Recognition in
web
Report
A small report of usefulness of Speech-Recognition in web
domain and it’s feasibility.
ganesh.sawant
4/23/2012

Speech Recognition in web
Apr. 23

What is Speech recognition?
In Computer Science, Speech recognition is the translation of spoken words into text. It is also
known as "automatic speech recognition", "ASR", "computer speech recognition", "speech to
text", or just "STT".
Speech Recognition is technology that can translate spoken words into text. Some SR systems
use "training" where an individual speaker reads sections of text into the SR system. These
systems analyze the person's specific voice and use it to fine tune the recognition of that
person's speech, resulting in more accurate transcription. Systems that do not use training are
called "Speaker Independent" systems. Systems that use training are called "Speaker
Dependent" systems.

2

Speech Recognition in web
Apr. 23

Speech recognition in Web
Speech recognition in web is achieved by implementing Speech Recognizing system
using powerful languages which have ability of interacting with operating system (such as
Java, .NET, pearl). Flash and Java Applet is used as front-end which takes user’s input in the
form of voice compare it with grammar base present in the system and returns the result.
Speech recognition in web is consisting of following elements.
1. An engine with various Grammar models or Back-End
2. Front-End which is of Java Applet or Flash
3. Scripting languages API for web programmers
Approaches:
1. HTML5 approach (webkit based approach which currently limited to Google chrome browser
and working fine and recognize dictionary based words correctly)
2. Using Flash, Java Applet and Javascript as front-end and conventional languages at back-end
(available APIs are speechAPI, WAMI, iSpeech) except iSpeech (which is premium API) other two
are experimental API and not working fine.

Usage:
1. Voice web search
2. Speech based games.
3. Speech based web catalog

Conclusion:
So, currently the best approach would be using Google chrome’s webkit based API which gives
optimum results as other APIs are in experimental state.

For detailed description of use cases, please visit the following link
http://css.dzone.com/articles/web-standard-speech

3

Recently uploaded

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi

GenAI Risks & Security Meetup 01052024.pdflior mazor

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Real Time Object Detection Using Open CVKhem

Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez

Partners Life - Insurer Innovation Award 2024The Digital Insurer

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

Recently uploaded (20)

Powerful Google developer tools for immediate impact! (2023-24 C)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

Artificial Intelligence Chap.5 : Uncertainty

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Tata AIG General Insurance Company - Insurer Innovation Award 2024

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

HTML Injection Attacks: Impact and Mitigation Strategies

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams

GenAI Risks & Security Meetup 01052024.pdf

Data Cloud, More than a CDP by Matt Robison

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Scaling API-first – The story of a global engineering organization

Real Time Object Detection Using Open CV

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Partners Life - Insurer Innovation Award 2024

AWS Community Day CPH - Three problems of Terraform

How to Troubleshoot Apps for the Modern Connected Worker

Strategies for Landing an Oracle DBA Job as a Fresher

Featured

AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork

Skeleton Culture CodeSkeleton Technologies

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

Content Methodology: A Best Practices Report (Webinar)contently

How to Prepare For a Successful Job Search for 2024Albert Qian

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

5 Public speaking tips from TED - Visualized summarySpeakerHub

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

Getting into the tech field. what next Tessa Mero

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

Introduction to Data ScienceChristy Abraham Joy

Time Management & Productivity - Best PracticesVit Horky

The six step guide to practical project managementMindGenius

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools

12 Ways to Increase Your Influence at WorkGetSmarter

ChatGPT webinar slidesAlireza Esmikhani

More than Just Lines on a Map: Best Practices for U.S Bike RoutesProject for Public Spaces & National Center for Biking and Walking

Featured (20)

AI Trends in Creative Operations 2024 by Artwork Flow.pdf

Skeleton Culture Code

PEPSICO Presentation to CAGNY Conference Feb 2024

Content Methodology: A Best Practices Report (Webinar)

How to Prepare For a Successful Job Search for 2024

Social Media Marketing Trends 2024 // The Global Indie Insights

Trends In Paid Search: Navigating The Digital Landscape In 2024

5 Public speaking tips from TED - Visualized summary

ChatGPT and the Future of Work - Clark Boyd

Getting into the tech field. what next

Google's Just Not That Into You: Understanding Core Updates & Search Intent

How to have difficult conversations

Introduction to Data Science

Time Management & Productivity - Best Practices

The six step guide to practical project management

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...

12 Ways to Increase Your Influence at Work

ChatGPT webinar slides

More than Just Lines on a Map: Best Practices for U.S Bike Routes

Speech recognition in web

1. Speech Recognition in web Report A small report of usefulness of Speech-Recognition in web domain and it’s feasibility. ganesh.sawant 4/23/2012

2. Speech Recognition in web Apr. 23 What is Speech recognition? In Computer Science, Speech recognition is the translation of spoken words into text. It is also known as "automatic speech recognition", "ASR", "computer speech recognition", "speech to text", or just "STT". Speech Recognition is technology that can translate spoken words into text. Some SR systems use "training" where an individual speaker reads sections of text into the SR system. These systems analyze the person's specific voice and use it to fine tune the recognition of that person's speech, resulting in more accurate transcription. Systems that do not use training are called "Speaker Independent" systems. Systems that use training are called "Speaker Dependent" systems. 2

3. Speech Recognition in web Apr. 23 Speech recognition in Web Speech recognition in web is achieved by implementing Speech Recognizing system using powerful languages which have ability of interacting with operating system (such as Java, .NET, pearl). Flash and Java Applet is used as front-end which takes user’s input in the form of voice compare it with grammar base present in the system and returns the result. Speech recognition in web is consisting of following elements. 1. An engine with various Grammar models or Back-End 2. Front-End which is of Java Applet or Flash 3. Scripting languages API for web programmers Approaches: 1. HTML5 approach (webkit based approach which currently limited to Google chrome browser and working fine and recognize dictionary based words correctly) 2. Using Flash, Java Applet and Javascript as front-end and conventional languages at back-end (available APIs are speechAPI, WAMI, iSpeech) except iSpeech (which is premium API) other two are experimental API and not working fine. Usage: 1. Voice web search 2. Speech based games. 3. Speech based web catalog Conclusion: So, currently the best approach would be using Google chrome’s webkit based API which gives optimum results as other APIs are in experimental state. For detailed description of use cases, please visit the following link http://css.dzone.com/articles/web-standard-speech 3

Speech recognition in web

Recommended

Recommended

More Related Content

Recently uploaded

Recently uploaded (20)

Featured

Featured (20)

Speech recognition in web