SlideShare a Scribd company logo
1 of 5
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 03 Issue: 02 | Feb-2016 www.irjet.net p-ISSN: 2395-0072
© 2016, IRJET | Impact Factor value: 4.45 | ISO 9001:2008 Certified Journal | Page 914
Automatic Subtitle Generation for Sound in Videos
Kanika Mishra1, Prerak Bhagat2, Prof. Atiya Kazi3
Kanika Mishra, Department of Information Technology, University of Mumbai,
Finolex Academy of Management & Technology, Ratnagiri, Maharashtra.
Prerak Bhagat, Department of Information Technology, University of Mumbai,
Finolex Academy of Management & Technology, Ratnagiri, Maharashtra.
Prof.Atiya Kazi, Department of Information Technology, University of Mumbai,
Finolex Academy of Management & Technology, Ratnagiri, Maharashtra.
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract – The last two decades have seen a vast
growth in the amount of video content being generated
across various platforms and spanning several genres.
Apart from the convenient availability of cheap portable
media devices, the emergence of dedicated websites
tailored specifically for storage and sharing of videoshas
increased the value of audio - visual content in eyes of the
general populous.
However as with any technology, the emergence
of audio-visual content as the preferred method of
information sharing has brought along with it certain
hurdles. Prominent among those is the existence of
numerous accents and dialects of any particular
language. Another is the sheer number of languages that
the content is generated in. Also the hearing impaired
audience is unable to receive the audio partof thecontent
and thus, unable in most cases, to completely or
accurately comprehend the information being presented
to them. This situation can be rectified with aid of audio
transcripts and necessary translations. The software’s
currently available require for most part an active user
participation and also have a notable lack of accuracy.
The sheer quantity of audio - video information therefore
makes it impossible to provide transcriptsforallrelevant
data encapsulated in such formats. Hence,theneedfor an
automated concept is vividly felt.
This thesis report analyses a specific path to
automatically generating subtitles for videos using
Google's free API's. Due to lack of necessary resources,the
translation component has not been included in this
specific paper. We propose a three step model to realize
our process. The first stage will be extracting the audioto
be transcribed from the video file and then converting it
into a format compatible withoursecondstageif theneed
be. The second stage uses a speech recognition engine to
convert the information from audioformattotextformat.
The third and final stage will be the encoding of the
generated text into a subtitle format which will include
adding time frames and necessary pauses and
punctuation marks.
The results of the resulting works have been
encouraging though there exists a tremendous room for
improvements and adjustments.
Key Words: Audio Extraction, Speech Recognition, Subtitle
Generation.
1. INTRODUCTION
Video is one of the most popular multimedia used on PCs. So
it is essential to make information encompassed in form of
sound in videos comprehendible to people with auditory
problems. The most natural way lies in the use of subtitles.
However, manual subtitle creation is a tedious activity and
requires active participation of the user. Hence, the study of
automatic subtitle generation presents as a valid subject of
research.
2. LITERATURE SURVEY
This is overview of concept we got from some papers about
automatic subtitle generation for sound in videos. There are
many programming languages that can be used in the
creation of AutoSubGen. A brief study on internet suggests
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 03 Issue: 02 | Feb-2016 www.irjet.net p-ISSN: 2395-0072
© 2016, IRJET | Impact Factor value: 4.45 | ISO 9001:2008 Certified Journal | Page 915
focus on C/C++ and Java. We now analyze thepositivepoints
of the two languages based on the research of J.P. Lewis and
Ulrich Neumann [8]. On the one hand, C++ provides
remarkable advantages in speed, cross systems
performances, and well-tested packages. On the other hand,
Java offers an intuitive syntax, portability on multiple
Operating Systems and trustedlibraries. They areusedinthe
Sphinx speech recognition engine. Java provides the Java
Media Framework (JMF) API that allows developers to deal
with media tracks (audio and video) in an efficient way. In
Lewis’s articles, it appears Java performances are quite
similar to C/C++. Besides, in theory, Java performances
should be higher. The absence of pointers,theuseof efficient
garbage collector and the run-time compilation plays a key
role in Java and their extent shouldsoonpermittogobeyond
C++ performances.
The JMF API features facilitate audio extraction.We
present an overview of the JMF API and focus a bit deeperon
insightful functions in relation to audio extraction. It exists
an interesting tutorial [4] to start with the JMF and hence it
is used in the following parts. The official documentations
are also pertinent resource [4]. Furthermore, a complete
book describing JMF API [3] was published in 1998. JMF
gives applications the possible capability to output
multimedia content, capture audio and video through
microphone and camera, do real-time streaming of media
over the Internet, process media and store media into file.
3. METHODOLOGY
3.1 Audio Extraction
This module aims to output an audio file from a media file.
An activity diagram of the module is presented in Figure 1.It
takes as input a file URL and gives audio format of the
output. Next, it checks the file content and creates a list of
individual tracks composing the initial media file. An
exception is thrown if the file content is irrelevant. Once the
tracks are separated, the processor analyzes each track,
selects the first audio track found and discards the rest.
Finally, the audio track is written in a file applying the
default or wished format. In Audio Formats Discussions, it
was previously talked about the audio formats that Sphinx
accepts. At the moment, it principally supportsWAVorRAW
files. Hence, the task of obtaining the preferred format is
complicated. It often requires various treatments and
conversions to reach the mentioned purpose. Audio
extraction provides a way to extract audio from a media file
but does not offer conversion tools.Otherwise,itisadvisable
to use the facilities of VLC for the purpose of getting suitable
audio material. It specially uses the java media framework
(jmf) for audio extraction. Because JMF provides a platform-
neutral framework for handling multimedia [4].
Java Media Framework (jmf): JMF is used for handling
streaming media in Java programs. JMF enables Java
programs to -
• Present multimedia contents,
• Capture audio through microphone and video
content through camera,
• Do instant streaming of media over the Internet,
• Process media(such as changing media format,
adding special effects),
• Store media into a file.
Figure 1. Activity Diagram for Audio Extraction.
3.2 Speech Recognition
Some speech recognition systems make use of "source-
independent speech recognition" while some others use
"training" in which a single speaker reads sections of text
into the SR system. These system analyze the person's
specific voice and use it to find the recognition of that
individual’s speech, which results in more accurate
transcripts. Systems that do notmakeuseoftrainingmethod
is called as "speaker-independent" systems. Systems that
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 03 Issue: 02 | Feb-2016 www.irjet.net p-ISSN: 2395-0072
© 2016, IRJET | Impact Factor value: 4.45 | ISO 9001:2008 Certified Journal | Page 916
make use of such training is called as "speaker-dependent"
systems. Here in our Project we have used the Line playback
device, which lets the voice to the google transcriptor in the
chrome web app and the text is generated from it. Google
Transcriptor gets the voice from the audio file which is
provided by the user, through the LINE playback device and
starts converting that audio into thetextformatanddisplays
on the front end. We can create a file .txt or .rtf using that
text. Using different Sphinx systems [6], for nearly two
decades they have developed and proved their reliability
within several ASR systems where they were linked in. It is
therefore natural we opted for the Sphinx-4 decode. Sphinx-
4 has been entirely written in Java [7], making it totally
portable and able to provide a modular architecture, that
allows us to modify configurations with ease. The
architecture overview wasshowninthe backgroundsection.
We aim to procure advantage of Sphinx-4 features in order
to satisfy the needs of the Automatic Subtitle Generation
Speech recognition module [5]. Subtitle Generation is
expected to select the most suitable models considering the
audio and the parameters (category, length, etc.) given in
input and thereby generate a precise transcript of the audio
speech. The Figure 2 resumes the organization of thespeech
recognition module. Speech Recognition uses the Hidden
Markov Model (HMMs) as an algorithm.
Figure 2.Activity Diagram for Speech Recognition.
3.3 Subtitle Generation:
This module is expected to get a list of words and their
respective speech time-frames from the speech recognition
module and then produce a .srt subtitle file. To do so, the
module must look at the list of words and use silence (SIL)
spoken words as a boundary between for two consecutive
sentences. The Figure 3 offers a visual representation of the
steps to follow for the subtitle generation for sound in
videos. However, we face up to some limiting rule. Indeed, it
will not be able to define punctuation in our system since it
involves much more speech analysis and deeper design.
Figure 3. Activity Diagram for Subtitle Generation.
4. REQUIREMENTS OF PROPOSED SYSTEMS
The requirements of our proposed system is expressed
through Table I.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 03 Issue: 02 | Feb-2016 www.irjet.net p-ISSN: 2395-0072
© 2016, IRJET | Impact Factor value: 4.45 | ISO 9001:2008 Certified Journal | Page 917
TABLE I. REQUIREMENT SPECIFICATIONS
No Technology used Description
1. 1GB RAM Storing
2 Java Speech API It allows to incorporation
of speech technology into
user interfaces for Java
based applets and
applications. This API
defines a cross-platform
interface to help command
and control recognizers
and speech synthesizers.
Sun [12] hasalsodeveloped
the JSGF(Java Speech
Grammar Format) to
supply cross-platform
grammar of speech
recognizers
3 JDK For writing Java applets
and applications and run
them into java language.
4 Apache Tomcat To manage sessions as well
as applications across the
network
5 MySQL Database Storing audio and video
related text files
6 Net beans It is an IDE use to run the
whole project.
5. CONCLUSIONS
A complete system including therequiredmodulescouldnot
be realized since the audio conversion need moreresources.
VLC gave a suitable solution but a custom coded Java
component is expected in future projects so that portability
and installation of the system become uncomplicated.
Nonetheless, the expected output for each phase has been
reached. The audio extraction module supply a suitable
audio format that can be used by the intended speech
recognition module. Speech recognition produce a list of
recognized words and their corresponding time-frames in
the audio content, although the accuracy cannot be
guaranteed. The former list is then used by the intended
subtitle generation module to create standard subtitle file
that is readable by the most common media players
available.
As of Now subtitles are being generated for the specific
format of the Audio files, But In future this restriction of the
format will be removed. English Subtitles of the Regional
Languages will be generated, and the accuracy of the project
will be more precise. Offline content will be web-based in
coming future, and the User can access and generate
subtitles for his video in a feasible way. Overall it will be
Accurate, less time consuming, No language barrier, Web-
based.
REFERENCES
[1] Boris Guenebaut, “Automatic Subtitle
Generation for Sound in Videos”, Tapro 02,
University West, pp. 35, 2009.
[2]Zoubin Ghahramani,“An introduction to
hiddenmarkov models and Bayesian
networks”, in World Scientific PublishingCo.,
Inc. River Edge, NJ, USA, pp.30, 2001.
[3] Robert Gordon, Stephen Talley, and Rob
Gordon.Essential Jmf:Java Media Framework.
Prentice Hall PTR, pp. 17, 1998.
[4] Java tm media framework api guide, 1999.
URL:http://java.sun.com/javase/technolog/
desktp/mdia/jmf/2.1.1/guide/index.html
[5] Frederick Jelinek, Statistical Methods for
Speech Recognition. MIT Press, pp. 7, 1999.
[6] L. Besacier, C. Bergamini, D. Vaufreydaz, and
E. Castelli,“The effect of speech and audio
compression on speech recognition
performance”. In IEEE Multimedia Signal
Processing Workshop,
Cannes:France,2001.URL :http://hal.archives
ouvertes.fr/docs/00/32/61/65/PDF/Besacie
r01b.pdf.
[7] Sphinx-4 a speech recognizer written entirely
in the javatm programming language, pp. 5,
1999-2008. URL
http://cmusphinx.sourceforge.net/sphinx4/.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 03 Issue: 02 | Feb-2016 www.irjet.net p-ISSN: 2395-0072
© 2016, IRJET | Impact Factor value: 4.45 | ISO 9001:2008 Certified Journal | Page 918
[8] J.P. Lewis and Ulrich Neumann. Performance
of java v/s c++
URL:http://www.idiom.com/˜zilla/Comput
er/javaCbenchmark.html.

More Related Content

Similar to Automatic Subtitle Generation For Sound In Videos

Audio computing Image to Text Synthesizer - A Cutting-Edge Content Generator ...
Audio computing Image to Text Synthesizer - A Cutting-Edge Content Generator ...Audio computing Image to Text Synthesizer - A Cutting-Edge Content Generator ...
Audio computing Image to Text Synthesizer - A Cutting-Edge Content Generator ...IRJET Journal
 
Review on content based video lecture retrieval
Review on content based video lecture retrievalReview on content based video lecture retrieval
Review on content based video lecture retrievaleSAT Journals
 
Playing sound files in labview using audacity toolkit
Playing sound files in labview using audacity toolkitPlaying sound files in labview using audacity toolkit
Playing sound files in labview using audacity toolkiteSAT Journals
 
IRJET- Audio Genre Classification using Neural Networks
IRJET-  	  Audio Genre Classification using Neural NetworksIRJET-  	  Audio Genre Classification using Neural Networks
IRJET- Audio Genre Classification using Neural NetworksIRJET Journal
 
IRJET-Raspberry Pi based Reader for Blind People
IRJET-Raspberry Pi based Reader for Blind PeopleIRJET-Raspberry Pi based Reader for Blind People
IRJET-Raspberry Pi based Reader for Blind PeopleIRJET Journal
 
Pacify based video retrieval system
Pacify based video retrieval systemPacify based video retrieval system
Pacify based video retrieval systemeSAT Journals
 
IRJET- Transcription of Conferences
IRJET- Transcription of ConferencesIRJET- Transcription of Conferences
IRJET- Transcription of ConferencesIRJET Journal
 
Voice Recognition System for Automobile Safety.
Voice Recognition System for Automobile Safety.Voice Recognition System for Automobile Safety.
Voice Recognition System for Automobile Safety.IRJET Journal
 
Voice Based Email System For Visual Impaired Using AI
Voice Based Email System For Visual Impaired Using AIVoice Based Email System For Visual Impaired Using AI
Voice Based Email System For Visual Impaired Using AIIRJET Journal
 
VIDEO TO TEXT SUMMARIZER USING AI.pdf
VIDEO TO TEXT SUMMARIZER USING AI.pdfVIDEO TO TEXT SUMMARIZER USING AI.pdf
VIDEO TO TEXT SUMMARIZER USING AI.pdfFreeFire293813
 
IRJET - Automation in Python using Speech Recognition
IRJET -  	  Automation in Python using Speech RecognitionIRJET -  	  Automation in Python using Speech Recognition
IRJET - Automation in Python using Speech RecognitionIRJET Journal
 
A web based approach: Acronym Definition Extraction
A web based approach: Acronym Definition ExtractionA web based approach: Acronym Definition Extraction
A web based approach: Acronym Definition ExtractionIRJET Journal
 
Efficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And RecordingEfficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And RecordingIOSR Journals
 
IRJET-Raspberry Pi Based Reader for Blind People
IRJET-Raspberry Pi Based Reader for Blind PeopleIRJET-Raspberry Pi Based Reader for Blind People
IRJET-Raspberry Pi Based Reader for Blind PeopleIRJET Journal
 
Real-Time Video Copy Detection in Big Data
Real-Time Video Copy Detection in Big DataReal-Time Video Copy Detection in Big Data
Real-Time Video Copy Detection in Big DataIRJET Journal
 

Similar to Automatic Subtitle Generation For Sound In Videos (20)

Audio computing Image to Text Synthesizer - A Cutting-Edge Content Generator ...
Audio computing Image to Text Synthesizer - A Cutting-Edge Content Generator ...Audio computing Image to Text Synthesizer - A Cutting-Edge Content Generator ...
Audio computing Image to Text Synthesizer - A Cutting-Edge Content Generator ...
 
Review on content based video lecture retrieval
Review on content based video lecture retrievalReview on content based video lecture retrieval
Review on content based video lecture retrieval
 
Playing sound files in labview using audacity toolkit
Playing sound files in labview using audacity toolkitPlaying sound files in labview using audacity toolkit
Playing sound files in labview using audacity toolkit
 
Sub1577
Sub1577Sub1577
Sub1577
 
IRJET- Audio Genre Classification using Neural Networks
IRJET-  	  Audio Genre Classification using Neural NetworksIRJET-  	  Audio Genre Classification using Neural Networks
IRJET- Audio Genre Classification using Neural Networks
 
IRJET-Raspberry Pi based Reader for Blind People
IRJET-Raspberry Pi based Reader for Blind PeopleIRJET-Raspberry Pi based Reader for Blind People
IRJET-Raspberry Pi based Reader for Blind People
 
Pacify based video retrieval system
Pacify based video retrieval systemPacify based video retrieval system
Pacify based video retrieval system
 
IRJET- Transcription of Conferences
IRJET- Transcription of ConferencesIRJET- Transcription of Conferences
IRJET- Transcription of Conferences
 
Voice Recognition System for Automobile Safety.
Voice Recognition System for Automobile Safety.Voice Recognition System for Automobile Safety.
Voice Recognition System for Automobile Safety.
 
Video Summarization
Video SummarizationVideo Summarization
Video Summarization
 
Part 6
Part 6Part 6
Part 6
 
Voice Based Email System For Visual Impaired Using AI
Voice Based Email System For Visual Impaired Using AIVoice Based Email System For Visual Impaired Using AI
Voice Based Email System For Visual Impaired Using AI
 
VIDEO TO TEXT SUMMARIZER USING AI.pdf
VIDEO TO TEXT SUMMARIZER USING AI.pdfVIDEO TO TEXT SUMMARIZER USING AI.pdf
VIDEO TO TEXT SUMMARIZER USING AI.pdf
 
IRJET - Automation in Python using Speech Recognition
IRJET -  	  Automation in Python using Speech RecognitionIRJET -  	  Automation in Python using Speech Recognition
IRJET - Automation in Python using Speech Recognition
 
A web based approach: Acronym Definition Extraction
A web based approach: Acronym Definition ExtractionA web based approach: Acronym Definition Extraction
A web based approach: Acronym Definition Extraction
 
Efficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And RecordingEfficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And Recording
 
Sirio Orione and Pan
Sirio Orione and PanSirio Orione and Pan
Sirio Orione and Pan
 
IRJET-Raspberry Pi Based Reader for Blind People
IRJET-Raspberry Pi Based Reader for Blind PeopleIRJET-Raspberry Pi Based Reader for Blind People
IRJET-Raspberry Pi Based Reader for Blind People
 
Arneb
ArnebArneb
Arneb
 
Real-Time Video Copy Detection in Big Data
Real-Time Video Copy Detection in Big DataReal-Time Video Copy Detection in Big Data
Real-Time Video Copy Detection in Big Data
 

More from Asia Smith

PPT - Custom Term Paper PowerPoint Presentation, Free Dow
PPT - Custom Term Paper PowerPoint Presentation, Free DowPPT - Custom Term Paper PowerPoint Presentation, Free Dow
PPT - Custom Term Paper PowerPoint Presentation, Free DowAsia Smith
 
PPT - Expository Text PowerPoint Presentation, F
PPT - Expository Text PowerPoint Presentation, FPPT - Expository Text PowerPoint Presentation, F
PPT - Expository Text PowerPoint Presentation, FAsia Smith
 
Writing A Motivational Letter Outlet Discounts, Save 64
Writing A Motivational Letter Outlet Discounts, Save 64Writing A Motivational Letter Outlet Discounts, Save 64
Writing A Motivational Letter Outlet Discounts, Save 64Asia Smith
 
6 Ways To Write A Brief Description Of Yourself - Wi
6 Ways To Write A Brief Description Of Yourself - Wi6 Ways To Write A Brief Description Of Yourself - Wi
6 Ways To Write A Brief Description Of Yourself - WiAsia Smith
 
Ant Writing Royalty Free Stock Images - Image 158
Ant Writing Royalty Free Stock Images - Image 158Ant Writing Royalty Free Stock Images - Image 158
Ant Writing Royalty Free Stock Images - Image 158Asia Smith
 
Ice Cream Paper Teacher Created Resources, Superki
Ice Cream Paper Teacher Created Resources, SuperkiIce Cream Paper Teacher Created Resources, Superki
Ice Cream Paper Teacher Created Resources, SuperkiAsia Smith
 
Work Experience Essay - GCSE English - Marked B
Work Experience Essay - GCSE English - Marked BWork Experience Essay - GCSE English - Marked B
Work Experience Essay - GCSE English - Marked BAsia Smith
 
Formal Letter Format Useful Example And Writing Tips
Formal Letter Format Useful Example And Writing TipsFormal Letter Format Useful Example And Writing Tips
Formal Letter Format Useful Example And Writing TipsAsia Smith
 
The Importance Of Writing Cafeviena.Pe
The Importance Of Writing Cafeviena.PeThe Importance Of Writing Cafeviena.Pe
The Importance Of Writing Cafeviena.PeAsia Smith
 
The 9 Best College Essay Writing Services In 2023 R
The 9 Best College Essay Writing Services In 2023 RThe 9 Best College Essay Writing Services In 2023 R
The 9 Best College Essay Writing Services In 2023 RAsia Smith
 
Music Training Material - English For Research Paper Writing - St
Music Training Material - English For Research Paper Writing - StMusic Training Material - English For Research Paper Writing - St
Music Training Material - English For Research Paper Writing - StAsia Smith
 
Free Speech Examples For Students To Craft A Best S
Free Speech Examples For Students To Craft A Best SFree Speech Examples For Students To Craft A Best S
Free Speech Examples For Students To Craft A Best SAsia Smith
 
Sample Essay About Career Goals
Sample Essay About Career GoalsSample Essay About Career Goals
Sample Essay About Career GoalsAsia Smith
 
015 Editorial V Essay Thatsnotus
015 Editorial V Essay Thatsnotus015 Editorial V Essay Thatsnotus
015 Editorial V Essay ThatsnotusAsia Smith
 
Analytical Rubrics Essays - Writefiction
Analytical Rubrics Essays - WritefictionAnalytical Rubrics Essays - Writefiction
Analytical Rubrics Essays - WritefictionAsia Smith
 
Short Story Writing Process Chart Writing Short Storie
Short Story Writing Process Chart Writing Short StorieShort Story Writing Process Chart Writing Short Storie
Short Story Writing Process Chart Writing Short StorieAsia Smith
 
Writing Paper Fichrios Decorados, Papis De Escri
Writing Paper Fichrios Decorados, Papis De EscriWriting Paper Fichrios Decorados, Papis De Escri
Writing Paper Fichrios Decorados, Papis De EscriAsia Smith
 
Santa Letterhead Printable - Inspiration Made Simple
Santa Letterhead Printable - Inspiration Made SimpleSanta Letterhead Printable - Inspiration Made Simple
Santa Letterhead Printable - Inspiration Made SimpleAsia Smith
 
Scroll Templates, Writing Practic
Scroll Templates, Writing PracticScroll Templates, Writing Practic
Scroll Templates, Writing PracticAsia Smith
 
Synthesis Essay Help What Is A Synthesis P
Synthesis Essay Help What Is A Synthesis PSynthesis Essay Help What Is A Synthesis P
Synthesis Essay Help What Is A Synthesis PAsia Smith
 

More from Asia Smith (20)

PPT - Custom Term Paper PowerPoint Presentation, Free Dow
PPT - Custom Term Paper PowerPoint Presentation, Free DowPPT - Custom Term Paper PowerPoint Presentation, Free Dow
PPT - Custom Term Paper PowerPoint Presentation, Free Dow
 
PPT - Expository Text PowerPoint Presentation, F
PPT - Expository Text PowerPoint Presentation, FPPT - Expository Text PowerPoint Presentation, F
PPT - Expository Text PowerPoint Presentation, F
 
Writing A Motivational Letter Outlet Discounts, Save 64
Writing A Motivational Letter Outlet Discounts, Save 64Writing A Motivational Letter Outlet Discounts, Save 64
Writing A Motivational Letter Outlet Discounts, Save 64
 
6 Ways To Write A Brief Description Of Yourself - Wi
6 Ways To Write A Brief Description Of Yourself - Wi6 Ways To Write A Brief Description Of Yourself - Wi
6 Ways To Write A Brief Description Of Yourself - Wi
 
Ant Writing Royalty Free Stock Images - Image 158
Ant Writing Royalty Free Stock Images - Image 158Ant Writing Royalty Free Stock Images - Image 158
Ant Writing Royalty Free Stock Images - Image 158
 
Ice Cream Paper Teacher Created Resources, Superki
Ice Cream Paper Teacher Created Resources, SuperkiIce Cream Paper Teacher Created Resources, Superki
Ice Cream Paper Teacher Created Resources, Superki
 
Work Experience Essay - GCSE English - Marked B
Work Experience Essay - GCSE English - Marked BWork Experience Essay - GCSE English - Marked B
Work Experience Essay - GCSE English - Marked B
 
Formal Letter Format Useful Example And Writing Tips
Formal Letter Format Useful Example And Writing TipsFormal Letter Format Useful Example And Writing Tips
Formal Letter Format Useful Example And Writing Tips
 
The Importance Of Writing Cafeviena.Pe
The Importance Of Writing Cafeviena.PeThe Importance Of Writing Cafeviena.Pe
The Importance Of Writing Cafeviena.Pe
 
The 9 Best College Essay Writing Services In 2023 R
The 9 Best College Essay Writing Services In 2023 RThe 9 Best College Essay Writing Services In 2023 R
The 9 Best College Essay Writing Services In 2023 R
 
Music Training Material - English For Research Paper Writing - St
Music Training Material - English For Research Paper Writing - StMusic Training Material - English For Research Paper Writing - St
Music Training Material - English For Research Paper Writing - St
 
Free Speech Examples For Students To Craft A Best S
Free Speech Examples For Students To Craft A Best SFree Speech Examples For Students To Craft A Best S
Free Speech Examples For Students To Craft A Best S
 
Sample Essay About Career Goals
Sample Essay About Career GoalsSample Essay About Career Goals
Sample Essay About Career Goals
 
015 Editorial V Essay Thatsnotus
015 Editorial V Essay Thatsnotus015 Editorial V Essay Thatsnotus
015 Editorial V Essay Thatsnotus
 
Analytical Rubrics Essays - Writefiction
Analytical Rubrics Essays - WritefictionAnalytical Rubrics Essays - Writefiction
Analytical Rubrics Essays - Writefiction
 
Short Story Writing Process Chart Writing Short Storie
Short Story Writing Process Chart Writing Short StorieShort Story Writing Process Chart Writing Short Storie
Short Story Writing Process Chart Writing Short Storie
 
Writing Paper Fichrios Decorados, Papis De Escri
Writing Paper Fichrios Decorados, Papis De EscriWriting Paper Fichrios Decorados, Papis De Escri
Writing Paper Fichrios Decorados, Papis De Escri
 
Santa Letterhead Printable - Inspiration Made Simple
Santa Letterhead Printable - Inspiration Made SimpleSanta Letterhead Printable - Inspiration Made Simple
Santa Letterhead Printable - Inspiration Made Simple
 
Scroll Templates, Writing Practic
Scroll Templates, Writing PracticScroll Templates, Writing Practic
Scroll Templates, Writing Practic
 
Synthesis Essay Help What Is A Synthesis P
Synthesis Essay Help What Is A Synthesis PSynthesis Essay Help What Is A Synthesis P
Synthesis Essay Help What Is A Synthesis P
 

Recently uploaded

Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...jaredbarbolino94
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 

Recently uploaded (20)

Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 

Automatic Subtitle Generation For Sound In Videos

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 03 Issue: 02 | Feb-2016 www.irjet.net p-ISSN: 2395-0072 © 2016, IRJET | Impact Factor value: 4.45 | ISO 9001:2008 Certified Journal | Page 914 Automatic Subtitle Generation for Sound in Videos Kanika Mishra1, Prerak Bhagat2, Prof. Atiya Kazi3 Kanika Mishra, Department of Information Technology, University of Mumbai, Finolex Academy of Management & Technology, Ratnagiri, Maharashtra. Prerak Bhagat, Department of Information Technology, University of Mumbai, Finolex Academy of Management & Technology, Ratnagiri, Maharashtra. Prof.Atiya Kazi, Department of Information Technology, University of Mumbai, Finolex Academy of Management & Technology, Ratnagiri, Maharashtra. ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract – The last two decades have seen a vast growth in the amount of video content being generated across various platforms and spanning several genres. Apart from the convenient availability of cheap portable media devices, the emergence of dedicated websites tailored specifically for storage and sharing of videoshas increased the value of audio - visual content in eyes of the general populous. However as with any technology, the emergence of audio-visual content as the preferred method of information sharing has brought along with it certain hurdles. Prominent among those is the existence of numerous accents and dialects of any particular language. Another is the sheer number of languages that the content is generated in. Also the hearing impaired audience is unable to receive the audio partof thecontent and thus, unable in most cases, to completely or accurately comprehend the information being presented to them. This situation can be rectified with aid of audio transcripts and necessary translations. The software’s currently available require for most part an active user participation and also have a notable lack of accuracy. The sheer quantity of audio - video information therefore makes it impossible to provide transcriptsforallrelevant data encapsulated in such formats. Hence,theneedfor an automated concept is vividly felt. This thesis report analyses a specific path to automatically generating subtitles for videos using Google's free API's. Due to lack of necessary resources,the translation component has not been included in this specific paper. We propose a three step model to realize our process. The first stage will be extracting the audioto be transcribed from the video file and then converting it into a format compatible withoursecondstageif theneed be. The second stage uses a speech recognition engine to convert the information from audioformattotextformat. The third and final stage will be the encoding of the generated text into a subtitle format which will include adding time frames and necessary pauses and punctuation marks. The results of the resulting works have been encouraging though there exists a tremendous room for improvements and adjustments. Key Words: Audio Extraction, Speech Recognition, Subtitle Generation. 1. INTRODUCTION Video is one of the most popular multimedia used on PCs. So it is essential to make information encompassed in form of sound in videos comprehendible to people with auditory problems. The most natural way lies in the use of subtitles. However, manual subtitle creation is a tedious activity and requires active participation of the user. Hence, the study of automatic subtitle generation presents as a valid subject of research. 2. LITERATURE SURVEY This is overview of concept we got from some papers about automatic subtitle generation for sound in videos. There are many programming languages that can be used in the creation of AutoSubGen. A brief study on internet suggests
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 03 Issue: 02 | Feb-2016 www.irjet.net p-ISSN: 2395-0072 © 2016, IRJET | Impact Factor value: 4.45 | ISO 9001:2008 Certified Journal | Page 915 focus on C/C++ and Java. We now analyze thepositivepoints of the two languages based on the research of J.P. Lewis and Ulrich Neumann [8]. On the one hand, C++ provides remarkable advantages in speed, cross systems performances, and well-tested packages. On the other hand, Java offers an intuitive syntax, portability on multiple Operating Systems and trustedlibraries. They areusedinthe Sphinx speech recognition engine. Java provides the Java Media Framework (JMF) API that allows developers to deal with media tracks (audio and video) in an efficient way. In Lewis’s articles, it appears Java performances are quite similar to C/C++. Besides, in theory, Java performances should be higher. The absence of pointers,theuseof efficient garbage collector and the run-time compilation plays a key role in Java and their extent shouldsoonpermittogobeyond C++ performances. The JMF API features facilitate audio extraction.We present an overview of the JMF API and focus a bit deeperon insightful functions in relation to audio extraction. It exists an interesting tutorial [4] to start with the JMF and hence it is used in the following parts. The official documentations are also pertinent resource [4]. Furthermore, a complete book describing JMF API [3] was published in 1998. JMF gives applications the possible capability to output multimedia content, capture audio and video through microphone and camera, do real-time streaming of media over the Internet, process media and store media into file. 3. METHODOLOGY 3.1 Audio Extraction This module aims to output an audio file from a media file. An activity diagram of the module is presented in Figure 1.It takes as input a file URL and gives audio format of the output. Next, it checks the file content and creates a list of individual tracks composing the initial media file. An exception is thrown if the file content is irrelevant. Once the tracks are separated, the processor analyzes each track, selects the first audio track found and discards the rest. Finally, the audio track is written in a file applying the default or wished format. In Audio Formats Discussions, it was previously talked about the audio formats that Sphinx accepts. At the moment, it principally supportsWAVorRAW files. Hence, the task of obtaining the preferred format is complicated. It often requires various treatments and conversions to reach the mentioned purpose. Audio extraction provides a way to extract audio from a media file but does not offer conversion tools.Otherwise,itisadvisable to use the facilities of VLC for the purpose of getting suitable audio material. It specially uses the java media framework (jmf) for audio extraction. Because JMF provides a platform- neutral framework for handling multimedia [4]. Java Media Framework (jmf): JMF is used for handling streaming media in Java programs. JMF enables Java programs to - • Present multimedia contents, • Capture audio through microphone and video content through camera, • Do instant streaming of media over the Internet, • Process media(such as changing media format, adding special effects), • Store media into a file. Figure 1. Activity Diagram for Audio Extraction. 3.2 Speech Recognition Some speech recognition systems make use of "source- independent speech recognition" while some others use "training" in which a single speaker reads sections of text into the SR system. These system analyze the person's specific voice and use it to find the recognition of that individual’s speech, which results in more accurate transcripts. Systems that do notmakeuseoftrainingmethod is called as "speaker-independent" systems. Systems that
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 03 Issue: 02 | Feb-2016 www.irjet.net p-ISSN: 2395-0072 © 2016, IRJET | Impact Factor value: 4.45 | ISO 9001:2008 Certified Journal | Page 916 make use of such training is called as "speaker-dependent" systems. Here in our Project we have used the Line playback device, which lets the voice to the google transcriptor in the chrome web app and the text is generated from it. Google Transcriptor gets the voice from the audio file which is provided by the user, through the LINE playback device and starts converting that audio into thetextformatanddisplays on the front end. We can create a file .txt or .rtf using that text. Using different Sphinx systems [6], for nearly two decades they have developed and proved their reliability within several ASR systems where they were linked in. It is therefore natural we opted for the Sphinx-4 decode. Sphinx- 4 has been entirely written in Java [7], making it totally portable and able to provide a modular architecture, that allows us to modify configurations with ease. The architecture overview wasshowninthe backgroundsection. We aim to procure advantage of Sphinx-4 features in order to satisfy the needs of the Automatic Subtitle Generation Speech recognition module [5]. Subtitle Generation is expected to select the most suitable models considering the audio and the parameters (category, length, etc.) given in input and thereby generate a precise transcript of the audio speech. The Figure 2 resumes the organization of thespeech recognition module. Speech Recognition uses the Hidden Markov Model (HMMs) as an algorithm. Figure 2.Activity Diagram for Speech Recognition. 3.3 Subtitle Generation: This module is expected to get a list of words and their respective speech time-frames from the speech recognition module and then produce a .srt subtitle file. To do so, the module must look at the list of words and use silence (SIL) spoken words as a boundary between for two consecutive sentences. The Figure 3 offers a visual representation of the steps to follow for the subtitle generation for sound in videos. However, we face up to some limiting rule. Indeed, it will not be able to define punctuation in our system since it involves much more speech analysis and deeper design. Figure 3. Activity Diagram for Subtitle Generation. 4. REQUIREMENTS OF PROPOSED SYSTEMS The requirements of our proposed system is expressed through Table I.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 03 Issue: 02 | Feb-2016 www.irjet.net p-ISSN: 2395-0072 © 2016, IRJET | Impact Factor value: 4.45 | ISO 9001:2008 Certified Journal | Page 917 TABLE I. REQUIREMENT SPECIFICATIONS No Technology used Description 1. 1GB RAM Storing 2 Java Speech API It allows to incorporation of speech technology into user interfaces for Java based applets and applications. This API defines a cross-platform interface to help command and control recognizers and speech synthesizers. Sun [12] hasalsodeveloped the JSGF(Java Speech Grammar Format) to supply cross-platform grammar of speech recognizers 3 JDK For writing Java applets and applications and run them into java language. 4 Apache Tomcat To manage sessions as well as applications across the network 5 MySQL Database Storing audio and video related text files 6 Net beans It is an IDE use to run the whole project. 5. CONCLUSIONS A complete system including therequiredmodulescouldnot be realized since the audio conversion need moreresources. VLC gave a suitable solution but a custom coded Java component is expected in future projects so that portability and installation of the system become uncomplicated. Nonetheless, the expected output for each phase has been reached. The audio extraction module supply a suitable audio format that can be used by the intended speech recognition module. Speech recognition produce a list of recognized words and their corresponding time-frames in the audio content, although the accuracy cannot be guaranteed. The former list is then used by the intended subtitle generation module to create standard subtitle file that is readable by the most common media players available. As of Now subtitles are being generated for the specific format of the Audio files, But In future this restriction of the format will be removed. English Subtitles of the Regional Languages will be generated, and the accuracy of the project will be more precise. Offline content will be web-based in coming future, and the User can access and generate subtitles for his video in a feasible way. Overall it will be Accurate, less time consuming, No language barrier, Web- based. REFERENCES [1] Boris Guenebaut, “Automatic Subtitle Generation for Sound in Videos”, Tapro 02, University West, pp. 35, 2009. [2]Zoubin Ghahramani,“An introduction to hiddenmarkov models and Bayesian networks”, in World Scientific PublishingCo., Inc. River Edge, NJ, USA, pp.30, 2001. [3] Robert Gordon, Stephen Talley, and Rob Gordon.Essential Jmf:Java Media Framework. Prentice Hall PTR, pp. 17, 1998. [4] Java tm media framework api guide, 1999. URL:http://java.sun.com/javase/technolog/ desktp/mdia/jmf/2.1.1/guide/index.html [5] Frederick Jelinek, Statistical Methods for Speech Recognition. MIT Press, pp. 7, 1999. [6] L. Besacier, C. Bergamini, D. Vaufreydaz, and E. Castelli,“The effect of speech and audio compression on speech recognition performance”. In IEEE Multimedia Signal Processing Workshop, Cannes:France,2001.URL :http://hal.archives ouvertes.fr/docs/00/32/61/65/PDF/Besacie r01b.pdf. [7] Sphinx-4 a speech recognizer written entirely in the javatm programming language, pp. 5, 1999-2008. URL http://cmusphinx.sourceforge.net/sphinx4/.
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 03 Issue: 02 | Feb-2016 www.irjet.net p-ISSN: 2395-0072 © 2016, IRJET | Impact Factor value: 4.45 | ISO 9001:2008 Certified Journal | Page 918 [8] J.P. Lewis and Ulrich Neumann. Performance of java v/s c++ URL:http://www.idiom.com/˜zilla/Comput er/javaCbenchmark.html.