SlideShare a Scribd company logo
KNOWLEDGECUDDLE PUBLICATION
International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1
1
www.knowledgecuddle.com
Animal Voice Morphing System
Siddhapara Milan v.
milansiddhapara@gmial.com
Gujarat Technological University,
Gujarat, India.
Abstract:This paper describe the morphing concept in which we convert the voice of any person into pre -analyzed or pre-
recorded voice of any animals.As the user generate a pre-established voice, his pitch, timbre, vibrato and articulation can
be modified to resemble those of a pre-recorded and pre-analyzed voice of animal. This technique is based on SMS. Thus
using this concept we can develop many funny application and we can used this type of application in mobile device,
personal computer etc. for enjoying the sometime of period.
Keywords: Voice morphing, Animal.
I. Introduction
In general morphing means “conversion of some stages into some another stages”. Similarly in voice morphing us
convert the voice generated from the any source into pre-analyzed or predict target voice in other word voice morphing means
the transition of one speech signal into another. Like image morphing, speech morphing aims to preserve the shared
characteristics of the starting and final signals, while generating a smooth transition between them. In voice morphing
processone speech signal should smoothly change into another, keeping the shared characteristics of the starting and ending
signals but smoothly changing the other properties. The major properties of concern as far as a speech signal is concerned are
its pitch and envelope information. These two reside in a convolved form in a speech signal. Hence some efficient method for
extracting each of these is necessary.The main goal of the developed voice morphing methods is the smooth transformation from
one sound to another, thus, the combination of two sounds to create a new sound with an intermediate timbre. Most of these methods
are based on the interpolation of sounds parameterizations resulting from analysis/synthesis techniques, such as the Short-timeFourier
Transform (STFT), Linear Predictive Coding (LPC) or Sinusoidal Models.
In this paper we focus on specific concept, we convert the voice of any person into predict animal voice to generate
some joyful environment. In this paper user convert the pitch, articulation and also able to control the degree of morphing to
achieve the desire objective. Morphing take place in following stages, first system understand the voice of source (person) and
find same sound Pieces from the target voice, then interpolate the selected attributes and produced the morphed voice In this
paper [1] the implantation is discussed.
KNOWLEDGECUDDLE PUBLICATION
International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1
2
www.knowledgecuddle.com
II. Voice Morphing System
Fig.1 shows the general block diagram of voice imitator system. The underlying analysis technique is SMS.Also in
system recognition and alignment module was added for synchronizing the user’s voice with the target voice before the
morphing is done.Before we can morph a particular voice we have to supplyinformation about target information(predict animal
voice ) and voice to be morphed. The system requires the phonetic transcription of the lyrics, the melody as MIDI data, and the actual
recording to be used as the target audio data. Thus, a good impersonator of any animal that originally Voice has to be recorded. This
recording has to be analysed with SMS, segmented into “morphing units”, and each unit labelled with the appropriate note and
phonetic information of the song. This preparation stage is done semi-automatically, using a non-real time application developed for
this task. The first module of the running system includes the real-time analysis and the recognition/alignment steps. Each analysis
frame, with the appropriate parameterization, is associated with the phoneme of a specific moment of the voice and thus with a target
frame. The recognition/alignment algorithm is based on traditional speech recognition technology, that is, Hidden Markov Models
(HMM) that were adapted to the singing voice [2]
The source sound are segmented into number of frames and each frame are individual analyzed. The recognitionalignment
algorithm is based on hidden Markov models (HMM) [3] and when the source frame matched with target frame then we
morphed them and get the desired output.
III. Spectral Modeling Synthesis
We use the term SMS to describe our ongoing research on the development of techniques and software applications for the
analysis, transformation and synthesis of musical sounds based on spectral models [2].Information on SMS and public domain
software applications based on it can be found in our Web site (http://www.iua.upf.es/~sms).
IV. Analysis Using SMS
Analysis processes can be categorized as follows.
 Preprocessing or representation conversion: This involves processes like signal acquisition in discrete form and
windowing.
 Cepstral analysis or Pitch and Envelope analysis: This process will extract the pitch and formant information in the
speech signal.
 Morphing which includes Warping and interpolation.
KNOWLEDGECUDDLE PUBLICATION
International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1
3
www.knowledgecuddle.com
 Signal re-estimation.
The traditional SMS analysis output is a collection of frequency and amplitude values that represent the partials of the
sound (sinusoidal component), and either filter coefficients with a gain value or spectral magnitudes and phases representing
the residual sound (non sinusoidal component) [4]
The major improvement is done SMS model is alignment and analysis. A major improvement to SMS has been the real-time
implementation of the whole analysis/synthesis process, with a processing latency of less than 30 milliseconds.
V. Phonetic recognition/ alignment
This part of the system is responsible for recognizing the voice that is being generated from the userthat a similar segment
can be chosen from the target information.
The huge amount of research is carry out in speech recognition.The recognition systems work well when tested in the well-
controlled.The recognition systems work reasonably well when tested in the well-controlled environment of the laboratory.
However, phoneme recognition rates decay miserably when the conditions are adverse.
A. SMorphing
Speech morphing can be achieved by transforming the signal.To prepare the signal for the transformation, it is split
into a number of 'frames' - sections of the waveform. The transformation is then applied to each frame of the signal. This
provides another way of viewing the signal information. The new representation (said to be in the frequency domain) describes
the average energy present at each frequency band. Further analysis enables two pieces of information to be obtained: pitch
information and the overall envelope of the sound. A key element in the morphing is the manipulation of the pitch information.
If two signals with different pitches were simply cross-faded it is highly likely that two separate sounds will be heard. This
occurs because the signal will have two distinct pitches causing the auditory system to perceive two different objects. A
successful morph must exhibit a smoothly changing pitch throughout. The pitch information of each sound is compared to
provide the best match between the two signals' pitches. To do this match, the signals are stretched and compressed so that
important sections of each signal match in time. The interpolation of the two sounds can then be performed which creates the
intermediate sounds in the morph. The final stage is then to convert the frames back into a normal waveform.
However, after the morphing has been performed, the legacy of the earlier analysis becomes apparent. The conversion of the
sound to a representation in which the pitch and spectral envelope can be separated loses some information. Therefore, this
information has to be re-estimated for the morphed sound. This process obtains an acoustic waveform, which can then be
stored or listened to.
Depending on the voice of person (user), a unit from the target is selected. Each frame from the user is morphed with
a different frame from the target, advancing sequentially in time. Then the user has the choice to interpolate the different
parameters extracted at the analysis stage, such as amplitude, fundamental frequency, spectral shape, residual signal, etc. In
general the amplitude will not be interpolated, thus always using the amplitude from the user and the unvoiced phonemes will
also not be morphed, thus always using the consonants from the user. This will give the user the feeling of being in control.In
most cases the durations of the user and target voice to be morphed will be different. If a given user’s voice is shorter than the
one from the target the system will simply skip the remaining part of the target voice and go directly to the articulation portion.
In the case when the user voiceisa longer than the one present in the target data the system enters in the loop mode. Each
voiced phoneme of the target has a loop point frame, marked in the preprocessing, non-real time stage. The system uses this
frame to loop-synthesis in case the user sings beyond that point in the phoneme. Once we reach this frame in the target, the rest
of the frames of the user will be interpolated with that same frame until the user ends the phoneme.
VI. CONCLUSION
KNOWLEDGECUDDLE PUBLICATION
International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1
4
www.knowledgecuddle.com
The final purpose of the system is to make joyful voice from the user’s (source) voice. Means using this we developed the
many funny application but we need the some voice sample of target voice. For instance the voice of Karan is translated into
voice of cat. But Limitation is that we need a clear voice from the source so we can easily recognize or analyzed it. If there is
no clear voice of user then sample of that voice will converted into some another sound pieces of target animal. So if we have
clear voice then we easily convert into target vice and enjoy the sometime of period.
REFERENCES
[1]. Cano, P., A. Loscos, J. Bonada, M. de Boer, X. Serra.2000. “Singing Voice Impersonator Application for PC”.
Proceedings of the ICMC 2000.
[2]. Serra, X. 1997. “Musical Sound Modeling with Sinusoids plus Noise”. G. D. Poli and others (eds.), MusicalSignal
Processing, Swets & Zeitlinger Publishers, 1997.
[3]. Loscos, A., P. Cano, J. Bonada. 1999. “Low-Delay Singing Voice Alignment to text”. Proceedings ofthe ICMC 1999.
[4]. Serra, X. 1997. “Musical Sound Modeling with Sinusoids plus Noise”. G. D. Poli and others (eds.), Musical Signal
Processing, Swets & Zeitlinger Publishers, 1997.

More Related Content

What's hot

speech enhancement
speech enhancementspeech enhancement
speech enhancement
senthilrajvlsi
 
Environmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniqueEnvironmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniquePankaj Kumar
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
ijceronline
 
Human Emotion Recognition From Speech
Human Emotion Recognition From SpeechHuman Emotion Recognition From Speech
Human Emotion Recognition From Speech
IJERA Editor
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCCHira Shaukat
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker Recognition
CSCJournals
 
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
TELKOMNIKA JOURNAL
 
Speech Signal Analysis
Speech Signal AnalysisSpeech Signal Analysis
Speech Signal Analysis
Pradeep Reddy Guvvala
 
An Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresAn Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech features
Sivaranjan Goswami
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlab
Arcanjo Salazaku
 
IRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time DomainIRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time Domain
IRJET Journal
 
Isolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkIsolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural network
eSAT Journals
 
COLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisCOLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech Analysis
Rushin Shah
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
 
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundWavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
TELKOMNIKA JOURNAL
 
44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognitionsunnysyed
 
Rain Technology
Rain TechnologyRain Technology
Rain Technology
amal390
 
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...guestfb80e22
 

What's hot (20)

speech enhancement
speech enhancementspeech enhancement
speech enhancement
 
Environmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniqueEnvironmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC technique
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
 
Human Emotion Recognition From Speech
Human Emotion Recognition From SpeechHuman Emotion Recognition From Speech
Human Emotion Recognition From Speech
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCC
 
Speaker recognition.
Speaker recognition.Speaker recognition.
Speaker recognition.
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker Recognition
 
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
 
Speech Signal Analysis
Speech Signal AnalysisSpeech Signal Analysis
Speech Signal Analysis
 
An Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresAn Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech features
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlab
 
IRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time DomainIRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time Domain
 
Isolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkIsolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural network
 
COLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisCOLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech Analysis
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
50120140501002
5012014050100250120140501002
50120140501002
 
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundWavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
 
44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition
 
Rain Technology
Rain TechnologyRain Technology
Rain Technology
 
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
 

Viewers also liked

Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...
Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...
Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...
Curley & Rothman, LLC
 
Proyecto para Marqueting Digital
Proyecto para Marqueting DigitalProyecto para Marqueting Digital
Proyecto para Marqueting Digital
Raul Matamoros Hernandez
 
LINDA HARRISON RESUME 12B
LINDA HARRISON  RESUME  12BLINDA HARRISON  RESUME  12B
LINDA HARRISON RESUME 12BLinda Harrison
 
Resume_Vikash1056@gmail
Resume_Vikash1056@gmailResume_Vikash1056@gmail
Resume_Vikash1056@gmailvikash kumar
 
Due Diligence Checklist for Commercial Real Estate Developers
Due Diligence Checklist for Commercial Real Estate DevelopersDue Diligence Checklist for Commercial Real Estate Developers
Due Diligence Checklist for Commercial Real Estate Developers
Curley & Rothman, LLC
 
St 022 2005
St 022 2005St 022 2005
St 022 2005
brianthomes841
 
Cultura china
Cultura chinaCultura china
Cultura china
Leydi Súarez
 
Efficient String Matching Algorithm for Intrusion Detection
Efficient String Matching Algorithm for Intrusion DetectionEfficient String Matching Algorithm for Intrusion Detection
Efficient String Matching Algorithm for Intrusion Detection
editor1knowledgecuddle
 
Umfrage Flirten im Ulraub
Umfrage Flirten im UlraubUmfrage Flirten im Ulraub
Umfrage Flirten im Ulraub
LTUR_Presse
 
What are the right metrics to measure process efficiency
What are the right metrics to measure process efficiencyWhat are the right metrics to measure process efficiency
What are the right metrics to measure process efficiency
Alok Bararia
 
Catalogo2017
Catalogo2017Catalogo2017
Catalogo2017
Pakko Difratex
 
koenig-social-media-advertising-case-study
koenig-social-media-advertising-case-studykoenig-social-media-advertising-case-study
koenig-social-media-advertising-case-studyGarima Mathur
 
Education: Public’s Feedback
Education: Public’s FeedbackEducation: Public’s Feedback
Education: Public’s Feedback
Ipsos
 

Viewers also liked (14)

Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...
Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...
Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...
 
Proyecto para Marqueting Digital
Proyecto para Marqueting DigitalProyecto para Marqueting Digital
Proyecto para Marqueting Digital
 
LINDA HARRISON RESUME 12B
LINDA HARRISON  RESUME  12BLINDA HARRISON  RESUME  12B
LINDA HARRISON RESUME 12B
 
Resume_Vikash1056@gmail
Resume_Vikash1056@gmailResume_Vikash1056@gmail
Resume_Vikash1056@gmail
 
Due Diligence Checklist for Commercial Real Estate Developers
Due Diligence Checklist for Commercial Real Estate DevelopersDue Diligence Checklist for Commercial Real Estate Developers
Due Diligence Checklist for Commercial Real Estate Developers
 
St 022 2005
St 022 2005St 022 2005
St 022 2005
 
Cultura china
Cultura chinaCultura china
Cultura china
 
Efficient String Matching Algorithm for Intrusion Detection
Efficient String Matching Algorithm for Intrusion DetectionEfficient String Matching Algorithm for Intrusion Detection
Efficient String Matching Algorithm for Intrusion Detection
 
Umfrage Flirten im Ulraub
Umfrage Flirten im UlraubUmfrage Flirten im Ulraub
Umfrage Flirten im Ulraub
 
研究论文
研究论文研究论文
研究论文
 
What are the right metrics to measure process efficiency
What are the right metrics to measure process efficiencyWhat are the right metrics to measure process efficiency
What are the right metrics to measure process efficiency
 
Catalogo2017
Catalogo2017Catalogo2017
Catalogo2017
 
koenig-social-media-advertising-case-study
koenig-social-media-advertising-case-studykoenig-social-media-advertising-case-study
koenig-social-media-advertising-case-study
 
Education: Public’s Feedback
Education: Public’s FeedbackEducation: Public’s Feedback
Education: Public’s Feedback
 

Similar to Animal Voice Morphing System

Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
IJTET Journal
 
Emotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio SpeechEmotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio Speech
IOSR Journals
 
Audio/Speech Signal Analysis for Depression
Audio/Speech Signal Analysis for DepressionAudio/Speech Signal Analysis for Depression
Audio/Speech Signal Analysis for Depression
ijsrd.com
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features
ijsc
 
Survey On Speech Synthesis
Survey On Speech SynthesisSurvey On Speech Synthesis
Survey On Speech Synthesis
CSCJournals
 
Effect of Dynamic Time Warping on Alignment of Phrases and Phonemes
Effect of Dynamic Time Warping on Alignment of Phrases and PhonemesEffect of Dynamic Time Warping on Alignment of Phrases and Phonemes
Effect of Dynamic Time Warping on Alignment of Phrases and Phonemes
kevig
 
Voice Morphing
Voice MorphingVoice Morphing
Voice MorphingSayyed Z
 
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
CSCJournals
 
Effect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal AlignmentsEffect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal Alignments
kevig
 
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSEFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
ijnlc
 
Kc3517481754
Kc3517481754Kc3517481754
Kc3517481754
IJERA Editor
 
Voice morphing-101113123852-phpapp01
Voice morphing-101113123852-phpapp01Voice morphing-101113123852-phpapp01
Voice morphing-101113123852-phpapp01
Rehan Ahmed
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
International Journal of Science and Research (IJSR)
 
International journal of signal and image processing issues vol 2015 - no 1...
International journal of signal and image processing issues   vol 2015 - no 1...International journal of signal and image processing issues   vol 2015 - no 1...
International journal of signal and image processing issues vol 2015 - no 1...
sophiabelthome
 
De4201715719
De4201715719De4201715719
De4201715719
IJERA Editor
 
H42045359
H42045359H42045359
H42045359
IJERA Editor
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
IRJET Journal
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
IRJET Journal
 
Emotion Recognition Based on Speech Signals by Combining Empirical Mode Decom...
Emotion Recognition Based on Speech Signals by Combining Empirical Mode Decom...Emotion Recognition Based on Speech Signals by Combining Empirical Mode Decom...
Emotion Recognition Based on Speech Signals by Combining Empirical Mode Decom...
BIJIAM Journal
 

Similar to Animal Voice Morphing System (20)

Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
Emotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio SpeechEmotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio Speech
 
Audio/Speech Signal Analysis for Depression
Audio/Speech Signal Analysis for DepressionAudio/Speech Signal Analysis for Depression
Audio/Speech Signal Analysis for Depression
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features
 
Survey On Speech Synthesis
Survey On Speech SynthesisSurvey On Speech Synthesis
Survey On Speech Synthesis
 
An Introduction To Speech Recognition
An Introduction To Speech RecognitionAn Introduction To Speech Recognition
An Introduction To Speech Recognition
 
Effect of Dynamic Time Warping on Alignment of Phrases and Phonemes
Effect of Dynamic Time Warping on Alignment of Phrases and PhonemesEffect of Dynamic Time Warping on Alignment of Phrases and Phonemes
Effect of Dynamic Time Warping on Alignment of Phrases and Phonemes
 
Voice Morphing
Voice MorphingVoice Morphing
Voice Morphing
 
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
 
Effect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal AlignmentsEffect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal Alignments
 
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSEFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
 
Kc3517481754
Kc3517481754Kc3517481754
Kc3517481754
 
Voice morphing-101113123852-phpapp01
Voice morphing-101113123852-phpapp01Voice morphing-101113123852-phpapp01
Voice morphing-101113123852-phpapp01
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
 
International journal of signal and image processing issues vol 2015 - no 1...
International journal of signal and image processing issues   vol 2015 - no 1...International journal of signal and image processing issues   vol 2015 - no 1...
International journal of signal and image processing issues vol 2015 - no 1...
 
De4201715719
De4201715719De4201715719
De4201715719
 
H42045359
H42045359H42045359
H42045359
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
Emotion Recognition Based on Speech Signals by Combining Empirical Mode Decom...
Emotion Recognition Based on Speech Signals by Combining Empirical Mode Decom...Emotion Recognition Based on Speech Signals by Combining Empirical Mode Decom...
Emotion Recognition Based on Speech Signals by Combining Empirical Mode Decom...
 

Recently uploaded

AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
SamSarthak3
 
LIGA(E)11111111111111111111111111111111111111111.ppt
LIGA(E)11111111111111111111111111111111111111111.pptLIGA(E)11111111111111111111111111111111111111111.ppt
LIGA(E)11111111111111111111111111111111111111111.ppt
ssuser9bd3ba
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
Gen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdfGen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdf
gdsczhcet
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
AafreenAbuthahir2
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
Jayaprasanna4
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
AJAYKUMARPUND1
 
COLLEGE BUS MANAGEMENT SYSTEM PROJECT REPORT.pdf
COLLEGE BUS MANAGEMENT SYSTEM PROJECT REPORT.pdfCOLLEGE BUS MANAGEMENT SYSTEM PROJECT REPORT.pdf
COLLEGE BUS MANAGEMENT SYSTEM PROJECT REPORT.pdf
Kamal Acharya
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
Jayaprasanna4
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
AhmedHussein950959
 
Vaccine management system project report documentation..pdf
Vaccine management system project report documentation..pdfVaccine management system project report documentation..pdf
Vaccine management system project report documentation..pdf
Kamal Acharya
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Dr.Costas Sachpazis
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
VENKATESHvenky89705
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
ViniHema
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
obonagu
 

Recently uploaded (20)

AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
 
LIGA(E)11111111111111111111111111111111111111111.ppt
LIGA(E)11111111111111111111111111111111111111111.pptLIGA(E)11111111111111111111111111111111111111111.ppt
LIGA(E)11111111111111111111111111111111111111111.ppt
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
 
Gen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdfGen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdf
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
 
COLLEGE BUS MANAGEMENT SYSTEM PROJECT REPORT.pdf
COLLEGE BUS MANAGEMENT SYSTEM PROJECT REPORT.pdfCOLLEGE BUS MANAGEMENT SYSTEM PROJECT REPORT.pdf
COLLEGE BUS MANAGEMENT SYSTEM PROJECT REPORT.pdf
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
 
Vaccine management system project report documentation..pdf
Vaccine management system project report documentation..pdfVaccine management system project report documentation..pdf
Vaccine management system project report documentation..pdf
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
 

Animal Voice Morphing System

  • 1. KNOWLEDGECUDDLE PUBLICATION International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1 1 www.knowledgecuddle.com Animal Voice Morphing System Siddhapara Milan v. milansiddhapara@gmial.com Gujarat Technological University, Gujarat, India. Abstract:This paper describe the morphing concept in which we convert the voice of any person into pre -analyzed or pre- recorded voice of any animals.As the user generate a pre-established voice, his pitch, timbre, vibrato and articulation can be modified to resemble those of a pre-recorded and pre-analyzed voice of animal. This technique is based on SMS. Thus using this concept we can develop many funny application and we can used this type of application in mobile device, personal computer etc. for enjoying the sometime of period. Keywords: Voice morphing, Animal. I. Introduction In general morphing means “conversion of some stages into some another stages”. Similarly in voice morphing us convert the voice generated from the any source into pre-analyzed or predict target voice in other word voice morphing means the transition of one speech signal into another. Like image morphing, speech morphing aims to preserve the shared characteristics of the starting and final signals, while generating a smooth transition between them. In voice morphing processone speech signal should smoothly change into another, keeping the shared characteristics of the starting and ending signals but smoothly changing the other properties. The major properties of concern as far as a speech signal is concerned are its pitch and envelope information. These two reside in a convolved form in a speech signal. Hence some efficient method for extracting each of these is necessary.The main goal of the developed voice morphing methods is the smooth transformation from one sound to another, thus, the combination of two sounds to create a new sound with an intermediate timbre. Most of these methods are based on the interpolation of sounds parameterizations resulting from analysis/synthesis techniques, such as the Short-timeFourier Transform (STFT), Linear Predictive Coding (LPC) or Sinusoidal Models. In this paper we focus on specific concept, we convert the voice of any person into predict animal voice to generate some joyful environment. In this paper user convert the pitch, articulation and also able to control the degree of morphing to achieve the desire objective. Morphing take place in following stages, first system understand the voice of source (person) and find same sound Pieces from the target voice, then interpolate the selected attributes and produced the morphed voice In this paper [1] the implantation is discussed.
  • 2. KNOWLEDGECUDDLE PUBLICATION International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1 2 www.knowledgecuddle.com II. Voice Morphing System Fig.1 shows the general block diagram of voice imitator system. The underlying analysis technique is SMS.Also in system recognition and alignment module was added for synchronizing the user’s voice with the target voice before the morphing is done.Before we can morph a particular voice we have to supplyinformation about target information(predict animal voice ) and voice to be morphed. The system requires the phonetic transcription of the lyrics, the melody as MIDI data, and the actual recording to be used as the target audio data. Thus, a good impersonator of any animal that originally Voice has to be recorded. This recording has to be analysed with SMS, segmented into “morphing units”, and each unit labelled with the appropriate note and phonetic information of the song. This preparation stage is done semi-automatically, using a non-real time application developed for this task. The first module of the running system includes the real-time analysis and the recognition/alignment steps. Each analysis frame, with the appropriate parameterization, is associated with the phoneme of a specific moment of the voice and thus with a target frame. The recognition/alignment algorithm is based on traditional speech recognition technology, that is, Hidden Markov Models (HMM) that were adapted to the singing voice [2] The source sound are segmented into number of frames and each frame are individual analyzed. The recognitionalignment algorithm is based on hidden Markov models (HMM) [3] and when the source frame matched with target frame then we morphed them and get the desired output. III. Spectral Modeling Synthesis We use the term SMS to describe our ongoing research on the development of techniques and software applications for the analysis, transformation and synthesis of musical sounds based on spectral models [2].Information on SMS and public domain software applications based on it can be found in our Web site (http://www.iua.upf.es/~sms). IV. Analysis Using SMS Analysis processes can be categorized as follows.  Preprocessing or representation conversion: This involves processes like signal acquisition in discrete form and windowing.  Cepstral analysis or Pitch and Envelope analysis: This process will extract the pitch and formant information in the speech signal.  Morphing which includes Warping and interpolation.
  • 3. KNOWLEDGECUDDLE PUBLICATION International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1 3 www.knowledgecuddle.com  Signal re-estimation. The traditional SMS analysis output is a collection of frequency and amplitude values that represent the partials of the sound (sinusoidal component), and either filter coefficients with a gain value or spectral magnitudes and phases representing the residual sound (non sinusoidal component) [4] The major improvement is done SMS model is alignment and analysis. A major improvement to SMS has been the real-time implementation of the whole analysis/synthesis process, with a processing latency of less than 30 milliseconds. V. Phonetic recognition/ alignment This part of the system is responsible for recognizing the voice that is being generated from the userthat a similar segment can be chosen from the target information. The huge amount of research is carry out in speech recognition.The recognition systems work well when tested in the well- controlled.The recognition systems work reasonably well when tested in the well-controlled environment of the laboratory. However, phoneme recognition rates decay miserably when the conditions are adverse. A. SMorphing Speech morphing can be achieved by transforming the signal.To prepare the signal for the transformation, it is split into a number of 'frames' - sections of the waveform. The transformation is then applied to each frame of the signal. This provides another way of viewing the signal information. The new representation (said to be in the frequency domain) describes the average energy present at each frequency band. Further analysis enables two pieces of information to be obtained: pitch information and the overall envelope of the sound. A key element in the morphing is the manipulation of the pitch information. If two signals with different pitches were simply cross-faded it is highly likely that two separate sounds will be heard. This occurs because the signal will have two distinct pitches causing the auditory system to perceive two different objects. A successful morph must exhibit a smoothly changing pitch throughout. The pitch information of each sound is compared to provide the best match between the two signals' pitches. To do this match, the signals are stretched and compressed so that important sections of each signal match in time. The interpolation of the two sounds can then be performed which creates the intermediate sounds in the morph. The final stage is then to convert the frames back into a normal waveform. However, after the morphing has been performed, the legacy of the earlier analysis becomes apparent. The conversion of the sound to a representation in which the pitch and spectral envelope can be separated loses some information. Therefore, this information has to be re-estimated for the morphed sound. This process obtains an acoustic waveform, which can then be stored or listened to. Depending on the voice of person (user), a unit from the target is selected. Each frame from the user is morphed with a different frame from the target, advancing sequentially in time. Then the user has the choice to interpolate the different parameters extracted at the analysis stage, such as amplitude, fundamental frequency, spectral shape, residual signal, etc. In general the amplitude will not be interpolated, thus always using the amplitude from the user and the unvoiced phonemes will also not be morphed, thus always using the consonants from the user. This will give the user the feeling of being in control.In most cases the durations of the user and target voice to be morphed will be different. If a given user’s voice is shorter than the one from the target the system will simply skip the remaining part of the target voice and go directly to the articulation portion. In the case when the user voiceisa longer than the one present in the target data the system enters in the loop mode. Each voiced phoneme of the target has a loop point frame, marked in the preprocessing, non-real time stage. The system uses this frame to loop-synthesis in case the user sings beyond that point in the phoneme. Once we reach this frame in the target, the rest of the frames of the user will be interpolated with that same frame until the user ends the phoneme. VI. CONCLUSION
  • 4. KNOWLEDGECUDDLE PUBLICATION International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1 4 www.knowledgecuddle.com The final purpose of the system is to make joyful voice from the user’s (source) voice. Means using this we developed the many funny application but we need the some voice sample of target voice. For instance the voice of Karan is translated into voice of cat. But Limitation is that we need a clear voice from the source so we can easily recognize or analyzed it. If there is no clear voice of user then sample of that voice will converted into some another sound pieces of target animal. So if we have clear voice then we easily convert into target vice and enjoy the sometime of period. REFERENCES [1]. Cano, P., A. Loscos, J. Bonada, M. de Boer, X. Serra.2000. “Singing Voice Impersonator Application for PC”. Proceedings of the ICMC 2000. [2]. Serra, X. 1997. “Musical Sound Modeling with Sinusoids plus Noise”. G. D. Poli and others (eds.), MusicalSignal Processing, Swets & Zeitlinger Publishers, 1997. [3]. Loscos, A., P. Cano, J. Bonada. 1999. “Low-Delay Singing Voice Alignment to text”. Proceedings ofthe ICMC 1999. [4]. Serra, X. 1997. “Musical Sound Modeling with Sinusoids plus Noise”. G. D. Poli and others (eds.), Musical Signal Processing, Swets & Zeitlinger Publishers, 1997.