SlideShare a Scribd company logo
1 of 4
Download to read offline
KNOWLEDGECUDDLE PUBLICATION
International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1
1
www.knowledgecuddle.com
Animal Voice Morphing System
Siddhapara Milan v.
milansiddhapara@gmial.com
Gujarat Technological University,
Gujarat, India.
Abstract:This paper describe the morphing concept in which we convert the voice of any person into pre -analyzed or pre-
recorded voice of any animals.As the user generate a pre-established voice, his pitch, timbre, vibrato and articulation can
be modified to resemble those of a pre-recorded and pre-analyzed voice of animal. This technique is based on SMS. Thus
using this concept we can develop many funny application and we can used this type of application in mobile device,
personal computer etc. for enjoying the sometime of period.
Keywords: Voice morphing, Animal.
I. Introduction
In general morphing means “conversion of some stages into some another stages”. Similarly in voice morphing us
convert the voice generated from the any source into pre-analyzed or predict target voice in other word voice morphing means
the transition of one speech signal into another. Like image morphing, speech morphing aims to preserve the shared
characteristics of the starting and final signals, while generating a smooth transition between them. In voice morphing
processone speech signal should smoothly change into another, keeping the shared characteristics of the starting and ending
signals but smoothly changing the other properties. The major properties of concern as far as a speech signal is concerned are
its pitch and envelope information. These two reside in a convolved form in a speech signal. Hence some efficient method for
extracting each of these is necessary.The main goal of the developed voice morphing methods is the smooth transformation from
one sound to another, thus, the combination of two sounds to create a new sound with an intermediate timbre. Most of these methods
are based on the interpolation of sounds parameterizations resulting from analysis/synthesis techniques, such as the Short-timeFourier
Transform (STFT), Linear Predictive Coding (LPC) or Sinusoidal Models.
In this paper we focus on specific concept, we convert the voice of any person into predict animal voice to generate
some joyful environment. In this paper user convert the pitch, articulation and also able to control the degree of morphing to
achieve the desire objective. Morphing take place in following stages, first system understand the voice of source (person) and
find same sound Pieces from the target voice, then interpolate the selected attributes and produced the morphed voice In this
paper [1] the implantation is discussed.
KNOWLEDGECUDDLE PUBLICATION
International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1
2
www.knowledgecuddle.com
II. Voice Morphing System
Fig.1 shows the general block diagram of voice imitator system. The underlying analysis technique is SMS.Also in
system recognition and alignment module was added for synchronizing the user’s voice with the target voice before the
morphing is done.Before we can morph a particular voice we have to supplyinformation about target information(predict animal
voice ) and voice to be morphed. The system requires the phonetic transcription of the lyrics, the melody as MIDI data, and the actual
recording to be used as the target audio data. Thus, a good impersonator of any animal that originally Voice has to be recorded. This
recording has to be analysed with SMS, segmented into “morphing units”, and each unit labelled with the appropriate note and
phonetic information of the song. This preparation stage is done semi-automatically, using a non-real time application developed for
this task. The first module of the running system includes the real-time analysis and the recognition/alignment steps. Each analysis
frame, with the appropriate parameterization, is associated with the phoneme of a specific moment of the voice and thus with a target
frame. The recognition/alignment algorithm is based on traditional speech recognition technology, that is, Hidden Markov Models
(HMM) that were adapted to the singing voice [2]
The source sound are segmented into number of frames and each frame are individual analyzed. The recognitionalignment
algorithm is based on hidden Markov models (HMM) [3] and when the source frame matched with target frame then we
morphed them and get the desired output.
III. Spectral Modeling Synthesis
We use the term SMS to describe our ongoing research on the development of techniques and software applications for the
analysis, transformation and synthesis of musical sounds based on spectral models [2].Information on SMS and public domain
software applications based on it can be found in our Web site (http://www.iua.upf.es/~sms).
IV. Analysis Using SMS
Analysis processes can be categorized as follows.
 Preprocessing or representation conversion: This involves processes like signal acquisition in discrete form and
windowing.
 Cepstral analysis or Pitch and Envelope analysis: This process will extract the pitch and formant information in the
speech signal.
 Morphing which includes Warping and interpolation.
KNOWLEDGECUDDLE PUBLICATION
International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1
3
www.knowledgecuddle.com
 Signal re-estimation.
The traditional SMS analysis output is a collection of frequency and amplitude values that represent the partials of the
sound (sinusoidal component), and either filter coefficients with a gain value or spectral magnitudes and phases representing
the residual sound (non sinusoidal component) [4]
The major improvement is done SMS model is alignment and analysis. A major improvement to SMS has been the real-time
implementation of the whole analysis/synthesis process, with a processing latency of less than 30 milliseconds.
V. Phonetic recognition/ alignment
This part of the system is responsible for recognizing the voice that is being generated from the userthat a similar segment
can be chosen from the target information.
The huge amount of research is carry out in speech recognition.The recognition systems work well when tested in the well-
controlled.The recognition systems work reasonably well when tested in the well-controlled environment of the laboratory.
However, phoneme recognition rates decay miserably when the conditions are adverse.
A. SMorphing
Speech morphing can be achieved by transforming the signal.To prepare the signal for the transformation, it is split
into a number of 'frames' - sections of the waveform. The transformation is then applied to each frame of the signal. This
provides another way of viewing the signal information. The new representation (said to be in the frequency domain) describes
the average energy present at each frequency band. Further analysis enables two pieces of information to be obtained: pitch
information and the overall envelope of the sound. A key element in the morphing is the manipulation of the pitch information.
If two signals with different pitches were simply cross-faded it is highly likely that two separate sounds will be heard. This
occurs because the signal will have two distinct pitches causing the auditory system to perceive two different objects. A
successful morph must exhibit a smoothly changing pitch throughout. The pitch information of each sound is compared to
provide the best match between the two signals' pitches. To do this match, the signals are stretched and compressed so that
important sections of each signal match in time. The interpolation of the two sounds can then be performed which creates the
intermediate sounds in the morph. The final stage is then to convert the frames back into a normal waveform.
However, after the morphing has been performed, the legacy of the earlier analysis becomes apparent. The conversion of the
sound to a representation in which the pitch and spectral envelope can be separated loses some information. Therefore, this
information has to be re-estimated for the morphed sound. This process obtains an acoustic waveform, which can then be
stored or listened to.
Depending on the voice of person (user), a unit from the target is selected. Each frame from the user is morphed with
a different frame from the target, advancing sequentially in time. Then the user has the choice to interpolate the different
parameters extracted at the analysis stage, such as amplitude, fundamental frequency, spectral shape, residual signal, etc. In
general the amplitude will not be interpolated, thus always using the amplitude from the user and the unvoiced phonemes will
also not be morphed, thus always using the consonants from the user. This will give the user the feeling of being in control.In
most cases the durations of the user and target voice to be morphed will be different. If a given user’s voice is shorter than the
one from the target the system will simply skip the remaining part of the target voice and go directly to the articulation portion.
In the case when the user voiceisa longer than the one present in the target data the system enters in the loop mode. Each
voiced phoneme of the target has a loop point frame, marked in the preprocessing, non-real time stage. The system uses this
frame to loop-synthesis in case the user sings beyond that point in the phoneme. Once we reach this frame in the target, the rest
of the frames of the user will be interpolated with that same frame until the user ends the phoneme.
VI. CONCLUSION
KNOWLEDGECUDDLE PUBLICATION
International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1
4
www.knowledgecuddle.com
The final purpose of the system is to make joyful voice from the user’s (source) voice. Means using this we developed the
many funny application but we need the some voice sample of target voice. For instance the voice of Karan is translated into
voice of cat. But Limitation is that we need a clear voice from the source so we can easily recognize or analyzed it. If there is
no clear voice of user then sample of that voice will converted into some another sound pieces of target animal. So if we have
clear voice then we easily convert into target vice and enjoy the sometime of period.
REFERENCES
[1]. Cano, P., A. Loscos, J. Bonada, M. de Boer, X. Serra.2000. “Singing Voice Impersonator Application for PC”.
Proceedings of the ICMC 2000.
[2]. Serra, X. 1997. “Musical Sound Modeling with Sinusoids plus Noise”. G. D. Poli and others (eds.), MusicalSignal
Processing, Swets & Zeitlinger Publishers, 1997.
[3]. Loscos, A., P. Cano, J. Bonada. 1999. “Low-Delay Singing Voice Alignment to text”. Proceedings ofthe ICMC 1999.
[4]. Serra, X. 1997. “Musical Sound Modeling with Sinusoids plus Noise”. G. D. Poli and others (eds.), Musical Signal
Processing, Swets & Zeitlinger Publishers, 1997.

More Related Content

What's hot

Environmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniqueEnvironmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniquePankaj Kumar
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...ijceronline
 
Human Emotion Recognition From Speech
Human Emotion Recognition From SpeechHuman Emotion Recognition From Speech
Human Emotion Recognition From SpeechIJERA Editor
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCCHira Shaukat
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionCSCJournals
 
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...TELKOMNIKA JOURNAL
 
An Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresAn Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresSivaranjan Goswami
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlabArcanjo Salazaku
 
IRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time DomainIRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time DomainIRJET Journal
 
Isolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkIsolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkeSAT Journals
 
COLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisCOLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisRushin Shah
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
 
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundWavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundTELKOMNIKA JOURNAL
 
44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognitionsunnysyed
 
Rain Technology
Rain TechnologyRain Technology
Rain Technologyamal390
 
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...guestfb80e22
 

What's hot (20)

speech enhancement
speech enhancementspeech enhancement
speech enhancement
 
Environmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniqueEnvironmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC technique
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
 
Human Emotion Recognition From Speech
Human Emotion Recognition From SpeechHuman Emotion Recognition From Speech
Human Emotion Recognition From Speech
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCC
 
Speaker recognition.
Speaker recognition.Speaker recognition.
Speaker recognition.
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker Recognition
 
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
 
Speech Signal Analysis
Speech Signal AnalysisSpeech Signal Analysis
Speech Signal Analysis
 
An Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresAn Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech features
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlab
 
IRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time DomainIRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time Domain
 
Isolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkIsolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural network
 
COLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisCOLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech Analysis
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
50120140501002
5012014050100250120140501002
50120140501002
 
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundWavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
 
44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition
 
Rain Technology
Rain TechnologyRain Technology
Rain Technology
 
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
 

Viewers also liked

Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...
Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...
Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...Curley & Rothman, LLC
 
LINDA HARRISON RESUME 12B
LINDA HARRISON  RESUME  12BLINDA HARRISON  RESUME  12B
LINDA HARRISON RESUME 12BLinda Harrison
 
Resume_Vikash1056@gmail
Resume_Vikash1056@gmailResume_Vikash1056@gmail
Resume_Vikash1056@gmailvikash kumar
 
Due Diligence Checklist for Commercial Real Estate Developers
Due Diligence Checklist for Commercial Real Estate DevelopersDue Diligence Checklist for Commercial Real Estate Developers
Due Diligence Checklist for Commercial Real Estate DevelopersCurley & Rothman, LLC
 
Efficient String Matching Algorithm for Intrusion Detection
Efficient String Matching Algorithm for Intrusion DetectionEfficient String Matching Algorithm for Intrusion Detection
Efficient String Matching Algorithm for Intrusion Detectioneditor1knowledgecuddle
 
Umfrage Flirten im Ulraub
Umfrage Flirten im UlraubUmfrage Flirten im Ulraub
Umfrage Flirten im UlraubLTUR_Presse
 
What are the right metrics to measure process efficiency
What are the right metrics to measure process efficiencyWhat are the right metrics to measure process efficiency
What are the right metrics to measure process efficiencyAlok Bararia
 
koenig-social-media-advertising-case-study
koenig-social-media-advertising-case-studykoenig-social-media-advertising-case-study
koenig-social-media-advertising-case-studyGarima Mathur
 
Education: Public’s Feedback
Education: Public’s FeedbackEducation: Public’s Feedback
Education: Public’s FeedbackIpsos
 

Viewers also liked (14)

Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...
Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...
Can You Qualify for Low Income Housing Tax Credits: A Guide for Land Developr...
 
Proyecto para Marqueting Digital
Proyecto para Marqueting DigitalProyecto para Marqueting Digital
Proyecto para Marqueting Digital
 
LINDA HARRISON RESUME 12B
LINDA HARRISON  RESUME  12BLINDA HARRISON  RESUME  12B
LINDA HARRISON RESUME 12B
 
Resume_Vikash1056@gmail
Resume_Vikash1056@gmailResume_Vikash1056@gmail
Resume_Vikash1056@gmail
 
Due Diligence Checklist for Commercial Real Estate Developers
Due Diligence Checklist for Commercial Real Estate DevelopersDue Diligence Checklist for Commercial Real Estate Developers
Due Diligence Checklist for Commercial Real Estate Developers
 
St 022 2005
St 022 2005St 022 2005
St 022 2005
 
Cultura china
Cultura chinaCultura china
Cultura china
 
Efficient String Matching Algorithm for Intrusion Detection
Efficient String Matching Algorithm for Intrusion DetectionEfficient String Matching Algorithm for Intrusion Detection
Efficient String Matching Algorithm for Intrusion Detection
 
Umfrage Flirten im Ulraub
Umfrage Flirten im UlraubUmfrage Flirten im Ulraub
Umfrage Flirten im Ulraub
 
研究论文
研究论文研究论文
研究论文
 
What are the right metrics to measure process efficiency
What are the right metrics to measure process efficiencyWhat are the right metrics to measure process efficiency
What are the right metrics to measure process efficiency
 
Catalogo2017
Catalogo2017Catalogo2017
Catalogo2017
 
koenig-social-media-advertising-case-study
koenig-social-media-advertising-case-studykoenig-social-media-advertising-case-study
koenig-social-media-advertising-case-study
 
Education: Public’s Feedback
Education: Public’s FeedbackEducation: Public’s Feedback
Education: Public’s Feedback
 

Similar to Animal Voice Morphing System

Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderIJTET Journal
 
Emotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio SpeechEmotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio SpeechIOSR Journals
 
Audio/Speech Signal Analysis for Depression
Audio/Speech Signal Analysis for DepressionAudio/Speech Signal Analysis for Depression
Audio/Speech Signal Analysis for Depressionijsrd.com
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features ijsc
 
Survey On Speech Synthesis
Survey On Speech SynthesisSurvey On Speech Synthesis
Survey On Speech SynthesisCSCJournals
 
Effect of Dynamic Time Warping on Alignment of Phrases and Phonemes
Effect of Dynamic Time Warping on Alignment of Phrases and PhonemesEffect of Dynamic Time Warping on Alignment of Phrases and Phonemes
Effect of Dynamic Time Warping on Alignment of Phrases and Phonemeskevig
 
Voice Morphing
Voice MorphingVoice Morphing
Voice MorphingSayyed Z
 
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...CSCJournals
 
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSEFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSijnlc
 
Effect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal AlignmentsEffect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal Alignmentskevig
 
Voice morphing-101113123852-phpapp01
Voice morphing-101113123852-phpapp01Voice morphing-101113123852-phpapp01
Voice morphing-101113123852-phpapp01Rehan Ahmed
 
International journal of signal and image processing issues vol 2015 - no 1...
International journal of signal and image processing issues   vol 2015 - no 1...International journal of signal and image processing issues   vol 2015 - no 1...
International journal of signal and image processing issues vol 2015 - no 1...sophiabelthome
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...IRJET Journal
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...IRJET Journal
 
Silent sound interface
Silent sound interfaceSilent sound interface
Silent sound interfaceJeevitha Reddy
 

Similar to Animal Voice Morphing System (20)

Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
Emotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio SpeechEmotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio Speech
 
Audio/Speech Signal Analysis for Depression
Audio/Speech Signal Analysis for DepressionAudio/Speech Signal Analysis for Depression
Audio/Speech Signal Analysis for Depression
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features
 
Survey On Speech Synthesis
Survey On Speech SynthesisSurvey On Speech Synthesis
Survey On Speech Synthesis
 
An Introduction To Speech Recognition
An Introduction To Speech RecognitionAn Introduction To Speech Recognition
An Introduction To Speech Recognition
 
Effect of Dynamic Time Warping on Alignment of Phrases and Phonemes
Effect of Dynamic Time Warping on Alignment of Phrases and PhonemesEffect of Dynamic Time Warping on Alignment of Phrases and Phonemes
Effect of Dynamic Time Warping on Alignment of Phrases and Phonemes
 
Voice Morphing
Voice MorphingVoice Morphing
Voice Morphing
 
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
 
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSEFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
 
Effect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal AlignmentsEffect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal Alignments
 
Kc3517481754
Kc3517481754Kc3517481754
Kc3517481754
 
Voice morphing-101113123852-phpapp01
Voice morphing-101113123852-phpapp01Voice morphing-101113123852-phpapp01
Voice morphing-101113123852-phpapp01
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
 
International journal of signal and image processing issues vol 2015 - no 1...
International journal of signal and image processing issues   vol 2015 - no 1...International journal of signal and image processing issues   vol 2015 - no 1...
International journal of signal and image processing issues vol 2015 - no 1...
 
De4201715719
De4201715719De4201715719
De4201715719
 
H42045359
H42045359H42045359
H42045359
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
Silent sound interface
Silent sound interfaceSilent sound interface
Silent sound interface
 

Recently uploaded

Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAbhinavSharma374939
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 

Recently uploaded (20)

Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog Converter
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 

Animal Voice Morphing System

  • 1. KNOWLEDGECUDDLE PUBLICATION International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1 1 www.knowledgecuddle.com Animal Voice Morphing System Siddhapara Milan v. milansiddhapara@gmial.com Gujarat Technological University, Gujarat, India. Abstract:This paper describe the morphing concept in which we convert the voice of any person into pre -analyzed or pre- recorded voice of any animals.As the user generate a pre-established voice, his pitch, timbre, vibrato and articulation can be modified to resemble those of a pre-recorded and pre-analyzed voice of animal. This technique is based on SMS. Thus using this concept we can develop many funny application and we can used this type of application in mobile device, personal computer etc. for enjoying the sometime of period. Keywords: Voice morphing, Animal. I. Introduction In general morphing means “conversion of some stages into some another stages”. Similarly in voice morphing us convert the voice generated from the any source into pre-analyzed or predict target voice in other word voice morphing means the transition of one speech signal into another. Like image morphing, speech morphing aims to preserve the shared characteristics of the starting and final signals, while generating a smooth transition between them. In voice morphing processone speech signal should smoothly change into another, keeping the shared characteristics of the starting and ending signals but smoothly changing the other properties. The major properties of concern as far as a speech signal is concerned are its pitch and envelope information. These two reside in a convolved form in a speech signal. Hence some efficient method for extracting each of these is necessary.The main goal of the developed voice morphing methods is the smooth transformation from one sound to another, thus, the combination of two sounds to create a new sound with an intermediate timbre. Most of these methods are based on the interpolation of sounds parameterizations resulting from analysis/synthesis techniques, such as the Short-timeFourier Transform (STFT), Linear Predictive Coding (LPC) or Sinusoidal Models. In this paper we focus on specific concept, we convert the voice of any person into predict animal voice to generate some joyful environment. In this paper user convert the pitch, articulation and also able to control the degree of morphing to achieve the desire objective. Morphing take place in following stages, first system understand the voice of source (person) and find same sound Pieces from the target voice, then interpolate the selected attributes and produced the morphed voice In this paper [1] the implantation is discussed.
  • 2. KNOWLEDGECUDDLE PUBLICATION International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1 2 www.knowledgecuddle.com II. Voice Morphing System Fig.1 shows the general block diagram of voice imitator system. The underlying analysis technique is SMS.Also in system recognition and alignment module was added for synchronizing the user’s voice with the target voice before the morphing is done.Before we can morph a particular voice we have to supplyinformation about target information(predict animal voice ) and voice to be morphed. The system requires the phonetic transcription of the lyrics, the melody as MIDI data, and the actual recording to be used as the target audio data. Thus, a good impersonator of any animal that originally Voice has to be recorded. This recording has to be analysed with SMS, segmented into “morphing units”, and each unit labelled with the appropriate note and phonetic information of the song. This preparation stage is done semi-automatically, using a non-real time application developed for this task. The first module of the running system includes the real-time analysis and the recognition/alignment steps. Each analysis frame, with the appropriate parameterization, is associated with the phoneme of a specific moment of the voice and thus with a target frame. The recognition/alignment algorithm is based on traditional speech recognition technology, that is, Hidden Markov Models (HMM) that were adapted to the singing voice [2] The source sound are segmented into number of frames and each frame are individual analyzed. The recognitionalignment algorithm is based on hidden Markov models (HMM) [3] and when the source frame matched with target frame then we morphed them and get the desired output. III. Spectral Modeling Synthesis We use the term SMS to describe our ongoing research on the development of techniques and software applications for the analysis, transformation and synthesis of musical sounds based on spectral models [2].Information on SMS and public domain software applications based on it can be found in our Web site (http://www.iua.upf.es/~sms). IV. Analysis Using SMS Analysis processes can be categorized as follows.  Preprocessing or representation conversion: This involves processes like signal acquisition in discrete form and windowing.  Cepstral analysis or Pitch and Envelope analysis: This process will extract the pitch and formant information in the speech signal.  Morphing which includes Warping and interpolation.
  • 3. KNOWLEDGECUDDLE PUBLICATION International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1 3 www.knowledgecuddle.com  Signal re-estimation. The traditional SMS analysis output is a collection of frequency and amplitude values that represent the partials of the sound (sinusoidal component), and either filter coefficients with a gain value or spectral magnitudes and phases representing the residual sound (non sinusoidal component) [4] The major improvement is done SMS model is alignment and analysis. A major improvement to SMS has been the real-time implementation of the whole analysis/synthesis process, with a processing latency of less than 30 milliseconds. V. Phonetic recognition/ alignment This part of the system is responsible for recognizing the voice that is being generated from the userthat a similar segment can be chosen from the target information. The huge amount of research is carry out in speech recognition.The recognition systems work well when tested in the well- controlled.The recognition systems work reasonably well when tested in the well-controlled environment of the laboratory. However, phoneme recognition rates decay miserably when the conditions are adverse. A. SMorphing Speech morphing can be achieved by transforming the signal.To prepare the signal for the transformation, it is split into a number of 'frames' - sections of the waveform. The transformation is then applied to each frame of the signal. This provides another way of viewing the signal information. The new representation (said to be in the frequency domain) describes the average energy present at each frequency band. Further analysis enables two pieces of information to be obtained: pitch information and the overall envelope of the sound. A key element in the morphing is the manipulation of the pitch information. If two signals with different pitches were simply cross-faded it is highly likely that two separate sounds will be heard. This occurs because the signal will have two distinct pitches causing the auditory system to perceive two different objects. A successful morph must exhibit a smoothly changing pitch throughout. The pitch information of each sound is compared to provide the best match between the two signals' pitches. To do this match, the signals are stretched and compressed so that important sections of each signal match in time. The interpolation of the two sounds can then be performed which creates the intermediate sounds in the morph. The final stage is then to convert the frames back into a normal waveform. However, after the morphing has been performed, the legacy of the earlier analysis becomes apparent. The conversion of the sound to a representation in which the pitch and spectral envelope can be separated loses some information. Therefore, this information has to be re-estimated for the morphed sound. This process obtains an acoustic waveform, which can then be stored or listened to. Depending on the voice of person (user), a unit from the target is selected. Each frame from the user is morphed with a different frame from the target, advancing sequentially in time. Then the user has the choice to interpolate the different parameters extracted at the analysis stage, such as amplitude, fundamental frequency, spectral shape, residual signal, etc. In general the amplitude will not be interpolated, thus always using the amplitude from the user and the unvoiced phonemes will also not be morphed, thus always using the consonants from the user. This will give the user the feeling of being in control.In most cases the durations of the user and target voice to be morphed will be different. If a given user’s voice is shorter than the one from the target the system will simply skip the remaining part of the target voice and go directly to the articulation portion. In the case when the user voiceisa longer than the one present in the target data the system enters in the loop mode. Each voiced phoneme of the target has a loop point frame, marked in the preprocessing, non-real time stage. The system uses this frame to loop-synthesis in case the user sings beyond that point in the phoneme. Once we reach this frame in the target, the rest of the frames of the user will be interpolated with that same frame until the user ends the phoneme. VI. CONCLUSION
  • 4. KNOWLEDGECUDDLE PUBLICATION International Journal of Research in Advance Engineering (IJRAE) – Volume-1 Issue -1 4 www.knowledgecuddle.com The final purpose of the system is to make joyful voice from the user’s (source) voice. Means using this we developed the many funny application but we need the some voice sample of target voice. For instance the voice of Karan is translated into voice of cat. But Limitation is that we need a clear voice from the source so we can easily recognize or analyzed it. If there is no clear voice of user then sample of that voice will converted into some another sound pieces of target animal. So if we have clear voice then we easily convert into target vice and enjoy the sometime of period. REFERENCES [1]. Cano, P., A. Loscos, J. Bonada, M. de Boer, X. Serra.2000. “Singing Voice Impersonator Application for PC”. Proceedings of the ICMC 2000. [2]. Serra, X. 1997. “Musical Sound Modeling with Sinusoids plus Noise”. G. D. Poli and others (eds.), MusicalSignal Processing, Swets & Zeitlinger Publishers, 1997. [3]. Loscos, A., P. Cano, J. Bonada. 1999. “Low-Delay Singing Voice Alignment to text”. Proceedings ofthe ICMC 1999. [4]. Serra, X. 1997. “Musical Sound Modeling with Sinusoids plus Noise”. G. D. Poli and others (eds.), Musical Signal Processing, Swets & Zeitlinger Publishers, 1997.