SlideShare a Scribd company logo
1 of 24
Voice morphing

              Presented
                  By
          H.Mohammed.Sabir
             09AT1A0461

              Supervised
                  By
             Shreedhar Sir
SEMINAR OUTLINES

What   It is?
Need   of Voice Morphing
Description      the Morphing.
Technical       details of Morphing.
Application       areas.
What is Voice Morphing ??


   Voice morphing is a technique for modifying a (source)
    speaker's speech to sound as if it were spoken by a
    different (target) speaker.


   In Simpler terms it is being able to change the speech of
    one speaker to that of another speaker.


   Technology developed at the Los Alamos National
    Laboratory in New Mexico, USA by George Papcun


   Applications for Voice Morphing range from recreational
    ones to security ones.
What it actually performs ?
   It is a technique to modify a source speaker's
    speech to sound as if it was spoken by a target
    speaker.
   Voice morphing enables speech patterns to be
    cloned
   And an accurate copy of a person's voice can
    be made that can wishes to say, anything in the
    voice of someone else.
Need of voice morphing

   Text To Speech (TTS)
   In public speech systems
   For special effects ( just like video or image morphing is
    done ).
   To diminish Ethnical barriers.
How to Morph Voice ??


   We need to effectively change the pitch from that of a male
    speaker to that of a female speaker. If we reminisce the
    excitation signal has information about the speaker.

   We find the LPC coefficients for the Source and Target Signals
    and using these coefficients we are going to interpolate
    between the two Signals.

   We get the New LPC (linear predictive coding) coefficients
    using the formula

         new lpc coeff = [const*(lpc source) + (1-const)(lpc
    target)]

   0 <= const <= 1

                                                                     …
How to Morph Speech ?? (contd…)



    The pitch of a female speaker will be close to twice that of
     the male speaker. In our example the pitch of the male
     speaker is 141Hz and that of the female speaker is 210Hz.


    So we need to develop some time stretching algorithm so
     that we can implement pitch shifting. We obtain the residue
     of the source signal and stretch it according to the value of
     the const. The const indicates what is the position of morphed
     signal in between the source and target.


    For example if const = 0.2 then the morphed signal will be
     closer in pitch to the source signal and a value of 0.8 for const
     will result in a pitch that is closer to the target signal.
How do we shift the Pitch ??
   We break the residue signal into small windows and introduce fade in
    and fade out for each block. We recombine everything to form the pitch
    shifted signal. Based on the alpha we can time stretch the residue
    according to our requirements.




    How do we Morph finally ??

•   We now have the pitch shifted residue signal and the new
      LPC coefficients. We should resample the pitch shifted
      signal so that it is played at a faster rate. [Remember
      when we pitch shift then the residue will last longer]. If
      we inverse filter the resampled pitch shifted residue then
      we can effect morphing.
Block Diagram
Time Domain Plots of Source and Target featuring the Pitch
Matching and Warping

   DTW(Dynamic Time Warping)


    - Dynamic Time Warping (DTW) is used to
    find the best match between the pitch of
    the two sounds.
Signal Re-Estimation

   Loss during Signal re-estimation


    -Due to signals being transformation into the
    cepstral domain, a magnitude function is
    used. This results in a loss of phase
    information in the representation of the
    data.
Limitations
 
Lots   of normalizing problems.
Some     applications require extensive sound libraries.
Different   languages require different phonetics.
It   is very seldom complete.
Advantages

   Allows speech model to be duplicated and an exact
    copy of a person’s voice.


   Powerful combat zone weapon.
Disadvantages

   Use to pull out the useful information.


   It hides the actual identity of the user.
Conclusion
   The approach we have adopted separates the sounds into two
    forms:

    - Spectral   envelope information
    - Pitch and voicing information.
   Dynamic Time Warping
    - Aligns the sounds with respect to their pitches.
   Signal re-estimation algorithm.
    - Frames are converted back into a time domain
    waveform.
Application Areas
   Fake telephone conversations as evidence in courts of
    law.


   Powerful battlefield weapon.

    - Provide
            fake orders to the enemy's troops,
    appearing to come from their own
    commanders.
Future Scope
   Extending the functionality of tool.
    - Create a powerful and flexible morphing
    tool.

   Increased user interaction.
    - Graphical User Interface could be
    designed and integrated to make the
    package more ‘user-friendly’.
BIBLIOGRAPHY:
• Ye, H. and S. Young (2003). "Perceptually Weighted Linear
  Transformations for Voice Conversion". Eurospeech 2003,
  Geneva. 
• Ye, H. and S. Young (2004). "High Quality Voice Morphing".
  Int Conference Acoustics Speech and Signal Processing,
  Montreal, Canada. 
• High quality Voice Morphing Hui Yeand Steve Young.
• Quality-enhanced Voice Morphing
Thank you!!!
Questions??

More Related Content

What's hot

Voice Morphing
Voice MorphingVoice Morphing
Voice MorphingSayyed Z
 
Linear Predictive Coding
Linear Predictive CodingLinear Predictive Coding
Linear Predictive CodingSrishti Kakade
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition Goa App
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by IqbalIqbal
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By MatlabAnkit Gujrati
 
SILENT SOUND TECHNOLOGY
SILENT SOUND TECHNOLOGYSILENT SOUND TECHNOLOGY
SILENT SOUND TECHNOLOGYNagma Parween
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data miningJimit Rupani
 
Silent Sound Technology
Silent Sound TechnologySilent Sound Technology
Silent Sound TechnologyMoumita132
 
Silent sound technology
Silent sound technologySilent sound technology
Silent sound technologyMaria Dominica
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySeminar Links
 
silent sound technology
silent sound technologysilent sound technology
silent sound technologykamesh0007
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCCHira Shaukat
 
Gujarati Text-to-Speech Presentation
Gujarati Text-to-Speech PresentationGujarati Text-to-Speech Presentation
Gujarati Text-to-Speech Presentationsamyakbhuta
 
Digital Video Editing
Digital Video EditingDigital Video Editing
Digital Video Editingmroe
 

What's hot (20)

Voice Morphing
Voice MorphingVoice Morphing
Voice Morphing
 
Linear Predictive Coding
Linear Predictive CodingLinear Predictive Coding
Linear Predictive Coding
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition
 
SPEECH CODING
SPEECH CODINGSPEECH CODING
SPEECH CODING
 
Silent sound technology
Silent sound technologySilent sound technology
Silent sound technology
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 
Linear Predictive Coding
Linear Predictive CodingLinear Predictive Coding
Linear Predictive Coding
 
SILENT SOUND TECHNOLOGY
SILENT SOUND TECHNOLOGYSILENT SOUND TECHNOLOGY
SILENT SOUND TECHNOLOGY
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
 
Silent Sound Technology
Silent Sound TechnologySilent Sound Technology
Silent Sound Technology
 
Silent sound technology
Silent sound technologySilent sound technology
Silent sound technology
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
silent sound technology
silent sound technologysilent sound technology
silent sound technology
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
 
Smart note taker
Smart note takerSmart note taker
Smart note taker
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCC
 
Speech encoding techniques
Speech encoding techniquesSpeech encoding techniques
Speech encoding techniques
 
Gujarati Text-to-Speech Presentation
Gujarati Text-to-Speech PresentationGujarati Text-to-Speech Presentation
Gujarati Text-to-Speech Presentation
 
Digital Video Editing
Digital Video EditingDigital Video Editing
Digital Video Editing
 

Viewers also liked

Viewers also liked (16)

FINAL REVIEW
FINAL REVIEWFINAL REVIEW
FINAL REVIEW
 
Vlsi technology-dinesh
Vlsi technology-dineshVlsi technology-dinesh
Vlsi technology-dinesh
 
brain chip technology
brain chip technologybrain chip technology
brain chip technology
 
VLSI
VLSI VLSI
VLSI
 
VLSI Training presentation
VLSI Training presentationVLSI Training presentation
VLSI Training presentation
 
Brain chips ppt
Brain chips pptBrain chips ppt
Brain chips ppt
 
Brain chips ppt
Brain chips pptBrain chips ppt
Brain chips ppt
 
Brain chips
Brain chipsBrain chips
Brain chips
 
Vlsi design and fabrication ppt
Vlsi design and fabrication  pptVlsi design and fabrication  ppt
Vlsi design and fabrication ppt
 
Chip morphing
Chip morphingChip morphing
Chip morphing
 
Basics Of VLSI
Basics Of VLSIBasics Of VLSI
Basics Of VLSI
 
Build Features, Not Apps
Build Features, Not AppsBuild Features, Not Apps
Build Features, Not Apps
 
All In One Olathe - revised 10-24-11
All In One Olathe - revised 10-24-11All In One Olathe - revised 10-24-11
All In One Olathe - revised 10-24-11
 
Fractal robotics
Fractal  roboticsFractal  robotics
Fractal robotics
 
March 3 2004 for the ai cie
March 3 2004 for the ai cieMarch 3 2004 for the ai cie
March 3 2004 for the ai cie
 
Airborn internet
Airborn internetAirborn internet
Airborn internet
 

Similar to Voice morphing-101113123852-phpapp01

voice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdfvoice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdfDeepthiDeepu668278
 
44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognitionsunnysyed
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization worksMuhammad Taqi
 
Linear predictive coding documentation
Linear predictive coding  documentationLinear predictive coding  documentation
Linear predictive coding documentationchakravarthy Gopi
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderIJTET Journal
 
An Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresAn Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresSivaranjan Goswami
 
High Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech SynthesisHigh Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech Synthesissipij
 
Speech and Language Processing
Speech and Language ProcessingSpeech and Language Processing
Speech and Language ProcessingVikalp Mahendra
 
Speech compression using loosy predictive coding (lpc)
Speech compression using loosy predictive coding (lpc)Speech compression using loosy predictive coding (lpc)
Speech compression using loosy predictive coding (lpc)Harshal Ladhe
 
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...IRJET Journal
 

Similar to Voice morphing-101113123852-phpapp01 (20)

An Introduction To Speech Recognition
An Introduction To Speech RecognitionAn Introduction To Speech Recognition
An Introduction To Speech Recognition
 
voice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdfvoice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdf
 
Animal Voice Morphing System
Animal Voice Morphing SystemAnimal Voice Morphing System
Animal Voice Morphing System
 
G010424248
G010424248G010424248
G010424248
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization works
 
Linear predictive coding documentation
Linear predictive coding  documentationLinear predictive coding  documentation
Linear predictive coding documentation
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
An Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresAn Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech features
 
High Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech SynthesisHigh Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech Synthesis
 
voice morphing.pptx
voice morphing.pptxvoice morphing.pptx
voice morphing.pptx
 
Speech and Language Processing
Speech and Language ProcessingSpeech and Language Processing
Speech and Language Processing
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Speech Recognition System
 
Automatic Speech Recognion
Automatic Speech RecognionAutomatic Speech Recognion
Automatic Speech Recognion
 
Speech compression using loosy predictive coding (lpc)
Speech compression using loosy predictive coding (lpc)Speech compression using loosy predictive coding (lpc)
Speech compression using loosy predictive coding (lpc)
 
50120140501002
5012014050100250120140501002
50120140501002
 
B45010811
B45010811B45010811
B45010811
 
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
 
A Case Study on DSP (Speech Processing)
A Case Study on DSP (Speech Processing)A Case Study on DSP (Speech Processing)
A Case Study on DSP (Speech Processing)
 

Recently uploaded

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 

Recently uploaded (20)

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 

Voice morphing-101113123852-phpapp01

  • 1. Voice morphing Presented By H.Mohammed.Sabir 09AT1A0461 Supervised By Shreedhar Sir
  • 2. SEMINAR OUTLINES What It is? Need of Voice Morphing Description the Morphing. Technical details of Morphing. Application areas.
  • 3. What is Voice Morphing ??  Voice morphing is a technique for modifying a (source) speaker's speech to sound as if it were spoken by a different (target) speaker.  In Simpler terms it is being able to change the speech of one speaker to that of another speaker.  Technology developed at the Los Alamos National Laboratory in New Mexico, USA by George Papcun  Applications for Voice Morphing range from recreational ones to security ones.
  • 4. What it actually performs ?  It is a technique to modify a source speaker's speech to sound as if it was spoken by a target speaker.  Voice morphing enables speech patterns to be cloned  And an accurate copy of a person's voice can be made that can wishes to say, anything in the voice of someone else.
  • 5. Need of voice morphing  Text To Speech (TTS)  In public speech systems  For special effects ( just like video or image morphing is done ).  To diminish Ethnical barriers.
  • 6. How to Morph Voice ??  We need to effectively change the pitch from that of a male speaker to that of a female speaker. If we reminisce the excitation signal has information about the speaker.  We find the LPC coefficients for the Source and Target Signals and using these coefficients we are going to interpolate between the two Signals.  We get the New LPC (linear predictive coding) coefficients using the formula new lpc coeff = [const*(lpc source) + (1-const)(lpc target)]  0 <= const <= 1 …
  • 7. How to Morph Speech ?? (contd…)  The pitch of a female speaker will be close to twice that of the male speaker. In our example the pitch of the male speaker is 141Hz and that of the female speaker is 210Hz.  So we need to develop some time stretching algorithm so that we can implement pitch shifting. We obtain the residue of the source signal and stretch it according to the value of the const. The const indicates what is the position of morphed signal in between the source and target.  For example if const = 0.2 then the morphed signal will be closer in pitch to the source signal and a value of 0.8 for const will result in a pitch that is closer to the target signal.
  • 8. How do we shift the Pitch ??  We break the residue signal into small windows and introduce fade in and fade out for each block. We recombine everything to form the pitch shifted signal. Based on the alpha we can time stretch the residue according to our requirements. How do we Morph finally ?? • We now have the pitch shifted residue signal and the new LPC coefficients. We should resample the pitch shifted signal so that it is played at a faster rate. [Remember when we pitch shift then the residue will last longer]. If we inverse filter the resampled pitch shifted residue then we can effect morphing.
  • 10. Time Domain Plots of Source and Target featuring the Pitch
  • 11.
  • 12. Matching and Warping  DTW(Dynamic Time Warping) - Dynamic Time Warping (DTW) is used to find the best match between the pitch of the two sounds.
  • 13.
  • 14.
  • 15. Signal Re-Estimation  Loss during Signal re-estimation -Due to signals being transformation into the cepstral domain, a magnitude function is used. This results in a loss of phase information in the representation of the data.
  • 16. Limitations   Lots of normalizing problems. Some applications require extensive sound libraries. Different languages require different phonetics. It is very seldom complete.
  • 17. Advantages  Allows speech model to be duplicated and an exact copy of a person’s voice.  Powerful combat zone weapon.
  • 18. Disadvantages  Use to pull out the useful information.  It hides the actual identity of the user.
  • 19. Conclusion  The approach we have adopted separates the sounds into two forms: - Spectral envelope information - Pitch and voicing information.  Dynamic Time Warping - Aligns the sounds with respect to their pitches.  Signal re-estimation algorithm. - Frames are converted back into a time domain waveform.
  • 20. Application Areas  Fake telephone conversations as evidence in courts of law.  Powerful battlefield weapon. - Provide fake orders to the enemy's troops, appearing to come from their own commanders.
  • 21. Future Scope  Extending the functionality of tool. - Create a powerful and flexible morphing tool.  Increased user interaction. - Graphical User Interface could be designed and integrated to make the package more ‘user-friendly’.
  • 22. BIBLIOGRAPHY: • Ye, H. and S. Young (2003). "Perceptually Weighted Linear Transformations for Voice Conversion". Eurospeech 2003, Geneva.  • Ye, H. and S. Young (2004). "High Quality Voice Morphing". Int Conference Acoustics Speech and Signal Processing, Montreal, Canada.  • High quality Voice Morphing Hui Yeand Steve Young. • Quality-enhanced Voice Morphing

Editor's Notes

  1. MAHATMA GANDHI MISSION ENGINEERING COLLEGE,NOIDA
  2. MAHATMA GANDHI MISSION ENGINEERING COLLEGE,NOIDA