SlideShare a Scribd company logo
1 of 24
Voice morphing

              Presented
                  By
          H.Mohammed.Sabir
             09AT1A0461

              Supervised
                  By
             Shreedhar Sir
SEMINAR OUTLINES

What   It is?
Need   of Voice Morphing
Description      the Morphing.
Technical       details of Morphing.
Application       areas.
What is Voice Morphing ??


   Voice morphing is a technique for modifying a (source)
    speaker's speech to sound as if it were spoken by a
    different (target) speaker.


   In Simpler terms it is being able to change the speech of
    one speaker to that of another speaker.


   Technology developed at the Los Alamos National
    Laboratory in New Mexico, USA by George Papcun


   Applications for Voice Morphing range from recreational
    ones to security ones.
What it actually performs ?
   It is a technique to modify a source speaker's
    speech to sound as if it was spoken by a target
    speaker.
   Voice morphing enables speech patterns to be
    cloned
   And an accurate copy of a person's voice can
    be made that can wishes to say, anything in the
    voice of someone else.
Need of voice morphing

   Text To Speech (TTS)
   In public speech systems
   For special effects ( just like video or image morphing is
    done ).
   To diminish Ethnical barriers.
How to Morph Voice ??


   We need to effectively change the pitch from that of a male
    speaker to that of a female speaker. If we reminisce the
    excitation signal has information about the speaker.

   We find the LPC coefficients for the Source and Target Signals
    and using these coefficients we are going to interpolate
    between the two Signals.

   We get the New LPC (linear predictive coding) coefficients
    using the formula

         new lpc coeff = [const*(lpc source) + (1-const)(lpc
    target)]

   0 <= const <= 1

                                                                     …
How to Morph Speech ?? (contd…)



    The pitch of a female speaker will be close to twice that of
     the male speaker. In our example the pitch of the male
     speaker is 141Hz and that of the female speaker is 210Hz.


    So we need to develop some time stretching algorithm so
     that we can implement pitch shifting. We obtain the residue
     of the source signal and stretch it according to the value of
     the const. The const indicates what is the position of morphed
     signal in between the source and target.


    For example if const = 0.2 then the morphed signal will be
     closer in pitch to the source signal and a value of 0.8 for const
     will result in a pitch that is closer to the target signal.
How do we shift the Pitch ??
   We break the residue signal into small windows and introduce fade in
    and fade out for each block. We recombine everything to form the pitch
    shifted signal. Based on the alpha we can time stretch the residue
    according to our requirements.




    How do we Morph finally ??

•   We now have the pitch shifted residue signal and the new
      LPC coefficients. We should resample the pitch shifted
      signal so that it is played at a faster rate. [Remember
      when we pitch shift then the residue will last longer]. If
      we inverse filter the resampled pitch shifted residue then
      we can effect morphing.
Block Diagram
Time Domain Plots of Source and Target featuring the Pitch
Matching and Warping

   DTW(Dynamic Time Warping)


    - Dynamic Time Warping (DTW) is used to
    find the best match between the pitch of
    the two sounds.
Signal Re-Estimation

   Loss during Signal re-estimation


    -Due to signals being transformation into the
    cepstral domain, a magnitude function is
    used. This results in a loss of phase
    information in the representation of the
    data.
Limitations
 
Lots   of normalizing problems.
Some     applications require extensive sound libraries.
Different   languages require different phonetics.
It   is very seldom complete.
Advantages

   Allows speech model to be duplicated and an exact
    copy of a person’s voice.


   Powerful combat zone weapon.
Disadvantages

   Use to pull out the useful information.


   It hides the actual identity of the user.
Conclusion
   The approach we have adopted separates the sounds into two
    forms:

    - Spectral   envelope information
    - Pitch and voicing information.
   Dynamic Time Warping
    - Aligns the sounds with respect to their pitches.
   Signal re-estimation algorithm.
    - Frames are converted back into a time domain
    waveform.
Application Areas
   Fake telephone conversations as evidence in courts of
    law.


   Powerful battlefield weapon.

    - Provide
            fake orders to the enemy's troops,
    appearing to come from their own
    commanders.
Future Scope
   Extending the functionality of tool.
    - Create a powerful and flexible morphing
    tool.

   Increased user interaction.
    - Graphical User Interface could be
    designed and integrated to make the
    package more ‘user-friendly’.
BIBLIOGRAPHY:
• Ye, H. and S. Young (2003). "Perceptually Weighted Linear
  Transformations for Voice Conversion". Eurospeech 2003,
  Geneva. 
• Ye, H. and S. Young (2004). "High Quality Voice Morphing".
  Int Conference Acoustics Speech and Signal Processing,
  Montreal, Canada. 
• High quality Voice Morphing Hui Yeand Steve Young.
• Quality-enhanced Voice Morphing
Thank you!!!
Questions??

More Related Content

What's hot

Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
Alok Tiwari
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
International Journal of Science and Research (IJSR)
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
Diptimaya Sarangi
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
Hugo Moreno
 
Smart note taker technology
Smart note taker technologySmart note taker technology
Smart note taker technology
Patel Neel
 
Silent Sound Technology
Silent Sound TechnologySilent Sound Technology
Silent Sound Technology
Moumita132
 

What's hot (20)

Voice morphing-
Voice morphing-Voice morphing-
Voice morphing-
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Speech Recognition System
 
Subband Coding
Subband CodingSubband Coding
Subband Coding
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Silentsound documentation
Silentsound documentationSilentsound documentation
Silentsound documentation
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overview
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
Linear Predictive Coding
Linear Predictive CodingLinear Predictive Coding
Linear Predictive Coding
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Smart note taker technology
Smart note taker technologySmart note taker technology
Smart note taker technology
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Silent Sound Technology
Silent Sound TechnologySilent Sound Technology
Silent Sound Technology
 

Viewers also liked

VLSI Training presentation
VLSI Training presentationVLSI Training presentation
VLSI Training presentation
Daola Khungur
 
Vlsi design and fabrication ppt
Vlsi design and fabrication  pptVlsi design and fabrication  ppt
Vlsi design and fabrication ppt
Manjushree Mashal
 
All In One Olathe - revised 10-24-11
All In One Olathe - revised 10-24-11All In One Olathe - revised 10-24-11
All In One Olathe - revised 10-24-11
Jeff Vanderpool
 
March 3 2004 for the ai cie
March 3 2004 for the ai cieMarch 3 2004 for the ai cie
March 3 2004 for the ai cie
Shailesh Dubey
 

Viewers also liked (16)

FINAL REVIEW
FINAL REVIEWFINAL REVIEW
FINAL REVIEW
 
Vlsi technology-dinesh
Vlsi technology-dineshVlsi technology-dinesh
Vlsi technology-dinesh
 
brain chip technology
brain chip technologybrain chip technology
brain chip technology
 
VLSI
VLSI VLSI
VLSI
 
VLSI Training presentation
VLSI Training presentationVLSI Training presentation
VLSI Training presentation
 
Brain chips ppt
Brain chips pptBrain chips ppt
Brain chips ppt
 
Brain chips ppt
Brain chips pptBrain chips ppt
Brain chips ppt
 
Brain chips
Brain chipsBrain chips
Brain chips
 
Vlsi design and fabrication ppt
Vlsi design and fabrication  pptVlsi design and fabrication  ppt
Vlsi design and fabrication ppt
 
Chip morphing
Chip morphingChip morphing
Chip morphing
 
Basics Of VLSI
Basics Of VLSIBasics Of VLSI
Basics Of VLSI
 
Build Features, Not Apps
Build Features, Not AppsBuild Features, Not Apps
Build Features, Not Apps
 
All In One Olathe - revised 10-24-11
All In One Olathe - revised 10-24-11All In One Olathe - revised 10-24-11
All In One Olathe - revised 10-24-11
 
Fractal robotics
Fractal  roboticsFractal  robotics
Fractal robotics
 
March 3 2004 for the ai cie
March 3 2004 for the ai cieMarch 3 2004 for the ai cie
March 3 2004 for the ai cie
 
Airborn internet
Airborn internetAirborn internet
Airborn internet
 

Similar to Voice morphing-101113123852-phpapp01

44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition
sunnysyed
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization works
Muhammad Taqi
 
Linear predictive coding documentation
Linear predictive coding  documentationLinear predictive coding  documentation
Linear predictive coding documentation
chakravarthy Gopi
 
Speech compression using loosy predictive coding (lpc)
Speech compression using loosy predictive coding (lpc)Speech compression using loosy predictive coding (lpc)
Speech compression using loosy predictive coding (lpc)
Harshal Ladhe
 

Similar to Voice morphing-101113123852-phpapp01 (20)

An Introduction To Speech Recognition
An Introduction To Speech RecognitionAn Introduction To Speech Recognition
An Introduction To Speech Recognition
 
voice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdfvoice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdf
 
Animal Voice Morphing System
Animal Voice Morphing SystemAnimal Voice Morphing System
Animal Voice Morphing System
 
G010424248
G010424248G010424248
G010424248
 
Linear Predictive Coding
Linear Predictive CodingLinear Predictive Coding
Linear Predictive Coding
 
44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization works
 
Linear predictive coding documentation
Linear predictive coding  documentationLinear predictive coding  documentation
Linear predictive coding documentation
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
An Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresAn Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech features
 
High Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech SynthesisHigh Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech Synthesis
 
voice morphing.pptx
voice morphing.pptxvoice morphing.pptx
voice morphing.pptx
 
Speech and Language Processing
Speech and Language ProcessingSpeech and Language Processing
Speech and Language Processing
 
Automatic Speech Recognion
Automatic Speech RecognionAutomatic Speech Recognion
Automatic Speech Recognion
 
Speech compression using loosy predictive coding (lpc)
Speech compression using loosy predictive coding (lpc)Speech compression using loosy predictive coding (lpc)
Speech compression using loosy predictive coding (lpc)
 
50120140501002
5012014050100250120140501002
50120140501002
 
B45010811
B45010811B45010811
B45010811
 
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
 
A Case Study on DSP (Speech Processing)
A Case Study on DSP (Speech Processing)A Case Study on DSP (Speech Processing)
A Case Study on DSP (Speech Processing)
 
IRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time DomainIRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time Domain
 

Recently uploaded

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 

Recently uploaded (20)

Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 

Voice morphing-101113123852-phpapp01

  • 1. Voice morphing Presented By H.Mohammed.Sabir 09AT1A0461 Supervised By Shreedhar Sir
  • 2. SEMINAR OUTLINES What It is? Need of Voice Morphing Description the Morphing. Technical details of Morphing. Application areas.
  • 3. What is Voice Morphing ??  Voice morphing is a technique for modifying a (source) speaker's speech to sound as if it were spoken by a different (target) speaker.  In Simpler terms it is being able to change the speech of one speaker to that of another speaker.  Technology developed at the Los Alamos National Laboratory in New Mexico, USA by George Papcun  Applications for Voice Morphing range from recreational ones to security ones.
  • 4. What it actually performs ?  It is a technique to modify a source speaker's speech to sound as if it was spoken by a target speaker.  Voice morphing enables speech patterns to be cloned  And an accurate copy of a person's voice can be made that can wishes to say, anything in the voice of someone else.
  • 5. Need of voice morphing  Text To Speech (TTS)  In public speech systems  For special effects ( just like video or image morphing is done ).  To diminish Ethnical barriers.
  • 6. How to Morph Voice ??  We need to effectively change the pitch from that of a male speaker to that of a female speaker. If we reminisce the excitation signal has information about the speaker.  We find the LPC coefficients for the Source and Target Signals and using these coefficients we are going to interpolate between the two Signals.  We get the New LPC (linear predictive coding) coefficients using the formula new lpc coeff = [const*(lpc source) + (1-const)(lpc target)]  0 <= const <= 1 …
  • 7. How to Morph Speech ?? (contd…)  The pitch of a female speaker will be close to twice that of the male speaker. In our example the pitch of the male speaker is 141Hz and that of the female speaker is 210Hz.  So we need to develop some time stretching algorithm so that we can implement pitch shifting. We obtain the residue of the source signal and stretch it according to the value of the const. The const indicates what is the position of morphed signal in between the source and target.  For example if const = 0.2 then the morphed signal will be closer in pitch to the source signal and a value of 0.8 for const will result in a pitch that is closer to the target signal.
  • 8. How do we shift the Pitch ??  We break the residue signal into small windows and introduce fade in and fade out for each block. We recombine everything to form the pitch shifted signal. Based on the alpha we can time stretch the residue according to our requirements. How do we Morph finally ?? • We now have the pitch shifted residue signal and the new LPC coefficients. We should resample the pitch shifted signal so that it is played at a faster rate. [Remember when we pitch shift then the residue will last longer]. If we inverse filter the resampled pitch shifted residue then we can effect morphing.
  • 10. Time Domain Plots of Source and Target featuring the Pitch
  • 11.
  • 12. Matching and Warping  DTW(Dynamic Time Warping) - Dynamic Time Warping (DTW) is used to find the best match between the pitch of the two sounds.
  • 13.
  • 14.
  • 15. Signal Re-Estimation  Loss during Signal re-estimation -Due to signals being transformation into the cepstral domain, a magnitude function is used. This results in a loss of phase information in the representation of the data.
  • 16. Limitations   Lots of normalizing problems. Some applications require extensive sound libraries. Different languages require different phonetics. It is very seldom complete.
  • 17. Advantages  Allows speech model to be duplicated and an exact copy of a person’s voice.  Powerful combat zone weapon.
  • 18. Disadvantages  Use to pull out the useful information.  It hides the actual identity of the user.
  • 19. Conclusion  The approach we have adopted separates the sounds into two forms: - Spectral envelope information - Pitch and voicing information.  Dynamic Time Warping - Aligns the sounds with respect to their pitches.  Signal re-estimation algorithm. - Frames are converted back into a time domain waveform.
  • 20. Application Areas  Fake telephone conversations as evidence in courts of law.  Powerful battlefield weapon. - Provide fake orders to the enemy's troops, appearing to come from their own commanders.
  • 21. Future Scope  Extending the functionality of tool. - Create a powerful and flexible morphing tool.  Increased user interaction. - Graphical User Interface could be designed and integrated to make the package more ‘user-friendly’.
  • 22. BIBLIOGRAPHY: • Ye, H. and S. Young (2003). "Perceptually Weighted Linear Transformations for Voice Conversion". Eurospeech 2003, Geneva.  • Ye, H. and S. Young (2004). "High Quality Voice Morphing". Int Conference Acoustics Speech and Signal Processing, Montreal, Canada.  • High quality Voice Morphing Hui Yeand Steve Young. • Quality-enhanced Voice Morphing

Editor's Notes

  1. MAHATMA GANDHI MISSION ENGINEERING COLLEGE,NOIDA
  2. MAHATMA GANDHI MISSION ENGINEERING COLLEGE,NOIDA