SlideShare a Scribd company logo
1 of 21
Technical Seminar presentation
                 on

           Voice morphing

                       Presented
                           By
                     R.Naresh Babu


                       Supervised
                           By
                  Mrs.G.Sandhya Madam
1         VLITS
SEMINAR OUTLINES


    •What It is?
    •Need of Voice Morphing
    •Description the Morphing.
    •Technical details of Morphing.
    •Application areas.


2
Voice Morphing

    • Transition Phenomenon.

    • Technology       developed       at
      the Los Alamos             National
      Laboratory in New Mexico, USA by
      George Papcun .

3
What it actually
           performs ?
    • It is a technique to modify a source
      speaker's speech to sound as if it
      was spoken by a target speaker.
    • Voice morphing enables speech
      patterns to be cloned
      – And an accurate copy of a person's voice
        can be made that can wishes to say,
        anything in the voice of someone else.

4
Need of voice morphing

    • Text To Speech (TTS)
    • In public speech systems
    • For special effects ( just like video
      or image morphing is done ).
    • To diminish Ethnical barriers.


5
Voice Morphing Process

    • Preprocessing    or    representation
      conversion.
    • Pitch and Envelope analysis.
    • Morphing which includes Warping and
      interpolation.
    • Signal re-estimation.


6
Block Diagram




7
Pre-Processing

    • Involves processes like signal
      acquisition in discrete form and
      windowing.




8
Pitch And Envelope
              Analysis
    • This process will extract the pitch.
    • Formant information in the speech
      signal.




9
Conversion




10
Matching and Warping

     • DTW(Dynamic Time Warping)

      - Dynamic Time Warping (DTW) is used to
      find the best match between the pitch of the
      two sounds.




11
Signal Re-Estimation


     • Loss during Signal re-estimation

       - Due to signals being transformation into the
       cepstral domain, a magnitude function is used.
       This results in a loss of phase information in
       the representation of the data.

12
Summarized Block
        Diagram




13
Limitations
      
     •Lots of normalizing problems.
     •Some applications require extensive
     sound libraries.
     •Different languages require different
     phonetics.
     •It is very seldom complete.

14
Advantages

     • Allows speech model to be duplicated
       and an exact copy of a person’s voice.


     • Powerful combat zone weapon.



15
Disadvantages

     • Use to pull out the useful
       information.


     • It hides the actual identity of the
       user.


16
Conclusion
     • The approach we have adopted
       separates the sounds into two forms:
       - Spectral envelope information
       - Pitch and voicing information.
     • Dynamic Time Warping
       - Aligns the sounds with respect to their
       pitches.
     • Signal re-estimation algorithm.
       - Frames are converted back into a time domain
17     waveform.
Application Areas
     • Fake telephone conversations as
       evidence in courts of law.


     • Powerful battlefield weapon.
       - Provide fake orders to the enemy's troops,
       appearing to come from their own commanders.




18
Future Scope
     • Extending the functionality of tool.
       - Create a powerful and flexible morphing
       tool.


     • Increased user interaction.
       - Graphical User Interface could be designed
       and integrated to make the package more
       ‘user-friendly’.


19
Thank you!!!

20
Questions??

21

More Related Content

What's hot

Voicemorphingppt 110328163403-phpapp01
Voicemorphingppt 110328163403-phpapp01Voicemorphingppt 110328163403-phpapp01
Voicemorphingppt 110328163403-phpapp01Madhu Babu
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition Goa App
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceIlhaan Marwat
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice RecognitionAmrita More
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminarDiptimaya Sarangi
 
Silent Sound Technology
Silent Sound TechnologySilent Sound Technology
Silent Sound TechnologyMoumita132
 
SILENT SOUND TECHNOLOGY
SILENT SOUND TECHNOLOGYSILENT SOUND TECHNOLOGY
SILENT SOUND TECHNOLOGYNagma Parween
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overviewVarun Jain
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCCHira Shaukat
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Silent sound technology
Silent sound technologySilent sound technology
Silent sound technologyMaria Dominica
 
Visual speech to text conversion applicable to telephone communication
Visual speech to text conversion  applicable  to telephone communicationVisual speech to text conversion  applicable  to telephone communication
Visual speech to text conversion applicable to telephone communicationSwathi Venugopal
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech RecognitionHugo Moreno
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySeminar Links
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data miningJimit Rupani
 

What's hot (20)

Voicemorphingppt 110328163403-phpapp01
Voicemorphingppt 110328163403-phpapp01Voicemorphingppt 110328163403-phpapp01
Voicemorphingppt 110328163403-phpapp01
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
Silent Sound Technology
Silent Sound TechnologySilent Sound Technology
Silent Sound Technology
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
SILENT SOUND TECHNOLOGY
SILENT SOUND TECHNOLOGYSILENT SOUND TECHNOLOGY
SILENT SOUND TECHNOLOGY
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overview
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCC
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Silent sound technology
Silent sound technologySilent sound technology
Silent sound technology
 
Sign language recognizer
Sign language recognizerSign language recognizer
Sign language recognizer
 
Visual speech to text conversion applicable to telephone communication
Visual speech to text conversion  applicable  to telephone communicationVisual speech to text conversion  applicable  to telephone communication
Visual speech to text conversion applicable to telephone communication
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Silent Sound Technology
Silent Sound TechnologySilent Sound Technology
Silent Sound Technology
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
 
An Introduction To Speech Recognition
An Introduction To Speech RecognitionAn Introduction To Speech Recognition
An Introduction To Speech Recognition
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 

Viewers also liked

Viewers also liked (8)

March 3 2004 for the ai cie
March 3 2004 for the ai cieMarch 3 2004 for the ai cie
March 3 2004 for the ai cie
 
Airborn internet
Airborn internetAirborn internet
Airborn internet
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
 
REPORT ON :- Airborne internet
REPORT ON :- Airborne internetREPORT ON :- Airborne internet
REPORT ON :- Airborne internet
 
Broadband
Broadband Broadband
Broadband
 
The airborne internet final my
The airborne internet final myThe airborne internet final my
The airborne internet final my
 
Airborne internet
Airborne internetAirborne internet
Airborne internet
 
Rain Technology
Rain TechnologyRain Technology
Rain Technology
 

Similar to Voice morphing-101113123852-phpapp01 (1)

voice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdfvoice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdfDeepthiDeepu668278
 
Teaching Machines to Listen: An Introduction to Automatic Speech Recognition
Teaching Machines to Listen: An Introduction to Automatic Speech RecognitionTeaching Machines to Listen: An Introduction to Automatic Speech Recognition
Teaching Machines to Listen: An Introduction to Automatic Speech RecognitionZachary S. Brown
 
Rayo for XMPP Folks
Rayo for XMPP FolksRayo for XMPP Folks
Rayo for XMPP FolksMojo Lingo
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition systemavinash raibole
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionArif A.
 
“Comparing ML-Based Audio with ML-Based Vision: An Introduction to ML Audio f...
“Comparing ML-Based Audio with ML-Based Vision: An Introduction to ML Audio f...“Comparing ML-Based Audio with ML-Based Vision: An Introduction to ML Audio f...
“Comparing ML-Based Audio with ML-Based Vision: An Introduction to ML Audio f...Edge AI and Vision Alliance
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologyAamir-sheriff
 
Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01girishjoshi1234
 
Building a Neural Machine Translation System From Scratch
Building a Neural Machine Translation System From ScratchBuilding a Neural Machine Translation System From Scratch
Building a Neural Machine Translation System From ScratchNatasha Latysheva
 
Evolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of MedicineEvolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of MedicineJohn Rees
 
Interactive voice conversion for augmented speech production
Interactive voice conversion for augmented speech productionInteractive voice conversion for augmented speech production
Interactive voice conversion for augmented speech productionNU_I_TODALAB
 
Home automation using offline Speech Recognition
Home automation using offline Speech RecognitionHome automation using offline Speech Recognition
Home automation using offline Speech RecognitionNilkanth Shirodkar
 
ETE405-lec8.pdf
ETE405-lec8.pdfETE405-lec8.pdf
ETE405-lec8.pdfmashiur
 
Digital speech processing lecture1
Digital speech processing lecture1Digital speech processing lecture1
Digital speech processing lecture1Samiul Parag
 
Speech recognition1
Speech recognition1Speech recognition1
Speech recognition1Sai Kiran
 

Similar to Voice morphing-101113123852-phpapp01 (1) (20)

voice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdfvoice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdf
 
Teaching Machines to Listen: An Introduction to Automatic Speech Recognition
Teaching Machines to Listen: An Introduction to Automatic Speech RecognitionTeaching Machines to Listen: An Introduction to Automatic Speech Recognition
Teaching Machines to Listen: An Introduction to Automatic Speech Recognition
 
Rayo for XMPP Folks
Rayo for XMPP FolksRayo for XMPP Folks
Rayo for XMPP Folks
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition system
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
“Comparing ML-Based Audio with ML-Based Vision: An Introduction to ML Audio f...
“Comparing ML-Based Audio with ML-Based Vision: An Introduction to ML Audio f...“Comparing ML-Based Audio with ML-Based Vision: An Introduction to ML Audio f...
“Comparing ML-Based Audio with ML-Based Vision: An Introduction to ML Audio f...
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01
 
Building a Neural Machine Translation System From Scratch
Building a Neural Machine Translation System From ScratchBuilding a Neural Machine Translation System From Scratch
Building a Neural Machine Translation System From Scratch
 
Evolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of MedicineEvolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of Medicine
 
Iitdmj 1
Iitdmj 1Iitdmj 1
Iitdmj 1
 
Interactive voice conversion for augmented speech production
Interactive voice conversion for augmented speech productionInteractive voice conversion for augmented speech production
Interactive voice conversion for augmented speech production
 
Home automation using offline Speech Recognition
Home automation using offline Speech RecognitionHome automation using offline Speech Recognition
Home automation using offline Speech Recognition
 
SPEECH CODING
SPEECH CODINGSPEECH CODING
SPEECH CODING
 
Speech recognition (dr. m. sabarimalai manikandan)
Speech recognition (dr. m. sabarimalai manikandan)Speech recognition (dr. m. sabarimalai manikandan)
Speech recognition (dr. m. sabarimalai manikandan)
 
ETE405-lec8.pdf
ETE405-lec8.pdfETE405-lec8.pdf
ETE405-lec8.pdf
 
Research_Wu.pptx
Research_Wu.pptxResearch_Wu.pptx
Research_Wu.pptx
 
Digital speech processing lecture1
Digital speech processing lecture1Digital speech processing lecture1
Digital speech processing lecture1
 
Speech recognition1
Speech recognition1Speech recognition1
Speech recognition1
 

Voice morphing-101113123852-phpapp01 (1)

  • 1. Technical Seminar presentation on Voice morphing Presented By R.Naresh Babu Supervised By Mrs.G.Sandhya Madam 1 VLITS
  • 2. SEMINAR OUTLINES •What It is? •Need of Voice Morphing •Description the Morphing. •Technical details of Morphing. •Application areas. 2
  • 3. Voice Morphing • Transition Phenomenon. • Technology developed at the Los Alamos National Laboratory in New Mexico, USA by George Papcun . 3
  • 4. What it actually performs ? • It is a technique to modify a source speaker's speech to sound as if it was spoken by a target speaker. • Voice morphing enables speech patterns to be cloned – And an accurate copy of a person's voice can be made that can wishes to say, anything in the voice of someone else. 4
  • 5. Need of voice morphing • Text To Speech (TTS) • In public speech systems • For special effects ( just like video or image morphing is done ). • To diminish Ethnical barriers. 5
  • 6. Voice Morphing Process • Preprocessing or representation conversion. • Pitch and Envelope analysis. • Morphing which includes Warping and interpolation. • Signal re-estimation. 6
  • 8. Pre-Processing • Involves processes like signal acquisition in discrete form and windowing. 8
  • 9. Pitch And Envelope Analysis • This process will extract the pitch. • Formant information in the speech signal. 9
  • 11. Matching and Warping • DTW(Dynamic Time Warping) - Dynamic Time Warping (DTW) is used to find the best match between the pitch of the two sounds. 11
  • 12. Signal Re-Estimation • Loss during Signal re-estimation - Due to signals being transformation into the cepstral domain, a magnitude function is used. This results in a loss of phase information in the representation of the data. 12
  • 13. Summarized Block Diagram 13
  • 14. Limitations   •Lots of normalizing problems. •Some applications require extensive sound libraries. •Different languages require different phonetics. •It is very seldom complete. 14
  • 15. Advantages • Allows speech model to be duplicated and an exact copy of a person’s voice. • Powerful combat zone weapon. 15
  • 16. Disadvantages • Use to pull out the useful information. • It hides the actual identity of the user. 16
  • 17. Conclusion • The approach we have adopted separates the sounds into two forms: - Spectral envelope information - Pitch and voicing information. • Dynamic Time Warping - Aligns the sounds with respect to their pitches. • Signal re-estimation algorithm. - Frames are converted back into a time domain 17 waveform.
  • 18. Application Areas • Fake telephone conversations as evidence in courts of law. • Powerful battlefield weapon. - Provide fake orders to the enemy's troops, appearing to come from their own commanders. 18
  • 19. Future Scope • Extending the functionality of tool. - Create a powerful and flexible morphing tool. • Increased user interaction. - Graphical User Interface could be designed and integrated to make the package more ‘user-friendly’. 19

Editor's Notes

  1. MAHATMA GANDHI MISSION ENGINEERING COLLEGE,NOIDA
  2. MAHATMA GANDHI MISSION ENGINEERING COLLEGE,NOIDA