SlideShare a Scribd company logo
1 of 16
B. P. Mandal College of Engineering, Madhepura
(Dept. Of Science & Technology, Govt. Of Bihar)
Major Project
Speech Dubbing Software
Presented By:
 Pushkar Kumar
 Kapil Kumar
 Pranav Ravi
 Sudama Manjhi
Project Guide:
Mr. Raj Kumar
Assistant Professor
Dept. of CSE
@copyright by Team Ciphers-BPMCE Madhepura
Contents
• About Project
• What is Dubbing?
• Block diagram of Project
• Project Dubbing Algorithm –I&II
• Divide / Conquer Logic of recognition
• Synchronization of Audio
• Project Flow Chart Diagram & Working Methodology-I&II
• Used Language , Tools & Environment Description for Project Implementation
• Project Dashboard Screen
• Future Scope
• Conclusion
About Project
It is a Machine Learning technology &Natural Language Processing
based project. In this project we developed a GUI based software. The
function of this software is to convert the .mp4 English to .mp4 Hindi
Video automatically.
Speech Dubbing Software
.mp4_English
.mp4_Hindi
What is Dubbing?
Dubbing is a post-production process where the original language of recording
is swapped with audio in a different language and is then mixed with the audio
of the media to make it sound as natural as possible.
Original language
Audio
Swapping
Desire language
Audio
Mixing
Desire language
with File(.mp4/.mkv)
Block diagram of Project
Project Dubbing Algorithm -I
Step 1: Take input video file from
desktop to program for dubbing
Step 2: In this step we extract the audio from the input
Video file and stored in directory as generatedSpeech.wav
Step 3: Again we import the generated Speech.wav from directory
and divide the audio into multiple parts.
Step 4: In this step we recognise the text of each audio clip and stored in
directory as generatedSpeech.txt
Project Dubbing Algorithm -II
Step 5: import the generatedSpeech.txt from directory and
translate the file English to Hindi and Stored in
translatedHindi as a variable.
Step 6: Now we generate & synchronized the audio file from
translatedHindi and stored in directory as translatedHindiVoice.mp3
Step 7: To overlapping input the video file with translatedHindiVoice.mp3.
& Stored in Directory as convertedOutput.mp4
Step 8: Remove all unnecessary generated files like GeneratedSpeech.wav,
generatedTxtFile.txt, TranslatedHindiVoice.mp3
Divide /Conquer Logic of Recognition
Total duration :17 min
60 sec
Total No. of frame =17
Audio File .wav
Speech recognition
generatedSpeech.txt
Synchronization of audio
OriginalAudio Generated Audio
Generated Audio
or
5 min 7.30 min
3.5 min
approx-5 min
Project Flow Chart Diagram & Working Methodology-I
Project Flow Chart Diagram & Working Methodology-II
Used Language , Tools & Environment Description for Project Implementation
Pycharm: PyCharm is an integrated development environment used in computer
programming, specifically for the Python language. It is developed by the Czech
company JetBrains.
Qt Designer It is the Qt tool for designing and building graphical user interfaces
(GUIs) with Qt Widgets.
Python: It is a powerful general-purpose programming language. It is used in web
development, data science, creating software prototypes, and so on. Fortunately for
beginners, Python has simple easy-to-use syntax. This makes Python an excellent
language to learn to program for beginners.
Project Dashboard Screen
Menu Bar
Tool Bar
Choose Button
for selecting the
input video file
Save Button
for save the
output file
Quit Button for
exit the software
Future Scope
In this project when we tested the video file we achieve 80% accuracy and in the
future that can be achieved by more than 80% by improving the generated audio
synchronization & audio recognition.
Future work will be devoted to better adapt machine translation to the style used in
dubbing and to improve the quality of prosodic alignment, by generating more accurate
sentence segmentation and by introducing more flexible synchronization.
In the upcoming days, we will make a device which help for those people who are a
tourist and that tourist are foreigner which native language is something else and tourist
place language is different and that device will live to translate automatically the visiting
place to tourist native language
Now Moving to Project
Running Part…
Speech Dubbing Software

More Related Content

What's hot

Connectors of confirmation
Connectors of confirmationConnectors of confirmation
Connectors of confirmation
Sharonezii Glez
 
BDD in Action – principles, practices and real-world application
BDD in Action – principles, practices and real-world applicationBDD in Action – principles, practices and real-world application
BDD in Action – principles, practices and real-world application
John Ferguson Smart Limited
 

What's hot (19)

E ball technology ppt
E ball technology pptE ball technology ppt
E ball technology ppt
 
Doc of 5 pen pc technology
Doc of 5 pen pc technologyDoc of 5 pen pc technology
Doc of 5 pen pc technology
 
P ism(pen style personal networking gadget)
P ism(pen style personal networking gadget)P ism(pen style personal networking gadget)
P ism(pen style personal networking gadget)
 
Diamond chip
Diamond chipDiamond chip
Diamond chip
 
voice recognition security system ppt
voice recognition security system pptvoice recognition security system ppt
voice recognition security system ppt
 
Software Configuration Management And CVS
Software Configuration Management And CVSSoftware Configuration Management And CVS
Software Configuration Management And CVS
 
Silentsound documentation
Silentsound documentationSilentsound documentation
Silentsound documentation
 
Extending WSO2 API Manager's Key Management Capabilities - WSO2 API Manager C...
Extending WSO2 API Manager's Key Management Capabilities - WSO2 API Manager C...Extending WSO2 API Manager's Key Management Capabilities - WSO2 API Manager C...
Extending WSO2 API Manager's Key Management Capabilities - WSO2 API Manager C...
 
5 pen pc technology
5 pen pc technology5 pen pc technology
5 pen pc technology
 
Digital Image Watermarking
Digital Image WatermarkingDigital Image Watermarking
Digital Image Watermarking
 
Warybee Investor Pitchdeck
Warybee Investor PitchdeckWarybee Investor Pitchdeck
Warybee Investor Pitchdeck
 
PowDroid: Energy Profiling of Android Applications (ASE 2021 [Workshop] SUSTA...
PowDroid: Energy Profiling of Android Applications (ASE 2021 [Workshop] SUSTA...PowDroid: Energy Profiling of Android Applications (ASE 2021 [Workshop] SUSTA...
PowDroid: Energy Profiling of Android Applications (ASE 2021 [Workshop] SUSTA...
 
3D password
3D password3D password
3D password
 
Crusoe processor
Crusoe    processorCrusoe    processor
Crusoe processor
 
Connectors of confirmation
Connectors of confirmationConnectors of confirmation
Connectors of confirmation
 
5 pen pc technology
5 pen pc technology5 pen pc technology
5 pen pc technology
 
Using Event-Driven Architectures with Cassandra
Using Event-Driven Architectures with CassandraUsing Event-Driven Architectures with Cassandra
Using Event-Driven Architectures with Cassandra
 
Designing Machine-level HMI with Studio 5000 View Designer® Demonstration
Designing Machine-level HMI with Studio 5000 View Designer® DemonstrationDesigning Machine-level HMI with Studio 5000 View Designer® Demonstration
Designing Machine-level HMI with Studio 5000 View Designer® Demonstration
 
BDD in Action – principles, practices and real-world application
BDD in Action – principles, practices and real-world applicationBDD in Action – principles, practices and real-world application
BDD in Action – principles, practices and real-world application
 

Similar to Speech Dubbing Software

How did you use media technologies in theconstruction, research, planning and...
How did you use media technologies in theconstruction, research, planning and...How did you use media technologies in theconstruction, research, planning and...
How did you use media technologies in theconstruction, research, planning and...
vishal-s93
 
Tecnhologies
TecnhologiesTecnhologies
Tecnhologies
mnoficzer
 
Skills learnt when editing 'Inner Demon'
Skills learnt when editing 'Inner Demon'Skills learnt when editing 'Inner Demon'
Skills learnt when editing 'Inner Demon'
Jade Suddaby
 
How did you use media technologies in the
How did you use media technologies in theHow did you use media technologies in the
How did you use media technologies in the
a2columnd12
 
Module 2 3
Module 2 3Module 2 3
Module 2 3
ryanette
 
How did you use new media technologies
How did you use new media technologiesHow did you use new media technologies
How did you use new media technologies
'Realz Waviest
 
How did you use new media technologies
How did you use new media technologiesHow did you use new media technologies
How did you use new media technologies
'Realz Waviest
 
Question 4 How did you use media technologies in the construction and researc...
Question 4 How did you use media technologies in the construction and researc...Question 4 How did you use media technologies in the construction and researc...
Question 4 How did you use media technologies in the construction and researc...
Nay4697
 

Similar to Speech Dubbing Software (20)

G324 Evaluation Question 4
G324 Evaluation Question 4G324 Evaluation Question 4
G324 Evaluation Question 4
 
How did you use media technologies?
How did you use media technologies?How did you use media technologies?
How did you use media technologies?
 
e content presentation
 e content presentation e content presentation
e content presentation
 
How did you use media technologies in theconstruction, research, planning and...
How did you use media technologies in theconstruction, research, planning and...How did you use media technologies in theconstruction, research, planning and...
How did you use media technologies in theconstruction, research, planning and...
 
Tecnhologies
TecnhologiesTecnhologies
Tecnhologies
 
Skills learnt when editing 'Inner Demon'
Skills learnt when editing 'Inner Demon'Skills learnt when editing 'Inner Demon'
Skills learnt when editing 'Inner Demon'
 
By teddy boateng q4
By teddy boateng q4By teddy boateng q4
By teddy boateng q4
 
Oxymoron
OxymoronOxymoron
Oxymoron
 
Video Localization: Challenges, Opportunities and Best Practices
Video Localization: Challenges, Opportunities and Best PracticesVideo Localization: Challenges, Opportunities and Best Practices
Video Localization: Challenges, Opportunities and Best Practices
 
How did you use media technologies in the
How did you use media technologies in theHow did you use media technologies in the
How did you use media technologies in the
 
Module 2 3
Module 2 3Module 2 3
Module 2 3
 
How did you use new media technologies
How did you use new media technologiesHow did you use new media technologies
How did you use new media technologies
 
How did you use new media technologies
How did you use new media technologiesHow did you use new media technologies
How did you use new media technologies
 
Evaluation Question 6 - Augusto Pitsilides
Evaluation Question 6 - Augusto PitsilidesEvaluation Question 6 - Augusto Pitsilides
Evaluation Question 6 - Augusto Pitsilides
 
PYTHON TUTORIALS.pptx
PYTHON TUTORIALS.pptxPYTHON TUTORIALS.pptx
PYTHON TUTORIALS.pptx
 
Movie maker audacity - multimedia , summer camp 2019-2020
Movie maker   audacity - multimedia , summer camp 2019-2020Movie maker   audacity - multimedia , summer camp 2019-2020
Movie maker audacity - multimedia , summer camp 2019-2020
 
Question 4 How did you use media technologies in the construction and researc...
Question 4 How did you use media technologies in the construction and researc...Question 4 How did you use media technologies in the construction and researc...
Question 4 How did you use media technologies in the construction and researc...
 
Media coursework question 4
Media coursework question 4Media coursework question 4
Media coursework question 4
 
Desktop assistant
Desktop assistant Desktop assistant
Desktop assistant
 
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
 

Recently uploaded

1893-part-1-2016 for Earthquake load design
1893-part-1-2016 for Earthquake load design1893-part-1-2016 for Earthquake load design
1893-part-1-2016 for Earthquake load design
AshishSingh1301
 
Maher Othman Interior Design Portfolio..
Maher Othman Interior Design Portfolio..Maher Othman Interior Design Portfolio..
Maher Othman Interior Design Portfolio..
MaherOthman7
 
Online crime reporting system project.pdf
Online crime reporting system project.pdfOnline crime reporting system project.pdf
Online crime reporting system project.pdf
Kamal Acharya
 
electrical installation and maintenance.
electrical installation and maintenance.electrical installation and maintenance.
electrical installation and maintenance.
benjamincojr
 
Final DBMS Manual (2).pdf final lab manual
Final DBMS Manual (2).pdf final lab manualFinal DBMS Manual (2).pdf final lab manual
Final DBMS Manual (2).pdf final lab manual
BalamuruganV28
 

Recently uploaded (20)

Diploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdfDiploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdf
 
1893-part-1-2016 for Earthquake load design
1893-part-1-2016 for Earthquake load design1893-part-1-2016 for Earthquake load design
1893-part-1-2016 for Earthquake load design
 
The Entity-Relationship Model(ER Diagram).pptx
The Entity-Relationship Model(ER Diagram).pptxThe Entity-Relationship Model(ER Diagram).pptx
The Entity-Relationship Model(ER Diagram).pptx
 
Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxWorksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptx
 
Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1
 
litvinenko_Henry_Intrusion_Hong-Kong_2024.pdf
litvinenko_Henry_Intrusion_Hong-Kong_2024.pdflitvinenko_Henry_Intrusion_Hong-Kong_2024.pdf
litvinenko_Henry_Intrusion_Hong-Kong_2024.pdf
 
Maher Othman Interior Design Portfolio..
Maher Othman Interior Design Portfolio..Maher Othman Interior Design Portfolio..
Maher Othman Interior Design Portfolio..
 
8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...
 
Research Methodolgy & Intellectual Property Rights Series 2
Research Methodolgy & Intellectual Property Rights Series 2Research Methodolgy & Intellectual Property Rights Series 2
Research Methodolgy & Intellectual Property Rights Series 2
 
Online crime reporting system project.pdf
Online crime reporting system project.pdfOnline crime reporting system project.pdf
Online crime reporting system project.pdf
 
Vip ℂall Girls Karkardooma Phone No 9999965857 High Profile ℂall Girl Delhi N...
Vip ℂall Girls Karkardooma Phone No 9999965857 High Profile ℂall Girl Delhi N...Vip ℂall Girls Karkardooma Phone No 9999965857 High Profile ℂall Girl Delhi N...
Vip ℂall Girls Karkardooma Phone No 9999965857 High Profile ℂall Girl Delhi N...
 
Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...
 
analog-vs-digital-communication (concept of analog and digital).pptx
analog-vs-digital-communication (concept of analog and digital).pptxanalog-vs-digital-communication (concept of analog and digital).pptx
analog-vs-digital-communication (concept of analog and digital).pptx
 
Developing a smart system for infant incubators using the internet of things ...
Developing a smart system for infant incubators using the internet of things ...Developing a smart system for infant incubators using the internet of things ...
Developing a smart system for infant incubators using the internet of things ...
 
Raashid final report on Embedded Systems
Raashid final report on Embedded SystemsRaashid final report on Embedded Systems
Raashid final report on Embedded Systems
 
electrical installation and maintenance.
electrical installation and maintenance.electrical installation and maintenance.
electrical installation and maintenance.
 
"United Nations Park" Site Visit Report.
"United Nations Park" Site  Visit Report."United Nations Park" Site  Visit Report.
"United Nations Park" Site Visit Report.
 
Final DBMS Manual (2).pdf final lab manual
Final DBMS Manual (2).pdf final lab manualFinal DBMS Manual (2).pdf final lab manual
Final DBMS Manual (2).pdf final lab manual
 
Insurance management system project report.pdf
Insurance management system project report.pdfInsurance management system project report.pdf
Insurance management system project report.pdf
 
Piping and instrumentation diagram p.pdf
Piping and instrumentation diagram p.pdfPiping and instrumentation diagram p.pdf
Piping and instrumentation diagram p.pdf
 

Speech Dubbing Software

  • 1. B. P. Mandal College of Engineering, Madhepura (Dept. Of Science & Technology, Govt. Of Bihar) Major Project Speech Dubbing Software Presented By:  Pushkar Kumar  Kapil Kumar  Pranav Ravi  Sudama Manjhi Project Guide: Mr. Raj Kumar Assistant Professor Dept. of CSE @copyright by Team Ciphers-BPMCE Madhepura
  • 2. Contents • About Project • What is Dubbing? • Block diagram of Project • Project Dubbing Algorithm –I&II • Divide / Conquer Logic of recognition • Synchronization of Audio • Project Flow Chart Diagram & Working Methodology-I&II • Used Language , Tools & Environment Description for Project Implementation • Project Dashboard Screen • Future Scope • Conclusion
  • 3. About Project It is a Machine Learning technology &Natural Language Processing based project. In this project we developed a GUI based software. The function of this software is to convert the .mp4 English to .mp4 Hindi Video automatically. Speech Dubbing Software .mp4_English .mp4_Hindi
  • 4. What is Dubbing? Dubbing is a post-production process where the original language of recording is swapped with audio in a different language and is then mixed with the audio of the media to make it sound as natural as possible. Original language Audio Swapping Desire language Audio Mixing Desire language with File(.mp4/.mkv)
  • 6. Project Dubbing Algorithm -I Step 1: Take input video file from desktop to program for dubbing Step 2: In this step we extract the audio from the input Video file and stored in directory as generatedSpeech.wav Step 3: Again we import the generated Speech.wav from directory and divide the audio into multiple parts. Step 4: In this step we recognise the text of each audio clip and stored in directory as generatedSpeech.txt
  • 7. Project Dubbing Algorithm -II Step 5: import the generatedSpeech.txt from directory and translate the file English to Hindi and Stored in translatedHindi as a variable. Step 6: Now we generate & synchronized the audio file from translatedHindi and stored in directory as translatedHindiVoice.mp3 Step 7: To overlapping input the video file with translatedHindiVoice.mp3. & Stored in Directory as convertedOutput.mp4 Step 8: Remove all unnecessary generated files like GeneratedSpeech.wav, generatedTxtFile.txt, TranslatedHindiVoice.mp3
  • 8. Divide /Conquer Logic of Recognition Total duration :17 min 60 sec Total No. of frame =17 Audio File .wav Speech recognition generatedSpeech.txt
  • 9. Synchronization of audio OriginalAudio Generated Audio Generated Audio or 5 min 7.30 min 3.5 min approx-5 min
  • 10. Project Flow Chart Diagram & Working Methodology-I
  • 11. Project Flow Chart Diagram & Working Methodology-II
  • 12. Used Language , Tools & Environment Description for Project Implementation Pycharm: PyCharm is an integrated development environment used in computer programming, specifically for the Python language. It is developed by the Czech company JetBrains. Qt Designer It is the Qt tool for designing and building graphical user interfaces (GUIs) with Qt Widgets. Python: It is a powerful general-purpose programming language. It is used in web development, data science, creating software prototypes, and so on. Fortunately for beginners, Python has simple easy-to-use syntax. This makes Python an excellent language to learn to program for beginners.
  • 13. Project Dashboard Screen Menu Bar Tool Bar Choose Button for selecting the input video file Save Button for save the output file Quit Button for exit the software
  • 14. Future Scope In this project when we tested the video file we achieve 80% accuracy and in the future that can be achieved by more than 80% by improving the generated audio synchronization & audio recognition. Future work will be devoted to better adapt machine translation to the style used in dubbing and to improve the quality of prosodic alignment, by generating more accurate sentence segmentation and by introducing more flexible synchronization. In the upcoming days, we will make a device which help for those people who are a tourist and that tourist are foreigner which native language is something else and tourist place language is different and that device will live to translate automatically the visiting place to tourist native language
  • 15. Now Moving to Project Running Part…