SlideShare a Scribd company logo
OCR BASED IMAGE TEXT TO SPEECH CONVERSION
Presentation on
of
Bachelor Of Engineering
in
Computer Science and Engineering(Data Science)
By
N. Niranjan Reddy P. Indraja
M. Bharadwaja T. Raju Reddy
Under the esteemed guidance
of
T.Anusha , Assistant Professor
TKR COLLEGE OF ENGINEERING & TECHNOLOGY(AUTONOMOUS)
Meerpet,near LB Nagar,Hyderabad,Telangana,India
2022-2023
Introduction
 In the recent era, visual text in natural or manmade scenes might carry very important and useful
information. Therefore, the scientists have started to digitize these images, extract and interpret the
data by using specific techniques, and then perform text-to-speech synthesis (TTS).
 Optical character recognition is employed to recognize and extract the words and finally the extracted
text is converted to appropriate speech using text-to-speech synthesizer.
 Text-to-speech (TTS) conversion is the process of converting written text into spoken words using
computer software.
 This technology is used to create synthetic voices that can read text aloud, making it accessible to
individuals who are visually impaired, have reading difficulties, or prefer to listen to information
rather than read it.
 The process involves analyzing the text and applying natural language processing algorithms to
determine pronunciation, intonation, and emphasis.
 The resulting audio output can be customized by adjusting the speed, pitch, and other parameters to
match the user's preferences
ABSTARCT
 In the current world, there is a great increase in the utilization of digital technology and various methods are
available for the people to capture images.
 Such images may contain important textual data that the user may need to edit or store digitally.
 Manual entry of data is time taking and may contain errors.
 There are millions of blind people in the world who are visually impaired.
 Disability to read has a large impact on the life of visually impaired people.
 The Proposed system is cost-efficient and helps the visually impaired person to hear the text.
 The main idea of this project is optical Character recognition which is used to convert text character into the audio
signal.
BASE PAPER
TITLE : OCR Based Image Text to Speech Conversion Using MATLAB
DETAILS : Received , June 14,2018, accepted June 15,2018, date of publication March 10, 2019.
 The main idea of this project is optical Character recognition which is used to convert text character into the audio
signal.
 The text is preprocessed and then used for recognition by segmenting each character.
 Segmentation is followed by extraction of the letter and resizing of the file containing the text.
 This Text file is then converted into the audio signal.
 LINK: OCR Based Image Text to Speech Conversion Using MATLAB | IEEE Conference Publication | IEEE
Xplore
Reference - I
TITLE : Image text to speech conversion in the desired language by translating with Raspberry
Pi
DETAILS : Received December 15, 2016, accepted December 16, 2016, date of publication May 8, 2017, date of
current version may19, 2017
 This paper is based on a prototype which helps user to hear the contents of the text images in the desired language.
 It involves extraction of text from the image and converting the text to translated speech in the user desired language.
 This is done with Raspberry Pi and a camera module by using the concepts of Tesseract OCR [optical character
recognition] engine, Google Speech API [application program interface] which is the Text to speech engine and the
Microsoft translator.
 LINK : Image text to speech conversion in the desired language by translating with Raspberry Pi | IEEE Conference
Publication | IEEE Xplore
Reference - II
TITLE : Image to Text Conversion Using Tesseract
DETAILS : Received 19 Feb 2019, accepted 23 feb 2019, date of publication 03 March 2019, date of
current version 18 March 2019.
 Textual information is available in many resources such as documents, newspapers, faxes, printed information, written
notes, etc.
 Many people simply scan the document to store the data in the computers.
 When a document is scanned with a scanner, it is stored in the form of images.
 But these images are not editable and it is very difficult to find what the user requires as they will have to go through the
whole image, reading each line and word to determine if it is relevant to their need.
 LINK : IRJET-V6I299.pdf
Reference - III
TITLE : Detecting text based image with optical character recognition for English
translation and speech using Android
DETAILS : Received 15 December 2023, accepted 16 December 2023, date of publication 30 April 2016,
date of current version 7 May 2016.
 In this study, an Android application is developed by integrating Tesseract OCR engine, Bing translator and phones'
built-in speech out technology.
 Final deliverable is tested by various type of target end user from a different language background and concluded that
the application benefits many users.
 LINK :Detecting text based image with optical character recognition for English translation and
speech using Android | IEEE Conference Publication | IEEE Xplore
CONCLUSION
 This project will extract text from image or video using tesseract OCR. With tesseract OCR , users can
extract text from images with efficient in-line and character pattern recognition of OCR engine. The
extracted text will be displayed to the user in editable format.
 Further pyttsx3 is used to convert text to audio format with different accents and male and female
voices. Tkinter , a python module is used to create a graphical user interface for the project which is
fast, efficient and easy to use.
OCR.pptx

More Related Content

Similar to OCR.pptx

IRJET- Voice based Billing System
IRJET-  	  Voice based Billing SystemIRJET-  	  Voice based Billing System
IRJET- Voice based Billing System
IRJET Journal
 
IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...
IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...
IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...
IRJET Journal
 
5.smart multilingual sign boards
5.smart multilingual sign boards5.smart multilingual sign boards
5.smart multilingual sign boards
EditorJST
 
Speech To Speech Translation
Speech To Speech TranslationSpeech To Speech Translation
Speech To Speech Translation
IRJET Journal
 
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET Journal
 
Colorful Modern Group Project Creative Presentation.pdf
Colorful Modern Group Project Creative Presentation.pdfColorful Modern Group Project Creative Presentation.pdf
Colorful Modern Group Project Creative Presentation.pdf
ImmanImman6
 
Text Detection and Recognition with Speech Output for Visually Challenged Per...
Text Detection and Recognition with Speech Output for Visually Challenged Per...Text Detection and Recognition with Speech Output for Visually Challenged Per...
Text Detection and Recognition with Speech Output for Visually Challenged Per...
IJERA Editor
 
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IRJET Journal
 
IRJET - Speech Recognition using Android
IRJET -  	  Speech Recognition using AndroidIRJET -  	  Speech Recognition using Android
IRJET - Speech Recognition using Android
IRJET Journal
 
The Impact of Mobile Native IOS OCR Application
The Impact of Mobile Native IOS OCR ApplicationThe Impact of Mobile Native IOS OCR Application
The Impact of Mobile Native IOS OCR Application
websubmissions
 
IRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind Person
IRJET Journal
 
IRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech Recognition
IRJET Journal
 
Smart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PISmart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PI
ijtsrd
 
DEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAF
DEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAFDEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAF
DEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAF
csandit
 
OPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCAREOPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCARE
IRJET Journal
 
IRJET-Raspberry Pi based Reader for Blind People
IRJET-Raspberry Pi based Reader for Blind PeopleIRJET-Raspberry Pi based Reader for Blind People
IRJET-Raspberry Pi based Reader for Blind People
IRJET Journal
 
IRJET - Optical Character Recognition and Translation
IRJET -  	  Optical Character Recognition and TranslationIRJET -  	  Optical Character Recognition and Translation
IRJET - Optical Character Recognition and Translation
IRJET Journal
 
IRJET - A Review on Text Recognition for Visually Blind People
IRJET - A Review on Text Recognition for Visually Blind PeopleIRJET - A Review on Text Recognition for Visually Blind People
IRJET - A Review on Text Recognition for Visually Blind People
IRJET Journal
 
Handwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with AudioHandwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with Audio
IRJET Journal
 
Mobile camera based text detection and translation
Mobile camera based text detection and translationMobile camera based text detection and translation
Mobile camera based text detection and translation
Vivek Bharadwaj
 

Similar to OCR.pptx (20)

IRJET- Voice based Billing System
IRJET-  	  Voice based Billing SystemIRJET-  	  Voice based Billing System
IRJET- Voice based Billing System
 
IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...
IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...
IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...
 
5.smart multilingual sign boards
5.smart multilingual sign boards5.smart multilingual sign boards
5.smart multilingual sign boards
 
Speech To Speech Translation
Speech To Speech TranslationSpeech To Speech Translation
Speech To Speech Translation
 
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
 
Colorful Modern Group Project Creative Presentation.pdf
Colorful Modern Group Project Creative Presentation.pdfColorful Modern Group Project Creative Presentation.pdf
Colorful Modern Group Project Creative Presentation.pdf
 
Text Detection and Recognition with Speech Output for Visually Challenged Per...
Text Detection and Recognition with Speech Output for Visually Challenged Per...Text Detection and Recognition with Speech Output for Visually Challenged Per...
Text Detection and Recognition with Speech Output for Visually Challenged Per...
 
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
 
IRJET - Speech Recognition using Android
IRJET -  	  Speech Recognition using AndroidIRJET -  	  Speech Recognition using Android
IRJET - Speech Recognition using Android
 
The Impact of Mobile Native IOS OCR Application
The Impact of Mobile Native IOS OCR ApplicationThe Impact of Mobile Native IOS OCR Application
The Impact of Mobile Native IOS OCR Application
 
IRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind Person
 
IRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech Recognition
 
Smart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PISmart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PI
 
DEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAF
DEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAFDEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAF
DEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAF
 
OPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCAREOPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCARE
 
IRJET-Raspberry Pi based Reader for Blind People
IRJET-Raspberry Pi based Reader for Blind PeopleIRJET-Raspberry Pi based Reader for Blind People
IRJET-Raspberry Pi based Reader for Blind People
 
IRJET - Optical Character Recognition and Translation
IRJET -  	  Optical Character Recognition and TranslationIRJET -  	  Optical Character Recognition and Translation
IRJET - Optical Character Recognition and Translation
 
IRJET - A Review on Text Recognition for Visually Blind People
IRJET - A Review on Text Recognition for Visually Blind PeopleIRJET - A Review on Text Recognition for Visually Blind People
IRJET - A Review on Text Recognition for Visually Blind People
 
Handwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with AudioHandwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with Audio
 
Mobile camera based text detection and translation
Mobile camera based text detection and translationMobile camera based text detection and translation
Mobile camera based text detection and translation
 

Recently uploaded

C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
mulvey2
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
PECB
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
Nicholas Montgomery
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
Celine George
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
GeorgeMilliken2
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
simonomuemu
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
amberjdewit93
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
Cognitive Development Adolescence Psychology
Cognitive Development Adolescence PsychologyCognitive Development Adolescence Psychology
Cognitive Development Adolescence Psychology
paigestewart1632
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
TechSoup
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 
Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5
sayalidalavi006
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
eBook.com.bd (প্রয়োজনীয় বাংলা বই)
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
National Information Standards Organization (NISO)
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
Colégio Santa Teresinha
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
Dr. Shivangi Singh Parihar
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
Celine George
 

Recently uploaded (20)

C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
Cognitive Development Adolescence Psychology
Cognitive Development Adolescence PsychologyCognitive Development Adolescence Psychology
Cognitive Development Adolescence Psychology
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 
Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
 

OCR.pptx

  • 1. OCR BASED IMAGE TEXT TO SPEECH CONVERSION Presentation on of Bachelor Of Engineering in Computer Science and Engineering(Data Science) By N. Niranjan Reddy P. Indraja M. Bharadwaja T. Raju Reddy Under the esteemed guidance of T.Anusha , Assistant Professor TKR COLLEGE OF ENGINEERING & TECHNOLOGY(AUTONOMOUS) Meerpet,near LB Nagar,Hyderabad,Telangana,India 2022-2023
  • 2. Introduction  In the recent era, visual text in natural or manmade scenes might carry very important and useful information. Therefore, the scientists have started to digitize these images, extract and interpret the data by using specific techniques, and then perform text-to-speech synthesis (TTS).  Optical character recognition is employed to recognize and extract the words and finally the extracted text is converted to appropriate speech using text-to-speech synthesizer.  Text-to-speech (TTS) conversion is the process of converting written text into spoken words using computer software.  This technology is used to create synthetic voices that can read text aloud, making it accessible to individuals who are visually impaired, have reading difficulties, or prefer to listen to information rather than read it.  The process involves analyzing the text and applying natural language processing algorithms to determine pronunciation, intonation, and emphasis.  The resulting audio output can be customized by adjusting the speed, pitch, and other parameters to match the user's preferences
  • 3. ABSTARCT  In the current world, there is a great increase in the utilization of digital technology and various methods are available for the people to capture images.  Such images may contain important textual data that the user may need to edit or store digitally.  Manual entry of data is time taking and may contain errors.  There are millions of blind people in the world who are visually impaired.  Disability to read has a large impact on the life of visually impaired people.  The Proposed system is cost-efficient and helps the visually impaired person to hear the text.  The main idea of this project is optical Character recognition which is used to convert text character into the audio signal.
  • 4. BASE PAPER TITLE : OCR Based Image Text to Speech Conversion Using MATLAB DETAILS : Received , June 14,2018, accepted June 15,2018, date of publication March 10, 2019.  The main idea of this project is optical Character recognition which is used to convert text character into the audio signal.  The text is preprocessed and then used for recognition by segmenting each character.  Segmentation is followed by extraction of the letter and resizing of the file containing the text.  This Text file is then converted into the audio signal.  LINK: OCR Based Image Text to Speech Conversion Using MATLAB | IEEE Conference Publication | IEEE Xplore
  • 5. Reference - I TITLE : Image text to speech conversion in the desired language by translating with Raspberry Pi DETAILS : Received December 15, 2016, accepted December 16, 2016, date of publication May 8, 2017, date of current version may19, 2017  This paper is based on a prototype which helps user to hear the contents of the text images in the desired language.  It involves extraction of text from the image and converting the text to translated speech in the user desired language.  This is done with Raspberry Pi and a camera module by using the concepts of Tesseract OCR [optical character recognition] engine, Google Speech API [application program interface] which is the Text to speech engine and the Microsoft translator.  LINK : Image text to speech conversion in the desired language by translating with Raspberry Pi | IEEE Conference Publication | IEEE Xplore
  • 6. Reference - II TITLE : Image to Text Conversion Using Tesseract DETAILS : Received 19 Feb 2019, accepted 23 feb 2019, date of publication 03 March 2019, date of current version 18 March 2019.  Textual information is available in many resources such as documents, newspapers, faxes, printed information, written notes, etc.  Many people simply scan the document to store the data in the computers.  When a document is scanned with a scanner, it is stored in the form of images.  But these images are not editable and it is very difficult to find what the user requires as they will have to go through the whole image, reading each line and word to determine if it is relevant to their need.  LINK : IRJET-V6I299.pdf
  • 7. Reference - III TITLE : Detecting text based image with optical character recognition for English translation and speech using Android DETAILS : Received 15 December 2023, accepted 16 December 2023, date of publication 30 April 2016, date of current version 7 May 2016.  In this study, an Android application is developed by integrating Tesseract OCR engine, Bing translator and phones' built-in speech out technology.  Final deliverable is tested by various type of target end user from a different language background and concluded that the application benefits many users.  LINK :Detecting text based image with optical character recognition for English translation and speech using Android | IEEE Conference Publication | IEEE Xplore
  • 8. CONCLUSION  This project will extract text from image or video using tesseract OCR. With tesseract OCR , users can extract text from images with efficient in-line and character pattern recognition of OCR engine. The extracted text will be displayed to the user in editable format.  Further pyttsx3 is used to convert text to audio format with different accents and male and female voices. Tkinter , a python module is used to create a graphical user interface for the project which is fast, efficient and easy to use.

Editor's Notes

  1. Insert a map of your country.
  2. Insert a picture of one of the geographic features of your country.
  3. Insert a picture illustrating a season in your country.
  4. Insert a picture of an animal and or plant found in your country.
  5. Insert a picture of an animal and or plant found in your country.
  6. Insert a picture of an animal and or plant found in your country.
  7. Insert a picture of one of the geographic features of your country.