SlideShare a Scribd company logo
1 of 9
OCR BASED IMAGE TEXT TO SPEECH CONVERSION
Presentation on
of
Bachelor Of Engineering
in
Computer Science and Engineering(Data Science)
By
N. Niranjan Reddy P. Indraja
M. Bharadwaja T. Raju Reddy
Under the esteemed guidance
of
T.Anusha , Assistant Professor
TKR COLLEGE OF ENGINEERING & TECHNOLOGY(AUTONOMOUS)
Meerpet,near LB Nagar,Hyderabad,Telangana,India
2022-2023
Introduction
๏ƒ˜ In the recent era, visual text in natural or manmade scenes might carry very important and useful
information. Therefore, the scientists have started to digitize these images, extract and interpret the
data by using specific techniques, and then perform text-to-speech synthesis (TTS).
๏ƒ˜ Optical character recognition is employed to recognize and extract the words and finally the extracted
text is converted to appropriate speech using text-to-speech synthesizer.
๏ƒ˜ Text-to-speech (TTS) conversion is the process of converting written text into spoken words using
computer software.
๏ƒ˜ This technology is used to create synthetic voices that can read text aloud, making it accessible to
individuals who are visually impaired, have reading difficulties, or prefer to listen to information
rather than read it.
๏ƒ˜ The process involves analyzing the text and applying natural language processing algorithms to
determine pronunciation, intonation, and emphasis.
๏ƒ˜ The resulting audio output can be customized by adjusting the speed, pitch, and other parameters to
match the user's preferences
ABSTARCT
๏ƒ˜ In the current world, there is a great increase in the utilization of digital technology and various methods are
available for the people to capture images.
๏ƒ˜ Such images may contain important textual data that the user may need to edit or store digitally.
๏ƒ˜ Manual entry of data is time taking and may contain errors.
๏ƒ˜ There are millions of blind people in the world who are visually impaired.
๏ƒ˜ Disability to read has a large impact on the life of visually impaired people.
๏ƒ˜ The Proposed system is cost-efficient and helps the visually impaired person to hear the text.
๏ƒ˜ The main idea of this project is optical Character recognition which is used to convert text character into the audio
signal.
BASE PAPER
TITLE : OCR Based Image Text to Speech Conversion Using MATLAB
DETAILS : Received , June 14,2018, accepted June 15,2018, date of publication March 10, 2019.
๏ƒ˜ The main idea of this project is optical Character recognition which is used to convert text character into the audio
signal.
๏ƒ˜ The text is preprocessed and then used for recognition by segmenting each character.
๏ƒ˜ Segmentation is followed by extraction of the letter and resizing of the file containing the text.
๏ƒ˜ This Text file is then converted into the audio signal.
๏ƒ˜ LINK: OCR Based Image Text to Speech Conversion Using MATLAB | IEEE Conference Publication | IEEE
Xplore
Reference - I
TITLE : Image text to speech conversion in the desired language by translating with Raspberry
Pi
DETAILS : Received December 15, 2016, accepted December 16, 2016, date of publication May 8, 2017, date of
current version may19, 2017
๏ƒ˜ This paper is based on a prototype which helps user to hear the contents of the text images in the desired language.
๏ƒ˜ It involves extraction of text from the image and converting the text to translated speech in the user desired language.
๏ƒ˜ This is done with Raspberry Pi and a camera module by using the concepts of Tesseract OCR [optical character
recognition] engine, Google Speech API [application program interface] which is the Text to speech engine and the
Microsoft translator.
๏ƒ˜ LINK : Image text to speech conversion in the desired language by translating with Raspberry Pi | IEEE Conference
Publication | IEEE Xplore
Reference - II
TITLE : Image to Text Conversion Using Tesseract
DETAILS : Received 19 Feb 2019, accepted 23 feb 2019, date of publication 03 March 2019, date of
current version 18 March 2019.
๏ƒ˜ Textual information is available in many resources such as documents, newspapers, faxes, printed information, written
notes, etc.
๏ƒ˜ Many people simply scan the document to store the data in the computers.
๏ƒ˜ When a document is scanned with a scanner, it is stored in the form of images.
๏ƒ˜ But these images are not editable and it is very difficult to find what the user requires as they will have to go through the
whole image, reading each line and word to determine if it is relevant to their need.
๏ƒ˜ LINK : IRJET-V6I299.pdf
Reference - III
TITLE : Detecting text based image with optical character recognition for English
translation and speech using Android
DETAILS : Received 15 December 2023, accepted 16 December 2023, date of publication 30 April 2016,
date of current version 7 May 2016.
๏ƒ˜ In this study, an Android application is developed by integrating Tesseract OCR engine, Bing translator and phones'
built-in speech out technology.
๏ƒ˜ Final deliverable is tested by various type of target end user from a different language background and concluded that
the application benefits many users.
๏ƒ˜ LINK :Detecting text based image with optical character recognition for English translation and
speech using Android | IEEE Conference Publication | IEEE Xplore
CONCLUSION
๏ƒ˜ This project will extract text from image or video using tesseract OCR. With tesseract OCR , users can
extract text from images with efficient in-line and character pattern recognition of OCR engine. The
extracted text will be displayed to the user in editable format.
๏ƒ˜ Further pyttsx3 is used to convert text to audio format with different accents and male and female
voices. Tkinter , a python module is used to create a graphical user interface for the project which is
fast, efficient and easy to use.
OCR.pptx

More Related Content

Similar to OCR.pptx

IRJET- Voice based Billing System
IRJET-  	  Voice based Billing SystemIRJET-  	  Voice based Billing System
IRJET- Voice based Billing SystemIRJET Journal
ย 
IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...
IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...
IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...IRJET Journal
ย 
5.smart multilingual sign boards
5.smart multilingual sign boards5.smart multilingual sign boards
5.smart multilingual sign boardsEditorJST
ย 
Speech To Speech Translation
Speech To Speech TranslationSpeech To Speech Translation
Speech To Speech TranslationIRJET Journal
ย 
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET Journal
ย 
Colorful Modern Group Project Creative Presentation.pdf
Colorful Modern Group Project Creative Presentation.pdfColorful Modern Group Project Creative Presentation.pdf
Colorful Modern Group Project Creative Presentation.pdfImmanImman6
ย 
Text Detection and Recognition with Speech Output for Visually Challenged Per...
Text Detection and Recognition with Speech Output for Visually Challenged Per...Text Detection and Recognition with Speech Output for Visually Challenged Per...
Text Detection and Recognition with Speech Output for Visually Challenged Per...IJERA Editor
ย 
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IRJET Journal
ย 
IRJET - Speech Recognition using Android
IRJET -  	  Speech Recognition using AndroidIRJET -  	  Speech Recognition using Android
IRJET - Speech Recognition using AndroidIRJET Journal
ย 
The Impact of Mobile Native IOS OCR Application
The Impact of Mobile Native IOS OCR ApplicationThe Impact of Mobile Native IOS OCR Application
The Impact of Mobile Native IOS OCR Applicationwebsubmissions
ย 
IRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET Journal
ย 
IRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET Journal
ย 
Smart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PISmart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PIijtsrd
ย 
DEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAF
DEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAFDEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAF
DEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAFcsandit
ย 
OPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCAREOPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCAREIRJET Journal
ย 
IRJET-Raspberry Pi based Reader for Blind People
IRJET-Raspberry Pi based Reader for Blind PeopleIRJET-Raspberry Pi based Reader for Blind People
IRJET-Raspberry Pi based Reader for Blind PeopleIRJET Journal
ย 
IRJET - Optical Character Recognition and Translation
IRJET -  	  Optical Character Recognition and TranslationIRJET -  	  Optical Character Recognition and Translation
IRJET - Optical Character Recognition and TranslationIRJET Journal
ย 
IRJET - A Review on Text Recognition for Visually Blind People
IRJET - A Review on Text Recognition for Visually Blind PeopleIRJET - A Review on Text Recognition for Visually Blind People
IRJET - A Review on Text Recognition for Visually Blind PeopleIRJET Journal
ย 
Handwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with AudioHandwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with AudioIRJET Journal
ย 
Mobile camera based text detection and translation
Mobile camera based text detection and translationMobile camera based text detection and translation
Mobile camera based text detection and translationVivek Bharadwaj
ย 

Similar to OCR.pptx (20)

IRJET- Voice based Billing System
IRJET-  	  Voice based Billing SystemIRJET-  	  Voice based Billing System
IRJET- Voice based Billing System
ย 
IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...
IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...
IRJET- Voice Assisted Text Reading and Google Home Smart Socket Control Syste...
ย 
5.smart multilingual sign boards
5.smart multilingual sign boards5.smart multilingual sign boards
5.smart multilingual sign boards
ย 
Speech To Speech Translation
Speech To Speech TranslationSpeech To Speech Translation
Speech To Speech Translation
ย 
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
ย 
Colorful Modern Group Project Creative Presentation.pdf
Colorful Modern Group Project Creative Presentation.pdfColorful Modern Group Project Creative Presentation.pdf
Colorful Modern Group Project Creative Presentation.pdf
ย 
Text Detection and Recognition with Speech Output for Visually Challenged Per...
Text Detection and Recognition with Speech Output for Visually Challenged Per...Text Detection and Recognition with Speech Output for Visually Challenged Per...
Text Detection and Recognition with Speech Output for Visually Challenged Per...
ย 
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
ย 
IRJET - Speech Recognition using Android
IRJET -  	  Speech Recognition using AndroidIRJET -  	  Speech Recognition using Android
IRJET - Speech Recognition using Android
ย 
The Impact of Mobile Native IOS OCR Application
The Impact of Mobile Native IOS OCR ApplicationThe Impact of Mobile Native IOS OCR Application
The Impact of Mobile Native IOS OCR Application
ย 
IRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind Person
ย 
IRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech Recognition
ย 
Smart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PISmart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PI
ย 
DEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAF
DEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAFDEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAF
DEVELOPMENT OF TOOL TO PROMOTE WEB ACCESSIBILITY FOR DEAF
ย 
OPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCAREOPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCARE
ย 
IRJET-Raspberry Pi based Reader for Blind People
IRJET-Raspberry Pi based Reader for Blind PeopleIRJET-Raspberry Pi based Reader for Blind People
IRJET-Raspberry Pi based Reader for Blind People
ย 
IRJET - Optical Character Recognition and Translation
IRJET -  	  Optical Character Recognition and TranslationIRJET -  	  Optical Character Recognition and Translation
IRJET - Optical Character Recognition and Translation
ย 
IRJET - A Review on Text Recognition for Visually Blind People
IRJET - A Review on Text Recognition for Visually Blind PeopleIRJET - A Review on Text Recognition for Visually Blind People
IRJET - A Review on Text Recognition for Visually Blind People
ย 
Handwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with AudioHandwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with Audio
ย 
Mobile camera based text detection and translation
Mobile camera based text detection and translationMobile camera based text detection and translation
Mobile camera based text detection and translation
ย 

Recently uploaded

Play hard learn harder: The Serious Business of Play
Play hard learn harder:  The Serious Business of PlayPlay hard learn harder:  The Serious Business of Play
Play hard learn harder: The Serious Business of PlayPooky Knightsmith
ย 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Pooja Bhuva
ย 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
ย 
dusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningdusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningMarc Dusseiller Dusjagr
ย 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxPooja Bhuva
ย 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
ย 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsSandeep D Chaudhary
ย 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma
ย 
Tatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsTatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsNbelano25
ย 
Simple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdfSimple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdfstareducators107
ย 
Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17Celine George
ย 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
ย 
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfUGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfNirmal Dwivedi
ย 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Pooja Bhuva
ย 
21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptxJoelynRubio1
ย 
How to Add a Tool Tip to a Field in Odoo 17
How to Add a Tool Tip to a Field in Odoo 17How to Add a Tool Tip to a Field in Odoo 17
How to Add a Tool Tip to a Field in Odoo 17Celine George
ย 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1
ย 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
ย 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
ย 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxPooja Bhuva
ย 

Recently uploaded (20)

Play hard learn harder: The Serious Business of Play
Play hard learn harder:  The Serious Business of PlayPlay hard learn harder:  The Serious Business of Play
Play hard learn harder: The Serious Business of Play
ย 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
ย 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
ย 
dusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningdusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learning
ย 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
ย 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
ย 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & Systems
ย 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
ย 
Tatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsTatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf arts
ย 
Simple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdfSimple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdf
ย 
Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17
ย 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
ย 
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfUGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
ย 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
ย 
21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx
ย 
How to Add a Tool Tip to a Field in Odoo 17
How to Add a Tool Tip to a Field in Odoo 17How to Add a Tool Tip to a Field in Odoo 17
How to Add a Tool Tip to a Field in Odoo 17
ย 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
ย 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
ย 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
ย 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
ย 

OCR.pptx

  • 1. OCR BASED IMAGE TEXT TO SPEECH CONVERSION Presentation on of Bachelor Of Engineering in Computer Science and Engineering(Data Science) By N. Niranjan Reddy P. Indraja M. Bharadwaja T. Raju Reddy Under the esteemed guidance of T.Anusha , Assistant Professor TKR COLLEGE OF ENGINEERING & TECHNOLOGY(AUTONOMOUS) Meerpet,near LB Nagar,Hyderabad,Telangana,India 2022-2023
  • 2. Introduction ๏ƒ˜ In the recent era, visual text in natural or manmade scenes might carry very important and useful information. Therefore, the scientists have started to digitize these images, extract and interpret the data by using specific techniques, and then perform text-to-speech synthesis (TTS). ๏ƒ˜ Optical character recognition is employed to recognize and extract the words and finally the extracted text is converted to appropriate speech using text-to-speech synthesizer. ๏ƒ˜ Text-to-speech (TTS) conversion is the process of converting written text into spoken words using computer software. ๏ƒ˜ This technology is used to create synthetic voices that can read text aloud, making it accessible to individuals who are visually impaired, have reading difficulties, or prefer to listen to information rather than read it. ๏ƒ˜ The process involves analyzing the text and applying natural language processing algorithms to determine pronunciation, intonation, and emphasis. ๏ƒ˜ The resulting audio output can be customized by adjusting the speed, pitch, and other parameters to match the user's preferences
  • 3. ABSTARCT ๏ƒ˜ In the current world, there is a great increase in the utilization of digital technology and various methods are available for the people to capture images. ๏ƒ˜ Such images may contain important textual data that the user may need to edit or store digitally. ๏ƒ˜ Manual entry of data is time taking and may contain errors. ๏ƒ˜ There are millions of blind people in the world who are visually impaired. ๏ƒ˜ Disability to read has a large impact on the life of visually impaired people. ๏ƒ˜ The Proposed system is cost-efficient and helps the visually impaired person to hear the text. ๏ƒ˜ The main idea of this project is optical Character recognition which is used to convert text character into the audio signal.
  • 4. BASE PAPER TITLE : OCR Based Image Text to Speech Conversion Using MATLAB DETAILS : Received , June 14,2018, accepted June 15,2018, date of publication March 10, 2019. ๏ƒ˜ The main idea of this project is optical Character recognition which is used to convert text character into the audio signal. ๏ƒ˜ The text is preprocessed and then used for recognition by segmenting each character. ๏ƒ˜ Segmentation is followed by extraction of the letter and resizing of the file containing the text. ๏ƒ˜ This Text file is then converted into the audio signal. ๏ƒ˜ LINK: OCR Based Image Text to Speech Conversion Using MATLAB | IEEE Conference Publication | IEEE Xplore
  • 5. Reference - I TITLE : Image text to speech conversion in the desired language by translating with Raspberry Pi DETAILS : Received December 15, 2016, accepted December 16, 2016, date of publication May 8, 2017, date of current version may19, 2017 ๏ƒ˜ This paper is based on a prototype which helps user to hear the contents of the text images in the desired language. ๏ƒ˜ It involves extraction of text from the image and converting the text to translated speech in the user desired language. ๏ƒ˜ This is done with Raspberry Pi and a camera module by using the concepts of Tesseract OCR [optical character recognition] engine, Google Speech API [application program interface] which is the Text to speech engine and the Microsoft translator. ๏ƒ˜ LINK : Image text to speech conversion in the desired language by translating with Raspberry Pi | IEEE Conference Publication | IEEE Xplore
  • 6. Reference - II TITLE : Image to Text Conversion Using Tesseract DETAILS : Received 19 Feb 2019, accepted 23 feb 2019, date of publication 03 March 2019, date of current version 18 March 2019. ๏ƒ˜ Textual information is available in many resources such as documents, newspapers, faxes, printed information, written notes, etc. ๏ƒ˜ Many people simply scan the document to store the data in the computers. ๏ƒ˜ When a document is scanned with a scanner, it is stored in the form of images. ๏ƒ˜ But these images are not editable and it is very difficult to find what the user requires as they will have to go through the whole image, reading each line and word to determine if it is relevant to their need. ๏ƒ˜ LINK : IRJET-V6I299.pdf
  • 7. Reference - III TITLE : Detecting text based image with optical character recognition for English translation and speech using Android DETAILS : Received 15 December 2023, accepted 16 December 2023, date of publication 30 April 2016, date of current version 7 May 2016. ๏ƒ˜ In this study, an Android application is developed by integrating Tesseract OCR engine, Bing translator and phones' built-in speech out technology. ๏ƒ˜ Final deliverable is tested by various type of target end user from a different language background and concluded that the application benefits many users. ๏ƒ˜ LINK :Detecting text based image with optical character recognition for English translation and speech using Android | IEEE Conference Publication | IEEE Xplore
  • 8. CONCLUSION ๏ƒ˜ This project will extract text from image or video using tesseract OCR. With tesseract OCR , users can extract text from images with efficient in-line and character pattern recognition of OCR engine. The extracted text will be displayed to the user in editable format. ๏ƒ˜ Further pyttsx3 is used to convert text to audio format with different accents and male and female voices. Tkinter , a python module is used to create a graphical user interface for the project which is fast, efficient and easy to use.

Editor's Notes

  1. Insert a map of your country.
  2. Insert a picture of one of the geographic features of your country.
  3. Insert a picture illustrating a season in your country.
  4. Insert a picture of an animal and or plant found in your country.
  5. Insert a picture of an animal and or plant found in your country.
  6. Insert a picture of an animal and or plant found in your country.
  7. Insert a picture of one of the geographic features of your country.