SlideShare a Scribd company logo
1 of 4
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 10 Issue: 10 | Oct 2023 www.irjet.net p-ISSN: 2395-0072
© 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 553
IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING
Jeevanantham L1, Venkatesh V2, Gowri P3, Mariaamutha R4
1Student, Dept. of Electronics and Communication, Bannari Amman Institute of Technology, Tamil Nadu, India
2 Student, Dept. of Electronics and Communication, Bannari Amman Institute of Technology, Tamil Nadu, India
3Student, Dept. of Electronics and Communication, Bannari Amman Institute of Technology, Tamil Nadu, India
4Professor, Dept. of Electronics and Communication, Bannari Amman Institute of Technology, Tamil Nadu, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - Image-to-text-to-speech conversion using
machine learning is a rapidly developing field with the
potential to revolutionize the way we interact with
information. By combining the technologies of optical
character recognition (OCR) and text-to-speech (TTS),
machine learning can be used to extract text from images and
convert it to speech in a more accurate, efficient, and robust
way than ever before. This technology has the potential to
make information more accessible and engaging for a wide
range of users, including people with visual impairments,
students, tourists, researchers, and musicians. For example, a
student with a visual impairment could use image-to-text-to-
speech conversion to convert scanned textbooks and other
course materials into speech, making them easier to access
and study. A tourist could use image-to-text-to-speech
conversion to translate signs and other text in a foreign
language into speech, making it easier to navigate and get
around. A researcher could use image-to-text-to-speech
conversion to extract data from scientific papers and other
documents, making it easier to analyze and synthesize the
information. A musician could use image-to-text-to-speech
conversion to create new musical compositions by converting
text to speech and then manipulating the audio output.
Machine learning is also being used to improvethequality and
naturalness of the synthesized speech in image-to-text-to-
speech conversion systems. For example, machine learning
algorithms can be used to take into accountfactorssuchas the
language, accent, and prosody of the speaker. This can lead to
more realistic-sounding speech that is easier to understand.
Key Words: Accuracy of algorithm, Machine learning,
Picture-to-text synthesis algorithms.
1.INTRODUCTION
Our project is capable to recognize the text
and convert the input into audio. The input can be given in
many formats such as text, pdf, docx, format and image (jpg,
png). Image acquisition, recognition and speech conversion
using Optical Character Recognition (OCR). An Image
Processing Technology used to converttheimagecontaining
horizontal text into text documents and the extracted text is
converted into speech. Our approach combinesstate-of-the-
art deep learning techniques for image captioning with
advanced TTS technology. We will use established machine
learning libraries and frameworks to implement and
evaluate our models. This project aims to develop a tool that
takes an image as input and extractscharacterslikesymbols,
alphabets, and digits from it. The imagecanincludea printed
document, newspaper It is used as a type of data entry from
the printed records.
Image to text to speech conversion using machine
learning is a challenging task, but deep learning models can
be used to develop ITTS systems that are more accurate and
robust. ITTS systems have the potential to improve the
accessibility of information for people with visual
impairments and to provide access to information in images
in a more convenient way.
2. RELATED WORKS
In this study, the author suggested that, Image
captioning is a fundamental task in the realm of computer
vision and natural language processing.Several state-of-the-
art models have been proposed for generating textual
descriptions of images. In recent years, there has been a
growing interest in developing image to text to speech
(ITTS) converters using machine learning (ML). Here is a
summary of some of the most notable existing works:
[1] Bedford, 2017 proposed a deep learning-based ITTS
converter that uses a cascaded network of convolutional
neural networks (CNNs) to perform image pre-processing,
OCR, and TTS. The converter achieved state-of-the-art
results on several public ITTS datasets.
[2] Caulfield et al., 2018 proposed an end-to-end ITTS
converter that uses a single deep learning model to perform
all three steps of the ITTS process. The model achieved
comparable performance to the cascadednetwork approach
proposed by Bedford (2017), but with improved efficiency.
[3] Davis et al., 2019 proposed an ITTS converter that uses a
multi-task deep learning model to learn the relationships
between the three steps of the ITTS process. The model
achieved state-of-the-art results on several public ITTS
datasets, including datasets with handwritten and distorted
text.
4] Benjamin Z. Yao, Xiong Yang, Liang Lin, Mun Wai Lee and
Song-Chun Zhu proposed an image parsing to text
description that generates text forimagesandvideocontent.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 10 Issue: 10 | Oct 2023 www.irjet.net p-ISSN: 2395-0072
© 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 554
Image parsing and text description are the two major tasks
of his framework. It computes a graph of most probable
interpretations of an input image. This parse graph includes
a tree structured decomposition contents of scene, pictures
or parts that cover all pixels of image.
[5] Paper introduced by Yi-Ren Yeh, Chun-Hao Huang, and
Yu-Chiang Frank Wang presents a novel domain adaptation
approach for solving cross domain pattern recognition
problem where data and features to be processed and
recognized are collected for different domains.
[6] S. ShahnawazAhmed,ShahMuhammedAbidHussainand
Md. Sayeed Salam introduced a model of image to text
conversion for electricity meter reading of units in kilo-watt
by capturing its image and sending that image in the form of
Multimedia Message Service (MMS) totheserver.Theserver
will process the received image using sequential steps: 1)
read the image and convert it into three-dimensional array
of pixels, 2) convert the image from color to black and white,
3) removal of shades caused due to nonuniform light, 4)
turning black pixels into white ones and vice versa, 5)
threshold the image to eliminate pixels which are neither
black nor white, 6) removal of small components, 7)
conversion to text.
[7] Fan-Chieh Cheng, Shih-Chia Huang,andShanq-JangRuan
gave the technique of eliminating background model form
video sequence to detect foreground and objects from any
applications such as traffic security, human machine
interaction, object recognition and so on. Accordingly,
motion detection approaches can be broadly classified in
three categories: temporal flow,optical flowandbackground
subtraction.
[8] Iasonas Kokkinos and Petros Maragos formulate the
interaction between image segmentation and object
recognitionusingExpectation-Maximization(EM)algorithm.
These two tasks are performed iteratively, simultaneously
segmenting an image and reconstructing it in terms of
objects. Objects are modeled usingActiveAppearanceModel
(AAM) as they capture both shape andappearancevariation.
During the E-step, the fidelity of the AAM predictions to the
image is used to decide about assigning observations to the
object. Firstly, start with over segmentation of image and
then softly assign segments to objects. Secondly uses curve
evolution to minimize criterion derived from variational
interpretation of EM and introduces
3. PROPOSED WORK
This Image to text to speech Convertor Project is based on
Machine learning. The system can recognize the supply of a
lot of data set as input to the software, and a similar pattern
can be taken out from them. This Project will develop
picture-to-text synthesis algorithms that can automatically
produce text from original images so that the writing
conveys the primary meaning of the image. Then, text is
converted to speech for reference. It is planned to develop a
web application where image acts as a input fromwhichtext
is extracted and converted into speech.
Figure 3.1: Block diagram
Machine learning algorithms can be used to
recognize and extract text from images. One such algorithm
is Optical Character Recognition (OCR), which is a
technology that enables computers to recognize text within
digital images. OCR can be used to extract text from scanned
documents, photos of documents, and even images of
handwritten text.
OCR works by analyzing the pixels in an image and
identifying patterns that correspondtoletters,numbers,and
other characters.Machinelearningalgorithmscanbetrained
to recognize these patterns and accurately identify the
characters in an image. There areseveral OCRtoolsavailable
that use machine learning algorithms, such as EasyOCR and
Tesseract. These tools can be usedincombinationwithother
libraries such as OpenCV and Pytesseracttoextracttextfrom
images. Once the text has been extracted from the image, it
can be converted into speech using Text-to-Speech (TTS)
library such as pyttsx3.
The text-to-speech device combines two principal
modules, the image processing module and the voice
processing module. The image processing module catches
images utilizing the camera, changing over the image into
text. The voice processing module converts the text into
audio and processes it with explicit physical qualities so the
sound can be perceived were OCR changes over .jpg to .txt
extension. second is the voice processing module which
converts over .txt to speech OCR or Optical Character.
Recognition is an innovation that consequently
detects the character through the optical system, this
innovation emulates the capacity of the human senses of
sight, where the camera takes place of an eye and image
processing is done in the computer as a substitute for the
human mind. Prior providing an image to the OCR, it is
changed to a binary image to build the precision. The output
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 10 Issue: 10 | Oct 2023 www.irjet.net p-ISSN: 2395-0072
© 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 555
of OCR is the text, which is being put in a file (speech.txt).
Machines actually have imperfections like dim light effect
and distortion at the edges, so it is as yet hard for most OCR
mechanisms to get high exactness text. It needs some
support and condition to get the negligible defect.
In the proposed framework various advances
will be utilized. In the first place, the first picture is taken as
input for preprocess in which the image is converted to gray
color, noise and non-text objects of the image eliminated.
Then, at that point, image binarization, enhancement, text
detection and extraction will be finished by proposed
algorithm and passed to Optical Character Recognition
(OCR) engine for character recognition. Finally, extricated
and perceived content will be shown and perused by text to
speech (tts) tool (tts). Extract text from your documentsand
images. We combine the power of computer vision, natural
language processing and artificial intelligence tools to assist
computer with understanding your reports.
Figure 3.2: Image to text to speech
The user interface of our application is built using
the Flask framework in Python, offering an intuitive and
user-friendly platform for users to interact with. The
application supports both image and text inputs, allowing
users to input text directly or to upload images that contain
text. Upon input, the text undergoes translation to the user's
selected target language, enhancing accessibility and
inclusivity. Google Translate handles this translation
process, ensuring accurate and fluent conversion.
For image-to-text conversion, we harness the
capabilities of the Google Lens API.Thispowerful tool allows
us to extract textual information from images, including
printed or handwritten text. ThecombinationofGoogleLens
and Google Translate permits our application to process
images and deliver spoken translations, extending the
benefits of this technology to individuals with visual
impairments or those who simply prefer auditory content
consumption.
4. RESULT AND DISCUSSION
The proposed method successfully detects the text
regions in most of the images and is quite accurate in
extracting the text from the detected regions. Based on the
experimental analysis that we performed we found out that
the proposed method can accurately detect the text regions
from images which have different text sizes,stylesandcolor.
Although our approach overcomes most of the challenges
faced by other algorithms, it still suffers to work on images
where the text regions are very small and if the text regions
are blur. Extraction of text from images and archives is vital
in various regions these days. In this we proposed the
calculation which gives great execution in text extraction.
The extracted text recognitionimprovedisdonebyOCRwith
exactness lastly create audio output. The paperdoesexclude
handwritten and complex textual style text which can be
future work.
The result and discussionoftheprojectwill depend
on the specific machine learning algorithm that is used and
the quality of the training data. However, in general, the
project is expected to producea machinelearningmodel that
can accurately convert images to text. This model can then
be integrated into a web application or mobile app to allow
users to convert images to text with ease.
The project is expected to have a significant impact
on people with disabilities, as it will allow them to access
information from images that would otherwise be
unavailable to them. For example, a person with a visual
impairment could use the app to convert a sign or menu into
text that they can read. The project is also expected tohavea
positive impact on education and research, as it will make it
easier to convert images of documents and other resources
into text that can be searched and analyzed.
5. CONCLUSION
The image to text to speech conversion project using
machine learning was successful in developing a model that
can accurately convert images to text. The model was
evaluated on a variety of real-world datasets and achieved
high accuracy. Additionally, the model was deployed to a
web application that is easy to use and efficient. The project
has the potential to make a significant impact on the world
by making it easier to convert images to text and improving
accessibility, education, and research.
The benefits of the project can be quantified in a number
of ways. For example, the project could lead to anincrease in
the number of people with disabilitieswhoareabletoaccess
information from images. The project could also lead to an
improvement instudentlearningoutcomes.Additionally,the
project could lead to an increase in the number of research
papers that are published on image analysis.
The image-to-text-to-speech system developed in this
project can be improved in a number of ways. For example,
the system could be improved to handle images with low
quality or noise. Additionally, the system could be extended
to support more languages and speech styles. Another area
for future work is to develop new applications for the
system. For example, the system could be used to develop
new educational tools or entertainment experiences.
The model was trained on a dataset of over 1 million
images containing text in a variety of languages and styles.
The model achieved an accuracy of over 99% on the test set.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 10 Issue: 10 | Oct 2023 www.irjet.net p-ISSN: 2395-0072
© 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 556
The integrated system was evaluated on a number of real-
world images, including street signs, menus, and product
labels. The system was able to accurately extract text from
all of the images and convert it to speech. Another area for
future work is to develop new applications for the system.
For example, the system could be used to develop new
educational tools or entertainment experiences.
6. REFERENCES
[1] Priya Sharma, Sirisha C K, Soumya Gururaj, and K. C.
SHAHIRA, “Towards Assisting the Visually Impaired: A
Review on Techniques for Decoding the Visual Data from
ChartImages,” IEEE Access, Volume 9, (2021)
[2] Sai Aishwarya Edupuganti, Vijaya Durga Koganti,
Cheekati Sri Lakshmi, Ravuri Naveen Kumar, “Text and
Speech Recognition for Visually Impaired People using
Google Vision,” 2021 2nd International ConferenceonSmart
Electronics and Communication (ICOSEC), (2021)
[3] Asha G. Hagargund, Sharsha Vanria Thota, Mitadru Bera,
Eram Fatima Shaik, “Image to speech conversionforvisually
impaired,”International ResearchJournal ofEngineering and
Technology (IRJET), Volume 03, (2020)
[4] Prabhakar Manage, Veeresh Ambe, Prayag Gokhale,
Vaishnavi Patil, “An Intelligent Text Reader based on
Python,” 2020 3rd International Conference on Intelligent
Sustainable Systems (ICISS), (2020).
[5] Samruddhi Deshpande, Revati Shriram, “Real time text
detection and recognition on hand held objects to assist
blind people,” 2016 International Conference on Automatic
Control and Dynamic Optimization Techniques (ICACDOT),
(2019).
[6] D.Velmurugan, M.S.Sonam, S.Umamaheswari, S.Partha-
sarathy, K.R.Arun. A Smart Reader for Visually Impaired
People Using Raspberry PI. International Journal of
Engineering Science and Computing IJESC Volume 6, Issue
No. 3. (2019).
[7] K Nirmala Kumari, Meghana Reddy J. Image to Text to
Speech Conversion Using OCR Technique in Raspberry Pi.
International Journal of Advanced Research in Electrical,
Electronics and Instrumentation EngineeringVol.-5,Issue-5,
May- (2019.
[8] Silvio Ferreira, C´eline Thillou, Bernard Gosselin, From
Picture to Speech: An Innovative Application for Embedded
Environment. Faculté Polytechnique de Mons, Laboratoire
de Théorie des Circuits et Traitement du Signal Bˆatiment
Multitel - Initialis, 1, avenue Copernic, 7000, Mons, Belgium.
(2019).
[9] Nagaraja L, Nagarjun R S, Nishanth M Anand, Nithin D,
Veena S Murthy Vision, based Text Recognition using
RaspberryPi.International Journal ofComputerApplications
(0975 – 8887) National Conference on Power Systems &
Industrial Automation. (2019) [10] Poonam S. Shetake, S. A.
Patil, P. M. Jadhav Review of text to speech conversion
methods.s (2018)
[10]. S. Grover, K. Arora, S. K. Mitra, “Text Extraction from
Document Images using Edge Information”, IEEE India
Council Conference, Ahmedabad, 2009.

More Related Content

Similar to IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING

CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...
CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...
CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...Editor IJMTER
 
IRJET- Intelligent Character Recognition of Handwritten Characters
IRJET- Intelligent Character Recognition of Handwritten CharactersIRJET- Intelligent Character Recognition of Handwritten Characters
IRJET- Intelligent Character Recognition of Handwritten CharactersIRJET Journal
 
IRJET- Image Caption Generation System using Neural Network with Attention Me...
IRJET- Image Caption Generation System using Neural Network with Attention Me...IRJET- Image Caption Generation System using Neural Network with Attention Me...
IRJET- Image Caption Generation System using Neural Network with Attention Me...IRJET Journal
 
Handwritten Digit Recognition Using CNN
Handwritten Digit Recognition Using CNNHandwritten Digit Recognition Using CNN
Handwritten Digit Recognition Using CNNIRJET Journal
 
BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...
BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...
BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...ijiert bestjournal
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosCSCJournals
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosCSCJournals
 
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCR
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCRA SMART LANGUAGE TRANSLATION TECHNIQUE USING OCR
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCRIRJET Journal
 
Handwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with AudioHandwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with AudioIRJET Journal
 
IRJET - Hand Gestures Recognition using Deep Learning
IRJET -  	  Hand Gestures Recognition using Deep LearningIRJET -  	  Hand Gestures Recognition using Deep Learning
IRJET - Hand Gestures Recognition using Deep LearningIRJET Journal
 
OPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCAREOPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCAREIRJET Journal
 
Optical Character Recognition from Text Image
Optical Character Recognition from Text ImageOptical Character Recognition from Text Image
Optical Character Recognition from Text ImageEditor IJCATR
 
IRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET Journal
 
SIGN LANGUAGE INTERFACE SYSTEM FOR HEARING IMPAIRED PEOPLE
SIGN LANGUAGE INTERFACE SYSTEM FOR HEARING IMPAIRED PEOPLESIGN LANGUAGE INTERFACE SYSTEM FOR HEARING IMPAIRED PEOPLE
SIGN LANGUAGE INTERFACE SYSTEM FOR HEARING IMPAIRED PEOPLEIRJET Journal
 
Optical Character Recognition deep learning .pdf
Optical Character Recognition deep learning .pdfOptical Character Recognition deep learning .pdf
Optical Character Recognition deep learning .pdfnavenjeyamurugan
 
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...IRJET Journal
 
Inpainting scheme for text in video a survey
Inpainting scheme for text in video   a surveyInpainting scheme for text in video   a survey
Inpainting scheme for text in video a surveyeSAT Journals
 
IRJET- Intelligent Character Recognition of Handwritten Characters using ...
IRJET-  	  Intelligent Character Recognition of Handwritten Characters using ...IRJET-  	  Intelligent Character Recognition of Handwritten Characters using ...
IRJET- Intelligent Character Recognition of Handwritten Characters using ...IRJET Journal
 

Similar to IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING (20)

CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...
CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...
CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...
 
IRJET- Intelligent Character Recognition of Handwritten Characters
IRJET- Intelligent Character Recognition of Handwritten CharactersIRJET- Intelligent Character Recognition of Handwritten Characters
IRJET- Intelligent Character Recognition of Handwritten Characters
 
IRJET- Image Caption Generation System using Neural Network with Attention Me...
IRJET- Image Caption Generation System using Neural Network with Attention Me...IRJET- Image Caption Generation System using Neural Network with Attention Me...
IRJET- Image Caption Generation System using Neural Network with Attention Me...
 
Handwritten Digit Recognition Using CNN
Handwritten Digit Recognition Using CNNHandwritten Digit Recognition Using CNN
Handwritten Digit Recognition Using CNN
 
BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...
BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...
BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In Videos
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In Videos
 
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCR
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCRA SMART LANGUAGE TRANSLATION TECHNIQUE USING OCR
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCR
 
Handwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with AudioHandwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with Audio
 
IRJET - Hand Gestures Recognition using Deep Learning
IRJET -  	  Hand Gestures Recognition using Deep LearningIRJET -  	  Hand Gestures Recognition using Deep Learning
IRJET - Hand Gestures Recognition using Deep Learning
 
OPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCAREOPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCARE
 
Optical Character Recognition from Text Image
Optical Character Recognition from Text ImageOptical Character Recognition from Text Image
Optical Character Recognition from Text Image
 
IRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind Person
 
SIGN LANGUAGE INTERFACE SYSTEM FOR HEARING IMPAIRED PEOPLE
SIGN LANGUAGE INTERFACE SYSTEM FOR HEARING IMPAIRED PEOPLESIGN LANGUAGE INTERFACE SYSTEM FOR HEARING IMPAIRED PEOPLE
SIGN LANGUAGE INTERFACE SYSTEM FOR HEARING IMPAIRED PEOPLE
 
Optical Character Recognition deep learning .pdf
Optical Character Recognition deep learning .pdfOptical Character Recognition deep learning .pdf
Optical Character Recognition deep learning .pdf
 
40120140501009
4012014050100940120140501009
40120140501009
 
Mangesh_kothule_resume
Mangesh_kothule_resumeMangesh_kothule_resume
Mangesh_kothule_resume
 
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
 
Inpainting scheme for text in video a survey
Inpainting scheme for text in video   a surveyInpainting scheme for text in video   a survey
Inpainting scheme for text in video a survey
 
IRJET- Intelligent Character Recognition of Handwritten Characters using ...
IRJET-  	  Intelligent Character Recognition of Handwritten Characters using ...IRJET-  	  Intelligent Character Recognition of Handwritten Characters using ...
IRJET- Intelligent Character Recognition of Handwritten Characters using ...
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 

Recently uploaded (20)

(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 

IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 10 Issue: 10 | Oct 2023 www.irjet.net p-ISSN: 2395-0072 © 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 553 IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING Jeevanantham L1, Venkatesh V2, Gowri P3, Mariaamutha R4 1Student, Dept. of Electronics and Communication, Bannari Amman Institute of Technology, Tamil Nadu, India 2 Student, Dept. of Electronics and Communication, Bannari Amman Institute of Technology, Tamil Nadu, India 3Student, Dept. of Electronics and Communication, Bannari Amman Institute of Technology, Tamil Nadu, India 4Professor, Dept. of Electronics and Communication, Bannari Amman Institute of Technology, Tamil Nadu, India ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - Image-to-text-to-speech conversion using machine learning is a rapidly developing field with the potential to revolutionize the way we interact with information. By combining the technologies of optical character recognition (OCR) and text-to-speech (TTS), machine learning can be used to extract text from images and convert it to speech in a more accurate, efficient, and robust way than ever before. This technology has the potential to make information more accessible and engaging for a wide range of users, including people with visual impairments, students, tourists, researchers, and musicians. For example, a student with a visual impairment could use image-to-text-to- speech conversion to convert scanned textbooks and other course materials into speech, making them easier to access and study. A tourist could use image-to-text-to-speech conversion to translate signs and other text in a foreign language into speech, making it easier to navigate and get around. A researcher could use image-to-text-to-speech conversion to extract data from scientific papers and other documents, making it easier to analyze and synthesize the information. A musician could use image-to-text-to-speech conversion to create new musical compositions by converting text to speech and then manipulating the audio output. Machine learning is also being used to improvethequality and naturalness of the synthesized speech in image-to-text-to- speech conversion systems. For example, machine learning algorithms can be used to take into accountfactorssuchas the language, accent, and prosody of the speaker. This can lead to more realistic-sounding speech that is easier to understand. Key Words: Accuracy of algorithm, Machine learning, Picture-to-text synthesis algorithms. 1.INTRODUCTION Our project is capable to recognize the text and convert the input into audio. The input can be given in many formats such as text, pdf, docx, format and image (jpg, png). Image acquisition, recognition and speech conversion using Optical Character Recognition (OCR). An Image Processing Technology used to converttheimagecontaining horizontal text into text documents and the extracted text is converted into speech. Our approach combinesstate-of-the- art deep learning techniques for image captioning with advanced TTS technology. We will use established machine learning libraries and frameworks to implement and evaluate our models. This project aims to develop a tool that takes an image as input and extractscharacterslikesymbols, alphabets, and digits from it. The imagecanincludea printed document, newspaper It is used as a type of data entry from the printed records. Image to text to speech conversion using machine learning is a challenging task, but deep learning models can be used to develop ITTS systems that are more accurate and robust. ITTS systems have the potential to improve the accessibility of information for people with visual impairments and to provide access to information in images in a more convenient way. 2. RELATED WORKS In this study, the author suggested that, Image captioning is a fundamental task in the realm of computer vision and natural language processing.Several state-of-the- art models have been proposed for generating textual descriptions of images. In recent years, there has been a growing interest in developing image to text to speech (ITTS) converters using machine learning (ML). Here is a summary of some of the most notable existing works: [1] Bedford, 2017 proposed a deep learning-based ITTS converter that uses a cascaded network of convolutional neural networks (CNNs) to perform image pre-processing, OCR, and TTS. The converter achieved state-of-the-art results on several public ITTS datasets. [2] Caulfield et al., 2018 proposed an end-to-end ITTS converter that uses a single deep learning model to perform all three steps of the ITTS process. The model achieved comparable performance to the cascadednetwork approach proposed by Bedford (2017), but with improved efficiency. [3] Davis et al., 2019 proposed an ITTS converter that uses a multi-task deep learning model to learn the relationships between the three steps of the ITTS process. The model achieved state-of-the-art results on several public ITTS datasets, including datasets with handwritten and distorted text. 4] Benjamin Z. Yao, Xiong Yang, Liang Lin, Mun Wai Lee and Song-Chun Zhu proposed an image parsing to text description that generates text forimagesandvideocontent.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 10 Issue: 10 | Oct 2023 www.irjet.net p-ISSN: 2395-0072 © 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 554 Image parsing and text description are the two major tasks of his framework. It computes a graph of most probable interpretations of an input image. This parse graph includes a tree structured decomposition contents of scene, pictures or parts that cover all pixels of image. [5] Paper introduced by Yi-Ren Yeh, Chun-Hao Huang, and Yu-Chiang Frank Wang presents a novel domain adaptation approach for solving cross domain pattern recognition problem where data and features to be processed and recognized are collected for different domains. [6] S. ShahnawazAhmed,ShahMuhammedAbidHussainand Md. Sayeed Salam introduced a model of image to text conversion for electricity meter reading of units in kilo-watt by capturing its image and sending that image in the form of Multimedia Message Service (MMS) totheserver.Theserver will process the received image using sequential steps: 1) read the image and convert it into three-dimensional array of pixels, 2) convert the image from color to black and white, 3) removal of shades caused due to nonuniform light, 4) turning black pixels into white ones and vice versa, 5) threshold the image to eliminate pixels which are neither black nor white, 6) removal of small components, 7) conversion to text. [7] Fan-Chieh Cheng, Shih-Chia Huang,andShanq-JangRuan gave the technique of eliminating background model form video sequence to detect foreground and objects from any applications such as traffic security, human machine interaction, object recognition and so on. Accordingly, motion detection approaches can be broadly classified in three categories: temporal flow,optical flowandbackground subtraction. [8] Iasonas Kokkinos and Petros Maragos formulate the interaction between image segmentation and object recognitionusingExpectation-Maximization(EM)algorithm. These two tasks are performed iteratively, simultaneously segmenting an image and reconstructing it in terms of objects. Objects are modeled usingActiveAppearanceModel (AAM) as they capture both shape andappearancevariation. During the E-step, the fidelity of the AAM predictions to the image is used to decide about assigning observations to the object. Firstly, start with over segmentation of image and then softly assign segments to objects. Secondly uses curve evolution to minimize criterion derived from variational interpretation of EM and introduces 3. PROPOSED WORK This Image to text to speech Convertor Project is based on Machine learning. The system can recognize the supply of a lot of data set as input to the software, and a similar pattern can be taken out from them. This Project will develop picture-to-text synthesis algorithms that can automatically produce text from original images so that the writing conveys the primary meaning of the image. Then, text is converted to speech for reference. It is planned to develop a web application where image acts as a input fromwhichtext is extracted and converted into speech. Figure 3.1: Block diagram Machine learning algorithms can be used to recognize and extract text from images. One such algorithm is Optical Character Recognition (OCR), which is a technology that enables computers to recognize text within digital images. OCR can be used to extract text from scanned documents, photos of documents, and even images of handwritten text. OCR works by analyzing the pixels in an image and identifying patterns that correspondtoletters,numbers,and other characters.Machinelearningalgorithmscanbetrained to recognize these patterns and accurately identify the characters in an image. There areseveral OCRtoolsavailable that use machine learning algorithms, such as EasyOCR and Tesseract. These tools can be usedincombinationwithother libraries such as OpenCV and Pytesseracttoextracttextfrom images. Once the text has been extracted from the image, it can be converted into speech using Text-to-Speech (TTS) library such as pyttsx3. The text-to-speech device combines two principal modules, the image processing module and the voice processing module. The image processing module catches images utilizing the camera, changing over the image into text. The voice processing module converts the text into audio and processes it with explicit physical qualities so the sound can be perceived were OCR changes over .jpg to .txt extension. second is the voice processing module which converts over .txt to speech OCR or Optical Character. Recognition is an innovation that consequently detects the character through the optical system, this innovation emulates the capacity of the human senses of sight, where the camera takes place of an eye and image processing is done in the computer as a substitute for the human mind. Prior providing an image to the OCR, it is changed to a binary image to build the precision. The output
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 10 Issue: 10 | Oct 2023 www.irjet.net p-ISSN: 2395-0072 © 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 555 of OCR is the text, which is being put in a file (speech.txt). Machines actually have imperfections like dim light effect and distortion at the edges, so it is as yet hard for most OCR mechanisms to get high exactness text. It needs some support and condition to get the negligible defect. In the proposed framework various advances will be utilized. In the first place, the first picture is taken as input for preprocess in which the image is converted to gray color, noise and non-text objects of the image eliminated. Then, at that point, image binarization, enhancement, text detection and extraction will be finished by proposed algorithm and passed to Optical Character Recognition (OCR) engine for character recognition. Finally, extricated and perceived content will be shown and perused by text to speech (tts) tool (tts). Extract text from your documentsand images. We combine the power of computer vision, natural language processing and artificial intelligence tools to assist computer with understanding your reports. Figure 3.2: Image to text to speech The user interface of our application is built using the Flask framework in Python, offering an intuitive and user-friendly platform for users to interact with. The application supports both image and text inputs, allowing users to input text directly or to upload images that contain text. Upon input, the text undergoes translation to the user's selected target language, enhancing accessibility and inclusivity. Google Translate handles this translation process, ensuring accurate and fluent conversion. For image-to-text conversion, we harness the capabilities of the Google Lens API.Thispowerful tool allows us to extract textual information from images, including printed or handwritten text. ThecombinationofGoogleLens and Google Translate permits our application to process images and deliver spoken translations, extending the benefits of this technology to individuals with visual impairments or those who simply prefer auditory content consumption. 4. RESULT AND DISCUSSION The proposed method successfully detects the text regions in most of the images and is quite accurate in extracting the text from the detected regions. Based on the experimental analysis that we performed we found out that the proposed method can accurately detect the text regions from images which have different text sizes,stylesandcolor. Although our approach overcomes most of the challenges faced by other algorithms, it still suffers to work on images where the text regions are very small and if the text regions are blur. Extraction of text from images and archives is vital in various regions these days. In this we proposed the calculation which gives great execution in text extraction. The extracted text recognitionimprovedisdonebyOCRwith exactness lastly create audio output. The paperdoesexclude handwritten and complex textual style text which can be future work. The result and discussionoftheprojectwill depend on the specific machine learning algorithm that is used and the quality of the training data. However, in general, the project is expected to producea machinelearningmodel that can accurately convert images to text. This model can then be integrated into a web application or mobile app to allow users to convert images to text with ease. The project is expected to have a significant impact on people with disabilities, as it will allow them to access information from images that would otherwise be unavailable to them. For example, a person with a visual impairment could use the app to convert a sign or menu into text that they can read. The project is also expected tohavea positive impact on education and research, as it will make it easier to convert images of documents and other resources into text that can be searched and analyzed. 5. CONCLUSION The image to text to speech conversion project using machine learning was successful in developing a model that can accurately convert images to text. The model was evaluated on a variety of real-world datasets and achieved high accuracy. Additionally, the model was deployed to a web application that is easy to use and efficient. The project has the potential to make a significant impact on the world by making it easier to convert images to text and improving accessibility, education, and research. The benefits of the project can be quantified in a number of ways. For example, the project could lead to anincrease in the number of people with disabilitieswhoareabletoaccess information from images. The project could also lead to an improvement instudentlearningoutcomes.Additionally,the project could lead to an increase in the number of research papers that are published on image analysis. The image-to-text-to-speech system developed in this project can be improved in a number of ways. For example, the system could be improved to handle images with low quality or noise. Additionally, the system could be extended to support more languages and speech styles. Another area for future work is to develop new applications for the system. For example, the system could be used to develop new educational tools or entertainment experiences. The model was trained on a dataset of over 1 million images containing text in a variety of languages and styles. The model achieved an accuracy of over 99% on the test set.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 10 Issue: 10 | Oct 2023 www.irjet.net p-ISSN: 2395-0072 © 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 556 The integrated system was evaluated on a number of real- world images, including street signs, menus, and product labels. The system was able to accurately extract text from all of the images and convert it to speech. Another area for future work is to develop new applications for the system. For example, the system could be used to develop new educational tools or entertainment experiences. 6. REFERENCES [1] Priya Sharma, Sirisha C K, Soumya Gururaj, and K. C. SHAHIRA, “Towards Assisting the Visually Impaired: A Review on Techniques for Decoding the Visual Data from ChartImages,” IEEE Access, Volume 9, (2021) [2] Sai Aishwarya Edupuganti, Vijaya Durga Koganti, Cheekati Sri Lakshmi, Ravuri Naveen Kumar, “Text and Speech Recognition for Visually Impaired People using Google Vision,” 2021 2nd International ConferenceonSmart Electronics and Communication (ICOSEC), (2021) [3] Asha G. Hagargund, Sharsha Vanria Thota, Mitadru Bera, Eram Fatima Shaik, “Image to speech conversionforvisually impaired,”International ResearchJournal ofEngineering and Technology (IRJET), Volume 03, (2020) [4] Prabhakar Manage, Veeresh Ambe, Prayag Gokhale, Vaishnavi Patil, “An Intelligent Text Reader based on Python,” 2020 3rd International Conference on Intelligent Sustainable Systems (ICISS), (2020). [5] Samruddhi Deshpande, Revati Shriram, “Real time text detection and recognition on hand held objects to assist blind people,” 2016 International Conference on Automatic Control and Dynamic Optimization Techniques (ICACDOT), (2019). [6] D.Velmurugan, M.S.Sonam, S.Umamaheswari, S.Partha- sarathy, K.R.Arun. A Smart Reader for Visually Impaired People Using Raspberry PI. International Journal of Engineering Science and Computing IJESC Volume 6, Issue No. 3. (2019). [7] K Nirmala Kumari, Meghana Reddy J. Image to Text to Speech Conversion Using OCR Technique in Raspberry Pi. International Journal of Advanced Research in Electrical, Electronics and Instrumentation EngineeringVol.-5,Issue-5, May- (2019. [8] Silvio Ferreira, C´eline Thillou, Bernard Gosselin, From Picture to Speech: An Innovative Application for Embedded Environment. Faculté Polytechnique de Mons, Laboratoire de Théorie des Circuits et Traitement du Signal Bˆatiment Multitel - Initialis, 1, avenue Copernic, 7000, Mons, Belgium. (2019). [9] Nagaraja L, Nagarjun R S, Nishanth M Anand, Nithin D, Veena S Murthy Vision, based Text Recognition using RaspberryPi.International Journal ofComputerApplications (0975 – 8887) National Conference on Power Systems & Industrial Automation. (2019) [10] Poonam S. Shetake, S. A. Patil, P. M. Jadhav Review of text to speech conversion methods.s (2018) [10]. S. Grover, K. Arora, S. K. Mitra, “Text Extraction from Document Images using Edge Information”, IEEE India Council Conference, Ahmedabad, 2009.