SlideShare a Scribd company logo
Image to Text Convertor
BY,
DHIRAJ RAJ
MANVENDRA PRIYADARSHI
Agenda :
 Abstract
 AIM
 Technology Used
 Procedure
 Algo I
 Algo II
 Algo III
 Algo IV (Part 1 & 2)
 Algo V (Part 1 & 2)
 Advantage
 Limitations
 Conclusion
Abstract :
 Image to text converter is a type of application that can be used to translate images of any format to the
text format. This application helps one to convert the texts in image files into editable text files.
 It has some pre-requisite conditions saying first that the text captured should be aligned horizontally
straight. Then the text in the image to be converted contains only A, B, C, and D of pre-defined fonts or
human written fonts. The image should be captured in a way that the pixel of any of the text should not be
present at the coordinate (0, 0). And also the image captured should have texts with intensity of dark color
and the background with intensity of light color.
 This program basically uses five specific algorithms where the first algorithm deals with converting the text
pixels and the background pixels other than text into opposite ranges of RGB so that the text pixels could
be identified with ease.
 In the second algorithm the image obtained previously is horizontally searched for all the portion of text (in
black) area and the dimension of each sentence is found. An array of BufferedImage type is used to store
the separated images containing each sentence. The dimension for the portion of image is defined to that
array, which is separated by using predefined method drawImage().
 The third algorithm deals with extracting each word from each of these sentences into specific images. The
words are separated using drawImage() and stored in an array of BufferedImage type.
Abstract Contd.
 The forth algorithm has two parts. The first part deals with extracting each letters from each of the
words which contain letters of predefined uppercase texts format and the second part deals with
extracting each of the letters from each of the words which contain letters of joint lowercase texts
format.
 The last algorithm deals with extracting each letters from each the words and convert each letter into
specific images. The obtained image of letter is then converted into a size with 100x100 pixels using
predefined method drawImage() for changing the pixels of the image. The image is matched with
predefined strips of co-ordinate. If the image matches every strips condition for letters (particularly
for A, B, C & D) then it gets validated for that letter. And, we display the corresponding letter as an
output.
Aim :
 To build an application to covert in to editable text from image(with
standard text/human handwritting).
Technology Used :
 Language : Java
 IDE : NetBeans
Procedure :
 Step 1 : Firstly, we have change the color of background to be white and
the color of text to be black.
 Step 2 : Now, we separate every sentence from the given segment.
 Step 3 : Then, we split each sentence into words.
 Step 4 : Each word will then split into letters.
 Step 5 : Now, we convert the obtained letter into 100x100 pixels.
 Step 6 : Then, we match the letter with predefined strips of co-ordinate
and validate the letter to be specified one.
 Step 7 : Finally, we display the corresponding letter as an output.
Algo I :
 To change the color of image, we have used predefined class ‘Color’ which
is available in java.awt package.
 Color c1 = new Color(255, 255, 255); // for White
 Color c2 = new Color(0, 0, 0); // for Black
Input : Output :
Algo II :
 Now, we separate each sentence from the given segment.
 We start searching horizontally, all the portion of text (in black) area and
count it separately for every horizontal line and store it into an array.
 Then we look for that line which has white portion and the previous line
should have some text portion and store the co-ordinate of that line into
an array.
 Then we also look for that line which has white portion and the next line
should have some text portion and store the co-ordinate of that line into
the same array.
 Now, we have the co-ordinates of image from which we need to separate
the image.
Algo II continues….
 We have created an array of BufferedImage type to store the separated images.
 BufferedImage imgs[ ] = new BufferedImage[size];
 Then we defined the dimension for the portion of image to that array, which is need to
be separated.
 We used predefined method drawImage() for separating the image.
Output :Input :
Algo III :
 Now, we split each word from the sentence.
 We start searching vertically, all the portion of text (in black) area and count it
separately for every vertical line and store it into an array.
 Then we look for that line which has white portion and the increment the
counter by one until we find a line which has text portion onto it and store
value of counter into an array and the co-ordinate of that line into another
array and use ‘continue’ keyword to skip that iteration and execute next
iteration. Also, assign zero to counter so that it calculate next gap.
 Then we find the maximum value from the counter and store the co-ordinate
of the corresponding line into an array .
 Now, we have the co-ordinates of image from which we need to separate the
image.
Algo III continues….
 Again, we have created an array of BufferedImage type to store the separated images.
 BufferedImage imgs[ ] = new BufferedImage[size];
 Then we defined the dimension for the portion of image to that array, which is need to
be separated.
 We used predefined method drawImage() for separating the image.
Input : Output :
Algo IV (Part 1 : Font Text)
 Now, we split each letter (font text) from the word.
 We start searching vertically, all the portion of text (in black) area and
count it separately for every vertical line and store it into an array.
 Then we look for that line which has white portion and the previous line
should have some text portion and we shift the value to adjust the gap
then store the co-ordinate of that line into an array.
 Now, we have the co-ordinates of image from which we need to separate
the image.
Algo IV (Part 1 : Font Text) continues….
 Again, we have created an array of BufferedImage type to store the separated images.
 BufferedImage imgs[ ] = new BufferedImage[size];
 Then we defined the dimension for the portion of image to that array, which is need to
be separated.
 We used predefined method drawImage() for separating the image.
Input : Output :
Algo IV (Part 2 : Hand written Text)
 Now, we split each letter (hand written text) from the word.
 We start searching vertically, all the portion of text (in black) area and
count it separately for every vertical line and store it into an array.
 Then we look for that line which has minimum portion of text and store
the co-ordinate of that line into an array.
 We find the line which is next to the stored co-ordinate of minimum
portion of text and if it is more than all the minimum portions stored in the
array then we shift the value to adjust the gap then store the co-ordinate
of that line into another array.
 Now, we have the co-ordinates of image from which we need to separate
the image.
Algo IV (Part 2 : Hand written Text)
continues….
 Again, we have created an array of BufferedImage type to store the separated images.
 BufferedImage imgs[ ] = new BufferedImage[size];
 Then we defined the dimension for the portion of image to that array, which is need to
be separated.
 We used predefined method drawImage() for separating the image.
Input : Output :
Algo V (Part 1) :
 We convert the obtained image of letter into 100x100 pixels.
 For this purpose we convert the size of image into 100x100 pixels.
 We used predefined method drawImage() for changing the pixels of the
image.
Input : Output :
Algo V (Part 2) :
 We have defined some strips condition for letters (particularly for A, B, C &
D).
 We match the image with predefined strips of co-ordinate.
 If the image matches every strips condition then it get validated for that
letter.
 And, we display the corresponding letter as an output.
Input : Output :
ABCD
Advantage :
 Image to text converter utility helps in format portability and compatibility
that serves the purpose of using conversion from one format to another. In
the present scenario, interchangeable formats are more in demand and
software developers around the world need utilities that can convert files
from one format to another easily and without too much hassle. This is
where the ‘Image To Text Converter’ utility comes into play and the
benefits of using the same are required. Further, many of the media
houses use the converted files to store and retrieve data whenever they
need. This helps in files restoring of image files at one's convenience
making life easier for everyone in the process.
Limitations :
 The first co-ordinate (0,0) of the image should not be the portion of text.
 The handwritten text extracting process is successful for few letters yet.
 The joining portion of the hand written text should not have more
thickness.
Conclusion :
 By this project we can come to the conclusion that we can convert image’s texts into
editable text.
References :
 http://alvinalexander.com/blog/post/java/getting-rgb-values-for-each-pixel-in-image-
using-java-bufferedi
 http://alvinalexander.com/java/java-image-how-to-crop-image-in-java
 http://kalanir.blogspot.in/2010/02/how-to-split-image-into-chunks-java.html
 http://www.codejava.net/java-se/graphics/how-to-resize-images-in-java
Thank You

More Related Content

What's hot

Machine learning in image processing
Machine learning in image processingMachine learning in image processing
Machine learning in image processing
Data Science Thailand
 
Intro to scan conversion
Intro to scan conversionIntro to scan conversion
Intro to scan conversionMohd Arif
 
Bit plane slicing
Bit plane slicingBit plane slicing
Bit plane slicing
Asad Ali
 
Attributes of Output Primitives
Attributes of Output PrimitivesAttributes of Output Primitives
Attributes of Output Primitives
Renita Santhmayora
 
Facial Expression Recognition System using Deep Convolutional Neural Networks.
Facial Expression Recognition  System using Deep Convolutional Neural Networks.Facial Expression Recognition  System using Deep Convolutional Neural Networks.
Facial Expression Recognition System using Deep Convolutional Neural Networks.
Sandeep Wakchaure
 
Polygon filling algorithm
Polygon filling algorithmPolygon filling algorithm
Polygon filling algorithm
Aparna Joshi
 
Animation in Computer Graphics
Animation in Computer GraphicsAnimation in Computer Graphics
Animation in Computer Graphics
RinkuNahar
 
Graphics software and standards
Graphics software and standardsGraphics software and standards
Graphics software and standards
Mani Kanth
 
Overview of the graphics system
Overview of the graphics systemOverview of the graphics system
Overview of the graphics system
Kamal Acharya
 
Object detection presentation
Object detection presentationObject detection presentation
Object detection presentation
AshwinBicholiya
 
Detecting Fake News Through NLP
Detecting Fake News Through NLPDetecting Fake News Through NLP
Detecting Fake News Through NLP
Sakha Global
 
Image Enhancement in Spatial Domain
Image Enhancement in Spatial DomainImage Enhancement in Spatial Domain
Image Enhancement in Spatial Domain
DEEPASHRI HK
 
Image recognition
Image recognitionImage recognition
Image recognition
Aseed Usmani
 
sutherland- Hodgeman Polygon clipping
sutherland- Hodgeman Polygon clippingsutherland- Hodgeman Polygon clipping
sutherland- Hodgeman Polygon clipping
Arvind Kumar
 
Image Representation & Descriptors
Image Representation & DescriptorsImage Representation & Descriptors
Image Representation & Descriptors
PundrikPatel
 
Image Enhancement in Spatial Domain
Image Enhancement in Spatial DomainImage Enhancement in Spatial Domain
Image Enhancement in Spatial Domain
A B Shinde
 
Point processing
Point processingPoint processing
Point processing
panupriyaa7
 
Data Redundacy
Data RedundacyData Redundacy
Data Redundacy
Poonam Seth
 

What's hot (20)

Machine learning in image processing
Machine learning in image processingMachine learning in image processing
Machine learning in image processing
 
Intro to scan conversion
Intro to scan conversionIntro to scan conversion
Intro to scan conversion
 
Bit plane slicing
Bit plane slicingBit plane slicing
Bit plane slicing
 
Attributes of Output Primitives
Attributes of Output PrimitivesAttributes of Output Primitives
Attributes of Output Primitives
 
Facial Expression Recognition System using Deep Convolutional Neural Networks.
Facial Expression Recognition  System using Deep Convolutional Neural Networks.Facial Expression Recognition  System using Deep Convolutional Neural Networks.
Facial Expression Recognition System using Deep Convolutional Neural Networks.
 
Polygon filling algorithm
Polygon filling algorithmPolygon filling algorithm
Polygon filling algorithm
 
Animation in Computer Graphics
Animation in Computer GraphicsAnimation in Computer Graphics
Animation in Computer Graphics
 
Curve clipping
Curve clippingCurve clipping
Curve clipping
 
Graphics software and standards
Graphics software and standardsGraphics software and standards
Graphics software and standards
 
Overview of the graphics system
Overview of the graphics systemOverview of the graphics system
Overview of the graphics system
 
Object detection presentation
Object detection presentationObject detection presentation
Object detection presentation
 
Detecting Fake News Through NLP
Detecting Fake News Through NLPDetecting Fake News Through NLP
Detecting Fake News Through NLP
 
Image Enhancement in Spatial Domain
Image Enhancement in Spatial DomainImage Enhancement in Spatial Domain
Image Enhancement in Spatial Domain
 
Image recognition
Image recognitionImage recognition
Image recognition
 
Image processing ppt
Image processing pptImage processing ppt
Image processing ppt
 
sutherland- Hodgeman Polygon clipping
sutherland- Hodgeman Polygon clippingsutherland- Hodgeman Polygon clipping
sutherland- Hodgeman Polygon clipping
 
Image Representation & Descriptors
Image Representation & DescriptorsImage Representation & Descriptors
Image Representation & Descriptors
 
Image Enhancement in Spatial Domain
Image Enhancement in Spatial DomainImage Enhancement in Spatial Domain
Image Enhancement in Spatial Domain
 
Point processing
Point processingPoint processing
Point processing
 
Data Redundacy
Data RedundacyData Redundacy
Data Redundacy
 

Viewers also liked

Image Converter
Image ConverterImage Converter
Image Converter
Navin Kumar
 
Word Detection & Translation from image on an android device
Word Detection & Translation from image on an android deviceWord Detection & Translation from image on an android device
Word Detection & Translation from image on an android deviceRitwik Kumar
 
Image recognition technology (Medical Presentation)
Image recognition technology (Medical Presentation)Image recognition technology (Medical Presentation)
Image recognition technology (Medical Presentation)
saravanan guru
 
Imago OCR: Open-source toolkit for chemical structure image recognition
Imago OCR: Open-source toolkit for chemical structure image recognitionImago OCR: Open-source toolkit for chemical structure image recognition
Imago OCR: Open-source toolkit for chemical structure image recognition
Mikhail Rybalkin
 
Mobile camera based text detection and translation
Mobile camera based text detection and translationMobile camera based text detection and translation
Mobile camera based text detection and translationVivek Bharadwaj
 
Motion capture
Motion captureMotion capture
Motion capturenooCnoo
 
DM8168 Dual SuperHD image capture using DaVinci
DM8168 Dual SuperHD image capture using DaVinciDM8168 Dual SuperHD image capture using DaVinci
DM8168 Dual SuperHD image capture using DaVinci
Sundance Multiprocessor Technology Ltd.
 
Motion
MotionMotion
Motion
희영 곽
 
Golang 으로 vision api 적용하기
Golang 으로 vision api 적용하기Golang 으로 vision api 적용하기
Golang 으로 vision api 적용하기
동철 박
 
Text & Image: Residents of a dysFUNctional HOME
Text & Image: Residents of a dysFUNctional HOMEText & Image: Residents of a dysFUNctional HOME
Text & Image: Residents of a dysFUNctional HOME
S. Song
 
Text and Image based Digital Humanities: providing access to textual heritage...
Text and Image based Digital Humanities: providing access to textual heritage...Text and Image based Digital Humanities: providing access to textual heritage...
Text and Image based Digital Humanities: providing access to textual heritage...
Edward Vanhoutte
 
Process for Big Data Analysis
Process for Big Data AnalysisProcess for Big Data Analysis
Process for Big Data Analysis
Myunggoon Choi
 
구글Fin
구글Fin구글Fin
구글Fin
zerk87
 
Introduction to OpenCV (with Java)
Introduction to OpenCV (with Java)Introduction to OpenCV (with Java)
Introduction to OpenCV (with Java)
Luigi De Russis
 
[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.
[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.
[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.
kosena
 
[DL輪読会]Image-to-Image Translation with Conditional Adversarial Networks
[DL輪読会]Image-to-Image Translation with Conditional Adversarial Networks[DL輪読会]Image-to-Image Translation with Conditional Adversarial Networks
[DL輪読会]Image-to-Image Translation with Conditional Adversarial Networks
Deep Learning JP
 
How to make Successful Open APIs for Startups (2012)
How to make Successful Open APIs for Startups (2012)How to make Successful Open APIs for Startups (2012)
How to make Successful Open APIs for Startups (2012)Channy Yun
 

Viewers also liked (20)

Image Converter
Image ConverterImage Converter
Image Converter
 
Word Detection & Translation from image on an android device
Word Detection & Translation from image on an android deviceWord Detection & Translation from image on an android device
Word Detection & Translation from image on an android device
 
Ocr revision
Ocr revisionOcr revision
Ocr revision
 
Image recognition technology (Medical Presentation)
Image recognition technology (Medical Presentation)Image recognition technology (Medical Presentation)
Image recognition technology (Medical Presentation)
 
Integrating Text and Image
Integrating Text and ImageIntegrating Text and Image
Integrating Text and Image
 
Imago OCR: Open-source toolkit for chemical structure image recognition
Imago OCR: Open-source toolkit for chemical structure image recognitionImago OCR: Open-source toolkit for chemical structure image recognition
Imago OCR: Open-source toolkit for chemical structure image recognition
 
Mobile camera based text detection and translation
Mobile camera based text detection and translationMobile camera based text detection and translation
Mobile camera based text detection and translation
 
Motion capture
Motion captureMotion capture
Motion capture
 
DM8168 Dual SuperHD image capture using DaVinci
DM8168 Dual SuperHD image capture using DaVinciDM8168 Dual SuperHD image capture using DaVinci
DM8168 Dual SuperHD image capture using DaVinci
 
Motion
MotionMotion
Motion
 
Golang 으로 vision api 적용하기
Golang 으로 vision api 적용하기Golang 으로 vision api 적용하기
Golang 으로 vision api 적용하기
 
MMT image & graphics
MMT image & graphicsMMT image & graphics
MMT image & graphics
 
Text & Image: Residents of a dysFUNctional HOME
Text & Image: Residents of a dysFUNctional HOMEText & Image: Residents of a dysFUNctional HOME
Text & Image: Residents of a dysFUNctional HOME
 
Text and Image based Digital Humanities: providing access to textual heritage...
Text and Image based Digital Humanities: providing access to textual heritage...Text and Image based Digital Humanities: providing access to textual heritage...
Text and Image based Digital Humanities: providing access to textual heritage...
 
Process for Big Data Analysis
Process for Big Data AnalysisProcess for Big Data Analysis
Process for Big Data Analysis
 
구글Fin
구글Fin구글Fin
구글Fin
 
Introduction to OpenCV (with Java)
Introduction to OpenCV (with Java)Introduction to OpenCV (with Java)
Introduction to OpenCV (with Java)
 
[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.
[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.
[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.
 
[DL輪読会]Image-to-Image Translation with Conditional Adversarial Networks
[DL輪読会]Image-to-Image Translation with Conditional Adversarial Networks[DL輪読会]Image-to-Image Translation with Conditional Adversarial Networks
[DL輪読会]Image-to-Image Translation with Conditional Adversarial Networks
 
How to make Successful Open APIs for Startups (2012)
How to make Successful Open APIs for Startups (2012)How to make Successful Open APIs for Startups (2012)
How to make Successful Open APIs for Startups (2012)
 

Similar to Image to text Converter

Image to Text Converter
Image to Text ConverterImage to Text Converter
Image to Text Converter
MANVENDRA PRIYADARSHI
 
19BCS1815_PresentationAutomatic Number Plate Recognition(ANPR)P.pptx
19BCS1815_PresentationAutomatic Number Plate Recognition(ANPR)P.pptx19BCS1815_PresentationAutomatic Number Plate Recognition(ANPR)P.pptx
19BCS1815_PresentationAutomatic Number Plate Recognition(ANPR)P.pptx
SamridhGarg
 
Image processing with matlab
Image processing with matlabImage processing with matlab
Image processing with matlab
Aman Gupta
 
We are restricted from importing cv2 numpy stats and other.pdf
We are restricted from importing cv2 numpy stats and other.pdfWe are restricted from importing cv2 numpy stats and other.pdf
We are restricted from importing cv2 numpy stats and other.pdf
DARSHANACHARYA13
 
Presen_Segmentation
Presen_SegmentationPresen_Segmentation
Presen_SegmentationVikas Goyal
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
Nitin Vishwari
 
IRJET- 3D Vision System using Calibrated Stereo Camera
IRJET- 3D Vision System using Calibrated Stereo CameraIRJET- 3D Vision System using Calibrated Stereo Camera
IRJET- 3D Vision System using Calibrated Stereo Camera
IRJET Journal
 
Using the Ceasar Cipher encryption algorithm, you take each characte.pdf
Using the Ceasar Cipher encryption algorithm, you take each characte.pdfUsing the Ceasar Cipher encryption algorithm, you take each characte.pdf
Using the Ceasar Cipher encryption algorithm, you take each characte.pdf
amirthagiftsmadurai
 
search engine for images
search engine for imagessearch engine for images
search engine for images
Anjani
 
Image processing
Image processingImage processing
Image processingmaheshpene
 
OCR for Gujarati Numeral using Neural Network
OCR for Gujarati Numeral using Neural NetworkOCR for Gujarati Numeral using Neural Network
OCR for Gujarati Numeral using Neural Network
ijsrd.com
 
Writeup advanced lane_lines_project
Writeup advanced lane_lines_projectWriteup advanced lane_lines_project
Writeup advanced lane_lines_project
Manish Jauhari
 
Image processing using matlab
Image processing using matlabImage processing using matlab
Image processing using matlab
dedik dafiyanto
 
A Mat Lab built software application for similar image retrieval
A Mat Lab built software application for similar image retrievalA Mat Lab built software application for similar image retrieval
A Mat Lab built software application for similar image retrieval
IOSR Journals
 
Matlab dip
Matlab dipMatlab dip
Matlab dip
Jeevan Reddy
 
Programming in matlab lesson5
Programming in matlab lesson5Programming in matlab lesson5
Programming in matlab lesson5
najmah17
 
Please make the complete program, distinguishing between each class .pdf
Please make the complete program, distinguishing between each class .pdfPlease make the complete program, distinguishing between each class .pdf
Please make the complete program, distinguishing between each class .pdf
faxteldelhi
 
//STEIM Workshop: A Vernacular of File Formats
//STEIM Workshop: A Vernacular of File Formats//STEIM Workshop: A Vernacular of File Formats
//STEIM Workshop: A Vernacular of File Formats
Rosa ɯǝukɯɐn
 
LSDI 2.pptx
LSDI 2.pptxLSDI 2.pptx
LSDI 2.pptx
HisokaFreecs
 

Similar to Image to text Converter (20)

Image to Text Converter
Image to Text ConverterImage to Text Converter
Image to Text Converter
 
19BCS1815_PresentationAutomatic Number Plate Recognition(ANPR)P.pptx
19BCS1815_PresentationAutomatic Number Plate Recognition(ANPR)P.pptx19BCS1815_PresentationAutomatic Number Plate Recognition(ANPR)P.pptx
19BCS1815_PresentationAutomatic Number Plate Recognition(ANPR)P.pptx
 
Image processing with matlab
Image processing with matlabImage processing with matlab
Image processing with matlab
 
We are restricted from importing cv2 numpy stats and other.pdf
We are restricted from importing cv2 numpy stats and other.pdfWe are restricted from importing cv2 numpy stats and other.pdf
We are restricted from importing cv2 numpy stats and other.pdf
 
Presen_Segmentation
Presen_SegmentationPresen_Segmentation
Presen_Segmentation
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
 
IRJET- 3D Vision System using Calibrated Stereo Camera
IRJET- 3D Vision System using Calibrated Stereo CameraIRJET- 3D Vision System using Calibrated Stereo Camera
IRJET- 3D Vision System using Calibrated Stereo Camera
 
Using the Ceasar Cipher encryption algorithm, you take each characte.pdf
Using the Ceasar Cipher encryption algorithm, you take each characte.pdfUsing the Ceasar Cipher encryption algorithm, you take each characte.pdf
Using the Ceasar Cipher encryption algorithm, you take each characte.pdf
 
search engine for images
search engine for imagessearch engine for images
search engine for images
 
Image processing
Image processingImage processing
Image processing
 
OCR for Gujarati Numeral using Neural Network
OCR for Gujarati Numeral using Neural NetworkOCR for Gujarati Numeral using Neural Network
OCR for Gujarati Numeral using Neural Network
 
Writeup advanced lane_lines_project
Writeup advanced lane_lines_projectWriteup advanced lane_lines_project
Writeup advanced lane_lines_project
 
Image processing using matlab
Image processing using matlabImage processing using matlab
Image processing using matlab
 
A Mat Lab built software application for similar image retrieval
A Mat Lab built software application for similar image retrievalA Mat Lab built software application for similar image retrieval
A Mat Lab built software application for similar image retrieval
 
Matlab dip
Matlab dipMatlab dip
Matlab dip
 
Programming in matlab lesson5
Programming in matlab lesson5Programming in matlab lesson5
Programming in matlab lesson5
 
MATLAB
MATLABMATLAB
MATLAB
 
Please make the complete program, distinguishing between each class .pdf
Please make the complete program, distinguishing between each class .pdfPlease make the complete program, distinguishing between each class .pdf
Please make the complete program, distinguishing between each class .pdf
 
//STEIM Workshop: A Vernacular of File Formats
//STEIM Workshop: A Vernacular of File Formats//STEIM Workshop: A Vernacular of File Formats
//STEIM Workshop: A Vernacular of File Formats
 
LSDI 2.pptx
LSDI 2.pptxLSDI 2.pptx
LSDI 2.pptx
 

Recently uploaded

Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
AJAYKUMARPUND1
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
Jayaprasanna4
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation & Control
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
seandesed
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
SupreethSP4
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
Divya Somashekar
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
AafreenAbuthahir2
 
Gen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdfGen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdf
gdsczhcet
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
zwunae
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
ongomchris
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
TeeVichai
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
AmarGB2
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
thanhdowork
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
VENKATESHvenky89705
 

Recently uploaded (20)

Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
 
Gen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdfGen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdf
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
 

Image to text Converter

  • 1. Image to Text Convertor BY, DHIRAJ RAJ MANVENDRA PRIYADARSHI
  • 2. Agenda :  Abstract  AIM  Technology Used  Procedure  Algo I  Algo II  Algo III  Algo IV (Part 1 & 2)  Algo V (Part 1 & 2)  Advantage  Limitations  Conclusion
  • 3. Abstract :  Image to text converter is a type of application that can be used to translate images of any format to the text format. This application helps one to convert the texts in image files into editable text files.  It has some pre-requisite conditions saying first that the text captured should be aligned horizontally straight. Then the text in the image to be converted contains only A, B, C, and D of pre-defined fonts or human written fonts. The image should be captured in a way that the pixel of any of the text should not be present at the coordinate (0, 0). And also the image captured should have texts with intensity of dark color and the background with intensity of light color.  This program basically uses five specific algorithms where the first algorithm deals with converting the text pixels and the background pixels other than text into opposite ranges of RGB so that the text pixels could be identified with ease.  In the second algorithm the image obtained previously is horizontally searched for all the portion of text (in black) area and the dimension of each sentence is found. An array of BufferedImage type is used to store the separated images containing each sentence. The dimension for the portion of image is defined to that array, which is separated by using predefined method drawImage().  The third algorithm deals with extracting each word from each of these sentences into specific images. The words are separated using drawImage() and stored in an array of BufferedImage type.
  • 4. Abstract Contd.  The forth algorithm has two parts. The first part deals with extracting each letters from each of the words which contain letters of predefined uppercase texts format and the second part deals with extracting each of the letters from each of the words which contain letters of joint lowercase texts format.  The last algorithm deals with extracting each letters from each the words and convert each letter into specific images. The obtained image of letter is then converted into a size with 100x100 pixels using predefined method drawImage() for changing the pixels of the image. The image is matched with predefined strips of co-ordinate. If the image matches every strips condition for letters (particularly for A, B, C & D) then it gets validated for that letter. And, we display the corresponding letter as an output.
  • 5. Aim :  To build an application to covert in to editable text from image(with standard text/human handwritting).
  • 6. Technology Used :  Language : Java  IDE : NetBeans
  • 7. Procedure :  Step 1 : Firstly, we have change the color of background to be white and the color of text to be black.  Step 2 : Now, we separate every sentence from the given segment.  Step 3 : Then, we split each sentence into words.  Step 4 : Each word will then split into letters.  Step 5 : Now, we convert the obtained letter into 100x100 pixels.  Step 6 : Then, we match the letter with predefined strips of co-ordinate and validate the letter to be specified one.  Step 7 : Finally, we display the corresponding letter as an output.
  • 8. Algo I :  To change the color of image, we have used predefined class ‘Color’ which is available in java.awt package.  Color c1 = new Color(255, 255, 255); // for White  Color c2 = new Color(0, 0, 0); // for Black Input : Output :
  • 9. Algo II :  Now, we separate each sentence from the given segment.  We start searching horizontally, all the portion of text (in black) area and count it separately for every horizontal line and store it into an array.  Then we look for that line which has white portion and the previous line should have some text portion and store the co-ordinate of that line into an array.  Then we also look for that line which has white portion and the next line should have some text portion and store the co-ordinate of that line into the same array.  Now, we have the co-ordinates of image from which we need to separate the image.
  • 10. Algo II continues….  We have created an array of BufferedImage type to store the separated images.  BufferedImage imgs[ ] = new BufferedImage[size];  Then we defined the dimension for the portion of image to that array, which is need to be separated.  We used predefined method drawImage() for separating the image. Output :Input :
  • 11. Algo III :  Now, we split each word from the sentence.  We start searching vertically, all the portion of text (in black) area and count it separately for every vertical line and store it into an array.  Then we look for that line which has white portion and the increment the counter by one until we find a line which has text portion onto it and store value of counter into an array and the co-ordinate of that line into another array and use ‘continue’ keyword to skip that iteration and execute next iteration. Also, assign zero to counter so that it calculate next gap.  Then we find the maximum value from the counter and store the co-ordinate of the corresponding line into an array .  Now, we have the co-ordinates of image from which we need to separate the image.
  • 12. Algo III continues….  Again, we have created an array of BufferedImage type to store the separated images.  BufferedImage imgs[ ] = new BufferedImage[size];  Then we defined the dimension for the portion of image to that array, which is need to be separated.  We used predefined method drawImage() for separating the image. Input : Output :
  • 13. Algo IV (Part 1 : Font Text)  Now, we split each letter (font text) from the word.  We start searching vertically, all the portion of text (in black) area and count it separately for every vertical line and store it into an array.  Then we look for that line which has white portion and the previous line should have some text portion and we shift the value to adjust the gap then store the co-ordinate of that line into an array.  Now, we have the co-ordinates of image from which we need to separate the image.
  • 14. Algo IV (Part 1 : Font Text) continues….  Again, we have created an array of BufferedImage type to store the separated images.  BufferedImage imgs[ ] = new BufferedImage[size];  Then we defined the dimension for the portion of image to that array, which is need to be separated.  We used predefined method drawImage() for separating the image. Input : Output :
  • 15. Algo IV (Part 2 : Hand written Text)  Now, we split each letter (hand written text) from the word.  We start searching vertically, all the portion of text (in black) area and count it separately for every vertical line and store it into an array.  Then we look for that line which has minimum portion of text and store the co-ordinate of that line into an array.  We find the line which is next to the stored co-ordinate of minimum portion of text and if it is more than all the minimum portions stored in the array then we shift the value to adjust the gap then store the co-ordinate of that line into another array.  Now, we have the co-ordinates of image from which we need to separate the image.
  • 16. Algo IV (Part 2 : Hand written Text) continues….  Again, we have created an array of BufferedImage type to store the separated images.  BufferedImage imgs[ ] = new BufferedImage[size];  Then we defined the dimension for the portion of image to that array, which is need to be separated.  We used predefined method drawImage() for separating the image. Input : Output :
  • 17. Algo V (Part 1) :  We convert the obtained image of letter into 100x100 pixels.  For this purpose we convert the size of image into 100x100 pixels.  We used predefined method drawImage() for changing the pixels of the image. Input : Output :
  • 18. Algo V (Part 2) :  We have defined some strips condition for letters (particularly for A, B, C & D).  We match the image with predefined strips of co-ordinate.  If the image matches every strips condition then it get validated for that letter.  And, we display the corresponding letter as an output. Input : Output : ABCD
  • 19. Advantage :  Image to text converter utility helps in format portability and compatibility that serves the purpose of using conversion from one format to another. In the present scenario, interchangeable formats are more in demand and software developers around the world need utilities that can convert files from one format to another easily and without too much hassle. This is where the ‘Image To Text Converter’ utility comes into play and the benefits of using the same are required. Further, many of the media houses use the converted files to store and retrieve data whenever they need. This helps in files restoring of image files at one's convenience making life easier for everyone in the process.
  • 20. Limitations :  The first co-ordinate (0,0) of the image should not be the portion of text.  The handwritten text extracting process is successful for few letters yet.  The joining portion of the hand written text should not have more thickness.
  • 21. Conclusion :  By this project we can come to the conclusion that we can convert image’s texts into editable text.
  • 22. References :  http://alvinalexander.com/blog/post/java/getting-rgb-values-for-each-pixel-in-image- using-java-bufferedi  http://alvinalexander.com/java/java-image-how-to-crop-image-in-java  http://kalanir.blogspot.in/2010/02/how-to-split-image-into-chunks-java.html  http://www.codejava.net/java-se/graphics/how-to-resize-images-in-java