SlideShare a Scribd company logo
1 of 6
Download to read offline
IOSR Journal of Computer Engineering (IOSR-JCE)
e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 3, Ver. II (May – Jun. 2015), PP 28-33
www.iosrjournals.org
DOI: 10.9790/0661-17322833 www.iosrjournals.org 28 | Page
A Review of Optical Character Recognition System for
Recognition of Printed Text
Rajas Kiran Jambekar1
1
(Department of Computer Science, NMIMS University, India)
Abstract: As we move ahead in technology advancements, from simple data processing, to intelligent
computing, one area of research undergoing advancement, is the system of reading text characters on an image.
Optical Character Recognition (OCR) system is used for converting text characters on images into computer
editable text characters. It includes steps such as Image acquisition, pre-processing of the image, segmentation
of lines and characters, recognition of characters, and finally application. The image acquisition step
determines the method for obtaining the image. Pre- processing of image includes enhancement of the image to
make it suitable for recognition. Segmentation is the extraction of the character part of the image. Recognition
is the comparison of the sample image with the template image. This paper focuses on the OCR system and
includes information regarding the various operations that may be performed on the image for the recognition
of characters.
Keywords: Image acquisition, OCR, Pre-Processing, Recognition, Segmentation, Text Characters.
I. Introduction
Optical Character Recognition System is a relatively new field in the world of technology. Although
research has been present on the topic since a few years, the topic has growing interest due to advancements in
the processing capabilities of computers. OCR system has plenty applications in the field such as car number
plate identification, processing cheques in banks, searching text in scanned documents, handwriting recognition,
etc. [1].
Character Recognition involves conversion of text characters, on an image, into a form of text
characters that can be edited by computers. The type of characters that are recognized by the system may
include the basic alphanumeric characters including both uppercase and lowercase characters. The system may
also be extended to recognize special characters and symbols.
The image used for the character recognition may contain printed text or handwritten text. This paper
aims at providing a system which may be used for the recognition of printed text. The recognition of
handwritten text is a difficult process due to the variations in the writing style in different individuals. Thus, the
accuracy of recognition of handwritten text is relatively less than the recognition of printed text [1].
II. Methodology
The entire process of Character Recognition may be divided into five main stages:
Fig. 1: OCR System Flowchart
Image Acquisition
Earlier, flat-bed scanners were used for obtaining clear images of standard quality and suitable for
recognition. Using a scanner has benefits such as low noise levels, virtually no blurring and low text skewing.
A Review of Optical Character Recognition System for Recognition of Printed Text
DOI: 10.9790/0661-17322833 www.iosrjournals.org 29 | Page
However, the working of the flat-bed scanners is slow and it requires physical connection with computers which
limits the process.
However, due to the expansive growth of technology, we now have high grade digital cameras on
smaller devices such as mobile phones. The image quality obtained using mobile phones is good enough for
processing of the image. The only issues while capturing images using a digital camera are shadows, blurring
and text-skewing. However, up to a certain limit, these issues may be managed using image enhancement
techniques discussed in the next section.
Pre-Processing
Pre-processing of the image includes modifications to the image to make it suitable for recognition. A
typical OCR system may use the following techniques for image enhancement:
a. RGB to Gray-Scale Conversion
The image acquired for recognition is usually a colour image, i.e. the pixel value consists of the combination of
the three colour components, red, green and blue. We first convert this colour image into a standard gray-scale
image. Colour images are represented using three matrices while gray-scale images are represented using a
single matrix.
Detection of text on a colour image is more difficult than on a gray-scale image. Thus, the first step in pre-
processing involves conversion of colour image to gray-scale image.
b. Skew Correction
Camera captured images may suffer from skew and perspective distortion [2]. As discussed above, this
effect is due to improper image capturing technique. The horizontal text axis may suffer rotation at some
degrees. Such an effect may be reduced by rotation of the image at a certain degree. The calculation method for
rotation of the image has been described in the article [3].
Fig 2: Skewed image [3]
c. Binarization
Binarization is the process where a threshold is selected for conversion of pixel values into 0s and 1s.
The black pixels are represented by a 0 and the white pixels are represented by 1. The threshold value may be
selected using various methods. A simple way to determine the threshold value would be to find the median
value of the maximum and minimum intensity values in the image.
It would be represented as:
Threshold value = (Imax + Imin)/2
Fig 3: Binarized image
d. Noise Reduction
Noise is unwanted pixels present in an image. They may be in the form of Salt and Pepper noise or
Gaussian noise. We use the low pass filter for filtering the Gaussian noise from the image. Since the presence of
salt and pepper noise is not as high as the presence of the Gaussian noise, we do not filter it.
A Review of Optical Character Recognition System for Recognition of Printed Text
DOI: 10.9790/0661-17322833 www.iosrjournals.org 30 | Page
Fig 4: Noisy Image
e. Thinning
Thinning is the process of reducing the width of the foreground pixels. While thinning, it is necessary
to maintain the form of the characters on the image. Thinning is done on the basis of neighborhood of a pixel.
e.g. If a line on an image is of 3 pixel width, the thinning function will change the border pixels of the line and
the output image will consist of line of one pixel width [4].
Segmentation
Once the image is enhanced, it is passed to the segmentation module where each character is separated
from the other. The Image at this point may be divided into two types of regions, background region and
foreground region [4].
This module is thus responsible for the separation of the foreground region from the background
region. The foreground region is a collection of text characters placed on the same line as well as on different
lines. The segmentation process works in two steps:
1. Line Segmentation
2. Character Segmentation
Line segmentation is the separation of the different lines of characters present in the image. Each line is
defined by a minimum vertical gap between the characters present on a line and on the line above and below it.
This gap can be used for the detection and separation of different lines of characters.
Character Segmentation is the separation of characters present in the same line. Once the lines are
separated, each character is extracted from the line. There is a constant horizontal gap between characters which
is used for the separation of characters. Thus, images corresponding to individual characters are extracted which
are fed to the recognition module.
Recognition
This is the final step of the process of recognition of characters. There are various methods available
for comparison of images, such as image correlation, feature extraction and comparison, chain code comparison,
artificial neural networks, etc. Image correlation method is the direct correlation function used on the sample
image and the template image. Feature extraction is a process of analyzing the sample image and deriving
specific features from the image. The comparison of these features with the features of template images is used
for recognizing the character. Chain codes are code sequences generated for each image based on the
neighborhood of pixels. The longest common sub-sequence is found between the code of sample image and the
template images which decides the output character.
Each of these techniques requires the use of stored template images used for comparison. The output
after the comparison is an editable text character which is added to a buffer. When all the characters of a word
are filled in the buffer, the string is passed to the application for further processing. However, this can also be
done after the recognition of each line of text or after the recognition of all the lines of text in the image.
Application
The output obtained from the recognition module is in the form of editable text characters i.e. ASCII
characters. These characters are initially stored in a buffer from which they may be stored in a file or further
processed to derive certain output such as text to speech conversion, business card to contact conversion, etc.
III. Proposed System
The described system has modules which can be implemented using a variety of algorithms. This paper
presents the following algorithms for implementing some of the modules.
RGB to Gray-Scale Conversion
Conversion of image in to gray-scale can be done using the following formula. The new pixel value is
computed for each pixel.
A Review of Optical Character Recognition System for Recognition of Printed Text
DOI: 10.9790/0661-17322833 www.iosrjournals.org 31 | Page
f(x,y) = 0.299 x r(x,y) + 0.587 x g(x,y) + 0.114 x b(x,y)
Fig 5: Colour Image
Fig. 6: Gray-Scale Image
Skew-Correction
This paper describes a slightly modified method for skew correction. The text regions in the referred
article [3] de-skews each text region separately by repeated calculation of skew angle. The process requires
repeated calculation of skew angle and image rotation for each extracted text region. This makes the algorithm
more computer intensive. A slight modification to the skew correction system is thus proposed.
The text present in the image has more tendency to suffer from the same degree of skew throughout the
image. The system scans the text region in the image from the top and the bottom and determines the skew angle
twice. Then, taking average of both the skew angles, the image is rotated by the calculated degree.
Since the process of rotation of the image is performed only once for the entire image, it proves to be
more efficient.
Noise Reduction
Given below in the figure is the low pass filter mask. It is basically used for removal of Gaussian noise
from the image. The image suffers from slight blurring effect, thus, the selection of the size of the mask is
crucial.
Fig 7: Noise Reduction Mask
Applying the mask on a noisy image provides a noise reduced result with a slight blurring effect.
A Review of Optical Character Recognition System for Recognition of Printed Text
DOI: 10.9790/0661-17322833 www.iosrjournals.org 32 | Page
Fig 8: Noise Reduced Image
Binarization
With varying images, using averaging method to determine threshold value may not produce the best
results. Having referred to different methods for thresholding, we have selected the Ostu‟s Method for
determining the threshold value. Ostu‟s method for thresholding is based on formation of two distinct classes of
pixels, one which contains background pixel intensities and the other containing foreground pixel intensities [5].
Segmentation
The segmentation process may be performed using methods such as blob colouring, peak-to-valley
method, etc. [6]. To separate individual characters present in the foreground region, the process of segmentation
is divided into two steps:
a. Marking
Marking is the process where the boundaries around a character are marked and stored, before it is
segmented. For each line of text in the image, the beginning and the ending rows and columns of the pixels
corresponding to each character in the text line are marked.
We have analyzed various methods which involve character segmentation without dedicated marking
of boundaries around the image. The method of marking the boundaries has a positive effect of improving the
performance of the system by performing similar task continuously for each text line. The process of marking is
described below.
Line marking is responsible for marking the upper and lower boundaries of each line of text present in
the image. This can be done by by first marking the top boundary of the text on the image by checking pixel
values along the rows starting from the top of the image. If a foreground pixel is found, the pixel position is
marked and added to the topForeground list. Similarly, for marking the bottom boundary of the line, the pixel
values are checked along each row. If no foreground pixel is found along a row, the row above the current row
is marked and it's position is added to the bottomForeground list.
Using this method, two lists are obtained. One indicating the top boundary of the lines and the other
indicating the bottom boundary of the lines. If an image contains „n‟ text lines, the length of both lists is „n‟.
The process of character marking is similar to that of line marking. The upper and lower boundaries for
each line of text in the image have been marked. Now, the start and the end boundary column for each character
along a line are marked. The process involves vertical scanning of each text line from left to right and checking
for foreground and background pixels.
Similar to line marking, we scan the image from top to bottom along all columns starting from the top
of the line up to the bottom of the line. If a foreground pixel is found in a column, the pixel position is marked
and added to the leftForeground list. Continuing the scanning along the columns, if a column is found without
any foreground pixel, it indicates the end of the character and thus we mark the pixel position and add it to the
rightForeground list.
Thus, we obtain the boundaries for each line and character along the lines.
b. Extraction
The system now extracts each character at a time by using the four boundaries and store it as a separate
image „sample‟. The sample has boundaries that exactly match the first and last pixel along each direction.
Thus, there is need to perform the additional step of boundary selection [6].
However, before the sample can be passed to the recognition module, the size of the sample must be
adjusted with respect to the size of the template images. This is necessary since the recognition module can only
compare images of the same size. The image is thus resized to the size of the template images. We maintain
templates of a small size to ensure that the resizing of sample always decrease the dimensions thus, preventing
blurring of image. However, the size must also not be so small that fine details of the sample are lost. Thus, a
size of no less than 20px is selected for template.
A Review of Optical Character Recognition System for Recognition of Printed Text
DOI: 10.9790/0661-17322833 www.iosrjournals.org 33 | Page
The above algorithm is used for only a single line of text at a time. Once the characters in a line have
been marked, the character marking system pauses. The list of leftForeground and rightForeground values along
with the top and bottom boundary of current line is passed to the extraction module.
The system only restarts when all the characters in the current line have been processed by the
following modules. When the system restarts, the next value of top and bottom boundary are selected and the
corresponding characters on the line are marked. This process is repeated for each line in the text region.
Recognition
We have selected image correlation for recognition of sample image with the template image. The
sample image is compared to each of the template images using the correlation function. The function returns a
scalar value with a value ranging between 0 and 1. Higher the value indicates better correlation between the
sample and the template images. Thus, we determine the character on the basis of the best value of correlation.
The recognized character is added to an output buffer string.
IV. Conclusion And Future Scope
We have reviewed the process of recognition of characters present on an image. This paper presents a
system which may be implemented as is or can be further enhanced. The current system is good enough for
recognition of simple characters and numbers along with the detection of white spaces. However, it may be
further extended to support text characters with a variety of fonts and special symbols. The use of neural
networks and adaptive learning for recognition may enhance the recognition capabilities of the system
extensively. The possible applications for this OCR system may be in the form of license plate recognition,
business card to phone contact conversion, document image to editable document conversion.
References
[1]. Disha Bhattacharjee, Deepti Tripathi, Rubi Debnath, Vivek Hanumante, Sahadev Roy. “A Novel Approach for Character
Recognition”, International Journal of Engineering Trends and Technology (IJETT) – Volume 10 Number 6 - Apr 2014
[2]. Ayatullah Faruk Mollah, Nabamita Majumder, Subhadip Basu and Mita Nasipuri. “Design of an Optical Character Recognition
System for Camera Based Handheld Devices”, IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 4, No 1, July
2011
[3]. A. F. Mollah, S. Basu, N. Das, R. Sarkar, M. Nasipuri, M. Kundu, “Text/Graphics Separation and Skew Correction of Text Regions
of Business Card Images for Mobile Devices”, Journal of Computing, Vol. 2, Issue 2, February 2010, ISSN 2151-9617
[4]. Ravi Kumar, Anurag Anand, Nikunj Sharma, “Recognition of English Characters by Codes Generated Using Neighbour
Identification”. International Journal of Application or Innovation in Engineering and Management (IJAIEM) – Vol. 2, Issue 4,
April 2013, ISSN 2319-4847
[5]. N. Otsu, "A threshold selection method from gray level histogram”, IEEE Transactions in Systems, Man, and Cybernetics, Vol. 9,
pp. 62 – 66
[6]. Ghugardare, Rakhi P., Sandip P. Narote, P. Mukherji, and Prathamesh M. Kulkarni. "Optical character recognition system for seven
segment display images of measuring instruments." In TENCON 2009-2009 IEEE Region 10 Conference, pp. 1-6. IEEE, 2009.

More Related Content

What's hot

Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Karan Panjwani
 
OCR Presentation (Optical Character Recognition)
OCR Presentation (Optical Character Recognition)OCR Presentation (Optical Character Recognition)
OCR Presentation (Optical Character Recognition)Neeraj Neupane
 
OCR vs. Urjanet
OCR vs. UrjanetOCR vs. Urjanet
OCR vs. UrjanetUrjanet
 
Optical Character Reader - Project Report BTech
Optical Character Reader - Project Report BTechOptical Character Reader - Project Report BTech
Optical Character Reader - Project Report BTechKushagraChadha1
 
Optical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyOptical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyEr. Ashish Pandey
 
Handwritten digit recognition using image processing
Handwritten digit recognition using image processing Handwritten digit recognition using image processing
Handwritten digit recognition using image processing anita maharjan
 
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...CSCJournals
 
Optical Character Recognition (OCR) System
Optical Character Recognition (OCR) SystemOptical Character Recognition (OCR) System
Optical Character Recognition (OCR) Systemiosrjce
 
Project report of OCR Recognition
Project report of OCR RecognitionProject report of OCR Recognition
Project report of OCR RecognitionBharat Kalia
 
Design and implementation of optical character recognition using template mat...
Design and implementation of optical character recognition using template mat...Design and implementation of optical character recognition using template mat...
Design and implementation of optical character recognition using template mat...eSAT Journals
 
IRJET- Intelligent Character Recognition of Handwritten Characters using ...
IRJET-  	  Intelligent Character Recognition of Handwritten Characters using ...IRJET-  	  Intelligent Character Recognition of Handwritten Characters using ...
IRJET- Intelligent Character Recognition of Handwritten Characters using ...IRJET Journal
 
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...GBM
 
Handwritten character recognition using artificial neural network
Handwritten character recognition using artificial neural networkHandwritten character recognition using artificial neural network
Handwritten character recognition using artificial neural networkHarshana Madusanka Jayamaha
 
IRJET- Image to Text Conversion using Tesseract
IRJET-  	  Image to Text Conversion using TesseractIRJET-  	  Image to Text Conversion using Tesseract
IRJET- Image to Text Conversion using TesseractIRJET Journal
 
Off-line English Character Recognition: A Comparative Survey
Off-line English Character Recognition: A Comparative SurveyOff-line English Character Recognition: A Comparative Survey
Off-line English Character Recognition: A Comparative Surveyidescitation
 
DevanagiriOCR on CELL BROADBAND ENGINE
DevanagiriOCR on CELL BROADBAND ENGINEDevanagiriOCR on CELL BROADBAND ENGINE
DevanagiriOCR on CELL BROADBAND ENGINEPridhvi Kodamasimham
 

What's hot (20)

Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Optical Character Recognition( OCR )
Optical Character Recognition( OCR )
 
OCR2
OCR2OCR2
OCR2
 
OCR Presentation (Optical Character Recognition)
OCR Presentation (Optical Character Recognition)OCR Presentation (Optical Character Recognition)
OCR Presentation (Optical Character Recognition)
 
OCR vs. Urjanet
OCR vs. UrjanetOCR vs. Urjanet
OCR vs. Urjanet
 
Optical Character Reader - Project Report BTech
Optical Character Reader - Project Report BTechOptical Character Reader - Project Report BTech
Optical Character Reader - Project Report BTech
 
PDF OCR
PDF OCRPDF OCR
PDF OCR
 
Optical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyOptical character recognition IEEE Paper Study
Optical character recognition IEEE Paper Study
 
ocr
ocrocr
ocr
 
Handwritten digit recognition using image processing
Handwritten digit recognition using image processing Handwritten digit recognition using image processing
Handwritten digit recognition using image processing
 
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...
 
Optical Character Recognition (OCR) System
Optical Character Recognition (OCR) SystemOptical Character Recognition (OCR) System
Optical Character Recognition (OCR) System
 
Project report of OCR Recognition
Project report of OCR RecognitionProject report of OCR Recognition
Project report of OCR Recognition
 
Design and implementation of optical character recognition using template mat...
Design and implementation of optical character recognition using template mat...Design and implementation of optical character recognition using template mat...
Design and implementation of optical character recognition using template mat...
 
Handwritten Character Recognition
Handwritten Character RecognitionHandwritten Character Recognition
Handwritten Character Recognition
 
IRJET- Intelligent Character Recognition of Handwritten Characters using ...
IRJET-  	  Intelligent Character Recognition of Handwritten Characters using ...IRJET-  	  Intelligent Character Recognition of Handwritten Characters using ...
IRJET- Intelligent Character Recognition of Handwritten Characters using ...
 
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...
 
Handwritten character recognition using artificial neural network
Handwritten character recognition using artificial neural networkHandwritten character recognition using artificial neural network
Handwritten character recognition using artificial neural network
 
IRJET- Image to Text Conversion using Tesseract
IRJET-  	  Image to Text Conversion using TesseractIRJET-  	  Image to Text Conversion using Tesseract
IRJET- Image to Text Conversion using Tesseract
 
Off-line English Character Recognition: A Comparative Survey
Off-line English Character Recognition: A Comparative SurveyOff-line English Character Recognition: A Comparative Survey
Off-line English Character Recognition: A Comparative Survey
 
DevanagiriOCR on CELL BROADBAND ENGINE
DevanagiriOCR on CELL BROADBAND ENGINEDevanagiriOCR on CELL BROADBAND ENGINE
DevanagiriOCR on CELL BROADBAND ENGINE
 

Similar to A Review of Optical Character Recognition System for Recognition of Printed Text

Opticalcharacter recognition
Opticalcharacter recognition Opticalcharacter recognition
Opticalcharacter recognition Shobhit Saxena
 
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...iosrjce
 
IRJET- Real-Time Text Reader for English Language
IRJET- Real-Time Text Reader for English LanguageIRJET- Real-Time Text Reader for English Language
IRJET- Real-Time Text Reader for English LanguageIRJET Journal
 
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdfHandwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdfSachin414679
 
IRJET- Optical Character Recognition using Image Processing
IRJET-  	  Optical Character Recognition using Image ProcessingIRJET-  	  Optical Character Recognition using Image Processing
IRJET- Optical Character Recognition using Image ProcessingIRJET Journal
 
Text Extraction and Recognition Using Median Filter
Text Extraction and Recognition Using Median FilterText Extraction and Recognition Using Median Filter
Text Extraction and Recognition Using Median FilterIRJET Journal
 
A Survey Paper on Character Recognition
A Survey Paper on Character RecognitionA Survey Paper on Character Recognition
A Survey Paper on Character Recognitionijsrd.com
 
IRJET - An Enhanced Approach for Extraction of Text from an Image using Fuzzy...
IRJET - An Enhanced Approach for Extraction of Text from an Image using Fuzzy...IRJET - An Enhanced Approach for Extraction of Text from an Image using Fuzzy...
IRJET - An Enhanced Approach for Extraction of Text from an Image using Fuzzy...IRJET Journal
 
OCR for Gujarati Numeral using Neural Network
OCR for Gujarati Numeral using Neural NetworkOCR for Gujarati Numeral using Neural Network
OCR for Gujarati Numeral using Neural Networkijsrd.com
 
Automatic Image Annotation Using CMRM with Scene Information
Automatic Image Annotation Using CMRM with Scene InformationAutomatic Image Annotation Using CMRM with Scene Information
Automatic Image Annotation Using CMRM with Scene InformationTELKOMNIKA JOURNAL
 
Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...
Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...
Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...IRJET Journal
 
IDENTIFICATION OF IMAGE SPAM BY USING LOW LEVEL & METADATA FEATURES
IDENTIFICATION OF IMAGE SPAM BY USING LOW LEVEL & METADATA FEATURES IDENTIFICATION OF IMAGE SPAM BY USING LOW LEVEL & METADATA FEATURES
IDENTIFICATION OF IMAGE SPAM BY USING LOW LEVEL & METADATA FEATURES IJNSA Journal
 
Text Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text RegionsText Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text RegionsIJCSIS Research Publications
 
14 offline signature verification based on euclidean distance using support v...
14 offline signature verification based on euclidean distance using support v...14 offline signature verification based on euclidean distance using support v...
14 offline signature verification based on euclidean distance using support v...INFOGAIN PUBLICATION
 
Feature Extraction and Feature Selection using Textual Analysis
Feature Extraction and Feature Selection using Textual AnalysisFeature Extraction and Feature Selection using Textual Analysis
Feature Extraction and Feature Selection using Textual Analysisvivatechijri
 

Similar to A Review of Optical Character Recognition System for Recognition of Printed Text (20)

Opticalcharacter recognition
Opticalcharacter recognition Opticalcharacter recognition
Opticalcharacter recognition
 
L017248388
L017248388L017248388
L017248388
 
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
 
IRJET- Real-Time Text Reader for English Language
IRJET- Real-Time Text Reader for English LanguageIRJET- Real-Time Text Reader for English Language
IRJET- Real-Time Text Reader for English Language
 
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdfHandwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
 
IRJET- Optical Character Recognition using Image Processing
IRJET-  	  Optical Character Recognition using Image ProcessingIRJET-  	  Optical Character Recognition using Image Processing
IRJET- Optical Character Recognition using Image Processing
 
Text Extraction and Recognition Using Median Filter
Text Extraction and Recognition Using Median FilterText Extraction and Recognition Using Median Filter
Text Extraction and Recognition Using Median Filter
 
A Survey Paper on Character Recognition
A Survey Paper on Character RecognitionA Survey Paper on Character Recognition
A Survey Paper on Character Recognition
 
IRJET - An Enhanced Approach for Extraction of Text from an Image using Fuzzy...
IRJET - An Enhanced Approach for Extraction of Text from an Image using Fuzzy...IRJET - An Enhanced Approach for Extraction of Text from an Image using Fuzzy...
IRJET - An Enhanced Approach for Extraction of Text from an Image using Fuzzy...
 
OCR for Gujarati Numeral using Neural Network
OCR for Gujarati Numeral using Neural NetworkOCR for Gujarati Numeral using Neural Network
OCR for Gujarati Numeral using Neural Network
 
Assignment-1-NF.docx
Assignment-1-NF.docxAssignment-1-NF.docx
Assignment-1-NF.docx
 
Automatic Image Annotation Using CMRM with Scene Information
Automatic Image Annotation Using CMRM with Scene InformationAutomatic Image Annotation Using CMRM with Scene Information
Automatic Image Annotation Using CMRM with Scene Information
 
C04741319
C04741319C04741319
C04741319
 
Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...
Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...
Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...
 
IDENTIFICATION OF IMAGE SPAM BY USING LOW LEVEL & METADATA FEATURES
IDENTIFICATION OF IMAGE SPAM BY USING LOW LEVEL & METADATA FEATURES IDENTIFICATION OF IMAGE SPAM BY USING LOW LEVEL & METADATA FEATURES
IDENTIFICATION OF IMAGE SPAM BY USING LOW LEVEL & METADATA FEATURES
 
E1803012329
E1803012329E1803012329
E1803012329
 
B018110915
B018110915B018110915
B018110915
 
Text Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text RegionsText Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text Regions
 
14 offline signature verification based on euclidean distance using support v...
14 offline signature verification based on euclidean distance using support v...14 offline signature verification based on euclidean distance using support v...
14 offline signature verification based on euclidean distance using support v...
 
Feature Extraction and Feature Selection using Textual Analysis
Feature Extraction and Feature Selection using Textual AnalysisFeature Extraction and Feature Selection using Textual Analysis
Feature Extraction and Feature Selection using Textual Analysis
 

More from iosrjce

An Examination of Effectuation Dimension as Financing Practice of Small and M...
An Examination of Effectuation Dimension as Financing Practice of Small and M...An Examination of Effectuation Dimension as Financing Practice of Small and M...
An Examination of Effectuation Dimension as Financing Practice of Small and M...iosrjce
 
Does Goods and Services Tax (GST) Leads to Indian Economic Development?
Does Goods and Services Tax (GST) Leads to Indian Economic Development?Does Goods and Services Tax (GST) Leads to Indian Economic Development?
Does Goods and Services Tax (GST) Leads to Indian Economic Development?iosrjce
 
Childhood Factors that influence success in later life
Childhood Factors that influence success in later lifeChildhood Factors that influence success in later life
Childhood Factors that influence success in later lifeiosrjce
 
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...iosrjce
 
Customer’s Acceptance of Internet Banking in Dubai
Customer’s Acceptance of Internet Banking in DubaiCustomer’s Acceptance of Internet Banking in Dubai
Customer’s Acceptance of Internet Banking in Dubaiiosrjce
 
A Study of Employee Satisfaction relating to Job Security & Working Hours amo...
A Study of Employee Satisfaction relating to Job Security & Working Hours amo...A Study of Employee Satisfaction relating to Job Security & Working Hours amo...
A Study of Employee Satisfaction relating to Job Security & Working Hours amo...iosrjce
 
Consumer Perspectives on Brand Preference: A Choice Based Model Approach
Consumer Perspectives on Brand Preference: A Choice Based Model ApproachConsumer Perspectives on Brand Preference: A Choice Based Model Approach
Consumer Perspectives on Brand Preference: A Choice Based Model Approachiosrjce
 
Student`S Approach towards Social Network Sites
Student`S Approach towards Social Network SitesStudent`S Approach towards Social Network Sites
Student`S Approach towards Social Network Sitesiosrjce
 
Broadcast Management in Nigeria: The systems approach as an imperative
Broadcast Management in Nigeria: The systems approach as an imperativeBroadcast Management in Nigeria: The systems approach as an imperative
Broadcast Management in Nigeria: The systems approach as an imperativeiosrjce
 
A Study on Retailer’s Perception on Soya Products with Special Reference to T...
A Study on Retailer’s Perception on Soya Products with Special Reference to T...A Study on Retailer’s Perception on Soya Products with Special Reference to T...
A Study on Retailer’s Perception on Soya Products with Special Reference to T...iosrjce
 
A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ...
A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ...A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ...
A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ...iosrjce
 
Consumers’ Behaviour on Sony Xperia: A Case Study on Bangladesh
Consumers’ Behaviour on Sony Xperia: A Case Study on BangladeshConsumers’ Behaviour on Sony Xperia: A Case Study on Bangladesh
Consumers’ Behaviour on Sony Xperia: A Case Study on Bangladeshiosrjce
 
Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P...
Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P...Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P...
Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P...iosrjce
 
Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu...
Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu...Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu...
Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu...iosrjce
 
Media Innovations and its Impact on Brand awareness & Consideration
Media Innovations and its Impact on Brand awareness & ConsiderationMedia Innovations and its Impact on Brand awareness & Consideration
Media Innovations and its Impact on Brand awareness & Considerationiosrjce
 
Customer experience in supermarkets and hypermarkets – A comparative study
Customer experience in supermarkets and hypermarkets – A comparative studyCustomer experience in supermarkets and hypermarkets – A comparative study
Customer experience in supermarkets and hypermarkets – A comparative studyiosrjce
 
Social Media and Small Businesses: A Combinational Strategic Approach under t...
Social Media and Small Businesses: A Combinational Strategic Approach under t...Social Media and Small Businesses: A Combinational Strategic Approach under t...
Social Media and Small Businesses: A Combinational Strategic Approach under t...iosrjce
 
Secretarial Performance and the Gender Question (A Study of Selected Tertiary...
Secretarial Performance and the Gender Question (A Study of Selected Tertiary...Secretarial Performance and the Gender Question (A Study of Selected Tertiary...
Secretarial Performance and the Gender Question (A Study of Selected Tertiary...iosrjce
 
Implementation of Quality Management principles at Zimbabwe Open University (...
Implementation of Quality Management principles at Zimbabwe Open University (...Implementation of Quality Management principles at Zimbabwe Open University (...
Implementation of Quality Management principles at Zimbabwe Open University (...iosrjce
 
Organizational Conflicts Management In Selected Organizaions In Lagos State, ...
Organizational Conflicts Management In Selected Organizaions In Lagos State, ...Organizational Conflicts Management In Selected Organizaions In Lagos State, ...
Organizational Conflicts Management In Selected Organizaions In Lagos State, ...iosrjce
 

More from iosrjce (20)

An Examination of Effectuation Dimension as Financing Practice of Small and M...
An Examination of Effectuation Dimension as Financing Practice of Small and M...An Examination of Effectuation Dimension as Financing Practice of Small and M...
An Examination of Effectuation Dimension as Financing Practice of Small and M...
 
Does Goods and Services Tax (GST) Leads to Indian Economic Development?
Does Goods and Services Tax (GST) Leads to Indian Economic Development?Does Goods and Services Tax (GST) Leads to Indian Economic Development?
Does Goods and Services Tax (GST) Leads to Indian Economic Development?
 
Childhood Factors that influence success in later life
Childhood Factors that influence success in later lifeChildhood Factors that influence success in later life
Childhood Factors that influence success in later life
 
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...
 
Customer’s Acceptance of Internet Banking in Dubai
Customer’s Acceptance of Internet Banking in DubaiCustomer’s Acceptance of Internet Banking in Dubai
Customer’s Acceptance of Internet Banking in Dubai
 
A Study of Employee Satisfaction relating to Job Security & Working Hours amo...
A Study of Employee Satisfaction relating to Job Security & Working Hours amo...A Study of Employee Satisfaction relating to Job Security & Working Hours amo...
A Study of Employee Satisfaction relating to Job Security & Working Hours amo...
 
Consumer Perspectives on Brand Preference: A Choice Based Model Approach
Consumer Perspectives on Brand Preference: A Choice Based Model ApproachConsumer Perspectives on Brand Preference: A Choice Based Model Approach
Consumer Perspectives on Brand Preference: A Choice Based Model Approach
 
Student`S Approach towards Social Network Sites
Student`S Approach towards Social Network SitesStudent`S Approach towards Social Network Sites
Student`S Approach towards Social Network Sites
 
Broadcast Management in Nigeria: The systems approach as an imperative
Broadcast Management in Nigeria: The systems approach as an imperativeBroadcast Management in Nigeria: The systems approach as an imperative
Broadcast Management in Nigeria: The systems approach as an imperative
 
A Study on Retailer’s Perception on Soya Products with Special Reference to T...
A Study on Retailer’s Perception on Soya Products with Special Reference to T...A Study on Retailer’s Perception on Soya Products with Special Reference to T...
A Study on Retailer’s Perception on Soya Products with Special Reference to T...
 
A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ...
A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ...A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ...
A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ...
 
Consumers’ Behaviour on Sony Xperia: A Case Study on Bangladesh
Consumers’ Behaviour on Sony Xperia: A Case Study on BangladeshConsumers’ Behaviour on Sony Xperia: A Case Study on Bangladesh
Consumers’ Behaviour on Sony Xperia: A Case Study on Bangladesh
 
Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P...
Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P...Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P...
Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P...
 
Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu...
Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu...Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu...
Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu...
 
Media Innovations and its Impact on Brand awareness & Consideration
Media Innovations and its Impact on Brand awareness & ConsiderationMedia Innovations and its Impact on Brand awareness & Consideration
Media Innovations and its Impact on Brand awareness & Consideration
 
Customer experience in supermarkets and hypermarkets – A comparative study
Customer experience in supermarkets and hypermarkets – A comparative studyCustomer experience in supermarkets and hypermarkets – A comparative study
Customer experience in supermarkets and hypermarkets – A comparative study
 
Social Media and Small Businesses: A Combinational Strategic Approach under t...
Social Media and Small Businesses: A Combinational Strategic Approach under t...Social Media and Small Businesses: A Combinational Strategic Approach under t...
Social Media and Small Businesses: A Combinational Strategic Approach under t...
 
Secretarial Performance and the Gender Question (A Study of Selected Tertiary...
Secretarial Performance and the Gender Question (A Study of Selected Tertiary...Secretarial Performance and the Gender Question (A Study of Selected Tertiary...
Secretarial Performance and the Gender Question (A Study of Selected Tertiary...
 
Implementation of Quality Management principles at Zimbabwe Open University (...
Implementation of Quality Management principles at Zimbabwe Open University (...Implementation of Quality Management principles at Zimbabwe Open University (...
Implementation of Quality Management principles at Zimbabwe Open University (...
 
Organizational Conflicts Management In Selected Organizaions In Lagos State, ...
Organizational Conflicts Management In Selected Organizaions In Lagos State, ...Organizational Conflicts Management In Selected Organizaions In Lagos State, ...
Organizational Conflicts Management In Selected Organizaions In Lagos State, ...
 

Recently uploaded

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 

Recently uploaded (20)

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 

A Review of Optical Character Recognition System for Recognition of Printed Text

  • 1. IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 3, Ver. II (May – Jun. 2015), PP 28-33 www.iosrjournals.org DOI: 10.9790/0661-17322833 www.iosrjournals.org 28 | Page A Review of Optical Character Recognition System for Recognition of Printed Text Rajas Kiran Jambekar1 1 (Department of Computer Science, NMIMS University, India) Abstract: As we move ahead in technology advancements, from simple data processing, to intelligent computing, one area of research undergoing advancement, is the system of reading text characters on an image. Optical Character Recognition (OCR) system is used for converting text characters on images into computer editable text characters. It includes steps such as Image acquisition, pre-processing of the image, segmentation of lines and characters, recognition of characters, and finally application. The image acquisition step determines the method for obtaining the image. Pre- processing of image includes enhancement of the image to make it suitable for recognition. Segmentation is the extraction of the character part of the image. Recognition is the comparison of the sample image with the template image. This paper focuses on the OCR system and includes information regarding the various operations that may be performed on the image for the recognition of characters. Keywords: Image acquisition, OCR, Pre-Processing, Recognition, Segmentation, Text Characters. I. Introduction Optical Character Recognition System is a relatively new field in the world of technology. Although research has been present on the topic since a few years, the topic has growing interest due to advancements in the processing capabilities of computers. OCR system has plenty applications in the field such as car number plate identification, processing cheques in banks, searching text in scanned documents, handwriting recognition, etc. [1]. Character Recognition involves conversion of text characters, on an image, into a form of text characters that can be edited by computers. The type of characters that are recognized by the system may include the basic alphanumeric characters including both uppercase and lowercase characters. The system may also be extended to recognize special characters and symbols. The image used for the character recognition may contain printed text or handwritten text. This paper aims at providing a system which may be used for the recognition of printed text. The recognition of handwritten text is a difficult process due to the variations in the writing style in different individuals. Thus, the accuracy of recognition of handwritten text is relatively less than the recognition of printed text [1]. II. Methodology The entire process of Character Recognition may be divided into five main stages: Fig. 1: OCR System Flowchart Image Acquisition Earlier, flat-bed scanners were used for obtaining clear images of standard quality and suitable for recognition. Using a scanner has benefits such as low noise levels, virtually no blurring and low text skewing.
  • 2. A Review of Optical Character Recognition System for Recognition of Printed Text DOI: 10.9790/0661-17322833 www.iosrjournals.org 29 | Page However, the working of the flat-bed scanners is slow and it requires physical connection with computers which limits the process. However, due to the expansive growth of technology, we now have high grade digital cameras on smaller devices such as mobile phones. The image quality obtained using mobile phones is good enough for processing of the image. The only issues while capturing images using a digital camera are shadows, blurring and text-skewing. However, up to a certain limit, these issues may be managed using image enhancement techniques discussed in the next section. Pre-Processing Pre-processing of the image includes modifications to the image to make it suitable for recognition. A typical OCR system may use the following techniques for image enhancement: a. RGB to Gray-Scale Conversion The image acquired for recognition is usually a colour image, i.e. the pixel value consists of the combination of the three colour components, red, green and blue. We first convert this colour image into a standard gray-scale image. Colour images are represented using three matrices while gray-scale images are represented using a single matrix. Detection of text on a colour image is more difficult than on a gray-scale image. Thus, the first step in pre- processing involves conversion of colour image to gray-scale image. b. Skew Correction Camera captured images may suffer from skew and perspective distortion [2]. As discussed above, this effect is due to improper image capturing technique. The horizontal text axis may suffer rotation at some degrees. Such an effect may be reduced by rotation of the image at a certain degree. The calculation method for rotation of the image has been described in the article [3]. Fig 2: Skewed image [3] c. Binarization Binarization is the process where a threshold is selected for conversion of pixel values into 0s and 1s. The black pixels are represented by a 0 and the white pixels are represented by 1. The threshold value may be selected using various methods. A simple way to determine the threshold value would be to find the median value of the maximum and minimum intensity values in the image. It would be represented as: Threshold value = (Imax + Imin)/2 Fig 3: Binarized image d. Noise Reduction Noise is unwanted pixels present in an image. They may be in the form of Salt and Pepper noise or Gaussian noise. We use the low pass filter for filtering the Gaussian noise from the image. Since the presence of salt and pepper noise is not as high as the presence of the Gaussian noise, we do not filter it.
  • 3. A Review of Optical Character Recognition System for Recognition of Printed Text DOI: 10.9790/0661-17322833 www.iosrjournals.org 30 | Page Fig 4: Noisy Image e. Thinning Thinning is the process of reducing the width of the foreground pixels. While thinning, it is necessary to maintain the form of the characters on the image. Thinning is done on the basis of neighborhood of a pixel. e.g. If a line on an image is of 3 pixel width, the thinning function will change the border pixels of the line and the output image will consist of line of one pixel width [4]. Segmentation Once the image is enhanced, it is passed to the segmentation module where each character is separated from the other. The Image at this point may be divided into two types of regions, background region and foreground region [4]. This module is thus responsible for the separation of the foreground region from the background region. The foreground region is a collection of text characters placed on the same line as well as on different lines. The segmentation process works in two steps: 1. Line Segmentation 2. Character Segmentation Line segmentation is the separation of the different lines of characters present in the image. Each line is defined by a minimum vertical gap between the characters present on a line and on the line above and below it. This gap can be used for the detection and separation of different lines of characters. Character Segmentation is the separation of characters present in the same line. Once the lines are separated, each character is extracted from the line. There is a constant horizontal gap between characters which is used for the separation of characters. Thus, images corresponding to individual characters are extracted which are fed to the recognition module. Recognition This is the final step of the process of recognition of characters. There are various methods available for comparison of images, such as image correlation, feature extraction and comparison, chain code comparison, artificial neural networks, etc. Image correlation method is the direct correlation function used on the sample image and the template image. Feature extraction is a process of analyzing the sample image and deriving specific features from the image. The comparison of these features with the features of template images is used for recognizing the character. Chain codes are code sequences generated for each image based on the neighborhood of pixels. The longest common sub-sequence is found between the code of sample image and the template images which decides the output character. Each of these techniques requires the use of stored template images used for comparison. The output after the comparison is an editable text character which is added to a buffer. When all the characters of a word are filled in the buffer, the string is passed to the application for further processing. However, this can also be done after the recognition of each line of text or after the recognition of all the lines of text in the image. Application The output obtained from the recognition module is in the form of editable text characters i.e. ASCII characters. These characters are initially stored in a buffer from which they may be stored in a file or further processed to derive certain output such as text to speech conversion, business card to contact conversion, etc. III. Proposed System The described system has modules which can be implemented using a variety of algorithms. This paper presents the following algorithms for implementing some of the modules. RGB to Gray-Scale Conversion Conversion of image in to gray-scale can be done using the following formula. The new pixel value is computed for each pixel.
  • 4. A Review of Optical Character Recognition System for Recognition of Printed Text DOI: 10.9790/0661-17322833 www.iosrjournals.org 31 | Page f(x,y) = 0.299 x r(x,y) + 0.587 x g(x,y) + 0.114 x b(x,y) Fig 5: Colour Image Fig. 6: Gray-Scale Image Skew-Correction This paper describes a slightly modified method for skew correction. The text regions in the referred article [3] de-skews each text region separately by repeated calculation of skew angle. The process requires repeated calculation of skew angle and image rotation for each extracted text region. This makes the algorithm more computer intensive. A slight modification to the skew correction system is thus proposed. The text present in the image has more tendency to suffer from the same degree of skew throughout the image. The system scans the text region in the image from the top and the bottom and determines the skew angle twice. Then, taking average of both the skew angles, the image is rotated by the calculated degree. Since the process of rotation of the image is performed only once for the entire image, it proves to be more efficient. Noise Reduction Given below in the figure is the low pass filter mask. It is basically used for removal of Gaussian noise from the image. The image suffers from slight blurring effect, thus, the selection of the size of the mask is crucial. Fig 7: Noise Reduction Mask Applying the mask on a noisy image provides a noise reduced result with a slight blurring effect.
  • 5. A Review of Optical Character Recognition System for Recognition of Printed Text DOI: 10.9790/0661-17322833 www.iosrjournals.org 32 | Page Fig 8: Noise Reduced Image Binarization With varying images, using averaging method to determine threshold value may not produce the best results. Having referred to different methods for thresholding, we have selected the Ostu‟s Method for determining the threshold value. Ostu‟s method for thresholding is based on formation of two distinct classes of pixels, one which contains background pixel intensities and the other containing foreground pixel intensities [5]. Segmentation The segmentation process may be performed using methods such as blob colouring, peak-to-valley method, etc. [6]. To separate individual characters present in the foreground region, the process of segmentation is divided into two steps: a. Marking Marking is the process where the boundaries around a character are marked and stored, before it is segmented. For each line of text in the image, the beginning and the ending rows and columns of the pixels corresponding to each character in the text line are marked. We have analyzed various methods which involve character segmentation without dedicated marking of boundaries around the image. The method of marking the boundaries has a positive effect of improving the performance of the system by performing similar task continuously for each text line. The process of marking is described below. Line marking is responsible for marking the upper and lower boundaries of each line of text present in the image. This can be done by by first marking the top boundary of the text on the image by checking pixel values along the rows starting from the top of the image. If a foreground pixel is found, the pixel position is marked and added to the topForeground list. Similarly, for marking the bottom boundary of the line, the pixel values are checked along each row. If no foreground pixel is found along a row, the row above the current row is marked and it's position is added to the bottomForeground list. Using this method, two lists are obtained. One indicating the top boundary of the lines and the other indicating the bottom boundary of the lines. If an image contains „n‟ text lines, the length of both lists is „n‟. The process of character marking is similar to that of line marking. The upper and lower boundaries for each line of text in the image have been marked. Now, the start and the end boundary column for each character along a line are marked. The process involves vertical scanning of each text line from left to right and checking for foreground and background pixels. Similar to line marking, we scan the image from top to bottom along all columns starting from the top of the line up to the bottom of the line. If a foreground pixel is found in a column, the pixel position is marked and added to the leftForeground list. Continuing the scanning along the columns, if a column is found without any foreground pixel, it indicates the end of the character and thus we mark the pixel position and add it to the rightForeground list. Thus, we obtain the boundaries for each line and character along the lines. b. Extraction The system now extracts each character at a time by using the four boundaries and store it as a separate image „sample‟. The sample has boundaries that exactly match the first and last pixel along each direction. Thus, there is need to perform the additional step of boundary selection [6]. However, before the sample can be passed to the recognition module, the size of the sample must be adjusted with respect to the size of the template images. This is necessary since the recognition module can only compare images of the same size. The image is thus resized to the size of the template images. We maintain templates of a small size to ensure that the resizing of sample always decrease the dimensions thus, preventing blurring of image. However, the size must also not be so small that fine details of the sample are lost. Thus, a size of no less than 20px is selected for template.
  • 6. A Review of Optical Character Recognition System for Recognition of Printed Text DOI: 10.9790/0661-17322833 www.iosrjournals.org 33 | Page The above algorithm is used for only a single line of text at a time. Once the characters in a line have been marked, the character marking system pauses. The list of leftForeground and rightForeground values along with the top and bottom boundary of current line is passed to the extraction module. The system only restarts when all the characters in the current line have been processed by the following modules. When the system restarts, the next value of top and bottom boundary are selected and the corresponding characters on the line are marked. This process is repeated for each line in the text region. Recognition We have selected image correlation for recognition of sample image with the template image. The sample image is compared to each of the template images using the correlation function. The function returns a scalar value with a value ranging between 0 and 1. Higher the value indicates better correlation between the sample and the template images. Thus, we determine the character on the basis of the best value of correlation. The recognized character is added to an output buffer string. IV. Conclusion And Future Scope We have reviewed the process of recognition of characters present on an image. This paper presents a system which may be implemented as is or can be further enhanced. The current system is good enough for recognition of simple characters and numbers along with the detection of white spaces. However, it may be further extended to support text characters with a variety of fonts and special symbols. The use of neural networks and adaptive learning for recognition may enhance the recognition capabilities of the system extensively. The possible applications for this OCR system may be in the form of license plate recognition, business card to phone contact conversion, document image to editable document conversion. References [1]. Disha Bhattacharjee, Deepti Tripathi, Rubi Debnath, Vivek Hanumante, Sahadev Roy. “A Novel Approach for Character Recognition”, International Journal of Engineering Trends and Technology (IJETT) – Volume 10 Number 6 - Apr 2014 [2]. Ayatullah Faruk Mollah, Nabamita Majumder, Subhadip Basu and Mita Nasipuri. “Design of an Optical Character Recognition System for Camera Based Handheld Devices”, IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 4, No 1, July 2011 [3]. A. F. Mollah, S. Basu, N. Das, R. Sarkar, M. Nasipuri, M. Kundu, “Text/Graphics Separation and Skew Correction of Text Regions of Business Card Images for Mobile Devices”, Journal of Computing, Vol. 2, Issue 2, February 2010, ISSN 2151-9617 [4]. Ravi Kumar, Anurag Anand, Nikunj Sharma, “Recognition of English Characters by Codes Generated Using Neighbour Identification”. International Journal of Application or Innovation in Engineering and Management (IJAIEM) – Vol. 2, Issue 4, April 2013, ISSN 2319-4847 [5]. N. Otsu, "A threshold selection method from gray level histogram”, IEEE Transactions in Systems, Man, and Cybernetics, Vol. 9, pp. 62 – 66 [6]. Ghugardare, Rakhi P., Sandip P. Narote, P. Mukherji, and Prathamesh M. Kulkarni. "Optical character recognition system for seven segment display images of measuring instruments." In TENCON 2009-2009 IEEE Region 10 Conference, pp. 1-6. IEEE, 2009.