SlideShare a Scribd company logo
1 of 7
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 179
Text Detection and Recognition: A Review
Chaitanya R. Kulkarni, Ashwini B. Barbadekar
------------------------------------------------------------------------------------------------------------------------------------
Abstract - This paper identifies and compares different
stages in the process of text detection and recognition and
analyses different approaches used for text extraction from
color images. Two commonly used methods for this problem
are stepwise methods and integrated methods, whereas this
task is further divided into text detection and localization,
classification, segmentation and text recognition. Important
approaches used to undergo these stages and their
corresponding advantages, disadvantages and applications
are presented in this paper. Various text related applications
for imagery are also presented over here. This review
performs comparative analysis of fundamental processes in
this field.
Key Words - Text detection, Text Recognition,
Localization, Classification, Segmentation
1. INTRODUCTION
Text detection and recognition has emerged as an
important problem in the past few years. Advancements in
the field of computer vision and machine learning as well
as increase in the applications based on text detection and
recognition has resulted in this trend. Various workshops
and conferences like International Conference on
Document Analysis and Recognition (ICDAR) are being
organized on international level giving further rise to
developments in field of text processing from imagery.
Text detection and recognition from video captions as
well as web pages is also getting attention. Huge work has
been done in the field of text detection and extraction from
natural scenes imagery. Various optical character
recognition techniques are also available. Still problem of
text detection and recognition is not thoroughly solved.
Segmentation and extraction of text from natural scenes is
still very difficult to achieve.
This paper studies various stages in process of text
detection and recognition and analyses and compares
different approaches used to undergo these stages. It
presents importance of every processing stage and
advantages, disadvantages and applications of approaches
used by various contributors to solve these problems.
Various applications of text detection and recognition are
also reviewed in this paper.
The paper is organized as follows. Section 2 presents
methodologies used for text detection and recognition.
Important stages of text detection and recognition are
presented in section 3. Various applications of text
detection and recognition are discussed in section 4 and
the paper concludes in section 5.
2. METHODOLOGIES
The process of detecting and recognizing text is divided
into text detection stage and recognition stage [1]. Text
detection deals with finding text area from input image,
whereas recognition deals with converting obtained text
into characters and words. Methods used for this purpose
are categorized as stepwise methods and integrated
methods. Stepwise methods have separate stages of
detection and recognition and they proceed through
detection, classification, segmentation and recognition.
Integrated methods have information sharing amongst
detection and recognition stages and these methods aim at
recognizing words from text available. Fig. 1 shows block
diagram of stepwise and integrated methods.
2.1. Stepwise methods
Stepwise methods follow stages of text detection and
localization, classification, segmentation and recognition
and remove background part from text in each stage. Since
they are independent of lexicon size, they can be used to
recognize text from images independent of volume of text.
Elagouni et al. used stepwise approach in [2] for text
recognition in videos using convolutional neural networks
based classifier. Text detection, tracking, character
segmentation, recognition and correction are important
processing steps used in this approach. Neural networks
based horizontal text detection is performed, followed by
statistical intensity based shortest path algorithm for
character segmentation. Convolutional neural
classification is used for recognition along with language
model. Ref. [3] also follows stepwise methods to detect
texts of arbitrary orientation from natural images.
Component extraction, component analysis, candidate
linking and chain analysis are four stages through which
proposed system proceeds. Connected components are
extracted in component extraction stage using edge
detection followed by SWT. Component analysis stage
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 180
eliminates non-text candidates and remaining text candidates are paired based on their adjacency in the
Fig -1: Block diagram of a) Stepwise methods b) Integrated methods [1]
candidate linking stage. Chains formed at this stage are
analyzed by chain level classifier in chain analysis stage.
2.2. Integrated methods
Integrated methods focus on detecting particular words
from images. Integrated methods often avoid
segmentation stage or replace it with word recognition or
matching stage. These methods are used for applications
with small size of lexicon recognizing fixed set of words.
Wang and Belongie used integrated method for word
spotting from natural scenes in [4]. It uses character
recognition and word configuration stages in this system.
It crops region around the text from an image and uses it
as input image along with lexicon for word recognition. It
uses Histogram of Oriented Gradients (HOG) features with
nearest neighbor classifier for character detection. Word
recognition represents each word in lexicon in form of
chain of connected characters and matches it with output
of character recognition stage to obtain nearest word for
set of characters. Ref. [5] performs end-to-end text scene
localization and recognition by keeping multiple
segmentations of single characters until last stage of
processing where character contexts in text line are
known. This system detects character as external regions,
i.e. regions whose outer boundary pixels have higher
values than the region. System uses threshold, adjacency
and color space projections as three parameters for every
detected character and stores their multiple
segmentations from which optimal values are selected
based on contexts of characters in text line. Any single
parameter does not guarantee efficient results, which
causes proposal of storing multiple segmentations of three
parameters. For sequence selection process, text regions
are divided into text lines using exhaustive search method,
followed by rejection of low confidence regions in text
lines and construction of directed graph by assigning
scores to each node and edge from which correct sequence
of characters is selected.
3. Important stages of text detection and
recognition
In this section, four important stages of text detection
and recognition are described. Text detection and
localization, classification, segmentation and text
recognition are described with their role and importance.
Various approaches followed for this stages are also
explained in this section.
3.1. Text detection and localization
Text detection deals with detecting presence of the text
in the input image whereas text localization localizes
position of the text and forms groups of text regions by
eliminating maximum of the background. Text detection
and localization process is performed using connected
component analysis or region based methods.
Connected component (cc) analysis method forms
graph of connected points based on color or edge features
from binarized image. In [6], [7] and [8], connected
component analysis is used to detect and localize text
regions. Ref. [8] uses efficient cc extraction methods
instead of using cc filtering approach. Ref. [7] extracts text
in form of connected components by applying pixel-based
constraints on components. Fig. 2 represents method of
extraction of connected components.
Region based methods divide images into small regions
using windows and search these regions for the presence
of text using texture or morphological operations since
text and non-text regions have different textual properties.
Ref. [10] uses 64×32 pixels window and applies Modest
AdaBoost classifier on 16 different spatial scales of image
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 181
to classify text and non-text considering large variations in
text size. Unsupervised learning approach is used by [11],
on small 8×8 grayscale patches of image for feature
learning. It further uses 32×32 pixels of image for feature
extraction, text detector training and character classifier
training.
3.2. Classification
After text detection and localization stage output may
contain non-text regions along with text regions as false
positives. Classification stage verifies text regions and
eliminates non-text regions using classification algorithms.
This stage can also be called as verification. Classification
algorithms are either supervised or unsupervised.
Supervised algorithms knows properties of text such as
color, size, texture, etc. before classification. Unsupervised
algorithms do not have prior knowledge about text
features.
Supervised classification algorithms need training
before classification. These algorithms undergo training to
be able to extract features of the text to be classified and
use these features in classification phase. Ref.[12] and [13]
uses supervised learning approach. Constraints on edge
area as well as area, height and width constraints on block
obtained in detection stage are used in [13] for
classification.
a) b)
c) d) e)
f)
Fig -2: Extraction of connected components. a) Original
image b) Maximum Difference map of fig.2.a c)Text
cluster d) Connected component (CC) e) Skeleton of CC
f) End points and Intersection points of CC [9]
Projection profile of text which is a representation of
spatial pixel content distribution is used as constraint for
separation of non-text regions from text regions in [12] as
shown in fig. 3 .
Unsupervised classification algorithm do not undergo
training. They extract features of text during the
classification phase only unlike supervised classification
and they use features extracted in previous classification
for next one. This is similar to adaptive learning. Wavelet
transform which gives successive approximation through
low-pass filter and details of edges and other features from
high-pass filter is used in [14]. It divides image using
16×16 sized window and obtains 36 features from each
windowed image for classification of text and non-text by
unsupervised method. Ref. [15] uses features like variance
of stroke width, difference between contrast of text and
background as well as aspect ratio of bounding box are
used to form connected components which are classified
using k-means based classifier. These are global features
extracted from entire image unlike method used in [14]
where image is divided in sub-regions.
3.3. Segmentation
Segmentation process is used to separate text from
background and to extract bounded text from image.
Integrated methods which focuses on word
matching/recognition often combine or replace complex
segmentation stage with recognition stage however
stepwise methods undergo segmentation to obtain
precisely extracted characters which are fed to recognition
stage. Binarization, character segmentation are few of the
segmentation algorithms studied in this paper.
Binarization converts color or gray-scale image into
black and white image. To achieve good segmentation
result irrespective of dark or bright text or background,
Kim et al. uses adaptive thresholding for binarization in
[16]. Ref. [17] uses k-means clustering algorithm for
binarization. It uses k=3 and k=4 as cluster parameters
and classify binarized image into texts using probabilistic
models as shown in fig. 4.
Character segmentation is process of converting text
into multiple sets of single characters. It is suitable in case
of degraded text or connected characters. Gradient vector
flow based method is used in [18], which is applied
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 182
directly on grayscale images eliminating need of
binarization. It initially identifies candidate cut pixels
from the characters and then uses two pass path finding
process that finds out potential cuts in forward pass and
verify true cuts and remove false cuts in backward pass.
Fig -3: Use of Projection profile for identification of text
lines and words [12]
Ref. [19] uses mathematical morphology (MM) operator
based adaptive thresholding and extracting approach for
segmenting characters from degraded text as shown in fig.
5. Along with MM operator, it uses heuristics to identify
candidate points for segmentation.
3.4. Text Recognition
Text recognition stage converts images of text into
string of characters or words. It is important to convert
images of text into words as word is an elementary entity
used by human for his visual recognition. Different
approaches of recognition are character recognition and
word recognition.
Character recognition methods divide text image into
multiple cut-outs of single characters. Separation between
adjacent characters is very important for these methods.
Fig -4: Binarization of text from street view [17]
Fig -5: Results of character segmentation of degraded
characters [19]
Character recognition approach using Optical Character
Recognition module (OCR) is used in [20], where initially
images are segmented into k classes followed by binary
text image hypothesis generation which passes through
connected components analysis and gray scale consistency
constraint module before getting fed to OCR. Support
Vector Machine (SVM) based classifier is used for
character recognition in [21]. SVM gives good support for
multi-class classification, which is tested in [21] on Indic
language Kannada which has total 578 characters formed
by altering 34 base consonants using 16 vowels. Many of
these characters falls in similar classes which has made
use of SVM based layered classification approach easier.
Word recognition uses character recognition outputs
along with language models or lexicons to recognize words
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 183
from text image. It can be used in case of degraded
characters. For applications with limited number of word
possibilities in input images, word recognition is better
approach than character recognition. Wachenfeld, et al.
used graphical approach for word recognition in [22].
During segmentation stage, when each character is
segmented, a hypothesis graph is formed to represent
every segmentation and each path of graph is an ordered
segmentation sequence which leads to formation of words
from characters. Ref. [23] uses similarity between
characters as feature along with lexicon, appearance and
language properties to accurately recognize characters and
words.
Table 1 and Table 2 compares methods used for text
detection and recognition and approaches used for its
processing stages respectively.
Table -1: Comparison of methods used for Text Detection and Recognition
Method Features of the method
Stepwise Methods
 Separate Detection and Recognition modules
 Divided into localization, classification, segmentation and recognition stages
 Suitable for detection of large number of words
 Less computation cost; but complexity increases due to more steps
Integrated Methods
 Detection and Recognition modules are not separate
 Can avoid segmentation or replace it with word recognition
 Suitable for identifying specific words from image i.e. small lexicon
 Increase in lexicon size makes recognition difficult
TABLE -2: Comparison of approaches used for processing stages of Text Detection and Recognition
Processing Stage Approach for
processing
Features
Text Detection and
Localization
Connected Components
analysis
 Graph based method
 High speed
 Uses color or edge features
 Not efficient for noisy images
Region based methods
 Windowing based approach
 Less speed
 Use texture features or morphological operations
 Efficient for noisy images also
Supervised Approach
 Supervised classifier has training phase
 Classifier knows features of the text before classification starts
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 184
Classification
 Features like color, size, projection profile are used
Unsupervised approach
 Unsupervised classifier do not have training phase
 Classifier learns from features extracted in previous classification
 Wavelet, stroke width, contrast etc. are used as features
Segmentation
Segmentation
Binarization
Binarization
 Converts color or gray-scale image into black and white image
 Uses simple or adaptive thesholding
 Simple algorithm
 Not suitable for connected characters or degraded text
Character
Segmentation
 Converts text into multiple sets of single characters
 Uses properties of characters for segmentation
 Complex algorithm
 Suitable for degraded text as well as connected characters
Text Recognition
Character Recognition
 Divide text into cut-outs of single characters
 Independent of lexicon
 Used when number of words to be recognized are not limited
Word Recognition
 Identifies word from text image
 Recognizes small number of words provided by lexicon
 Suitable only for recognizing limited number of words
4. APPLICATIONS
Various applications of text detection and recognition
from images and videos have been emerged in past few
years with advancements in image processing techniques.
Developments of various embedded systems and
increasing work in the field of computer vision and
machine learning gives further rise to the increase in
applications of text detection and recognition.
Text detection and recognition is used in industries for
reading package labels, numbers etc. It is used to retrieve
video captions as well as specific text contents from web
pages. It is used for automatic number plate recognition at
toll booths as well as for street boards reading purpose in
case of unmanned vehicles. Text detection and recognition
has very important application in form of assisting blind or
visually impaired people for reading, making their daily
life easy. It is also used in automatic cheque signature
reading. Automatic document scanning is another
application of text recognition.
5. CONCLUSION
In this paper, stages of text detection and recognition
and various methods used for that have been presented.
This process is further divided into text detection and
localization, classification, segmentation and text
recognition. These stages are presented in this paper along
with comparison of approaches used to undergo the above
mentioned stages. Analysis of advantages, disadvantages
and applications of different approaches have also been
performed over here.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 185
6. REFERENCES
[1] Qixiang Ye, David Doermann, “Text Detection and
Recognition in Imagery: A Survey” IEEE Transactions on
Pattern Analysis and Machine Intelligence, 2014
[2] K. Elagouni, C. Garcia and P. Sbillot,” A Comprehensive
Neural-Based Approach for Text Recognition in Videos
using Natural Language Processing,” in Proc. ACM Conf.
Multimedia Retrieval, 2011.
[3] C. Yao, X. Zhang, X. Bai, W. Liu, Y. Ma, and Z. Tu,”
Detecting Texts of Arbitrary Orientations in Natural
Images,” in Proc. IEEE Int’l Conf. Computer Vision and
Pattern Recognition, pp.1083-1090, 2012.
[4] K. Wang, S. Belongie,” Word Spotting in the Wild”, in
Proc. European Conference on Computer Vision, pp. 591-
604, 2010.
[5] L. Neumann, J. Matas,” On Combining Multiple
Segmentations in Scene Text Recognition,” in Proc. IEEE
Int’l Conf. Document Analysis and Recognition, pp. 523-
527, 2013.
[6] A.K. Jain and B. Yu,” Automatic Text Location in Images
and Video Frames,” Pattern Recognition, vol. 31, no. 12, pp.
2055-2076, 1998.
[7] S.M. Hanif, L. Prevost, P.A. Negri,” A Cascade Detector
for Text Detection in Natural Scene Images,” in Proc. IEEE
Int’l Conf. Pattern Recognition, pp. 1-4, 2008.
[8] H. Koo, D.H. Kim,” Scene Text Detection via Connected
Component Clustering and Non-text Filtering,” IEEE Trans.
Image Processing, vol. 22, no. 6, pp. 2296-2305, 2013.
[9] P. Shivakumara, T.Q. Phan and C.L. Tan,” A Laplacian
Approach to Multi-Oriented Text Detection in Video,” IEEE
Trans. Pattern Analysis and Machine Intelligence, vol. 33,
no. 2, pp. 412-419, 2011.
[10] J. Lee, P. Lee, S. Lee, A. Yuille and C. Koch,” AdaBoost
for Text Detection in Natural Scene,” in Proc. IEEE Int’l
Conf. Document Analysis and Recognition, pp. 429-434,
2011.
[11] A. Coates, B. Carpenter, C. Case, S. Satheesh, B. Suresh,
T. Wang, D. J. Wu, Andrew Y. Ng,” Text Detection and
Character Recognition in Scene Images with Unsupervised
Feature Learning,” in Proc. IEEE Int’l Conf. Document
Analysis and Recognition, pp. 440-445, 2011.
[12] R. Lienhart and A. Wernicke, ”Localizing and
Segmenting Text in Images and Videos,” IEEE Trans.
Circuits System on Video Technology, vol. 12, no. 4, pp.
256-268, 2002.
[13] M. Li, C. Wang,” An Adaptive Text Detection Approach
in Images and Video Frames,” in Proc. Int’l Joint Conf.
Neural Network, pp. 72-77, 2008.
[14] H. Li, D. Doermann, and O. Kia,” Automatic Text
Detection and Tracking in Digital Video,” IEEE Trans.
Image Processing, vol. 9, pp. 147-156, 2000.
[15] A. Mosleh, N. Bouguila, A. Ben Hamza,” Image Text
Detection Using a Bandlet-Based Edge Detector and Stroke
Width Transform,” in Proc. British Machine Vision
Conference, pp. 1-2, 2012.
[16] W. Kim and C. Kim,” A New Approach for Overlay Text
Detection and Extraction from Complex Video Scene,” IEEE
Trans. Image Processing, vol. 18, no. 2, pp. 401-411, 2009.
[17] J.J. Weinman, Z. Butler, D. Knoll, J. Feild,” Toward
Integrated Scene Text Reading,” IEEE Trans. Pattern
Analysis and Machine Intelligence, vol. 3, no. 2, pp. 375-
387, 2014.
[18] T. Phan, P. Shivakumara, B. Su and C.L. Tan,” A
Gradient Vector Flow-Based Method for Video Character
Segmentation,” in Proc. IEEE Int’l Conf. Document Analysis
and Recognition, pp. 1024-1028, 2011.
[19] S. Nomura, K. Yamanak, O. Katai, H. Kawakami, T.
Shiose, ”A Novel Adaptive Morphological Approach for
Degraded Character Image Segmentation,” Pattern
Recognition, vol. 38, no. 11, pp. 1961-1975, 2005.
[20] D. Chen, J.M. Odobez, H. Bourlard,”Text Detection and
Recognition in Images and Video Frames,” Pattern
Recognition, vol. 37, no. 3, pp. 596-608, 2004.
[21] K. Sheshadri, S.K. Divvala,” Exemplar Driven Character
Recognition in the Wild,” in Proc. British Machine Vision
Conference, pp. 1-10, 2012.
[22] S. Wachenfeld, H. Klein and X. Jiang,” Recognition of
Screen-Rendered Text,” in Proc. IEEE Int’l Conf. Pattern
Recognition, pp. 1086-1089, 2006.1733-1746, 2009.
[23] J.J. Weinman, E. Learned-Miller and A. Hanson,” Scene
Text Recognition using Similarity and a Lexicon with
Sparse Belief Propagation,” IEEE Trans. Pattern Analysis
and Machine Intelligence, vol. 31, no. 10, pp.

More Related Content

What's hot

Segmentation of Handwritten Chinese Character Strings Based on improved Algor...
Segmentation of Handwritten Chinese Character Strings Based on improved Algor...Segmentation of Handwritten Chinese Character Strings Based on improved Algor...
Segmentation of Handwritten Chinese Character Strings Based on improved Algor...ijeei-iaes
 
A case study in computer
A case study in computerA case study in computer
A case study in computercsandit
 
IRJET - A Survey on Recognition of Handwritten ZIP Codes in a Postal Sorting ...
IRJET - A Survey on Recognition of Handwritten ZIP Codes in a Postal Sorting ...IRJET - A Survey on Recognition of Handwritten ZIP Codes in a Postal Sorting ...
IRJET - A Survey on Recognition of Handwritten ZIP Codes in a Postal Sorting ...IRJET Journal
 
A novel method for arabic multi word term extraction
A novel method for arabic multi word term extractionA novel method for arabic multi word term extraction
A novel method for arabic multi word term extractionijdms
 
A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...
A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...
A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...ijma
 
IRJET- Object Detection using Hausdorff Distance
IRJET-  	  Object Detection using Hausdorff DistanceIRJET-  	  Object Detection using Hausdorff Distance
IRJET- Object Detection using Hausdorff DistanceIRJET Journal
 
Text Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR TransformText Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR TransformIOSR Journals
 
IRJET- Optical Character Recognition using Image Processing
IRJET-  	  Optical Character Recognition using Image ProcessingIRJET-  	  Optical Character Recognition using Image Processing
IRJET- Optical Character Recognition using Image ProcessingIRJET Journal
 
IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SV...
IRJET- 	  Document Layout analysis using Inverse Support Vector Machine (I-SV...IRJET- 	  Document Layout analysis using Inverse Support Vector Machine (I-SV...
IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SV...IRJET Journal
 
An effective approach to offline arabic handwriting recognition
An effective approach to offline arabic handwriting recognitionAn effective approach to offline arabic handwriting recognition
An effective approach to offline arabic handwriting recognitionijaia
 
A New Concept Extraction Method for Ontology Construction From Arabic Text
A New Concept Extraction Method for Ontology Construction From Arabic TextA New Concept Extraction Method for Ontology Construction From Arabic Text
A New Concept Extraction Method for Ontology Construction From Arabic TextCSCJournals
 
Determining Basis Test Paths Using Genetic Algorithm and J48
Determining Basis Test Paths Using Genetic Algorithm and J48 Determining Basis Test Paths Using Genetic Algorithm and J48
Determining Basis Test Paths Using Genetic Algorithm and J48 IJECEIAES
 
The Framework of Image Recognition based on Modified Freeman Chain Code
The Framework of Image Recognition based on Modified Freeman Chain CodeThe Framework of Image Recognition based on Modified Freeman Chain Code
The Framework of Image Recognition based on Modified Freeman Chain CodeCSCJournals
 
A Comprehensive Study On Handwritten Character Recognition System
A Comprehensive Study On Handwritten Character Recognition SystemA Comprehensive Study On Handwritten Character Recognition System
A Comprehensive Study On Handwritten Character Recognition Systemiosrjce
 
IEEE 2014 DOTNET DATA MINING PROJECTS A probabilistic approach to string tran...
IEEE 2014 DOTNET DATA MINING PROJECTS A probabilistic approach to string tran...IEEE 2014 DOTNET DATA MINING PROJECTS A probabilistic approach to string tran...
IEEE 2014 DOTNET DATA MINING PROJECTS A probabilistic approach to string tran...IEEEMEMTECHSTUDENTPROJECTS
 
DEVNAGARI NUMERALS CLASSIFICATION AND RECOGNITION USING AN INTEGRATED APPROACH
DEVNAGARI NUMERALS CLASSIFICATION AND RECOGNITION USING AN INTEGRATED APPROACHDEVNAGARI NUMERALS CLASSIFICATION AND RECOGNITION USING AN INTEGRATED APPROACH
DEVNAGARI NUMERALS CLASSIFICATION AND RECOGNITION USING AN INTEGRATED APPROACHijfcstjournal
 

What's hot (18)

Segmentation of Handwritten Chinese Character Strings Based on improved Algor...
Segmentation of Handwritten Chinese Character Strings Based on improved Algor...Segmentation of Handwritten Chinese Character Strings Based on improved Algor...
Segmentation of Handwritten Chinese Character Strings Based on improved Algor...
 
A case study in computer
A case study in computerA case study in computer
A case study in computer
 
IRJET - A Survey on Recognition of Handwritten ZIP Codes in a Postal Sorting ...
IRJET - A Survey on Recognition of Handwritten ZIP Codes in a Postal Sorting ...IRJET - A Survey on Recognition of Handwritten ZIP Codes in a Postal Sorting ...
IRJET - A Survey on Recognition of Handwritten ZIP Codes in a Postal Sorting ...
 
A novel method for arabic multi word term extraction
A novel method for arabic multi word term extractionA novel method for arabic multi word term extraction
A novel method for arabic multi word term extraction
 
A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...
A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...
A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...
 
IRJET- Object Detection using Hausdorff Distance
IRJET-  	  Object Detection using Hausdorff DistanceIRJET-  	  Object Detection using Hausdorff Distance
IRJET- Object Detection using Hausdorff Distance
 
Text Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR TransformText Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR Transform
 
IRJET- Optical Character Recognition using Image Processing
IRJET-  	  Optical Character Recognition using Image ProcessingIRJET-  	  Optical Character Recognition using Image Processing
IRJET- Optical Character Recognition using Image Processing
 
IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SV...
IRJET- 	  Document Layout analysis using Inverse Support Vector Machine (I-SV...IRJET- 	  Document Layout analysis using Inverse Support Vector Machine (I-SV...
IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SV...
 
An effective approach to offline arabic handwriting recognition
An effective approach to offline arabic handwriting recognitionAn effective approach to offline arabic handwriting recognition
An effective approach to offline arabic handwriting recognition
 
A New Concept Extraction Method for Ontology Construction From Arabic Text
A New Concept Extraction Method for Ontology Construction From Arabic TextA New Concept Extraction Method for Ontology Construction From Arabic Text
A New Concept Extraction Method for Ontology Construction From Arabic Text
 
50120130406021
5012013040602150120130406021
50120130406021
 
Determining Basis Test Paths Using Genetic Algorithm and J48
Determining Basis Test Paths Using Genetic Algorithm and J48 Determining Basis Test Paths Using Genetic Algorithm and J48
Determining Basis Test Paths Using Genetic Algorithm and J48
 
The Framework of Image Recognition based on Modified Freeman Chain Code
The Framework of Image Recognition based on Modified Freeman Chain CodeThe Framework of Image Recognition based on Modified Freeman Chain Code
The Framework of Image Recognition based on Modified Freeman Chain Code
 
A Comprehensive Study On Handwritten Character Recognition System
A Comprehensive Study On Handwritten Character Recognition SystemA Comprehensive Study On Handwritten Character Recognition System
A Comprehensive Study On Handwritten Character Recognition System
 
Ijcatr04041021
Ijcatr04041021Ijcatr04041021
Ijcatr04041021
 
IEEE 2014 DOTNET DATA MINING PROJECTS A probabilistic approach to string tran...
IEEE 2014 DOTNET DATA MINING PROJECTS A probabilistic approach to string tran...IEEE 2014 DOTNET DATA MINING PROJECTS A probabilistic approach to string tran...
IEEE 2014 DOTNET DATA MINING PROJECTS A probabilistic approach to string tran...
 
DEVNAGARI NUMERALS CLASSIFICATION AND RECOGNITION USING AN INTEGRATED APPROACH
DEVNAGARI NUMERALS CLASSIFICATION AND RECOGNITION USING AN INTEGRATED APPROACHDEVNAGARI NUMERALS CLASSIFICATION AND RECOGNITION USING AN INTEGRATED APPROACH
DEVNAGARI NUMERALS CLASSIFICATION AND RECOGNITION USING AN INTEGRATED APPROACH
 

Similar to Text Detection and Recognition: A Review

­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...IRJET Journal
 
IRJET - Object Detection using Hausdorff Distance
IRJET -  	  Object Detection using Hausdorff DistanceIRJET -  	  Object Detection using Hausdorff Distance
IRJET - Object Detection using Hausdorff DistanceIRJET Journal
 
IRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural NetworkIRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural NetworkIRJET Journal
 
Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...
Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...
Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...IRJET Journal
 
Study on Relavance Feature Selection Methods
Study on Relavance Feature Selection MethodsStudy on Relavance Feature Selection Methods
Study on Relavance Feature Selection MethodsIRJET Journal
 
Text Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text RegionsText Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text RegionsIJCSIS Research Publications
 
Text Extraction from Image Using GAMMA Correction Method.
Text Extraction from Image Using GAMMA Correction Method.Text Extraction from Image Using GAMMA Correction Method.
Text Extraction from Image Using GAMMA Correction Method.IRJET Journal
 
Cc31331335
Cc31331335Cc31331335
Cc31331335IJMER
 
Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Editor IJARCET
 
Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Editor IJARCET
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosCSCJournals
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosCSCJournals
 
IRJET- Image to Text Conversion using Tesseract
IRJET-  	  Image to Text Conversion using TesseractIRJET-  	  Image to Text Conversion using Tesseract
IRJET- Image to Text Conversion using TesseractIRJET Journal
 
A Review Paper on Personal Identification with An Efficient Method Of Combina...
A Review Paper on Personal Identification with An Efficient Method Of Combina...A Review Paper on Personal Identification with An Efficient Method Of Combina...
A Review Paper on Personal Identification with An Efficient Method Of Combina...IRJET Journal
 
Paper id 24201433
Paper id 24201433Paper id 24201433
Paper id 24201433IJRAT
 
IRJET - Text Detection in Natural Scene Images: A Survey
IRJET - Text Detection in Natural Scene Images: A SurveyIRJET - Text Detection in Natural Scene Images: A Survey
IRJET - Text Detection in Natural Scene Images: A SurveyIRJET Journal
 
IRJET- Text Line Detection in Camera Caputerd Images using Matlab GUI
IRJET- Text Line Detection in Camera Caputerd Images using Matlab GUIIRJET- Text Line Detection in Camera Caputerd Images using Matlab GUI
IRJET- Text Line Detection in Camera Caputerd Images using Matlab GUIIRJET Journal
 
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...CSCJournals
 
Design and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English FontDesign and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English FontIRJET Journal
 

Similar to Text Detection and Recognition: A Review (20)

40120140501009
4012014050100940120140501009
40120140501009
 
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
 
IRJET - Object Detection using Hausdorff Distance
IRJET -  	  Object Detection using Hausdorff DistanceIRJET -  	  Object Detection using Hausdorff Distance
IRJET - Object Detection using Hausdorff Distance
 
IRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural NetworkIRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural Network
 
Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...
Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...
Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...
 
Study on Relavance Feature Selection Methods
Study on Relavance Feature Selection MethodsStudy on Relavance Feature Selection Methods
Study on Relavance Feature Selection Methods
 
Text Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text RegionsText Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text Regions
 
Text Extraction from Image Using GAMMA Correction Method.
Text Extraction from Image Using GAMMA Correction Method.Text Extraction from Image Using GAMMA Correction Method.
Text Extraction from Image Using GAMMA Correction Method.
 
Cc31331335
Cc31331335Cc31331335
Cc31331335
 
Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124
 
Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In Videos
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In Videos
 
IRJET- Image to Text Conversion using Tesseract
IRJET-  	  Image to Text Conversion using TesseractIRJET-  	  Image to Text Conversion using Tesseract
IRJET- Image to Text Conversion using Tesseract
 
A Review Paper on Personal Identification with An Efficient Method Of Combina...
A Review Paper on Personal Identification with An Efficient Method Of Combina...A Review Paper on Personal Identification with An Efficient Method Of Combina...
A Review Paper on Personal Identification with An Efficient Method Of Combina...
 
Paper id 24201433
Paper id 24201433Paper id 24201433
Paper id 24201433
 
IRJET - Text Detection in Natural Scene Images: A Survey
IRJET - Text Detection in Natural Scene Images: A SurveyIRJET - Text Detection in Natural Scene Images: A Survey
IRJET - Text Detection in Natural Scene Images: A Survey
 
IRJET- Text Line Detection in Camera Caputerd Images using Matlab GUI
IRJET- Text Line Detection in Camera Caputerd Images using Matlab GUIIRJET- Text Line Detection in Camera Caputerd Images using Matlab GUI
IRJET- Text Line Detection in Camera Caputerd Images using Matlab GUI
 
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
 
Design and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English FontDesign and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English Font
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 

Recently uploaded (20)

Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 

Text Detection and Recognition: A Review

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 179 Text Detection and Recognition: A Review Chaitanya R. Kulkarni, Ashwini B. Barbadekar ------------------------------------------------------------------------------------------------------------------------------------ Abstract - This paper identifies and compares different stages in the process of text detection and recognition and analyses different approaches used for text extraction from color images. Two commonly used methods for this problem are stepwise methods and integrated methods, whereas this task is further divided into text detection and localization, classification, segmentation and text recognition. Important approaches used to undergo these stages and their corresponding advantages, disadvantages and applications are presented in this paper. Various text related applications for imagery are also presented over here. This review performs comparative analysis of fundamental processes in this field. Key Words - Text detection, Text Recognition, Localization, Classification, Segmentation 1. INTRODUCTION Text detection and recognition has emerged as an important problem in the past few years. Advancements in the field of computer vision and machine learning as well as increase in the applications based on text detection and recognition has resulted in this trend. Various workshops and conferences like International Conference on Document Analysis and Recognition (ICDAR) are being organized on international level giving further rise to developments in field of text processing from imagery. Text detection and recognition from video captions as well as web pages is also getting attention. Huge work has been done in the field of text detection and extraction from natural scenes imagery. Various optical character recognition techniques are also available. Still problem of text detection and recognition is not thoroughly solved. Segmentation and extraction of text from natural scenes is still very difficult to achieve. This paper studies various stages in process of text detection and recognition and analyses and compares different approaches used to undergo these stages. It presents importance of every processing stage and advantages, disadvantages and applications of approaches used by various contributors to solve these problems. Various applications of text detection and recognition are also reviewed in this paper. The paper is organized as follows. Section 2 presents methodologies used for text detection and recognition. Important stages of text detection and recognition are presented in section 3. Various applications of text detection and recognition are discussed in section 4 and the paper concludes in section 5. 2. METHODOLOGIES The process of detecting and recognizing text is divided into text detection stage and recognition stage [1]. Text detection deals with finding text area from input image, whereas recognition deals with converting obtained text into characters and words. Methods used for this purpose are categorized as stepwise methods and integrated methods. Stepwise methods have separate stages of detection and recognition and they proceed through detection, classification, segmentation and recognition. Integrated methods have information sharing amongst detection and recognition stages and these methods aim at recognizing words from text available. Fig. 1 shows block diagram of stepwise and integrated methods. 2.1. Stepwise methods Stepwise methods follow stages of text detection and localization, classification, segmentation and recognition and remove background part from text in each stage. Since they are independent of lexicon size, they can be used to recognize text from images independent of volume of text. Elagouni et al. used stepwise approach in [2] for text recognition in videos using convolutional neural networks based classifier. Text detection, tracking, character segmentation, recognition and correction are important processing steps used in this approach. Neural networks based horizontal text detection is performed, followed by statistical intensity based shortest path algorithm for character segmentation. Convolutional neural classification is used for recognition along with language model. Ref. [3] also follows stepwise methods to detect texts of arbitrary orientation from natural images. Component extraction, component analysis, candidate linking and chain analysis are four stages through which proposed system proceeds. Connected components are extracted in component extraction stage using edge detection followed by SWT. Component analysis stage
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 180 eliminates non-text candidates and remaining text candidates are paired based on their adjacency in the Fig -1: Block diagram of a) Stepwise methods b) Integrated methods [1] candidate linking stage. Chains formed at this stage are analyzed by chain level classifier in chain analysis stage. 2.2. Integrated methods Integrated methods focus on detecting particular words from images. Integrated methods often avoid segmentation stage or replace it with word recognition or matching stage. These methods are used for applications with small size of lexicon recognizing fixed set of words. Wang and Belongie used integrated method for word spotting from natural scenes in [4]. It uses character recognition and word configuration stages in this system. It crops region around the text from an image and uses it as input image along with lexicon for word recognition. It uses Histogram of Oriented Gradients (HOG) features with nearest neighbor classifier for character detection. Word recognition represents each word in lexicon in form of chain of connected characters and matches it with output of character recognition stage to obtain nearest word for set of characters. Ref. [5] performs end-to-end text scene localization and recognition by keeping multiple segmentations of single characters until last stage of processing where character contexts in text line are known. This system detects character as external regions, i.e. regions whose outer boundary pixels have higher values than the region. System uses threshold, adjacency and color space projections as three parameters for every detected character and stores their multiple segmentations from which optimal values are selected based on contexts of characters in text line. Any single parameter does not guarantee efficient results, which causes proposal of storing multiple segmentations of three parameters. For sequence selection process, text regions are divided into text lines using exhaustive search method, followed by rejection of low confidence regions in text lines and construction of directed graph by assigning scores to each node and edge from which correct sequence of characters is selected. 3. Important stages of text detection and recognition In this section, four important stages of text detection and recognition are described. Text detection and localization, classification, segmentation and text recognition are described with their role and importance. Various approaches followed for this stages are also explained in this section. 3.1. Text detection and localization Text detection deals with detecting presence of the text in the input image whereas text localization localizes position of the text and forms groups of text regions by eliminating maximum of the background. Text detection and localization process is performed using connected component analysis or region based methods. Connected component (cc) analysis method forms graph of connected points based on color or edge features from binarized image. In [6], [7] and [8], connected component analysis is used to detect and localize text regions. Ref. [8] uses efficient cc extraction methods instead of using cc filtering approach. Ref. [7] extracts text in form of connected components by applying pixel-based constraints on components. Fig. 2 represents method of extraction of connected components. Region based methods divide images into small regions using windows and search these regions for the presence of text using texture or morphological operations since text and non-text regions have different textual properties. Ref. [10] uses 64×32 pixels window and applies Modest AdaBoost classifier on 16 different spatial scales of image
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 181 to classify text and non-text considering large variations in text size. Unsupervised learning approach is used by [11], on small 8×8 grayscale patches of image for feature learning. It further uses 32×32 pixels of image for feature extraction, text detector training and character classifier training. 3.2. Classification After text detection and localization stage output may contain non-text regions along with text regions as false positives. Classification stage verifies text regions and eliminates non-text regions using classification algorithms. This stage can also be called as verification. Classification algorithms are either supervised or unsupervised. Supervised algorithms knows properties of text such as color, size, texture, etc. before classification. Unsupervised algorithms do not have prior knowledge about text features. Supervised classification algorithms need training before classification. These algorithms undergo training to be able to extract features of the text to be classified and use these features in classification phase. Ref.[12] and [13] uses supervised learning approach. Constraints on edge area as well as area, height and width constraints on block obtained in detection stage are used in [13] for classification. a) b) c) d) e) f) Fig -2: Extraction of connected components. a) Original image b) Maximum Difference map of fig.2.a c)Text cluster d) Connected component (CC) e) Skeleton of CC f) End points and Intersection points of CC [9] Projection profile of text which is a representation of spatial pixel content distribution is used as constraint for separation of non-text regions from text regions in [12] as shown in fig. 3 . Unsupervised classification algorithm do not undergo training. They extract features of text during the classification phase only unlike supervised classification and they use features extracted in previous classification for next one. This is similar to adaptive learning. Wavelet transform which gives successive approximation through low-pass filter and details of edges and other features from high-pass filter is used in [14]. It divides image using 16×16 sized window and obtains 36 features from each windowed image for classification of text and non-text by unsupervised method. Ref. [15] uses features like variance of stroke width, difference between contrast of text and background as well as aspect ratio of bounding box are used to form connected components which are classified using k-means based classifier. These are global features extracted from entire image unlike method used in [14] where image is divided in sub-regions. 3.3. Segmentation Segmentation process is used to separate text from background and to extract bounded text from image. Integrated methods which focuses on word matching/recognition often combine or replace complex segmentation stage with recognition stage however stepwise methods undergo segmentation to obtain precisely extracted characters which are fed to recognition stage. Binarization, character segmentation are few of the segmentation algorithms studied in this paper. Binarization converts color or gray-scale image into black and white image. To achieve good segmentation result irrespective of dark or bright text or background, Kim et al. uses adaptive thresholding for binarization in [16]. Ref. [17] uses k-means clustering algorithm for binarization. It uses k=3 and k=4 as cluster parameters and classify binarized image into texts using probabilistic models as shown in fig. 4. Character segmentation is process of converting text into multiple sets of single characters. It is suitable in case of degraded text or connected characters. Gradient vector flow based method is used in [18], which is applied
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 182 directly on grayscale images eliminating need of binarization. It initially identifies candidate cut pixels from the characters and then uses two pass path finding process that finds out potential cuts in forward pass and verify true cuts and remove false cuts in backward pass. Fig -3: Use of Projection profile for identification of text lines and words [12] Ref. [19] uses mathematical morphology (MM) operator based adaptive thresholding and extracting approach for segmenting characters from degraded text as shown in fig. 5. Along with MM operator, it uses heuristics to identify candidate points for segmentation. 3.4. Text Recognition Text recognition stage converts images of text into string of characters or words. It is important to convert images of text into words as word is an elementary entity used by human for his visual recognition. Different approaches of recognition are character recognition and word recognition. Character recognition methods divide text image into multiple cut-outs of single characters. Separation between adjacent characters is very important for these methods. Fig -4: Binarization of text from street view [17] Fig -5: Results of character segmentation of degraded characters [19] Character recognition approach using Optical Character Recognition module (OCR) is used in [20], where initially images are segmented into k classes followed by binary text image hypothesis generation which passes through connected components analysis and gray scale consistency constraint module before getting fed to OCR. Support Vector Machine (SVM) based classifier is used for character recognition in [21]. SVM gives good support for multi-class classification, which is tested in [21] on Indic language Kannada which has total 578 characters formed by altering 34 base consonants using 16 vowels. Many of these characters falls in similar classes which has made use of SVM based layered classification approach easier. Word recognition uses character recognition outputs along with language models or lexicons to recognize words
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 183 from text image. It can be used in case of degraded characters. For applications with limited number of word possibilities in input images, word recognition is better approach than character recognition. Wachenfeld, et al. used graphical approach for word recognition in [22]. During segmentation stage, when each character is segmented, a hypothesis graph is formed to represent every segmentation and each path of graph is an ordered segmentation sequence which leads to formation of words from characters. Ref. [23] uses similarity between characters as feature along with lexicon, appearance and language properties to accurately recognize characters and words. Table 1 and Table 2 compares methods used for text detection and recognition and approaches used for its processing stages respectively. Table -1: Comparison of methods used for Text Detection and Recognition Method Features of the method Stepwise Methods  Separate Detection and Recognition modules  Divided into localization, classification, segmentation and recognition stages  Suitable for detection of large number of words  Less computation cost; but complexity increases due to more steps Integrated Methods  Detection and Recognition modules are not separate  Can avoid segmentation or replace it with word recognition  Suitable for identifying specific words from image i.e. small lexicon  Increase in lexicon size makes recognition difficult TABLE -2: Comparison of approaches used for processing stages of Text Detection and Recognition Processing Stage Approach for processing Features Text Detection and Localization Connected Components analysis  Graph based method  High speed  Uses color or edge features  Not efficient for noisy images Region based methods  Windowing based approach  Less speed  Use texture features or morphological operations  Efficient for noisy images also Supervised Approach  Supervised classifier has training phase  Classifier knows features of the text before classification starts
  • 6. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 184 Classification  Features like color, size, projection profile are used Unsupervised approach  Unsupervised classifier do not have training phase  Classifier learns from features extracted in previous classification  Wavelet, stroke width, contrast etc. are used as features Segmentation Segmentation Binarization Binarization  Converts color or gray-scale image into black and white image  Uses simple or adaptive thesholding  Simple algorithm  Not suitable for connected characters or degraded text Character Segmentation  Converts text into multiple sets of single characters  Uses properties of characters for segmentation  Complex algorithm  Suitable for degraded text as well as connected characters Text Recognition Character Recognition  Divide text into cut-outs of single characters  Independent of lexicon  Used when number of words to be recognized are not limited Word Recognition  Identifies word from text image  Recognizes small number of words provided by lexicon  Suitable only for recognizing limited number of words 4. APPLICATIONS Various applications of text detection and recognition from images and videos have been emerged in past few years with advancements in image processing techniques. Developments of various embedded systems and increasing work in the field of computer vision and machine learning gives further rise to the increase in applications of text detection and recognition. Text detection and recognition is used in industries for reading package labels, numbers etc. It is used to retrieve video captions as well as specific text contents from web pages. It is used for automatic number plate recognition at toll booths as well as for street boards reading purpose in case of unmanned vehicles. Text detection and recognition has very important application in form of assisting blind or visually impaired people for reading, making their daily life easy. It is also used in automatic cheque signature reading. Automatic document scanning is another application of text recognition. 5. CONCLUSION In this paper, stages of text detection and recognition and various methods used for that have been presented. This process is further divided into text detection and localization, classification, segmentation and text recognition. These stages are presented in this paper along with comparison of approaches used to undergo the above mentioned stages. Analysis of advantages, disadvantages and applications of different approaches have also been performed over here.
  • 7. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 06 | June-2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 185 6. REFERENCES [1] Qixiang Ye, David Doermann, “Text Detection and Recognition in Imagery: A Survey” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014 [2] K. Elagouni, C. Garcia and P. Sbillot,” A Comprehensive Neural-Based Approach for Text Recognition in Videos using Natural Language Processing,” in Proc. ACM Conf. Multimedia Retrieval, 2011. [3] C. Yao, X. Zhang, X. Bai, W. Liu, Y. Ma, and Z. Tu,” Detecting Texts of Arbitrary Orientations in Natural Images,” in Proc. IEEE Int’l Conf. Computer Vision and Pattern Recognition, pp.1083-1090, 2012. [4] K. Wang, S. Belongie,” Word Spotting in the Wild”, in Proc. European Conference on Computer Vision, pp. 591- 604, 2010. [5] L. Neumann, J. Matas,” On Combining Multiple Segmentations in Scene Text Recognition,” in Proc. IEEE Int’l Conf. Document Analysis and Recognition, pp. 523- 527, 2013. [6] A.K. Jain and B. Yu,” Automatic Text Location in Images and Video Frames,” Pattern Recognition, vol. 31, no. 12, pp. 2055-2076, 1998. [7] S.M. Hanif, L. Prevost, P.A. Negri,” A Cascade Detector for Text Detection in Natural Scene Images,” in Proc. IEEE Int’l Conf. Pattern Recognition, pp. 1-4, 2008. [8] H. Koo, D.H. Kim,” Scene Text Detection via Connected Component Clustering and Non-text Filtering,” IEEE Trans. Image Processing, vol. 22, no. 6, pp. 2296-2305, 2013. [9] P. Shivakumara, T.Q. Phan and C.L. Tan,” A Laplacian Approach to Multi-Oriented Text Detection in Video,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 33, no. 2, pp. 412-419, 2011. [10] J. Lee, P. Lee, S. Lee, A. Yuille and C. Koch,” AdaBoost for Text Detection in Natural Scene,” in Proc. IEEE Int’l Conf. Document Analysis and Recognition, pp. 429-434, 2011. [11] A. Coates, B. Carpenter, C. Case, S. Satheesh, B. Suresh, T. Wang, D. J. Wu, Andrew Y. Ng,” Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning,” in Proc. IEEE Int’l Conf. Document Analysis and Recognition, pp. 440-445, 2011. [12] R. Lienhart and A. Wernicke, ”Localizing and Segmenting Text in Images and Videos,” IEEE Trans. Circuits System on Video Technology, vol. 12, no. 4, pp. 256-268, 2002. [13] M. Li, C. Wang,” An Adaptive Text Detection Approach in Images and Video Frames,” in Proc. Int’l Joint Conf. Neural Network, pp. 72-77, 2008. [14] H. Li, D. Doermann, and O. Kia,” Automatic Text Detection and Tracking in Digital Video,” IEEE Trans. Image Processing, vol. 9, pp. 147-156, 2000. [15] A. Mosleh, N. Bouguila, A. Ben Hamza,” Image Text Detection Using a Bandlet-Based Edge Detector and Stroke Width Transform,” in Proc. British Machine Vision Conference, pp. 1-2, 2012. [16] W. Kim and C. Kim,” A New Approach for Overlay Text Detection and Extraction from Complex Video Scene,” IEEE Trans. Image Processing, vol. 18, no. 2, pp. 401-411, 2009. [17] J.J. Weinman, Z. Butler, D. Knoll, J. Feild,” Toward Integrated Scene Text Reading,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 3, no. 2, pp. 375- 387, 2014. [18] T. Phan, P. Shivakumara, B. Su and C.L. Tan,” A Gradient Vector Flow-Based Method for Video Character Segmentation,” in Proc. IEEE Int’l Conf. Document Analysis and Recognition, pp. 1024-1028, 2011. [19] S. Nomura, K. Yamanak, O. Katai, H. Kawakami, T. Shiose, ”A Novel Adaptive Morphological Approach for Degraded Character Image Segmentation,” Pattern Recognition, vol. 38, no. 11, pp. 1961-1975, 2005. [20] D. Chen, J.M. Odobez, H. Bourlard,”Text Detection and Recognition in Images and Video Frames,” Pattern Recognition, vol. 37, no. 3, pp. 596-608, 2004. [21] K. Sheshadri, S.K. Divvala,” Exemplar Driven Character Recognition in the Wild,” in Proc. British Machine Vision Conference, pp. 1-10, 2012. [22] S. Wachenfeld, H. Klein and X. Jiang,” Recognition of Screen-Rendered Text,” in Proc. IEEE Int’l Conf. Pattern Recognition, pp. 1086-1089, 2006.1733-1746, 2009. [23] J.J. Weinman, E. Learned-Miller and A. Hanson,” Scene Text Recognition using Similarity and a Lexicon with Sparse Belief Propagation,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 31, no. 10, pp.