SlideShare a Scribd company logo
1 of 5
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 01 | Jan-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 995
TEXT DETECTION AND RECOGNITION IN NATURAL IMAGES
Shabana M A1, Alphy Jose2, Ani Sunny3
1,2 Student, Dept. of Computer Science and Engineering, Mar Athanasius College of Engineering, Kerala, India
3Professor, Dept. of Computer Science and Engineering, Mar Athanasius College of Engineering, Kerala, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - Text embodied in scene texts are both rich and
clear and thus can serve as important cues for a wide range of
vision applications, for instance, image understanding, image
indexing, video search, geolocation, andautomaticnavigation.
Here we present a unified framework for text detection,
recognition in natural images. The proposed system consist of
mainly five steps; Candidate generation, Component level
classifier, Non text filtering and Text recognition. Among a
number of CC (Connected Component) extraction methods,we
have adopted the MSER algorithm because it shows good
performance with a small computation cost. Filtering of the
input image is done using Discrete wavelettransform(DWT)in
which edge map of the image is constructed in all directions.
The output of the MSER extraction andDWTiscombinedusing
logical OR. Filtering by occupation ratio is done by examining
the area of the bounding box with respect to the area of the
object contained in it. The image is then filtered with the
values of stroke widths. Component level classifier will
determine adjacency relationship between CCs. Then a
classifier that rejects non text blocks among normalized
images. This is done by using SVM as classifier. Finally we get
the detected text as first output. The detected text is passed to
an OCR engine in order to recognize the text.
Key Words: Connected Component(CC),MSER,Discrete
Wavelet Transform (DWT), Support Vector Machine
(SVM), Stroke Width Transform(SWT), OCR.
1.INTRODUCTION
With the increasing popularity of practical vision systems
and smart phones, text detection in natural scenes becomes
a critical yet challenging task. The proposed algorithm can
assist numerous applications that require text information
extraction from images or videos, such as video search,
target geolocation, and automatic navigation. Mainly there
are two modules in this project.
 Text Detection
 Text Recognition
In the proposed system consist of mainly four steps.
Candidate generation, Component level classifier, Non text
filtering and Text recognition. Again candidate generation
consist of a number of steps. Among a number of CC
(Connected Component) extraction methods, we have
adopted the MSER algorithm because it shows good
performance with a small computation cost. The
MSER(Maximally Stable Extremal Region) algorithm yields
CCs that are either darker or brighter than their
surroundings.
Filtering of the input image is done using Discrete wavelet
transform(DWT) in which edge map of the image is
constructed in all directions. The output of the MSER
extraction and DWT is combined using logical OR. Filtering
by occupation ratio is done by examining the area of the
bounding box with respect to the area of the object
contained in it. Next step is to filter the image withthevalues
of stroke widths.
Then the candidate generation process is completed. We
have trained component level classifier that determines
adjacency relationship between CCs. In training, we have
selected efficient features and trained the classifier with the
AdaBoost algorithm. In this way, we are able to detect text.
we develop a text or non text classifier that rejects non text
blocksamong normalized images. This is done by usingSVM
as classifier. Finally we get the detected text as first output.
The detected text is passed to an OCR engine in order to
recognize the text.
2. PROPOSED SYSTEM
In the proposed system consist of mainly four steps.
Candidate generation, Component level classifier, Non text
filtering and Text recognition.
2.1 MSER Extraction
Among a number of CC extraction methods,wehaveadopted
the MSER algorithm because it shows good performance
with a small computation cost . This algorithm can be
considered as a process to find local binarizationresultsthat
are stable over a range of thresholds, and this property
allows us to find most of the text components. The MSER
algorithm yields CCs that are either darker or brighter than
their surroundings. We have illustrated brighter CCs by
assigning random colors to them. Note that many CCs are
overlapping due to the properties of stable regions. More
MSER extraction examples which show brighter and darker
CCs, respectively.
2.2 Discrete Wavelet Transform
It is a novel method image edge detection using 2-D Discrete
Wavelet Transform (DWT) & the results are compared with
one of the best classical edge detector: Sobel method for
performing edge detection at multiple scales, even in the
presence of noise also, that can extract information and
perform required processing even at much smaller parts of
an image and handle noisy images more efficiently as
compared to classical edge detectors.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 01 | Jan-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 996
Wavelet transform is a representation of signals in terms of
basic functions that are obtained by dilating and translation
a basic wavelet function. We can take a wavelet transformas
a tool of low-pass and high-pass filters for edge detection.
The wavelet transform has the properties of locality, multi
resolution, compression, clustering and persistence. These
properties are suitable for most applications in image
processing including edge detection.
Classical methods of edge detection involve convolving the
image with an operator (a2-D filter), which is constructedto
be sensitive to large gradients in the image while returning
values of zero in uniform regions. There are an extremely
large number of classical edge detection operators like
Roberts, Sobel, Prewitt, Laplacian operators and canny.
Operators can be optimized to look for horizontal,verticalor
diagonal edges. Classical Edge detectors usually fail to
handle images with dull object outline or noisy images,since
both the noise and the edges contain high frequencycontent.
Attempts to reduce the noise result in blurred and distorted
edges. To reduce the influence of noise, many techniques
have been developed. Filtering of images can be done with
the Gaussian before edge detection. Methodshave also been
proposed to approximate the image with a smooth function.
Where the classical edge detectors fail to detect edges in
noisy images, the proposed edge detection method works
efficiently on images corrupted by noise and is able to
differentiate between noise and real edges, thus detecting
the actual edges.
2.3 Occupation Ratio
It consists of a heuristic rule. This filter runs onthestatistical
and geometric propertiesof components, whichareveryfast
to compute. For a connected component c with qforeground
pixels, we first compute its bounding box bb(c) (its width
and height are denoted by w(c) and h(c), respectively) and
the mean as well as standard deviation of the stroke widths,
μ(c) and ϭ(c). The definitions of these basic properties and
the corresponding valid ranges are summarized below. The
components with one or more invalid properties will be
taken as non-text regions and discarded. Here we are
considering only the occupation ratio as the filtering rule.
Table -1: Heuristic Rules
2.4 Stroke Width Transform
SWT is a novel image operator that seeksto find the value of
stroke width for each image pixel, and demonstrate its use
on the task of text detection in natural images.Thesuggested
operator is local and data dependent, which makes it fast
and robust enough to eliminate the need for multi-scale
computation or scanning windows. Extensive testing shows
that the suggested scheme outperformsthe latest published
algorithms. Its simplicity allowsthe algorithmtodetecttexts
in many fonts and languages.
The Stroke Width Transform (SWT for short) isalocalimage
operator which computes per pixel the width of the most
likely stroke containing the pixel. The output of the SWT is
an image of size equal to the size of the input image where
each element contains the width of the stroke associated
with the pixel. We define a stroke to be a contiguous part of
an image that formsa band of a nearly constantwidth.Wedo
not assume to know the actual width of the stroke butrather
recover it.
The initial value of each element of the SWT is set to ∞. In
order to recover strokes, we first compute edges in the
image using Canny edge detector. After that, a gradient
direction dp of each e edge pixel P is considered. If P lies on a
stroke boundary, then dp must be roughly perpendicular to
the orientation of the stroke. We follow the ray r= p+n*dp,
n>0 until another edge pixel q is found. We considerthenthe
gradient direction dq at pixel q. The SWT operator described
Fig - 1: Overview of proposed system
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 01 | Jan-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 997
Here is linear in the number of edge pixels in the image and
also linear in the maximal stroke width, determined at the
training stage.
To extract connected components from the image, SWT is
adopted for its effectiveness and efficiency. In addition, it
provides a way to discover connectedcomponentsfromedge
map directly. . The next step of this stage is to group the
pixels in the SWT image into connected components. The
pixels are associated using a simple rule that the ratio of
SWT values of neighboring pixels is less than 0.01.
2.5. AdaBoost
Our classifier is based on pair wise relations between CCs,
and let us first consider cases that can happen for a CC pair
we address a relatively simple problem by adopting an idea
in region-based approaches. That is, we adopt a non text
filtering block, and we are no longer required to care about
other cases. If we have ci ~ cj for some ci , c j ∈ N, it will yield
a candidate consisting of non text CCs and this candidatewill
be rejected at the non text rejection step. Also, we will
perform word segmentation asa post processingstep.Based
on the observations, we build training sets. Specifically, we
first obtained sets of CCs by applying the MSER algorithm to
a training set. Then, for every pair (ci,cj) ∈ C ×C (i ≠j ), we
identify its category among 5 cases. A positive set is built by
gathering samples corresponding to the case (1) and a
negative set by gathering samplescorresponding to thecase
(3) or (4). Samples from other cases were discarded.
2.6. Support Vector Machine
Support vector machines (SVMs, also support vector
networks) are supervised learning models with associated
learning algorithms that analyse data used for classification
and regression analysis. An SVM model is a representationof
the examples as points in space, mapped so that the
examples of the separate categories are divided by a clear
gap that is as wide as possible. New examples are then
mapped into that same space and predicted to belong to a
category based on which side of the gap they fall on.
“Support Vector Machine” (SVM) is a supervised machine
learning algorithm which can be used for both classification
or regression challenges. However, it is mostly used in
classification problems. In this algorithm, each data item is
plotted as a point in n-dimensional space (where n is
number of features you have) with the value of each feature
being the value of a particular coordinate. Then,
classification is performed by finding the hyper-plane that
differentiate the two classes very well.
In order to get final result, we develop a text/nontext
classifier that rejects non text blocks among normalized
images. In our classification, we do not adopt sophisticated
techniques such as cascade structures, since the number of
samples to be classified is usually small. However, one
challenge for our approach is the variable aspect ratio. One
possible approach to this problem is to split the normalized
imagesinto patchescovering one of the lettersanddevelopa
character/non-character classifier. However, character
segmentation is not an easy problem andthereareexamples
where this approach may fail. Rather, we split a normalized
block into overlapping squares, and develop a classifier that
assigns a textness value to each square block. Finally,
decision results for all square blocks are integrated so that
the original block is classified.
Our feature vector is based on mesh and gradient features.
We divide each square into 4 horizontal and vertical ones
and extract features.
For a horizontal block Hi (i = 1, 2, 3, 4), we consider
1) The number of white pixels,
2) The number of vertical white-black transitions,
Fig - 2: four horizontal and four vertical blocks.
3) The number of vertical black-white transitions as
features, and featuresfor a vertical block issimilarlydefined.
Here the two classes are text and non text. Initially we train
the SVM classifier with a number of images. A number of
bounding boxes are created using the region properties of
the image. Then we analyse every combination of the
bounding box regions. If both are text then 1 is assigned as
label otherwise 0 is assigned aslabel. In the prediction stage
these labels for the new image is checked and categorize as
text or non-text.
2.7. OCR Engine
Optical character recognition (OCR) method has been used
in converting printed text into editable text. OCR is very
useful and popular method in variousapplications.Accuracy
of OCR can be dependent on text pre processing and
segmentation algorithms. Sometimesit is difficulttoretrieve
text from the image because of different size, style,
orientation, complex background of image etc.
Tesseract is an open source optical character recognition
engine. It was modified and improved in 1995 with greater
accuracy. It is highly portable. It is more focused towards
providing less rejection than accuracy.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 01 | Jan-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 998
Optical character Recognition (OCR) is a conversion of
scanned or printed text images, handwritten text into
editable text for further processing. This technology allows
machine to recognize the text automatically. It is like
combination of eye and mind of human body. An eye can
view the text from the images but actually the brain
processes as well as interprets that extracted text read by
eye. In development of computerized OCR system, few
problems can occur. First: there is very little visible
difference between some letters and digits for computers to
understand. For example it might be difficult for the
computer to differentiate between digit “0” and letter “o”.
Second: It might be very difficult to extract text, which is
embedded in very dark background or printed on other
words or graphics. In 1955, the first commercialsystem was
installed at the reader’s digest, which used OCR to input
sales report into a computer and then after OCRmethod has
become very helpful in computerizing the physical office
documents. There are many applications of OCR, which
includes: License plate recognition, image text extraction
from natural scene images, extracting text from scanned
documents etc.
Today many types of OCR software available in the markets
like: Desktop OCR, Server OCR, Web OCR etc. Accuracy rate
of any OCR tool varies from 71%to 98%. Many OCRtoolsare
available as of now but only few of them are opensourceand
free. Tesseract is the one of the open source and free OCR
software.
An image with the text is given as input to the Tesseract
engine that is command based tool. Then it is processed by
Tesseract command. Tesseract command takes two
arguments: First argument is image file name that contains
text and second argument is output text file in which,
extracted text is stored. The output file extension is given as
.txt by Tesseract, so no need to specify the file extension
while specifying the output file name as a second argument
in Tesseract command. As Tesseract supports various
languages, the language training data file must be keptinthe
tessdata folder.
After processing is completed, the content of the output file.
In simple images with or without colour (gray scale),
Tesseract provides results with 100% accuracy. But in the
case of some complex images Tesseract provides better
accuracy results if the images are in the gray scale mode as
compared to colour images.
3. PERFORMANCE ANALYSIS
The experimentation of the proposed algorithm was
carried out on a data set consisting of different images such
as indoor, outdoor, posters etc. These test images vary with
respect to scale, lighting and orientation of text in the image.
Currently the data set consists of 10 images.
3.1 Performance of Text Detection
Text detection is done on various images in which the
number of object in the image varies. The results show that
asthe number of objects in the image variesthe accuracyget
reduced since the image is becoming complex.
Based on such an analysis with different types of images
with varying complexity we have drawn a graph. In which
the x-axis represents the images and y-axis represents the
accuracy. As we move from left to right in x-axis the
complexity of images is increasing.
Fig - 3: Image 1 Stages
The system is tested with various images having different
orientations. Some examples of such images in different
stages is shown bellow. From the analysis it is clear that the
system isefficient to detect text having differentorientations
up to a large extent from the images. The analysisshowsthat
the system fails to images in which text is making a large
angle with the horizontal.
Chart - 1: Performance Analysis 1
Fig - 4: Image 2 Stages
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 01 | Jan-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 999
Based on such an analysis with different types of images
with varying orientation we have drawn a graph. In which
the x-axis represents the images and y-axis represents the
accuracy. As we move from left to right in x-axis the
Orientation of images with respect to horizontal line is
increasing.
3.2 Performance of Text Recognition
Chart - 2: Performance Analysis 2
Text recognition is done on various images in which the
number of object in the image varies. The results show that
as the number of objects in the image, orientation and the
connection between the characters varies the accuracy get
affected since the image is becoming complex to recognize
each character.
3. CONCLUSIONS
High level semantics embodied in scene texts are both rich
and clear and thus can serve as important cues for a wide
Chart - 3: Performance Analysis 3
range of vision applications, for instance, image
understanding, image indexing, video search, geolocation,
and automatic navigation. Here we implemented a unified
framework for text detection, recognition and retrieval of
details about the recognized text in natural images. Mainly
there are two modules in this project. Text detection and
recognition. The text detection is done with MSER regions,
DWT, SWT etc. Also a clustering and a classification is also
done. The text recognition is done with OCR engine. The
results shows that as the complexity, orientation, and
connection between two charactersvariesthe possibilityfor
detection and recognition also varies.
REFERENCES
[1] X. Chen and A. L. Yuille, “Detecting and reading text
in natural scenes,” in Proc. IEEE CVPR, Jun./Jul.
2004, pp. II-366–II-373.
[2] D. Chen, J.-M. Odobez, and H. Bourlard, “Text
detection and recognition in images and video
frames,” Pattern Recognit., vol. 37, no. 3, pp. 595–
608, 2004.
[3] L. Neumann and J. Matas, “A method for text
localization and recognition in real-world images,”
in Proc. 10th ACCV, 2010, pp. 770–783.
[4] K. Wang, B. Babenko, and S. Belongie, “End-to-end
scene text recognition,” in Proc. IEEE ICCV, Nov.
2011, pp. 1457–1464.
[5] B. Epshtein, E. Ofek, and Y. Wexler, “Detectingtextin
natural scenes with stroke width transform,” in
Proc. IEEE CVPR, Jun. 2010, pp. 2963–2970.
[6] C. Yao, X. Bai, W. Liu, Y. Ma, and Z. Tu, “Detecting
textsof arbitrary orientations in natural images,”in
Proc. IEEE CVPR, Jun. 2012, pp. 1083–1090.
[7] Cong Yao, Xiang Bai, “A Unified Framework for
Multioriented Text Detection and Recognition”,
IEEE Transactions On Image Processing,Vol.23,No.
11, November 2014.
[8] Hyung Il Koo, “Scene Text Detection via Connected
Component Clustering and Nontext Filtering”, IEEE
Transactions On Image Processing, Vol. 22, No. 6,
June 2013.

More Related Content

What's hot

IRJET- Image based Approach for Indian Fake Note Detection by Dark Channe...
IRJET-  	  Image based Approach for Indian Fake Note Detection by Dark Channe...IRJET-  	  Image based Approach for Indian Fake Note Detection by Dark Channe...
IRJET- Image based Approach for Indian Fake Note Detection by Dark Channe...IRJET Journal
 
IRJET- Traffic Sign Classification and Detection using Deep Learning
IRJET- Traffic Sign Classification and Detection using Deep LearningIRJET- Traffic Sign Classification and Detection using Deep Learning
IRJET- Traffic Sign Classification and Detection using Deep LearningIRJET Journal
 
Automatic Image Registration Using 2D-DWT
Automatic Image Registration Using 2D-DWTAutomatic Image Registration Using 2D-DWT
Automatic Image Registration Using 2D-DWTinventionjournals
 
IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SV...
IRJET- 	  Document Layout analysis using Inverse Support Vector Machine (I-SV...IRJET- 	  Document Layout analysis using Inverse Support Vector Machine (I-SV...
IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SV...IRJET Journal
 
Object extraction using edge, motion and saliency information from videos
Object extraction using edge, motion and saliency information from videosObject extraction using edge, motion and saliency information from videos
Object extraction using edge, motion and saliency information from videoseSAT Journals
 
Image Steganography Using Wavelet Transform And Genetic Algorithm
Image Steganography Using Wavelet Transform And Genetic AlgorithmImage Steganography Using Wavelet Transform And Genetic Algorithm
Image Steganography Using Wavelet Transform And Genetic AlgorithmAM Publications
 
Analysis of Various Single Frame Super Resolution Techniques for better PSNR
Analysis of Various Single Frame Super Resolution Techniques for better PSNRAnalysis of Various Single Frame Super Resolution Techniques for better PSNR
Analysis of Various Single Frame Super Resolution Techniques for better PSNRIRJET Journal
 
Video object tracking with classification and recognition of objects
Video object tracking with classification and recognition of objectsVideo object tracking with classification and recognition of objects
Video object tracking with classification and recognition of objectsManish Khare
 
Presentation Object Recognition And Tracking Project
Presentation Object Recognition And Tracking ProjectPresentation Object Recognition And Tracking Project
Presentation Object Recognition And Tracking ProjectPrathamesh Joshi
 
IRJET - Hand Gesture Recognition to Perform System Operations
IRJET -  	  Hand Gesture Recognition to Perform System OperationsIRJET -  	  Hand Gesture Recognition to Perform System Operations
IRJET - Hand Gesture Recognition to Perform System OperationsIRJET Journal
 
Text Recognition using Convolutional Neural Network: A Review
Text Recognition using Convolutional Neural Network: A ReviewText Recognition using Convolutional Neural Network: A Review
Text Recognition using Convolutional Neural Network: A ReviewIRJET Journal
 
Motion Detection and Clustering Using PCA and NN in Color Image Sequence
Motion Detection and Clustering Using PCA and NN in Color Image SequenceMotion Detection and Clustering Using PCA and NN in Color Image Sequence
Motion Detection and Clustering Using PCA and NN in Color Image SequenceTELKOMNIKA JOURNAL
 
IRJET- Chord Classification of an Audio Signal using Artificial Neural Network
IRJET- Chord Classification of an Audio Signal using Artificial Neural NetworkIRJET- Chord Classification of an Audio Signal using Artificial Neural Network
IRJET- Chord Classification of an Audio Signal using Artificial Neural NetworkIRJET Journal
 
IRJET- Fabric Defect Detection using Discrete Wavelet Transform
IRJET- Fabric Defect Detection using Discrete Wavelet TransformIRJET- Fabric Defect Detection using Discrete Wavelet Transform
IRJET- Fabric Defect Detection using Discrete Wavelet TransformIRJET Journal
 
Performance Evaluation of Image Edge Detection Techniques
Performance Evaluation of Image Edge Detection Techniques Performance Evaluation of Image Edge Detection Techniques
Performance Evaluation of Image Edge Detection Techniques CSCJournals
 
High Performance Traffic Sign Detection
High Performance Traffic Sign DetectionHigh Performance Traffic Sign Detection
High Performance Traffic Sign DetectionCraig Ferguson
 
IRJET- Wavelet Transform along with SPIHT Algorithm used for Image Compre...
IRJET-  	  Wavelet Transform along with SPIHT Algorithm used for Image Compre...IRJET-  	  Wavelet Transform along with SPIHT Algorithm used for Image Compre...
IRJET- Wavelet Transform along with SPIHT Algorithm used for Image Compre...IRJET Journal
 

What's hot (20)

IRJET- Image based Approach for Indian Fake Note Detection by Dark Channe...
IRJET-  	  Image based Approach for Indian Fake Note Detection by Dark Channe...IRJET-  	  Image based Approach for Indian Fake Note Detection by Dark Channe...
IRJET- Image based Approach for Indian Fake Note Detection by Dark Channe...
 
IRJET- Traffic Sign Classification and Detection using Deep Learning
IRJET- Traffic Sign Classification and Detection using Deep LearningIRJET- Traffic Sign Classification and Detection using Deep Learning
IRJET- Traffic Sign Classification and Detection using Deep Learning
 
B0441418
B0441418B0441418
B0441418
 
Automatic Image Registration Using 2D-DWT
Automatic Image Registration Using 2D-DWTAutomatic Image Registration Using 2D-DWT
Automatic Image Registration Using 2D-DWT
 
IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SV...
IRJET- 	  Document Layout analysis using Inverse Support Vector Machine (I-SV...IRJET- 	  Document Layout analysis using Inverse Support Vector Machine (I-SV...
IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SV...
 
Object extraction using edge, motion and saliency information from videos
Object extraction using edge, motion and saliency information from videosObject extraction using edge, motion and saliency information from videos
Object extraction using edge, motion and saliency information from videos
 
Image Steganography Using Wavelet Transform And Genetic Algorithm
Image Steganography Using Wavelet Transform And Genetic AlgorithmImage Steganography Using Wavelet Transform And Genetic Algorithm
Image Steganography Using Wavelet Transform And Genetic Algorithm
 
Analysis of Various Single Frame Super Resolution Techniques for better PSNR
Analysis of Various Single Frame Super Resolution Techniques for better PSNRAnalysis of Various Single Frame Super Resolution Techniques for better PSNR
Analysis of Various Single Frame Super Resolution Techniques for better PSNR
 
Kq3518291832
Kq3518291832Kq3518291832
Kq3518291832
 
Video object tracking with classification and recognition of objects
Video object tracking with classification and recognition of objectsVideo object tracking with classification and recognition of objects
Video object tracking with classification and recognition of objects
 
Presentation Object Recognition And Tracking Project
Presentation Object Recognition And Tracking ProjectPresentation Object Recognition And Tracking Project
Presentation Object Recognition And Tracking Project
 
IRJET - Hand Gesture Recognition to Perform System Operations
IRJET -  	  Hand Gesture Recognition to Perform System OperationsIRJET -  	  Hand Gesture Recognition to Perform System Operations
IRJET - Hand Gesture Recognition to Perform System Operations
 
Text Recognition using Convolutional Neural Network: A Review
Text Recognition using Convolutional Neural Network: A ReviewText Recognition using Convolutional Neural Network: A Review
Text Recognition using Convolutional Neural Network: A Review
 
Motion Detection and Clustering Using PCA and NN in Color Image Sequence
Motion Detection and Clustering Using PCA and NN in Color Image SequenceMotion Detection and Clustering Using PCA and NN in Color Image Sequence
Motion Detection and Clustering Using PCA and NN in Color Image Sequence
 
Object tracking
Object trackingObject tracking
Object tracking
 
IRJET- Chord Classification of an Audio Signal using Artificial Neural Network
IRJET- Chord Classification of an Audio Signal using Artificial Neural NetworkIRJET- Chord Classification of an Audio Signal using Artificial Neural Network
IRJET- Chord Classification of an Audio Signal using Artificial Neural Network
 
IRJET- Fabric Defect Detection using Discrete Wavelet Transform
IRJET- Fabric Defect Detection using Discrete Wavelet TransformIRJET- Fabric Defect Detection using Discrete Wavelet Transform
IRJET- Fabric Defect Detection using Discrete Wavelet Transform
 
Performance Evaluation of Image Edge Detection Techniques
Performance Evaluation of Image Edge Detection Techniques Performance Evaluation of Image Edge Detection Techniques
Performance Evaluation of Image Edge Detection Techniques
 
High Performance Traffic Sign Detection
High Performance Traffic Sign DetectionHigh Performance Traffic Sign Detection
High Performance Traffic Sign Detection
 
IRJET- Wavelet Transform along with SPIHT Algorithm used for Image Compre...
IRJET-  	  Wavelet Transform along with SPIHT Algorithm used for Image Compre...IRJET-  	  Wavelet Transform along with SPIHT Algorithm used for Image Compre...
IRJET- Wavelet Transform along with SPIHT Algorithm used for Image Compre...
 

Similar to Text Detection and Recognition in Natural Images

Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...
Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...
Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...IRJET Journal
 
Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...
Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...
Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...IRJET Journal
 
Text Extraction and Recognition Using Median Filter
Text Extraction and Recognition Using Median FilterText Extraction and Recognition Using Median Filter
Text Extraction and Recognition Using Median FilterIRJET Journal
 
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...IRJET Journal
 
LANE DETECTION USING IMAGE PROCESSING IN PYTHON
LANE DETECTION USING IMAGE PROCESSING IN PYTHONLANE DETECTION USING IMAGE PROCESSING IN PYTHON
LANE DETECTION USING IMAGE PROCESSING IN PYTHONIRJET Journal
 
Real Time Implementation of Ede Detection Technique for Angiogram Images on FPGA
Real Time Implementation of Ede Detection Technique for Angiogram Images on FPGAReal Time Implementation of Ede Detection Technique for Angiogram Images on FPGA
Real Time Implementation of Ede Detection Technique for Angiogram Images on FPGAIRJET Journal
 
IRJET - Smart Vet Locator for Hybrid Pets
IRJET -  	  Smart Vet Locator for Hybrid PetsIRJET -  	  Smart Vet Locator for Hybrid Pets
IRJET - Smart Vet Locator for Hybrid PetsIRJET Journal
 
Number Plate Recognition of Still Images in Vehicular Parking System
Number Plate Recognition of Still Images in Vehicular Parking SystemNumber Plate Recognition of Still Images in Vehicular Parking System
Number Plate Recognition of Still Images in Vehicular Parking SystemIRJET Journal
 
IRJET- Automatic Data Collection from Forms using Optical Character Recognition
IRJET- Automatic Data Collection from Forms using Optical Character RecognitionIRJET- Automatic Data Collection from Forms using Optical Character Recognition
IRJET- Automatic Data Collection from Forms using Optical Character RecognitionIRJET Journal
 
IRJET - Simulation of Colour Image Processing Techniques on VHDL
IRJET - Simulation of Colour Image Processing Techniques on VHDLIRJET - Simulation of Colour Image Processing Techniques on VHDL
IRJET - Simulation of Colour Image Processing Techniques on VHDLIRJET Journal
 
IRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural NetworkIRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural NetworkIRJET Journal
 
IRJET- Automatic Traffic Sign Detection and Recognition using CNN
IRJET- Automatic Traffic Sign Detection and Recognition using CNNIRJET- Automatic Traffic Sign Detection and Recognition using CNN
IRJET- Automatic Traffic Sign Detection and Recognition using CNNIRJET Journal
 
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...IRJET Journal
 
IRJET - Change Detection in Satellite Images using Convolutional Neural N...
IRJET -  	  Change Detection in Satellite Images using Convolutional Neural N...IRJET -  	  Change Detection in Satellite Images using Convolutional Neural N...
IRJET - Change Detection in Satellite Images using Convolutional Neural N...IRJET Journal
 
Trajectory Based Unusual Human Movement Identification for ATM System
	 Trajectory Based Unusual Human Movement Identification for ATM System	 Trajectory Based Unusual Human Movement Identification for ATM System
Trajectory Based Unusual Human Movement Identification for ATM SystemIRJET Journal
 
IRJET - Object Detection using Hausdorff Distance
IRJET -  	  Object Detection using Hausdorff DistanceIRJET -  	  Object Detection using Hausdorff Distance
IRJET - Object Detection using Hausdorff DistanceIRJET Journal
 
Hardware software co simulation of edge detection for image processing system...
Hardware software co simulation of edge detection for image processing system...Hardware software co simulation of edge detection for image processing system...
Hardware software co simulation of edge detection for image processing system...eSAT Publishing House
 
Lane Detection and Traffic Sign Recognition using OpenCV and Deep Learning fo...
Lane Detection and Traffic Sign Recognition using OpenCV and Deep Learning fo...Lane Detection and Traffic Sign Recognition using OpenCV and Deep Learning fo...
Lane Detection and Traffic Sign Recognition using OpenCV and Deep Learning fo...IRJET Journal
 
IRJET- Object Detection using Hausdorff Distance
IRJET-  	  Object Detection using Hausdorff DistanceIRJET-  	  Object Detection using Hausdorff Distance
IRJET- Object Detection using Hausdorff DistanceIRJET Journal
 

Similar to Text Detection and Recognition in Natural Images (20)

Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...
Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...
Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...
 
Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...
Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...
Automatic License Plate Detection in Foggy Condition using Enhanced OTSU Tech...
 
Text Extraction and Recognition Using Median Filter
Text Extraction and Recognition Using Median FilterText Extraction and Recognition Using Median Filter
Text Extraction and Recognition Using Median Filter
 
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
 
LANE DETECTION USING IMAGE PROCESSING IN PYTHON
LANE DETECTION USING IMAGE PROCESSING IN PYTHONLANE DETECTION USING IMAGE PROCESSING IN PYTHON
LANE DETECTION USING IMAGE PROCESSING IN PYTHON
 
Real Time Implementation of Ede Detection Technique for Angiogram Images on FPGA
Real Time Implementation of Ede Detection Technique for Angiogram Images on FPGAReal Time Implementation of Ede Detection Technique for Angiogram Images on FPGA
Real Time Implementation of Ede Detection Technique for Angiogram Images on FPGA
 
IRJET - Smart Vet Locator for Hybrid Pets
IRJET -  	  Smart Vet Locator for Hybrid PetsIRJET -  	  Smart Vet Locator for Hybrid Pets
IRJET - Smart Vet Locator for Hybrid Pets
 
Number Plate Recognition of Still Images in Vehicular Parking System
Number Plate Recognition of Still Images in Vehicular Parking SystemNumber Plate Recognition of Still Images in Vehicular Parking System
Number Plate Recognition of Still Images in Vehicular Parking System
 
IRJET- Automatic Data Collection from Forms using Optical Character Recognition
IRJET- Automatic Data Collection from Forms using Optical Character RecognitionIRJET- Automatic Data Collection from Forms using Optical Character Recognition
IRJET- Automatic Data Collection from Forms using Optical Character Recognition
 
IRJET - Simulation of Colour Image Processing Techniques on VHDL
IRJET - Simulation of Colour Image Processing Techniques on VHDLIRJET - Simulation of Colour Image Processing Techniques on VHDL
IRJET - Simulation of Colour Image Processing Techniques on VHDL
 
IRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural NetworkIRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural Network
 
IRJET- Automatic Traffic Sign Detection and Recognition using CNN
IRJET- Automatic Traffic Sign Detection and Recognition using CNNIRJET- Automatic Traffic Sign Detection and Recognition using CNN
IRJET- Automatic Traffic Sign Detection and Recognition using CNN
 
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
 
IRJET - Change Detection in Satellite Images using Convolutional Neural N...
IRJET -  	  Change Detection in Satellite Images using Convolutional Neural N...IRJET -  	  Change Detection in Satellite Images using Convolutional Neural N...
IRJET - Change Detection in Satellite Images using Convolutional Neural N...
 
Trajectory Based Unusual Human Movement Identification for ATM System
	 Trajectory Based Unusual Human Movement Identification for ATM System	 Trajectory Based Unusual Human Movement Identification for ATM System
Trajectory Based Unusual Human Movement Identification for ATM System
 
project final ppt.pptx
project final ppt.pptxproject final ppt.pptx
project final ppt.pptx
 
IRJET - Object Detection using Hausdorff Distance
IRJET -  	  Object Detection using Hausdorff DistanceIRJET -  	  Object Detection using Hausdorff Distance
IRJET - Object Detection using Hausdorff Distance
 
Hardware software co simulation of edge detection for image processing system...
Hardware software co simulation of edge detection for image processing system...Hardware software co simulation of edge detection for image processing system...
Hardware software co simulation of edge detection for image processing system...
 
Lane Detection and Traffic Sign Recognition using OpenCV and Deep Learning fo...
Lane Detection and Traffic Sign Recognition using OpenCV and Deep Learning fo...Lane Detection and Traffic Sign Recognition using OpenCV and Deep Learning fo...
Lane Detection and Traffic Sign Recognition using OpenCV and Deep Learning fo...
 
IRJET- Object Detection using Hausdorff Distance
IRJET-  	  Object Detection using Hausdorff DistanceIRJET-  	  Object Detection using Hausdorff Distance
IRJET- Object Detection using Hausdorff Distance
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 

Recently uploaded (20)

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 

Text Detection and Recognition in Natural Images

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 01 | Jan-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 995 TEXT DETECTION AND RECOGNITION IN NATURAL IMAGES Shabana M A1, Alphy Jose2, Ani Sunny3 1,2 Student, Dept. of Computer Science and Engineering, Mar Athanasius College of Engineering, Kerala, India 3Professor, Dept. of Computer Science and Engineering, Mar Athanasius College of Engineering, Kerala, India ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - Text embodied in scene texts are both rich and clear and thus can serve as important cues for a wide range of vision applications, for instance, image understanding, image indexing, video search, geolocation, andautomaticnavigation. Here we present a unified framework for text detection, recognition in natural images. The proposed system consist of mainly five steps; Candidate generation, Component level classifier, Non text filtering and Text recognition. Among a number of CC (Connected Component) extraction methods,we have adopted the MSER algorithm because it shows good performance with a small computation cost. Filtering of the input image is done using Discrete wavelettransform(DWT)in which edge map of the image is constructed in all directions. The output of the MSER extraction andDWTiscombinedusing logical OR. Filtering by occupation ratio is done by examining the area of the bounding box with respect to the area of the object contained in it. The image is then filtered with the values of stroke widths. Component level classifier will determine adjacency relationship between CCs. Then a classifier that rejects non text blocks among normalized images. This is done by using SVM as classifier. Finally we get the detected text as first output. The detected text is passed to an OCR engine in order to recognize the text. Key Words: Connected Component(CC),MSER,Discrete Wavelet Transform (DWT), Support Vector Machine (SVM), Stroke Width Transform(SWT), OCR. 1.INTRODUCTION With the increasing popularity of practical vision systems and smart phones, text detection in natural scenes becomes a critical yet challenging task. The proposed algorithm can assist numerous applications that require text information extraction from images or videos, such as video search, target geolocation, and automatic navigation. Mainly there are two modules in this project.  Text Detection  Text Recognition In the proposed system consist of mainly four steps. Candidate generation, Component level classifier, Non text filtering and Text recognition. Again candidate generation consist of a number of steps. Among a number of CC (Connected Component) extraction methods, we have adopted the MSER algorithm because it shows good performance with a small computation cost. The MSER(Maximally Stable Extremal Region) algorithm yields CCs that are either darker or brighter than their surroundings. Filtering of the input image is done using Discrete wavelet transform(DWT) in which edge map of the image is constructed in all directions. The output of the MSER extraction and DWT is combined using logical OR. Filtering by occupation ratio is done by examining the area of the bounding box with respect to the area of the object contained in it. Next step is to filter the image withthevalues of stroke widths. Then the candidate generation process is completed. We have trained component level classifier that determines adjacency relationship between CCs. In training, we have selected efficient features and trained the classifier with the AdaBoost algorithm. In this way, we are able to detect text. we develop a text or non text classifier that rejects non text blocksamong normalized images. This is done by usingSVM as classifier. Finally we get the detected text as first output. The detected text is passed to an OCR engine in order to recognize the text. 2. PROPOSED SYSTEM In the proposed system consist of mainly four steps. Candidate generation, Component level classifier, Non text filtering and Text recognition. 2.1 MSER Extraction Among a number of CC extraction methods,wehaveadopted the MSER algorithm because it shows good performance with a small computation cost . This algorithm can be considered as a process to find local binarizationresultsthat are stable over a range of thresholds, and this property allows us to find most of the text components. The MSER algorithm yields CCs that are either darker or brighter than their surroundings. We have illustrated brighter CCs by assigning random colors to them. Note that many CCs are overlapping due to the properties of stable regions. More MSER extraction examples which show brighter and darker CCs, respectively. 2.2 Discrete Wavelet Transform It is a novel method image edge detection using 2-D Discrete Wavelet Transform (DWT) & the results are compared with one of the best classical edge detector: Sobel method for performing edge detection at multiple scales, even in the presence of noise also, that can extract information and perform required processing even at much smaller parts of an image and handle noisy images more efficiently as compared to classical edge detectors.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 01 | Jan-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 996 Wavelet transform is a representation of signals in terms of basic functions that are obtained by dilating and translation a basic wavelet function. We can take a wavelet transformas a tool of low-pass and high-pass filters for edge detection. The wavelet transform has the properties of locality, multi resolution, compression, clustering and persistence. These properties are suitable for most applications in image processing including edge detection. Classical methods of edge detection involve convolving the image with an operator (a2-D filter), which is constructedto be sensitive to large gradients in the image while returning values of zero in uniform regions. There are an extremely large number of classical edge detection operators like Roberts, Sobel, Prewitt, Laplacian operators and canny. Operators can be optimized to look for horizontal,verticalor diagonal edges. Classical Edge detectors usually fail to handle images with dull object outline or noisy images,since both the noise and the edges contain high frequencycontent. Attempts to reduce the noise result in blurred and distorted edges. To reduce the influence of noise, many techniques have been developed. Filtering of images can be done with the Gaussian before edge detection. Methodshave also been proposed to approximate the image with a smooth function. Where the classical edge detectors fail to detect edges in noisy images, the proposed edge detection method works efficiently on images corrupted by noise and is able to differentiate between noise and real edges, thus detecting the actual edges. 2.3 Occupation Ratio It consists of a heuristic rule. This filter runs onthestatistical and geometric propertiesof components, whichareveryfast to compute. For a connected component c with qforeground pixels, we first compute its bounding box bb(c) (its width and height are denoted by w(c) and h(c), respectively) and the mean as well as standard deviation of the stroke widths, μ(c) and ϭ(c). The definitions of these basic properties and the corresponding valid ranges are summarized below. The components with one or more invalid properties will be taken as non-text regions and discarded. Here we are considering only the occupation ratio as the filtering rule. Table -1: Heuristic Rules 2.4 Stroke Width Transform SWT is a novel image operator that seeksto find the value of stroke width for each image pixel, and demonstrate its use on the task of text detection in natural images.Thesuggested operator is local and data dependent, which makes it fast and robust enough to eliminate the need for multi-scale computation or scanning windows. Extensive testing shows that the suggested scheme outperformsthe latest published algorithms. Its simplicity allowsthe algorithmtodetecttexts in many fonts and languages. The Stroke Width Transform (SWT for short) isalocalimage operator which computes per pixel the width of the most likely stroke containing the pixel. The output of the SWT is an image of size equal to the size of the input image where each element contains the width of the stroke associated with the pixel. We define a stroke to be a contiguous part of an image that formsa band of a nearly constantwidth.Wedo not assume to know the actual width of the stroke butrather recover it. The initial value of each element of the SWT is set to ∞. In order to recover strokes, we first compute edges in the image using Canny edge detector. After that, a gradient direction dp of each e edge pixel P is considered. If P lies on a stroke boundary, then dp must be roughly perpendicular to the orientation of the stroke. We follow the ray r= p+n*dp, n>0 until another edge pixel q is found. We considerthenthe gradient direction dq at pixel q. The SWT operator described Fig - 1: Overview of proposed system
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 01 | Jan-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 997 Here is linear in the number of edge pixels in the image and also linear in the maximal stroke width, determined at the training stage. To extract connected components from the image, SWT is adopted for its effectiveness and efficiency. In addition, it provides a way to discover connectedcomponentsfromedge map directly. . The next step of this stage is to group the pixels in the SWT image into connected components. The pixels are associated using a simple rule that the ratio of SWT values of neighboring pixels is less than 0.01. 2.5. AdaBoost Our classifier is based on pair wise relations between CCs, and let us first consider cases that can happen for a CC pair we address a relatively simple problem by adopting an idea in region-based approaches. That is, we adopt a non text filtering block, and we are no longer required to care about other cases. If we have ci ~ cj for some ci , c j ∈ N, it will yield a candidate consisting of non text CCs and this candidatewill be rejected at the non text rejection step. Also, we will perform word segmentation asa post processingstep.Based on the observations, we build training sets. Specifically, we first obtained sets of CCs by applying the MSER algorithm to a training set. Then, for every pair (ci,cj) ∈ C ×C (i ≠j ), we identify its category among 5 cases. A positive set is built by gathering samples corresponding to the case (1) and a negative set by gathering samplescorresponding to thecase (3) or (4). Samples from other cases were discarded. 2.6. Support Vector Machine Support vector machines (SVMs, also support vector networks) are supervised learning models with associated learning algorithms that analyse data used for classification and regression analysis. An SVM model is a representationof the examples as points in space, mapped so that the examples of the separate categories are divided by a clear gap that is as wide as possible. New examples are then mapped into that same space and predicted to belong to a category based on which side of the gap they fall on. “Support Vector Machine” (SVM) is a supervised machine learning algorithm which can be used for both classification or regression challenges. However, it is mostly used in classification problems. In this algorithm, each data item is plotted as a point in n-dimensional space (where n is number of features you have) with the value of each feature being the value of a particular coordinate. Then, classification is performed by finding the hyper-plane that differentiate the two classes very well. In order to get final result, we develop a text/nontext classifier that rejects non text blocks among normalized images. In our classification, we do not adopt sophisticated techniques such as cascade structures, since the number of samples to be classified is usually small. However, one challenge for our approach is the variable aspect ratio. One possible approach to this problem is to split the normalized imagesinto patchescovering one of the lettersanddevelopa character/non-character classifier. However, character segmentation is not an easy problem andthereareexamples where this approach may fail. Rather, we split a normalized block into overlapping squares, and develop a classifier that assigns a textness value to each square block. Finally, decision results for all square blocks are integrated so that the original block is classified. Our feature vector is based on mesh and gradient features. We divide each square into 4 horizontal and vertical ones and extract features. For a horizontal block Hi (i = 1, 2, 3, 4), we consider 1) The number of white pixels, 2) The number of vertical white-black transitions, Fig - 2: four horizontal and four vertical blocks. 3) The number of vertical black-white transitions as features, and featuresfor a vertical block issimilarlydefined. Here the two classes are text and non text. Initially we train the SVM classifier with a number of images. A number of bounding boxes are created using the region properties of the image. Then we analyse every combination of the bounding box regions. If both are text then 1 is assigned as label otherwise 0 is assigned aslabel. In the prediction stage these labels for the new image is checked and categorize as text or non-text. 2.7. OCR Engine Optical character recognition (OCR) method has been used in converting printed text into editable text. OCR is very useful and popular method in variousapplications.Accuracy of OCR can be dependent on text pre processing and segmentation algorithms. Sometimesit is difficulttoretrieve text from the image because of different size, style, orientation, complex background of image etc. Tesseract is an open source optical character recognition engine. It was modified and improved in 1995 with greater accuracy. It is highly portable. It is more focused towards providing less rejection than accuracy.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 01 | Jan-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 998 Optical character Recognition (OCR) is a conversion of scanned or printed text images, handwritten text into editable text for further processing. This technology allows machine to recognize the text automatically. It is like combination of eye and mind of human body. An eye can view the text from the images but actually the brain processes as well as interprets that extracted text read by eye. In development of computerized OCR system, few problems can occur. First: there is very little visible difference between some letters and digits for computers to understand. For example it might be difficult for the computer to differentiate between digit “0” and letter “o”. Second: It might be very difficult to extract text, which is embedded in very dark background or printed on other words or graphics. In 1955, the first commercialsystem was installed at the reader’s digest, which used OCR to input sales report into a computer and then after OCRmethod has become very helpful in computerizing the physical office documents. There are many applications of OCR, which includes: License plate recognition, image text extraction from natural scene images, extracting text from scanned documents etc. Today many types of OCR software available in the markets like: Desktop OCR, Server OCR, Web OCR etc. Accuracy rate of any OCR tool varies from 71%to 98%. Many OCRtoolsare available as of now but only few of them are opensourceand free. Tesseract is the one of the open source and free OCR software. An image with the text is given as input to the Tesseract engine that is command based tool. Then it is processed by Tesseract command. Tesseract command takes two arguments: First argument is image file name that contains text and second argument is output text file in which, extracted text is stored. The output file extension is given as .txt by Tesseract, so no need to specify the file extension while specifying the output file name as a second argument in Tesseract command. As Tesseract supports various languages, the language training data file must be keptinthe tessdata folder. After processing is completed, the content of the output file. In simple images with or without colour (gray scale), Tesseract provides results with 100% accuracy. But in the case of some complex images Tesseract provides better accuracy results if the images are in the gray scale mode as compared to colour images. 3. PERFORMANCE ANALYSIS The experimentation of the proposed algorithm was carried out on a data set consisting of different images such as indoor, outdoor, posters etc. These test images vary with respect to scale, lighting and orientation of text in the image. Currently the data set consists of 10 images. 3.1 Performance of Text Detection Text detection is done on various images in which the number of object in the image varies. The results show that asthe number of objects in the image variesthe accuracyget reduced since the image is becoming complex. Based on such an analysis with different types of images with varying complexity we have drawn a graph. In which the x-axis represents the images and y-axis represents the accuracy. As we move from left to right in x-axis the complexity of images is increasing. Fig - 3: Image 1 Stages The system is tested with various images having different orientations. Some examples of such images in different stages is shown bellow. From the analysis it is clear that the system isefficient to detect text having differentorientations up to a large extent from the images. The analysisshowsthat the system fails to images in which text is making a large angle with the horizontal. Chart - 1: Performance Analysis 1 Fig - 4: Image 2 Stages
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 01 | Jan-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 999 Based on such an analysis with different types of images with varying orientation we have drawn a graph. In which the x-axis represents the images and y-axis represents the accuracy. As we move from left to right in x-axis the Orientation of images with respect to horizontal line is increasing. 3.2 Performance of Text Recognition Chart - 2: Performance Analysis 2 Text recognition is done on various images in which the number of object in the image varies. The results show that as the number of objects in the image, orientation and the connection between the characters varies the accuracy get affected since the image is becoming complex to recognize each character. 3. CONCLUSIONS High level semantics embodied in scene texts are both rich and clear and thus can serve as important cues for a wide Chart - 3: Performance Analysis 3 range of vision applications, for instance, image understanding, image indexing, video search, geolocation, and automatic navigation. Here we implemented a unified framework for text detection, recognition and retrieval of details about the recognized text in natural images. Mainly there are two modules in this project. Text detection and recognition. The text detection is done with MSER regions, DWT, SWT etc. Also a clustering and a classification is also done. The text recognition is done with OCR engine. The results shows that as the complexity, orientation, and connection between two charactersvariesthe possibilityfor detection and recognition also varies. REFERENCES [1] X. Chen and A. L. Yuille, “Detecting and reading text in natural scenes,” in Proc. IEEE CVPR, Jun./Jul. 2004, pp. II-366–II-373. [2] D. Chen, J.-M. Odobez, and H. Bourlard, “Text detection and recognition in images and video frames,” Pattern Recognit., vol. 37, no. 3, pp. 595– 608, 2004. [3] L. Neumann and J. Matas, “A method for text localization and recognition in real-world images,” in Proc. 10th ACCV, 2010, pp. 770–783. [4] K. Wang, B. Babenko, and S. Belongie, “End-to-end scene text recognition,” in Proc. IEEE ICCV, Nov. 2011, pp. 1457–1464. [5] B. Epshtein, E. Ofek, and Y. Wexler, “Detectingtextin natural scenes with stroke width transform,” in Proc. IEEE CVPR, Jun. 2010, pp. 2963–2970. [6] C. Yao, X. Bai, W. Liu, Y. Ma, and Z. Tu, “Detecting textsof arbitrary orientations in natural images,”in Proc. IEEE CVPR, Jun. 2012, pp. 1083–1090. [7] Cong Yao, Xiang Bai, “A Unified Framework for Multioriented Text Detection and Recognition”, IEEE Transactions On Image Processing,Vol.23,No. 11, November 2014. [8] Hyung Il Koo, “Scene Text Detection via Connected Component Clustering and Nontext Filtering”, IEEE Transactions On Image Processing, Vol. 22, No. 6, June 2013.