SlideShare a Scribd company logo
IOSR Journal of Engineering (IOSRJEN) www.iosrjen.org
ISSN (e): 2250-3021, ISSN (p): 2278-8719
Vol. 04, Issue 07 (July. 2014), ||V4|| PP 19-20
International organization of Scientific Research 13 |
P a g e
A Hybrid Approach To Detect And Recognize Text In Images
1
Miss.Poonam B. Kadam, 2
Prof. Latika R. Desai
1
(Dept of Computer Engineering, 2
(Dept of Computer Engineering
Abstract: - Text detection and recognition in images is a research area which attempts to develop a computer
system with the ability to automatically read text from images or videos. This problem is challenging due to the
complex background and large variations of these components features like color, size, shape, orientation or
texture. Here new proposed framework is explored which can automatically detect and recognize aligned text
from urban scene images e.g.shop name,landmark,street.The proposed framework evaluated on new generated
dataset.It consist of a three main step 1) image partition perform to segment text based on color information.2)
character grouping to detect text character in every text string depend on character size differences, distance
between neighboring characters.3) the detected text recognition using neural network. This proposed method
efficiently and accurately detects and recognizes the text with a low false positive.
Keywords: - Text detection, preprocessing, segmentation, character recognition.
I. INTRODUCTION
Many facilities are involved in human reading and similarly, there has been a large amount of research
on computational methods for text recognition. The problem of developing a model for reading is broad,
surrounding many facets of information. Also there are many typefaces, and they may be encountered at
different legible sizes. Text is frequently printed on a surface and viewers generally doesn't want that the surface
be fronto-parallel to their eyes in order to read it. Additionally, changing lighting conditions and designers use
many colors for background and text. All of these factors leads to make the robust reading problem very
challenging. Text understanding system consist of four stages: text detection, text localization, text extraction,
text recognition. These stages are used interchangeably. Text detection consists of determination of the
occurrence of text in images. Text localization is the process of determining the location of text. In text
extraction stage the text components in images are segmented from background. After, the extracted text images
can be converted into plain text using OCR technology. Through Text detection and recognition in images is
coupling of text-based searching technologies and optical character recognition (OCR).Text in images or video
is a powerful source of information. Caption embedded with news videos provide information about name of
related location, people, date and time, purpose. Caption provide an summary or abstract of the image.
II. LITERATURE REVIEW
There have been a different methods dealing with text detection and recognition in images [4, 17].
Comprehensive surveys can be found in [8].Some approaches to text detection classified into three categories:
texture-based methods, region based methods and hybrid methods. Texture-based methods [3,16] involves
texture properties of text such as style, orientation and the construction of gray-level co-occurrence matrix.
These methods are computation demanding as all locations and scales are exhaustively scanned. Moreover,
these algorithms mostly concentrate to detect horizontal texts. Region based methods [7,18] use the properties of
the color or gray scale or alignment in a text region or their differences in properties of the background. First
extract candidate text regions through segmentation or clustering and then remove non-text regions.
The third category, hybrid methods [9]is a fusion of region-based and texture-based methods. Different
document or web, e-mail images, in which text characters are normalized and proper resolutions, natural scene
images, embed text can be in size, shapes and orientations into complex background. It is impossible to
recognize text in images directly through OCR software because of complex background. Thus, we need to
detect image regions containing text and their corresponding orientations. This is compatible with the detection
procedure described text extraction algorithms survey[4,6].
Yi Method work in two stages for text detection first, the connected component in image partition based on
gradient and color feature, then through text line grouping and adjacent character grouping [1]done which gives
set of candidate or text line in image obtained. Then, Haar features are extracted from gradient maps and stroke
by the block patterns presented in [2].
A Hybrid Approach To Detect And Recognize Text In Images
International organization of Scientific Research 14 | P a g e
III. SYSTEM DESIGN
Fig 1. System Architecture
The proposed framework used to detect and recognize scheme efficient in urban scene context. This is
an unified framework for text detection and recognition. The framework consist of three stages 1) image
partition perform to segment text based on color information.2) character grouping to detect text characters in
same text string depend on character size differences, distance between neighboring characters.3) the text
recognition using neural network of detected text. After study of previous methods we found that combination
color based partition and adjacent character grouping achieves better results in text detection. Therefore here by
using effective color segmentation approach for text detection which help to get accurate result for recognition.
Brief description of our proposed work.
3.1 Module I: Preprocessing
First, preprocessing and segmentation of input urban scene image, involves removing low frequency
background noise, removing reflections, it is used for efficient extraction of text region information, a text
region detector is designed to estimate the text confidence and the corresponding scale, based on which text
candidate components are segmented and analyzed accurately. This stage significantly increase the reliability of
an optical inspection. Here new hybrid color image segmentation approach is used. In most of images, color
uniformity acts as stronger indicator to distinguish text characters of text string from background. Here k-means
algorithm used.
K-Means Algorithm
The whole process of clustering is depicted in Figure 11. Given a set of observations (x1, x2,..., xn),
where each observation is a d-dimensional real vector, k-means clustering aims to partition the n observations
into k sets (k ≤ n) S = S1, S2,..., Sk so as to minimize the within-cluster sum of squares (WCSS):
where µi is the mean of points in Si.
The initial step is choosing k cluster centroid positions, sometimes called as seeds randomly from the image, the
clusters are named as m1,m2,……,mk. Then, the algorithmruns iteratively, which mainly consists of two steps.
 Assignment step:
Assign each observation to the cluster whose mean yields the least within cluster sum of squares (WCSS). Since
the sum of squares is the squared Euclidean distance, this is intuitively the nearest mean:
(1)
(2)
A Hybrid Approach To Detect And Recognize Text In Images
International organization of Scientific Research 15 | P a g e
where each xp is assigned to exactly one St , even if it could be is assigned to two or more of them.
 Update step:
Calculate the new means as the centroids of the observations in the new clusters:
In second step, calculate the new means as centroid of the clusters. The iteration done until assignments no
longer change, which is a convergence to a local optimum. The k-means clustering is that we must specify the
number of clusters in advance.
After the color quantization, the image is divided into different clusters, and we treat the connected
components with the same cluster labels as a region. Each region is represented by its average color
components. The output of the stage is the image partitioned into thousands of tiny regions quantized by a small
number of colors.
3.2. Module II: Segmentation
The first step in a process of character recognition is a detection of a text area. After detecting the image text
area segment it using horizontal projection. After text is segmented then characters are extracted from horizontal
segments. Extracted characters are normalized by checking light related parameters like brightness etc. Then,
recognized using by NN algorithm.
Edge Detection
Let us define input image as a rectangular area with horizontal and vertical edges. The high density of
vertical and horizontal edges on a small area is in different cases caused by contrast characters, but not in every
case. This process can cause sometimes detect a wrong area that does not correspond to actual text in image.
Because of this, we detect several candidates using this algorithm, and then select the best one by a further
heuristic analysis. The input image edge can detected vertically and horizontally.
Here segmentation using a horizontal projection is done. After cropping text area is deskewed, segment it by
detecting spaces in its horizontal projection The adaptive thresholding is used to separate dark foreground from
light background with non-uniform illumination before segmentation. After the thresholding horizontal
projection perform f (x,y) to find horizontal boundaries between segmented characters. These boundaries
correspond to peaks in the graph of the horizontal projection.
3.3. Module III: Character Recognition
After pre-processing and segmentation step feature set is extracted this is further used for training and
recognition step. Feature extraction stage where features of the characters that are crucial for classifying them at
recognition stage are extracted.
After this Neural network used as learning mechanism .In this comparison of input character pattern with stored
character training set and find their matching probability. In the used method, various characters are taught to
the network in a supervised manner. A labeled input character also have several variant patterns of the same
character are taught to the network under the same label. Using it the network learns various possible variations
of a single pattern and becomes adaptive in nature. This process also know as Character Matching which
compare the segments against characters in a database, it check matching score of each database template to
input character segment.
IV. RESULT
We evaluated proposed method on the most challenging street scene text dataset ,namely SVT (street
view, landmarks, boards).The images in used dataset similar to traditional OCR setting which not contain low
contrast or low resolution images. The motivation of this approach is to detect significant and accurate text in
image and recognize it. The combination of color and texture based segmentation and after that using neural
network for highly accurate character recognition. In recognition, the classifiers are evaluated with several
metrics like accuracy refers to the percentage of the testing samples which were correctly recognized. Accuracy
refers to the percentage of the characters of the testing samples which were correctly recognized. We have tested
many random sample urban scene images which are collected as test set. We show performance of our method
in GUI mention below.
(3)
A Hybrid Approach To Detect And Recognize Text In Images
International organization of Scientific Research 16 | P a g e
4.1. Dataset and Evaluation
We used the street view text(SVT)images used for robust word recognition in our system. Our street
view text dataset contain images taken from Google. We have tested 100 sample images. Most of the images are
from urban signage. Our work focus on detecting words and recognizing respectively.
In our experiment color based segmentation and neural network are performed to partion and
recognition of detected text. After testing 100 sample referred images from outdoor (street) indoor(mall and
office).Our dataset the resulting precision, recall and f-measure are 0.83,0.93,0.25 respectively. Based on
definition of precision and recall are: Precision (P) is the proportion of the predicted positive cases(tp) that were
correct,as calculated using the equation:
precision=|tp| / |tp + fp|
The recall or true positive rate (tp) is the proportion of positive:
recall= |tp| / |tp + fn|
where false positive(fp) and false negative(fn).The f measures is combination of the precision and recall
f = 2.precision.recall / (precision + recall)
Sample Input Image from street scene text dataset.
Step-I Preprocessing and Segmentation
As shown in Figure we get cropped words separately after color segmentation. The Figure 2.shows different
color segments in image depending that we get partitioned image for text detection.
Fig 2.After Preprocessing and Segmentation
Step-II Recognition
All adjacent character groups i.e cropped words are recognized accurately as shown in Figure .Also we can
separately recognize each word as shown in Figure3,4.
A Hybrid Approach To Detect And Recognize Text In Images
International organization of Scientific Research 17 | P a g e
Fig 3. Character Recognition
Fig 4. Separately Recognition of subimage
A Hybrid Approach To Detect And Recognize Text In Images
International organization of Scientific Research 18 | P a g e
4.2. Result Table
Table 1. Sample Images with Result Table
V. CONCLUSION
This is unified approach on learning based methods for text detection from image having complex
background and normalized text recognition. We have proposed the detection model eliminates the need for
layout rules by learning spatial context parameters. Also, eliminate the peephole view of the sliding window
approach and unify the entire process under a probabilistic model. Finally, robust reader needs rough text
detection windows and completes both word and character segmentation in a recognition-driven process. To
deal with the problems involved in training a computer to read from detection, to segmentation and recognition.
To read text information embedded in those images, we propose a new framework which is more concentrating
on how to give detected text having less false positive input to OCR give efficient and accurate recognized text.
Here we choose texture color based image segmentation method for text detection and fast, accurate recognition
perform using neural network framework.
VI. FUTURE ENHANCEMENT
There are different challenges occurs in text extraction from image because of size,style, orientation,
shadow which makes recognition difficult. To improve the efficiency and transplant these algorithms into a
system prepared for way finding of visually impaired people, keyword search engine application.
REFERENCES
[1] C. Yi and Y. Tian,Text string detection from natural scenes by structure-based partition and grouping,
vol. 19, no. 12,2011.
[2] Chen and A. L. Yuille,Detecting and reading text in natural scenes in Proc. IEEE Conf. Comput. Vis.
Pattern Recognit., 2004, pp. 366.
[3] N. Zhong,B. Epshtein, E. Ofek, and Y. Wexler, Detecting text in nature scenes with stroke width
transform, in Proc.IEEE Conf. Comput. Vis. Pattern Recognit., 2010, pp. 2963.
[4] K. C. Jung, K. I. Kim, and A. K. Jain,Text information extraction in images and video: A survey, Pattern
Recognit., vol. 5, pp. 977.May 2004.
[5] Datong Chen, Jean-Marc Odobez,Herve Bourlard,Text detection and recognition in images and video
frames Pattern Recognition, vol.37 ,2004,pp 595.
[6] Asif Shahab, Faisal Shafait, Andreas Dengel,ICDAR 2011 Robust Reading Competition Challenge 2:
Reading Text in Scene Images, Int. Jour.on Document Analysis and Recognition,2011.
A Hybrid Approach To Detect And Recognize Text In Images
International organization of Scientific Research 19 | P a g e
[7] Yassin M. Y. Hasan and Lina J. Karam,Morphological Text Extraction from Images, IEEE Transactions
on Image Processing, 9 (11) (2000) 1978-1983.
[8] Y. Pan, X. Hou, and C. Liu, A hybrid approach to detect and localize texts in natural scene images, IEEE
Trans. on Image Processing, vol. 20, no. 3, pp.800, Mar. 2011.
[9] K. I. Kim, K. Jung, and J. H. Kim.Texture based approach for text detection in images using support
vector machines and continuously adaptive mean shift algorithm. IEEE Trans. PAMI, 2003.
[10] L. Neumann and J. Matas.A method for text localization and recognition in real-world images. In Proc. of
ACCV, 2010.
[11] Chucai Yi ,Yingli Tain Localizing text in scene images by boundary clustering,stroke segmentation,and
string fragment classi_cation IEEE trans.on Image processing Vol 21 No.9 Sep 2012.
[12] Anand Mishra, Karteek Alahari,C.V.JawaharTop down and Bottom Up cues for text recognition IEEE
conference of Image processing,2012.
[13] R.Minetto, N.Thome,M.cord,Text detection and Recognition in Urban Scene IEEE 2011.
[14] Ning-Yu An ,Chi-Man PunColor image segmentation using adaptive color quantization and
multiresolution texture characterization. Springer-Verlag London Limited 25 May 2012.
[15] Fakhraddin Mamedov, Jamal Fathi Abu Hasna.Character Recognition Using Neural Networks, Near East
University, North Cyprus, Turkey via Mersin-10, KKT C2005.
[16] R. Jiang, F. Qi, L. Xu, and G.Wu, A learning-based method to detect and segment text from scene
images, J. Zhejiang Univ., vol. 8, pp.568-574, Apr. 2007.
[17] Pranob, K Charles, V.Harish,A Review on the Various Techniques used for Optical Character
Recognition, Vol. 2, Issue 1,Jan-Feb 2012.
[18] Neural Network Approach To Mathematical Expression Recognition System, IJERT Dec 2012.
[19] Anil.K.Jain TextFeature extraction methods for character recognition-A Survey, Pattern Recognition, vol.
29, no. 4, pp. 641-662, 2006.

More Related Content

What's hot

An implementation of novel genetic based clustering algorithm for color image...
An implementation of novel genetic based clustering algorithm for color image...An implementation of novel genetic based clustering algorithm for color image...
An implementation of novel genetic based clustering algorithm for color image...
TELKOMNIKA JOURNAL
 
Image to text Converter
Image to text ConverterImage to text Converter
Image to text Converter
Dhiraj Raj
 
Texture features based text extraction from images using DWT and K-means clus...
Texture features based text extraction from images using DWT and K-means clus...Texture features based text extraction from images using DWT and K-means clus...
Texture features based text extraction from images using DWT and K-means clus...
Divya Gera
 
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
Cheriyan K M
 
Text-Image Separation in Document Images using Boundary/Perimeter Detection
Text-Image Separation in Document Images using Boundary/Perimeter DetectionText-Image Separation in Document Images using Boundary/Perimeter Detection
Text-Image Separation in Document Images using Boundary/Perimeter Detection
IDES Editor
 
Text Detection Strategies
Text Detection StrategiesText Detection Strategies
Text Detection Strategies
Anyline
 
Text Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text RegionsText Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text Regions
IJCSIS Research Publications
 
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Zahra Mansoori
 
Sample Paper Techscribe
Sample  Paper TechscribeSample  Paper Techscribe
Sample Paper Techscribeguest533af374
 
Ay32333339
Ay32333339Ay32333339
Ay32333339
IJERA Editor
 
Evaluation of Texture in CBIR
Evaluation of Texture in CBIREvaluation of Texture in CBIR
Evaluation of Texture in CBIR
Zahra Mansoori
 
C1803011419
C1803011419C1803011419
C1803011419
IOSR Journals
 
Handwritten and Machine Printed Text Separation in Document Images using the ...
Handwritten and Machine Printed Text Separation in Document Images using the ...Handwritten and Machine Printed Text Separation in Document Images using the ...
Handwritten and Machine Printed Text Separation in Document Images using the ...
Konstantinos Zagoris
 
IMAGE RETRIEVAL USING QUADRATIC DISTANCE BASED ON COLOR FEATURE AND PYRAMID S...
IMAGE RETRIEVAL USING QUADRATIC DISTANCE BASED ON COLOR FEATURE AND PYRAMID S...IMAGE RETRIEVAL USING QUADRATIC DISTANCE BASED ON COLOR FEATURE AND PYRAMID S...
IMAGE RETRIEVAL USING QUADRATIC DISTANCE BASED ON COLOR FEATURE AND PYRAMID S...
International Journal of Technical Research & Application
 
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
CSCJournals
 
Dj31514517
Dj31514517Dj31514517
Dj31514517IJMER
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
researchinventy
 
AVC based Compression of Compound Images Using Block Classification Scheme
AVC based Compression of Compound Images Using Block Classification SchemeAVC based Compression of Compound Images Using Block Classification Scheme
AVC based Compression of Compound Images Using Block Classification Scheme
DR.P.S.JAGADEESH KUMAR
 

What's hot (20)

An implementation of novel genetic based clustering algorithm for color image...
An implementation of novel genetic based clustering algorithm for color image...An implementation of novel genetic based clustering algorithm for color image...
An implementation of novel genetic based clustering algorithm for color image...
 
40120140501009
4012014050100940120140501009
40120140501009
 
Image to text Converter
Image to text ConverterImage to text Converter
Image to text Converter
 
Texture features based text extraction from images using DWT and K-means clus...
Texture features based text extraction from images using DWT and K-means clus...Texture features based text extraction from images using DWT and K-means clus...
Texture features based text extraction from images using DWT and K-means clus...
 
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
 
Text-Image Separation in Document Images using Boundary/Perimeter Detection
Text-Image Separation in Document Images using Boundary/Perimeter DetectionText-Image Separation in Document Images using Boundary/Perimeter Detection
Text-Image Separation in Document Images using Boundary/Perimeter Detection
 
Text Detection Strategies
Text Detection StrategiesText Detection Strategies
Text Detection Strategies
 
Text Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text RegionsText Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text Regions
 
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
 
Sample Paper Techscribe
Sample  Paper TechscribeSample  Paper Techscribe
Sample Paper Techscribe
 
Ay32333339
Ay32333339Ay32333339
Ay32333339
 
Evaluation of Texture in CBIR
Evaluation of Texture in CBIREvaluation of Texture in CBIR
Evaluation of Texture in CBIR
 
C1803011419
C1803011419C1803011419
C1803011419
 
Handwritten and Machine Printed Text Separation in Document Images using the ...
Handwritten and Machine Printed Text Separation in Document Images using the ...Handwritten and Machine Printed Text Separation in Document Images using the ...
Handwritten and Machine Printed Text Separation in Document Images using the ...
 
IMAGE RETRIEVAL USING QUADRATIC DISTANCE BASED ON COLOR FEATURE AND PYRAMID S...
IMAGE RETRIEVAL USING QUADRATIC DISTANCE BASED ON COLOR FEATURE AND PYRAMID S...IMAGE RETRIEVAL USING QUADRATIC DISTANCE BASED ON COLOR FEATURE AND PYRAMID S...
IMAGE RETRIEVAL USING QUADRATIC DISTANCE BASED ON COLOR FEATURE AND PYRAMID S...
 
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
 
Dj31514517
Dj31514517Dj31514517
Dj31514517
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
AVC based Compression of Compound Images Using Block Classification Scheme
AVC based Compression of Compound Images Using Block Classification SchemeAVC based Compression of Compound Images Using Block Classification Scheme
AVC based Compression of Compound Images Using Block Classification Scheme
 
Ac03401600163.
Ac03401600163.Ac03401600163.
Ac03401600163.
 

Similar to C04741319

Optical Character Recognition from Text Image
Optical Character Recognition from Text ImageOptical Character Recognition from Text Image
Optical Character Recognition from Text Image
Editor IJCATR
 
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdfHandwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Sachin414679
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In Videos
CSCJournals
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In Videos
CSCJournals
 
Cc31331335
Cc31331335Cc31331335
Cc31331335IJMER
 
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
IJERA Editor
 
Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Editor IJARCET
 
Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Editor IJARCET
 
Scene text recognition in mobile applications by character descriptor and str...
Scene text recognition in mobile applications by character descriptor and str...Scene text recognition in mobile applications by character descriptor and str...
Scene text recognition in mobile applications by character descriptor and str...
eSAT Journals
 
Fc4301935938
Fc4301935938Fc4301935938
Fc4301935938
IJERA Editor
 
Anatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern DetectionAnatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern Detection
IJEACS
 
E1803012329
E1803012329E1803012329
E1803012329
IOSR Journals
 
E017322833
E017322833E017322833
E017322833
IOSR Journals
 
A Review of Optical Character Recognition System for Recognition of Printed Text
A Review of Optical Character Recognition System for Recognition of Printed TextA Review of Optical Character Recognition System for Recognition of Printed Text
A Review of Optical Character Recognition System for Recognition of Printed Text
iosrjce
 
Feature integration for image information retrieval using image mining techni...
Feature integration for image information retrieval using image mining techni...Feature integration for image information retrieval using image mining techni...
Feature integration for image information retrieval using image mining techni...iaemedu
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
IJERD Editor
 
A comparative study on content based image retrieval methods
A comparative study on content based image retrieval methodsA comparative study on content based image retrieval methods
A comparative study on content based image retrieval methods
IJLT EMAS
 
Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...
Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...
Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...
CSCJournals
 
Reconstructing the Path of the Object based on Time and Date OCR in Surveilla...
Reconstructing the Path of the Object based on Time and Date OCR in Surveilla...Reconstructing the Path of the Object based on Time and Date OCR in Surveilla...
Reconstructing the Path of the Object based on Time and Date OCR in Surveilla...
ijtsrd
 

Similar to C04741319 (20)

Optical Character Recognition from Text Image
Optical Character Recognition from Text ImageOptical Character Recognition from Text Image
Optical Character Recognition from Text Image
 
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdfHandwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In Videos
 
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In VideosA Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In Videos
 
Cc31331335
Cc31331335Cc31331335
Cc31331335
 
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
 
Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124
 
Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124
 
Scene text recognition in mobile applications by character descriptor and str...
Scene text recognition in mobile applications by character descriptor and str...Scene text recognition in mobile applications by character descriptor and str...
Scene text recognition in mobile applications by character descriptor and str...
 
Fc4301935938
Fc4301935938Fc4301935938
Fc4301935938
 
Anatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern DetectionAnatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern Detection
 
E1803012329
E1803012329E1803012329
E1803012329
 
E017322833
E017322833E017322833
E017322833
 
A Review of Optical Character Recognition System for Recognition of Printed Text
A Review of Optical Character Recognition System for Recognition of Printed TextA Review of Optical Character Recognition System for Recognition of Printed Text
A Review of Optical Character Recognition System for Recognition of Printed Text
 
Feature integration for image information retrieval using image mining techni...
Feature integration for image information retrieval using image mining techni...Feature integration for image information retrieval using image mining techni...
Feature integration for image information retrieval using image mining techni...
 
Q0460398103
Q0460398103Q0460398103
Q0460398103
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
A comparative study on content based image retrieval methods
A comparative study on content based image retrieval methodsA comparative study on content based image retrieval methods
A comparative study on content based image retrieval methods
 
Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...
Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...
Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...
 
Reconstructing the Path of the Object based on Time and Date OCR in Surveilla...
Reconstructing the Path of the Object based on Time and Date OCR in Surveilla...Reconstructing the Path of the Object based on Time and Date OCR in Surveilla...
Reconstructing the Path of the Object based on Time and Date OCR in Surveilla...
 

More from IOSR-JEN

C05921721
C05921721C05921721
C05921721
IOSR-JEN
 
B05921016
B05921016B05921016
B05921016
IOSR-JEN
 
A05920109
A05920109A05920109
A05920109
IOSR-JEN
 
J05915457
J05915457J05915457
J05915457
IOSR-JEN
 
I05914153
I05914153I05914153
I05914153
IOSR-JEN
 
H05913540
H05913540H05913540
H05913540
IOSR-JEN
 
G05913234
G05913234G05913234
G05913234
IOSR-JEN
 
F05912731
F05912731F05912731
F05912731
IOSR-JEN
 
E05912226
E05912226E05912226
E05912226
IOSR-JEN
 
D05911621
D05911621D05911621
D05911621
IOSR-JEN
 
C05911315
C05911315C05911315
C05911315
IOSR-JEN
 
B05910712
B05910712B05910712
B05910712
IOSR-JEN
 
A05910106
A05910106A05910106
A05910106
IOSR-JEN
 
B05840510
B05840510B05840510
B05840510
IOSR-JEN
 
I05844759
I05844759I05844759
I05844759
IOSR-JEN
 
H05844346
H05844346H05844346
H05844346
IOSR-JEN
 
G05843942
G05843942G05843942
G05843942
IOSR-JEN
 
F05843238
F05843238F05843238
F05843238
IOSR-JEN
 
E05842831
E05842831E05842831
E05842831
IOSR-JEN
 
D05842227
D05842227D05842227
D05842227
IOSR-JEN
 

More from IOSR-JEN (20)

C05921721
C05921721C05921721
C05921721
 
B05921016
B05921016B05921016
B05921016
 
A05920109
A05920109A05920109
A05920109
 
J05915457
J05915457J05915457
J05915457
 
I05914153
I05914153I05914153
I05914153
 
H05913540
H05913540H05913540
H05913540
 
G05913234
G05913234G05913234
G05913234
 
F05912731
F05912731F05912731
F05912731
 
E05912226
E05912226E05912226
E05912226
 
D05911621
D05911621D05911621
D05911621
 
C05911315
C05911315C05911315
C05911315
 
B05910712
B05910712B05910712
B05910712
 
A05910106
A05910106A05910106
A05910106
 
B05840510
B05840510B05840510
B05840510
 
I05844759
I05844759I05844759
I05844759
 
H05844346
H05844346H05844346
H05844346
 
G05843942
G05843942G05843942
G05843942
 
F05843238
F05843238F05843238
F05843238
 
E05842831
E05842831E05842831
E05842831
 
D05842227
D05842227D05842227
D05842227
 

Recently uploaded

To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 

Recently uploaded (20)

To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 

C04741319

  • 1. IOSR Journal of Engineering (IOSRJEN) www.iosrjen.org ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 07 (July. 2014), ||V4|| PP 19-20 International organization of Scientific Research 13 | P a g e A Hybrid Approach To Detect And Recognize Text In Images 1 Miss.Poonam B. Kadam, 2 Prof. Latika R. Desai 1 (Dept of Computer Engineering, 2 (Dept of Computer Engineering Abstract: - Text detection and recognition in images is a research area which attempts to develop a computer system with the ability to automatically read text from images or videos. This problem is challenging due to the complex background and large variations of these components features like color, size, shape, orientation or texture. Here new proposed framework is explored which can automatically detect and recognize aligned text from urban scene images e.g.shop name,landmark,street.The proposed framework evaluated on new generated dataset.It consist of a three main step 1) image partition perform to segment text based on color information.2) character grouping to detect text character in every text string depend on character size differences, distance between neighboring characters.3) the detected text recognition using neural network. This proposed method efficiently and accurately detects and recognizes the text with a low false positive. Keywords: - Text detection, preprocessing, segmentation, character recognition. I. INTRODUCTION Many facilities are involved in human reading and similarly, there has been a large amount of research on computational methods for text recognition. The problem of developing a model for reading is broad, surrounding many facets of information. Also there are many typefaces, and they may be encountered at different legible sizes. Text is frequently printed on a surface and viewers generally doesn't want that the surface be fronto-parallel to their eyes in order to read it. Additionally, changing lighting conditions and designers use many colors for background and text. All of these factors leads to make the robust reading problem very challenging. Text understanding system consist of four stages: text detection, text localization, text extraction, text recognition. These stages are used interchangeably. Text detection consists of determination of the occurrence of text in images. Text localization is the process of determining the location of text. In text extraction stage the text components in images are segmented from background. After, the extracted text images can be converted into plain text using OCR technology. Through Text detection and recognition in images is coupling of text-based searching technologies and optical character recognition (OCR).Text in images or video is a powerful source of information. Caption embedded with news videos provide information about name of related location, people, date and time, purpose. Caption provide an summary or abstract of the image. II. LITERATURE REVIEW There have been a different methods dealing with text detection and recognition in images [4, 17]. Comprehensive surveys can be found in [8].Some approaches to text detection classified into three categories: texture-based methods, region based methods and hybrid methods. Texture-based methods [3,16] involves texture properties of text such as style, orientation and the construction of gray-level co-occurrence matrix. These methods are computation demanding as all locations and scales are exhaustively scanned. Moreover, these algorithms mostly concentrate to detect horizontal texts. Region based methods [7,18] use the properties of the color or gray scale or alignment in a text region or their differences in properties of the background. First extract candidate text regions through segmentation or clustering and then remove non-text regions. The third category, hybrid methods [9]is a fusion of region-based and texture-based methods. Different document or web, e-mail images, in which text characters are normalized and proper resolutions, natural scene images, embed text can be in size, shapes and orientations into complex background. It is impossible to recognize text in images directly through OCR software because of complex background. Thus, we need to detect image regions containing text and their corresponding orientations. This is compatible with the detection procedure described text extraction algorithms survey[4,6]. Yi Method work in two stages for text detection first, the connected component in image partition based on gradient and color feature, then through text line grouping and adjacent character grouping [1]done which gives set of candidate or text line in image obtained. Then, Haar features are extracted from gradient maps and stroke by the block patterns presented in [2].
  • 2. A Hybrid Approach To Detect And Recognize Text In Images International organization of Scientific Research 14 | P a g e III. SYSTEM DESIGN Fig 1. System Architecture The proposed framework used to detect and recognize scheme efficient in urban scene context. This is an unified framework for text detection and recognition. The framework consist of three stages 1) image partition perform to segment text based on color information.2) character grouping to detect text characters in same text string depend on character size differences, distance between neighboring characters.3) the text recognition using neural network of detected text. After study of previous methods we found that combination color based partition and adjacent character grouping achieves better results in text detection. Therefore here by using effective color segmentation approach for text detection which help to get accurate result for recognition. Brief description of our proposed work. 3.1 Module I: Preprocessing First, preprocessing and segmentation of input urban scene image, involves removing low frequency background noise, removing reflections, it is used for efficient extraction of text region information, a text region detector is designed to estimate the text confidence and the corresponding scale, based on which text candidate components are segmented and analyzed accurately. This stage significantly increase the reliability of an optical inspection. Here new hybrid color image segmentation approach is used. In most of images, color uniformity acts as stronger indicator to distinguish text characters of text string from background. Here k-means algorithm used. K-Means Algorithm The whole process of clustering is depicted in Figure 11. Given a set of observations (x1, x2,..., xn), where each observation is a d-dimensional real vector, k-means clustering aims to partition the n observations into k sets (k ≤ n) S = S1, S2,..., Sk so as to minimize the within-cluster sum of squares (WCSS): where µi is the mean of points in Si. The initial step is choosing k cluster centroid positions, sometimes called as seeds randomly from the image, the clusters are named as m1,m2,……,mk. Then, the algorithmruns iteratively, which mainly consists of two steps.  Assignment step: Assign each observation to the cluster whose mean yields the least within cluster sum of squares (WCSS). Since the sum of squares is the squared Euclidean distance, this is intuitively the nearest mean: (1) (2)
  • 3. A Hybrid Approach To Detect And Recognize Text In Images International organization of Scientific Research 15 | P a g e where each xp is assigned to exactly one St , even if it could be is assigned to two or more of them.  Update step: Calculate the new means as the centroids of the observations in the new clusters: In second step, calculate the new means as centroid of the clusters. The iteration done until assignments no longer change, which is a convergence to a local optimum. The k-means clustering is that we must specify the number of clusters in advance. After the color quantization, the image is divided into different clusters, and we treat the connected components with the same cluster labels as a region. Each region is represented by its average color components. The output of the stage is the image partitioned into thousands of tiny regions quantized by a small number of colors. 3.2. Module II: Segmentation The first step in a process of character recognition is a detection of a text area. After detecting the image text area segment it using horizontal projection. After text is segmented then characters are extracted from horizontal segments. Extracted characters are normalized by checking light related parameters like brightness etc. Then, recognized using by NN algorithm. Edge Detection Let us define input image as a rectangular area with horizontal and vertical edges. The high density of vertical and horizontal edges on a small area is in different cases caused by contrast characters, but not in every case. This process can cause sometimes detect a wrong area that does not correspond to actual text in image. Because of this, we detect several candidates using this algorithm, and then select the best one by a further heuristic analysis. The input image edge can detected vertically and horizontally. Here segmentation using a horizontal projection is done. After cropping text area is deskewed, segment it by detecting spaces in its horizontal projection The adaptive thresholding is used to separate dark foreground from light background with non-uniform illumination before segmentation. After the thresholding horizontal projection perform f (x,y) to find horizontal boundaries between segmented characters. These boundaries correspond to peaks in the graph of the horizontal projection. 3.3. Module III: Character Recognition After pre-processing and segmentation step feature set is extracted this is further used for training and recognition step. Feature extraction stage where features of the characters that are crucial for classifying them at recognition stage are extracted. After this Neural network used as learning mechanism .In this comparison of input character pattern with stored character training set and find their matching probability. In the used method, various characters are taught to the network in a supervised manner. A labeled input character also have several variant patterns of the same character are taught to the network under the same label. Using it the network learns various possible variations of a single pattern and becomes adaptive in nature. This process also know as Character Matching which compare the segments against characters in a database, it check matching score of each database template to input character segment. IV. RESULT We evaluated proposed method on the most challenging street scene text dataset ,namely SVT (street view, landmarks, boards).The images in used dataset similar to traditional OCR setting which not contain low contrast or low resolution images. The motivation of this approach is to detect significant and accurate text in image and recognize it. The combination of color and texture based segmentation and after that using neural network for highly accurate character recognition. In recognition, the classifiers are evaluated with several metrics like accuracy refers to the percentage of the testing samples which were correctly recognized. Accuracy refers to the percentage of the characters of the testing samples which were correctly recognized. We have tested many random sample urban scene images which are collected as test set. We show performance of our method in GUI mention below. (3)
  • 4. A Hybrid Approach To Detect And Recognize Text In Images International organization of Scientific Research 16 | P a g e 4.1. Dataset and Evaluation We used the street view text(SVT)images used for robust word recognition in our system. Our street view text dataset contain images taken from Google. We have tested 100 sample images. Most of the images are from urban signage. Our work focus on detecting words and recognizing respectively. In our experiment color based segmentation and neural network are performed to partion and recognition of detected text. After testing 100 sample referred images from outdoor (street) indoor(mall and office).Our dataset the resulting precision, recall and f-measure are 0.83,0.93,0.25 respectively. Based on definition of precision and recall are: Precision (P) is the proportion of the predicted positive cases(tp) that were correct,as calculated using the equation: precision=|tp| / |tp + fp| The recall or true positive rate (tp) is the proportion of positive: recall= |tp| / |tp + fn| where false positive(fp) and false negative(fn).The f measures is combination of the precision and recall f = 2.precision.recall / (precision + recall) Sample Input Image from street scene text dataset. Step-I Preprocessing and Segmentation As shown in Figure we get cropped words separately after color segmentation. The Figure 2.shows different color segments in image depending that we get partitioned image for text detection. Fig 2.After Preprocessing and Segmentation Step-II Recognition All adjacent character groups i.e cropped words are recognized accurately as shown in Figure .Also we can separately recognize each word as shown in Figure3,4.
  • 5. A Hybrid Approach To Detect And Recognize Text In Images International organization of Scientific Research 17 | P a g e Fig 3. Character Recognition Fig 4. Separately Recognition of subimage
  • 6. A Hybrid Approach To Detect And Recognize Text In Images International organization of Scientific Research 18 | P a g e 4.2. Result Table Table 1. Sample Images with Result Table V. CONCLUSION This is unified approach on learning based methods for text detection from image having complex background and normalized text recognition. We have proposed the detection model eliminates the need for layout rules by learning spatial context parameters. Also, eliminate the peephole view of the sliding window approach and unify the entire process under a probabilistic model. Finally, robust reader needs rough text detection windows and completes both word and character segmentation in a recognition-driven process. To deal with the problems involved in training a computer to read from detection, to segmentation and recognition. To read text information embedded in those images, we propose a new framework which is more concentrating on how to give detected text having less false positive input to OCR give efficient and accurate recognized text. Here we choose texture color based image segmentation method for text detection and fast, accurate recognition perform using neural network framework. VI. FUTURE ENHANCEMENT There are different challenges occurs in text extraction from image because of size,style, orientation, shadow which makes recognition difficult. To improve the efficiency and transplant these algorithms into a system prepared for way finding of visually impaired people, keyword search engine application. REFERENCES [1] C. Yi and Y. Tian,Text string detection from natural scenes by structure-based partition and grouping, vol. 19, no. 12,2011. [2] Chen and A. L. Yuille,Detecting and reading text in natural scenes in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2004, pp. 366. [3] N. Zhong,B. Epshtein, E. Ofek, and Y. Wexler, Detecting text in nature scenes with stroke width transform, in Proc.IEEE Conf. Comput. Vis. Pattern Recognit., 2010, pp. 2963. [4] K. C. Jung, K. I. Kim, and A. K. Jain,Text information extraction in images and video: A survey, Pattern Recognit., vol. 5, pp. 977.May 2004. [5] Datong Chen, Jean-Marc Odobez,Herve Bourlard,Text detection and recognition in images and video frames Pattern Recognition, vol.37 ,2004,pp 595. [6] Asif Shahab, Faisal Shafait, Andreas Dengel,ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images, Int. Jour.on Document Analysis and Recognition,2011.
  • 7. A Hybrid Approach To Detect And Recognize Text In Images International organization of Scientific Research 19 | P a g e [7] Yassin M. Y. Hasan and Lina J. Karam,Morphological Text Extraction from Images, IEEE Transactions on Image Processing, 9 (11) (2000) 1978-1983. [8] Y. Pan, X. Hou, and C. Liu, A hybrid approach to detect and localize texts in natural scene images, IEEE Trans. on Image Processing, vol. 20, no. 3, pp.800, Mar. 2011. [9] K. I. Kim, K. Jung, and J. H. Kim.Texture based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans. PAMI, 2003. [10] L. Neumann and J. Matas.A method for text localization and recognition in real-world images. In Proc. of ACCV, 2010. [11] Chucai Yi ,Yingli Tain Localizing text in scene images by boundary clustering,stroke segmentation,and string fragment classi_cation IEEE trans.on Image processing Vol 21 No.9 Sep 2012. [12] Anand Mishra, Karteek Alahari,C.V.JawaharTop down and Bottom Up cues for text recognition IEEE conference of Image processing,2012. [13] R.Minetto, N.Thome,M.cord,Text detection and Recognition in Urban Scene IEEE 2011. [14] Ning-Yu An ,Chi-Man PunColor image segmentation using adaptive color quantization and multiresolution texture characterization. Springer-Verlag London Limited 25 May 2012. [15] Fakhraddin Mamedov, Jamal Fathi Abu Hasna.Character Recognition Using Neural Networks, Near East University, North Cyprus, Turkey via Mersin-10, KKT C2005. [16] R. Jiang, F. Qi, L. Xu, and G.Wu, A learning-based method to detect and segment text from scene images, J. Zhejiang Univ., vol. 8, pp.568-574, Apr. 2007. [17] Pranob, K Charles, V.Harish,A Review on the Various Techniques used for Optical Character Recognition, Vol. 2, Issue 1,Jan-Feb 2012. [18] Neural Network Approach To Mathematical Expression Recognition System, IJERT Dec 2012. [19] Anil.K.Jain TextFeature extraction methods for character recognition-A Survey, Pattern Recognition, vol. 29, no. 4, pp. 641-662, 2006.