SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 57
DOCUMENT LAYOUT ANALYSIS USING INVERSE SUPPORT VECTOR MACHINE (I-SVM) FOR HINDI NEWS PAPER IN
IMAGE PROCESSING
Rimmy Kathuria
M .Tech Student of MRIU, Faridabad.
Abstract: Optical character recognition (OCR) is the
translation of handwritten, typed or impressed word
images into a form that the computer can manipulate.
There are various steps in OCR. One of them is
classification. To do the Classification, we must have to
compare a training data with many feature vectors. A
classifier is needed to compare the feature vector of input
and the feature vector of training data using inverse
support vector machine. This paper proposed
classification techniques, which are used to recognize
character or word, and about some related work which has
been done. It will also present a novel learning based
framework to extract articles from newspaper images
using bounding box method. The input to the system
comprises blocks of text and graphics, obtained using
standard image processing techniques. The fixed point
model uses contextual information and features of each
block to learn the layout of newspaper images and attains
a contraction mapping to assign a unique label to every
block. Experimental results show the applicability of our
algorithm in document newspaper layout labeling and
article extraction.
Keywords: OCR, I-SVM, CANNY, SOBEL, Hindi newspaper
script.
I. INTRODUCTION
This Research gives an overview about document layout
analysis. It presents in detail one method which is part of
the pre-processing phase, called Morphological operator.
In doing so it introduces a special technique of how
determining the bounding box and inverse support vector
machine. Furthermore
it presents numerous ways of approaches to speed up the
process of document layout.The second part deals with
page segmentation which is one of the huge parts in
document image analysis. It illustrates a means of
partitioning using bounding boxes of different entities.
Reasonable success has been achieved at developing mono
lingual OCR systems in Indian scripts. Scientists,
optimistically, have started to look beyond. Development
of bilingual OCR systems and OCR systems with capability
to identify the text areas are some of the pointers to future
activities in Indian scenario. The separation of text and
non-text regions before considering the document layout
for OCR is an important task.
In this paper, we present a biologically inspired,
multichannel filtering scheme for page layout analysis. The
same scheme has been used for script recognition as well.
Parameter tuning is mostly done heuristically. It has also
been seen to be computationally viable for commercial
OCR system development.
Figure 1: Block diagram of the OCR
II. OCR
OCR Stands for Optical Character Recognition. It is Extracts
the text from a given image. It is invented by Gustav
Tauschek. Tauschek obtained a patent on OCR 1929 in
Germany and 1935 in USA.
Figure 2: Example of OCR
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 58
III. Data for work
Data in a newspaper document analysis are usually
captured by optical scanning and stored in a file of picture
elements, called pixels that are sampled in a grid pattern
throughout the document. These pixels may have values:
OFF (0) or ON (1) for binary images, 0–255 for gray-scale
images and 3 channels of 0–255 colour values for colour
images.
At a typical sampling resolution of 120 pixels per
centimeter, a 20 x 30 cm page would yield an image of
2400x3600 pixels. When the document is on a different
medium such as microfilm, palm leaves, or fabric,
photographic methods are often used to capture images. In
any case, it is important to understand that the image of
the document contains only raw data that must be further
analyzed to collect the information.
IV. PROPOSED IMPLEMENTATION
We have used scanned images of Hindi newspapers for our
experimentation. Each document provides for a number of
blocks, typically an average of multiple blocks exists in a
single newspaper image. We use Bounding box to obtain
these blocks.
This is the most important stage of document layout OCR
system. Relevant information from the selected data has
been extracted for classification. Different shapes of the
character parts have been selected from feature selection
algorithm. They might be curves or points or linear shapes.
But most of them are open curve shaped. In feature
extraction, two techniques are proposed here. One is
support vector based sub line direction and the other one
is bounding box based shape detection. Here, directions
are extracted by sub line direction and bounding box
procedures from selected portion of character image.
A. Preprocessing Of newspaper data
The gray scale image is first binaries using the method
described in our earlier work [2]. Horizontal and vertical
lines are then removed on the basis of aspect ratio of
connected components.
B. Inverse-SVM classification
The classification strategy we have adopted combines both
deterministic and probabilistic decision making. First,
using training samples, we try to learn the various conflicts
among classes and equivalent representations for every
class. . From here, we use probabilistic classification to
assign a label to the test pattern. The important point to
observe here is, the switch to conflict resolution can
happen at any level (from 1 to 4), since conflicts can be
defined at various levels. Usually this will be dictated by
certain trade-offs between accuracy and precision-recall,
as we show later. The classifier we use to resolve conflicts
is the Inverse Support Vector Machine (I-SVM). Though it
is not mandatory to use SVMs, their utility in character
recognition has been well documented. We have chosen
RBF kernel, and the SVM formulation “support vector
classification” for multi-class classification.
C. SOBEL OPERATOR
The most popular approximation of equation (1) but using
a 33 mask is the following.
)2()2()2()2( 741963321987 zzzzzzzzzzzzf 
(1)
This approximation is known as the Sobel operator.
If we consider the left of mask the Sobel operator, this
causes differentiation along the y direction. If we isolate
the following part of the mask and treat it as a one
dimensional mask, we are interested in finding the effects
of that mask. We will therefore, treat this mask as a one
dimensional impulse response ][nh of the form
Sobel operator
-
1
00
2
-
2
1
0
-
1
1
-1
0-2
0
0
1
2
1
-1
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 59











otherwise0
11
02
11
][
n
n
n
nh
The above response applied to a signal ][nx yields a signal
]1[][2]1[][  nxnxnxny or in z-transform domain
)()1(cos2)()()2()( 1
 jXjYzXzzzY  
. (2)
Therefore, ][nh is the impulse response of a system with
transfer function
)()1(cos2)(  jHjH  Shown in the figure
below for ],0[  . This is a low pass filter type of response.
Therefore, we can claim that the Sobel operator has a
differentiation effect along one of the two directions and a
smoothing effect along the other direction.
D. Finding text lines
Precise identification of text lines is an important part for
most OCR systems. It is also very useful for document
layout analysis. Numerous methods have been proposed
for text line and baseline finding. Some methods attempt to
find just text lines, e.g. by using Hough transforms,
projection profiles, and Radon transforms. Others find text
lines as part of more general document layout analysis
tasks, e.g. XY cuts, whitespace segmentation, Voronoi
diagrams, and distance-based grouping. Usually, such
methods start by performing a rough analysis of the
layout, often based on the proximity of bounding boxes of
connected components or connected components
themselves. More precise base- and text-line models are
employed in subsequent steps, as required. The text line
fitting algorithm correctly identified all lines present in the
documents when the extracted bounding boxes allowed it
to do so. No spurious text lines were detected, and some
short one-word lines were ignored. One particularly
interesting property of this algorithm is that it allows
variations in text line orientations. This permits the
algorithm to be applied to document images captured by
photo or video cameras rather than by scanners.
E. MORPHOLOGICAL OPERATION
A morphological operation is used to remove the noise and
smooth the shape of the candidate text areas. The element
used here is also cross-shaped with size 11x45. The size of
the elements for the morphological operations and the
geometrical constraints give to the algorithm the ability to
detect text in a specific range of character sizes (12-48
pixels).
F. Canny edge detection
We use canny edge detector applied in grey scale images.
Canny uses Sobel masks in order to find the edge
magnitude of the image, in gray scale, and then uses non-
Maxima suppression and hysteresis thresholding. With
these two post-processing operations canny edge detector
manage to remove no maxima pixels, preserving the
connectivity of the contours. After computing the Canny
edge map, dilation by an element 5x21 is performed to
connect the character contours of every text line. Dilation
by a cross-shaped element 5x21 is performed to connect
the character contours of every text line.
G. Pseudo code of proposed method
1. Initialize input image from dataset
2. If input image in RGB format then convert in gray
image
3. If no then go to next step
4. Now do the preprocessing step using Bounding
box
5. Also apply the morphology operation
6. Then extract the feature
7. Testing and classification using Inverse support
vector machine
8. Find the F-measure, Accuracy and Precision Recall
value.
9. Finally get the output result
10. End
2
1
-1 10
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 60
Figure 3: Proposed flow chart
V.RESULT
The present growth of document analysis of books and
manuscripts demands an immediate solution to access
them electronically. This requires research in the area of
document image understanding, specifically in the area of
document layout analysis. There is an immense scope for
such a feature extraction system for a digital Document
Images. This paper presents an efficient Inverse support
vector machine system for a Hindi newspaper document
image collection. A recognition-free approach is followed
because recognition based approach is inefficient in terms
of performance. The data is pre-processed and segmented
for faster matching and extraction. An efficient search
technique - Correlation method is used to search in large
collection of document images. Performance evaluation
using different datasets of documents shows the
effectiveness of the approach.
Figure 5.1: Input Image with Noise
For the next test scenario we used for training only the
features corresponding to Hindi newspaper OCR letters.
The image used for testing contained data, and the
construction of the training set, which consisted of images
containing examples of each Hindi OCR letter in the Hindi
alphabet. We used gray scale for image without noise.
Figure 5.2: Extracted character from input image
Figure 5.3: Binary image
Figure 5.4: Image after applying Median filter on
binary image.
Figure 5.5: Segmented image
INPUT IMAGE WITH NOISE
Binary Hindi Text for Morphological Operator
Average Median Filter
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 61
Figure 5.6: comparisons between ISVM-morphology
and SVM X-Y cut for f-measure and recall value
Figure 5.7 comparisons between ISVM-morphology
and SVM X-Y cut for Accuracy on iteration value
Figure 5.8 comparisons between ISVM-morphology
and SVM X-Y cut for precision and recall value
Evaluation strategy
A text line must have influence to the final evaluation
measure proportional to the number of containing
characters and not to the number of its pixels
The number of characters in a box cannot be defined by
the algorithm but it can be approximated by the ratio
width/height of the bounding box
hgi is the height of the ith ground truth bounding box . hdi is
the height of the ith detection bounding box.
EGi is the number of pixel of the ith ground truth
bounding box.
EDi is the number of pixel of the ith detection bounding
box.
EGD is the number of pixel of the intersection that
belongs to ith ground truth bounding box.
EGDi is the number of pixel of the intersection that
belongs to ith detection bounding box.
MSE = 13.1506
MSE=13.1506
PSNR = 85.0606
VI.CONCLUSION
0 20 40 60 80 100 120
0
0.2
0.4
0.6
0.8
1
1.2
1.4
recall
Fmeasure
Fmeasure results
I-SVM Morphology
SVM-XY cut
1 2 3 4 5 6 7 8 9
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Iteration
Accuracy
Accuracy results
SVM-XY cut
I-SVM Morphology
0 10 20 30 40 50 60 70 80 90
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
recall
precision
precision-recall for SVM-CC-BBX and ISVM-CC-BBX
I-SVM Morphology
SVM-XY cut
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 62
In this paper, document layout analysis text extraction
techniques such as I-SVM, bounding box, sobel operator,
morphological method etc. have been discussed. The
performance comparison of these methods for document
text extraction on the basis of accuracy, precision rate,
recall rate, processing time has been done. It is observed
that better accuracy is best for document texture Analysis)
approach and edge based text extraction techniques.
Precision and recall rate is best in case of I-SVM algorithm
and segmentation method.
Future work
We plan to exploit the colour homogeneity of text
temporal, text detection from frame to frame.Multi-frame
integration for image enhancement
REFERENCES
[1] Vijay Singh and Bhupendra Kumar “Document layout
analysis for Indian newspapers using contour based
symbiotic approach” 2014 International Conference on
Computer Communication and Informatics (ICCCI -2014),
Jan. 03 – 05, 2014, Coimbatore, INDIA.
[2] S. Malakar, S. Halder, R. Sarker, N. Das, S. Basu, M.
Nasipuri, Text line Extraction from Handwritten Document
pages using spiral run length smearing algorithm,
International Conference on communications, Devices and
Intelligent Systems, Kolkata, Dec. 28-29 (2012) 616-619.
[3] S.J. Ha, B. Jin, N.I. Cho, Fast Text Line Extraction in
Document Images, 19th IEEE International Conference on
Image Processing, Orlando, Sept. 30-Oct 3 (2012) 797-800.
[4] S.V. Seeri, S. Giraddi, Prashant B.M, A Novel Approach
for Kannada Text Extraction, Proceedings of the
International Conference on Pattern Recognition,
Informatics and Medical Engineering, Tamil Naidu, Mar.
21-23 (2012) 444-448.
[5] Z. Li, J. Luo, Resolution Enhancement from Document
Images for Text Extraction, 5th International Conference on
Multimedia and Ubiquitous Engineering, Loutraki, June 28-
30 (2011) 251-256.
[6] D. Zaravi, H. Rostami, A. Malahzaheh, S.S Mortazavi,
Journals Subheadlines Text Extraction Using Wavelet
Thresholding and New Projection Profile, World Academy of
Science, Engineering and Technology, 49 (2011) 686-689.
[7] T.V. Hoang, S. Tabbone, Text Extraction From Graphical
Document Images Using Sparse Representation,
International Workshop on Document Analysis Systems,
June 9-11 (2010) 143-150.
[8] P. Nagabhushan, S. Nirmala, Text Extraction in Complex
Color Document Images for Enhanced Readability,
Intelligent Information Management, 2 (2010) 120-133.
[9] D. Dunn, W. E. Higgins, and J. Wakeley, “Texture
segmentation using 2-D Gabor elementary functions,” IEEE
transaction on Pattern Analysis and
Machine Intelligence, vol. 16, no. 2, pp. 130-149,
1994.
[l0] W. Chan and G. Coghill, “Text analysis using local
energy,” Pattern Recognition, vol. 34,
pp. 2523-2532,2001.
[l1] U. Pal and B. B. Chaudhuri, “Script line separation from
Indian muliti-script document,” in Proceedings of the
International Conference on Document Analysis and
Recognition, pp. 406-409, 1999.
[I21 D. Dhanya, A. G. Ramakrishnan, and P. B. Pati,
“Script identification in printed bilingual docuements,”
Sadhana, vol. 27, pp. 73-82,2002.
[13] Chih-Wei Hsu, Chih-len Lin. A comparison of methods
for multiclass support vector machines. IEEE Trans Neural
Networks. 13: 415-425, March (2002).
[14] Vijay singh and Bhupendra kumar “document layout
analysis for Indian newspapers using contour based
symbiotic approach”, 2014 International Conference on
Computer Communication and Informatics (ICCCI -
2014),IEEE.
[15] Chih-Chung Chang and Chih-Jen Lin, LIBSVM: a library
for support vector machines, Software available at
http://www.csie.ntu.edu.tw/ cjlin/libsvm.
[16] F. Liu, Y. Luo, D. Hu, and M. Yoshikawa. A new
component based
algorithm for newspaper layout analysis. pages 1176 1180.
IEEE Computer Society, 2001.
[17] S. Mao, A. Rosenfeld, and T. Kanungo. Document
structure analysis algorithms: a literature survey. volume
5010 of SPIE Proceedings, pages 197–207. SPIE, 2003.

More Related Content

What's hot

Content Based Image Retrieval Approach Based on Top-Hat Transform And Modifie...
Content Based Image Retrieval Approach Based on Top-Hat Transform And Modifie...Content Based Image Retrieval Approach Based on Top-Hat Transform And Modifie...
Content Based Image Retrieval Approach Based on Top-Hat Transform And Modifie...
cscpconf
 
Template matching03
Template matching03Template matching03
Template matching03
amitkhanna1991
 
Image to text Converter
Image to text ConverterImage to text Converter
Image to text Converter
Dhiraj Raj
 
The Application Of Bayes Ying-Yang Harmony Based Gmms In On-Line Signature Ve...
The Application Of Bayes Ying-Yang Harmony Based Gmms In On-Line Signature Ve...The Application Of Bayes Ying-Yang Harmony Based Gmms In On-Line Signature Ve...
The Application Of Bayes Ying-Yang Harmony Based Gmms In On-Line Signature Ve...
ijaia
 
E046012533
E046012533E046012533
E046012533
IJERA Editor
 
GRAY SCALE IMAGE SEGMENTATION USING OTSU THRESHOLDING OPTIMAL APPROACH
GRAY SCALE IMAGE SEGMENTATION USING OTSU THRESHOLDING OPTIMAL APPROACHGRAY SCALE IMAGE SEGMENTATION USING OTSU THRESHOLDING OPTIMAL APPROACH
GRAY SCALE IMAGE SEGMENTATION USING OTSU THRESHOLDING OPTIMAL APPROACH
Journal For Research
 
K2 Algorithm-based Text Detection with An Adaptive Classifier Threshold
K2 Algorithm-based Text Detection with An Adaptive Classifier ThresholdK2 Algorithm-based Text Detection with An Adaptive Classifier Threshold
K2 Algorithm-based Text Detection with An Adaptive Classifier Threshold
CSCJournals
 
Image Processing for Automated Flaw Detection and CMYK model for Color Image ...
Image Processing for Automated Flaw Detection and CMYK model for Color Image ...Image Processing for Automated Flaw Detection and CMYK model for Color Image ...
Image Processing for Automated Flaw Detection and CMYK model for Color Image ...
IOSR Journals
 
Object Recognition Using Shape Context with Canberra Distance
Object Recognition Using Shape Context with Canberra DistanceObject Recognition Using Shape Context with Canberra Distance
Object Recognition Using Shape Context with Canberra Distance
Associate Professor in VSB Coimbatore
 
Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...
ijfcstjournal
 
Face recognition using selected topographical features
Face recognition using selected topographical features Face recognition using selected topographical features
Face recognition using selected topographical features
IJECEIAES
 
Successive Geometric Center Based Dynamic Signature Recognition
Successive Geometric Center Based Dynamic Signature RecognitionSuccessive Geometric Center Based Dynamic Signature Recognition
Successive Geometric Center Based Dynamic Signature Recognition
Dr. Vinayak Bharadi
 
Face recognition using gaussian mixture model & artificial neural network
Face recognition using gaussian mixture model & artificial neural networkFace recognition using gaussian mixture model & artificial neural network
Face recognition using gaussian mixture model & artificial neural network
eSAT Journals
 
IRJET- Note to Coin Converter
IRJET- Note to Coin ConverterIRJET- Note to Coin Converter
IRJET- Note to Coin Converter
IRJET Journal
 
IRJET- Shape based Image Classification using Geometric ­–Properties
IRJET-  	  Shape based Image Classification using Geometric ­–PropertiesIRJET-  	  Shape based Image Classification using Geometric ­–Properties
IRJET- Shape based Image Classification using Geometric ­–Properties
IRJET Journal
 
Scene Text Detection of Curved Text Using Gradiant Vector Flow Method
Scene Text Detection of Curved Text Using Gradiant Vector Flow MethodScene Text Detection of Curved Text Using Gradiant Vector Flow Method
Scene Text Detection of Curved Text Using Gradiant Vector Flow Method
IJTET Journal
 
A Simple Signature Recognition System
A Simple Signature Recognition System A Simple Signature Recognition System
A Simple Signature Recognition System
iosrjce
 
A combined method of fractal and glcm features for mri and ct scan images cla...
A combined method of fractal and glcm features for mri and ct scan images cla...A combined method of fractal and glcm features for mri and ct scan images cla...
A combined method of fractal and glcm features for mri and ct scan images cla...
sipij
 
Analysis of computational
Analysis of computationalAnalysis of computational
Analysis of computational
csandit
 

What's hot (19)

Content Based Image Retrieval Approach Based on Top-Hat Transform And Modifie...
Content Based Image Retrieval Approach Based on Top-Hat Transform And Modifie...Content Based Image Retrieval Approach Based on Top-Hat Transform And Modifie...
Content Based Image Retrieval Approach Based on Top-Hat Transform And Modifie...
 
Template matching03
Template matching03Template matching03
Template matching03
 
Image to text Converter
Image to text ConverterImage to text Converter
Image to text Converter
 
The Application Of Bayes Ying-Yang Harmony Based Gmms In On-Line Signature Ve...
The Application Of Bayes Ying-Yang Harmony Based Gmms In On-Line Signature Ve...The Application Of Bayes Ying-Yang Harmony Based Gmms In On-Line Signature Ve...
The Application Of Bayes Ying-Yang Harmony Based Gmms In On-Line Signature Ve...
 
E046012533
E046012533E046012533
E046012533
 
GRAY SCALE IMAGE SEGMENTATION USING OTSU THRESHOLDING OPTIMAL APPROACH
GRAY SCALE IMAGE SEGMENTATION USING OTSU THRESHOLDING OPTIMAL APPROACHGRAY SCALE IMAGE SEGMENTATION USING OTSU THRESHOLDING OPTIMAL APPROACH
GRAY SCALE IMAGE SEGMENTATION USING OTSU THRESHOLDING OPTIMAL APPROACH
 
K2 Algorithm-based Text Detection with An Adaptive Classifier Threshold
K2 Algorithm-based Text Detection with An Adaptive Classifier ThresholdK2 Algorithm-based Text Detection with An Adaptive Classifier Threshold
K2 Algorithm-based Text Detection with An Adaptive Classifier Threshold
 
Image Processing for Automated Flaw Detection and CMYK model for Color Image ...
Image Processing for Automated Flaw Detection and CMYK model for Color Image ...Image Processing for Automated Flaw Detection and CMYK model for Color Image ...
Image Processing for Automated Flaw Detection and CMYK model for Color Image ...
 
Object Recognition Using Shape Context with Canberra Distance
Object Recognition Using Shape Context with Canberra DistanceObject Recognition Using Shape Context with Canberra Distance
Object Recognition Using Shape Context with Canberra Distance
 
Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...
 
Face recognition using selected topographical features
Face recognition using selected topographical features Face recognition using selected topographical features
Face recognition using selected topographical features
 
Successive Geometric Center Based Dynamic Signature Recognition
Successive Geometric Center Based Dynamic Signature RecognitionSuccessive Geometric Center Based Dynamic Signature Recognition
Successive Geometric Center Based Dynamic Signature Recognition
 
Face recognition using gaussian mixture model & artificial neural network
Face recognition using gaussian mixture model & artificial neural networkFace recognition using gaussian mixture model & artificial neural network
Face recognition using gaussian mixture model & artificial neural network
 
IRJET- Note to Coin Converter
IRJET- Note to Coin ConverterIRJET- Note to Coin Converter
IRJET- Note to Coin Converter
 
IRJET- Shape based Image Classification using Geometric ­–Properties
IRJET-  	  Shape based Image Classification using Geometric ­–PropertiesIRJET-  	  Shape based Image Classification using Geometric ­–Properties
IRJET- Shape based Image Classification using Geometric ­–Properties
 
Scene Text Detection of Curved Text Using Gradiant Vector Flow Method
Scene Text Detection of Curved Text Using Gradiant Vector Flow MethodScene Text Detection of Curved Text Using Gradiant Vector Flow Method
Scene Text Detection of Curved Text Using Gradiant Vector Flow Method
 
A Simple Signature Recognition System
A Simple Signature Recognition System A Simple Signature Recognition System
A Simple Signature Recognition System
 
A combined method of fractal and glcm features for mri and ct scan images cla...
A combined method of fractal and glcm features for mri and ct scan images cla...A combined method of fractal and glcm features for mri and ct scan images cla...
A combined method of fractal and glcm features for mri and ct scan images cla...
 
Analysis of computational
Analysis of computationalAnalysis of computational
Analysis of computational
 

Viewers also liked

IRJET-A Design Of Modified SSTBC Encoder to Noise Free Mimo Communication in ...
IRJET-A Design Of Modified SSTBC Encoder to Noise Free Mimo Communication in ...IRJET-A Design Of Modified SSTBC Encoder to Noise Free Mimo Communication in ...
IRJET-A Design Of Modified SSTBC Encoder to Noise Free Mimo Communication in ...
IRJET Journal
 
IRJET-CFD Analysis of conceptual Aircraft body
IRJET-CFD Analysis of conceptual Aircraft bodyIRJET-CFD Analysis of conceptual Aircraft body
IRJET-CFD Analysis of conceptual Aircraft body
IRJET Journal
 
IRJET-A Review on Two Stroke Single Cylinder Compressed Air Engine
IRJET-A Review on Two Stroke Single Cylinder Compressed Air EngineIRJET-A Review on Two Stroke Single Cylinder Compressed Air Engine
IRJET-A Review on Two Stroke Single Cylinder Compressed Air Engine
IRJET Journal
 
IRJET-A Virtual Grid-Based Dynamic Routes Adjustment (VGDRA) Scheme for Wir...
IRJET-A Virtual Grid-Based Dynamic Routes Adjustment (VGDRA) Scheme for   Wir...IRJET-A Virtual Grid-Based Dynamic Routes Adjustment (VGDRA) Scheme for   Wir...
IRJET-A Virtual Grid-Based Dynamic Routes Adjustment (VGDRA) Scheme for Wir...
IRJET Journal
 
IRJET- Simulation based design and analysis of combined effect of various ...
IRJET- 	  Simulation based design and analysis of combined effect of various ...IRJET- 	  Simulation based design and analysis of combined effect of various ...
IRJET- Simulation based design and analysis of combined effect of various ...
IRJET Journal
 
IRJET-A Survey on Stealthy Denial of Service Strategy in Cloud Computing
IRJET-A Survey on Stealthy Denial of Service Strategy in Cloud ComputingIRJET-A Survey on Stealthy Denial of Service Strategy in Cloud Computing
IRJET-A Survey on Stealthy Denial of Service Strategy in Cloud Computing
IRJET Journal
 
IRJET-Second Throat Diffuser System at Different Back Pressure for High Altit...
IRJET-Second Throat Diffuser System at Different Back Pressure for High Altit...IRJET-Second Throat Diffuser System at Different Back Pressure for High Altit...
IRJET-Second Throat Diffuser System at Different Back Pressure for High Altit...
IRJET Journal
 
IRJET- Review of Vapour Absorption System and Vapour Compression System.
IRJET- 	  Review of Vapour Absorption System and Vapour Compression System.IRJET- 	  Review of Vapour Absorption System and Vapour Compression System.
IRJET- Review of Vapour Absorption System and Vapour Compression System.
IRJET Journal
 
IRJET- Sensrless Luenberger Observer Based Sliding Mode Control of DC Motor
IRJET- 	  Sensrless Luenberger Observer Based Sliding Mode Control of DC MotorIRJET- 	  Sensrless Luenberger Observer Based Sliding Mode Control of DC Motor
IRJET- Sensrless Luenberger Observer Based Sliding Mode Control of DC Motor
IRJET Journal
 
IRJET-Power Flow & Voltage Stability Analysis using MATLAB
IRJET-Power Flow & Voltage Stability Analysis using MATLAB IRJET-Power Flow & Voltage Stability Analysis using MATLAB
IRJET-Power Flow & Voltage Stability Analysis using MATLAB
IRJET Journal
 
IRJET- Assessment of Environmental Impacts during Operational Phase of a T...
IRJET- 	  Assessment of Environmental Impacts during Operational Phase of a T...IRJET- 	  Assessment of Environmental Impacts during Operational Phase of a T...
IRJET- Assessment of Environmental Impacts during Operational Phase of a T...
IRJET Journal
 
IRJET-Accessing Information about Programs and Services through a Voice Site ...
IRJET-Accessing Information about Programs and Services through a Voice Site ...IRJET-Accessing Information about Programs and Services through a Voice Site ...
IRJET-Accessing Information about Programs and Services through a Voice Site ...
IRJET Journal
 
IRJET-Design of Pile Foundation for site in Sangli district of Maharashtra: C...
IRJET-Design of Pile Foundation for site in Sangli district of Maharashtra: C...IRJET-Design of Pile Foundation for site in Sangli district of Maharashtra: C...
IRJET-Design of Pile Foundation for site in Sangli district of Maharashtra: C...
IRJET Journal
 
IRJET- Missing Value Evaluation in SQL Queries: A Survey
IRJET- 	  Missing Value Evaluation in SQL Queries: A SurveyIRJET- 	  Missing Value Evaluation in SQL Queries: A Survey
IRJET- Missing Value Evaluation in SQL Queries: A Survey
IRJET Journal
 
IRJET-Design And Development Of Sugar Cane Sprout Cutter Machine By Human Pow...
IRJET-Design And Development Of Sugar Cane Sprout Cutter Machine By Human Pow...IRJET-Design And Development Of Sugar Cane Sprout Cutter Machine By Human Pow...
IRJET-Design And Development Of Sugar Cane Sprout Cutter Machine By Human Pow...
IRJET Journal
 
IRJET-Solar Power Generation with Capacitor Based Seven Level Inverter System
IRJET-Solar Power Generation with Capacitor Based Seven Level Inverter SystemIRJET-Solar Power Generation with Capacitor Based Seven Level Inverter System
IRJET-Solar Power Generation with Capacitor Based Seven Level Inverter System
IRJET Journal
 
IRJET-Evaluation of the Back Propagation Neural Network for Gravity Mapping
IRJET-Evaluation of the Back Propagation Neural Network for Gravity Mapping IRJET-Evaluation of the Back Propagation Neural Network for Gravity Mapping
IRJET-Evaluation of the Back Propagation Neural Network for Gravity Mapping
IRJET Journal
 
IRJET-Survival and Growth Rate of Clarias gariepinus Larvae Fed with Artemia ...
IRJET-Survival and Growth Rate of Clarias gariepinus Larvae Fed with Artemia ...IRJET-Survival and Growth Rate of Clarias gariepinus Larvae Fed with Artemia ...
IRJET-Survival and Growth Rate of Clarias gariepinus Larvae Fed with Artemia ...
IRJET Journal
 
IRJET-A Survey On Opportunistic Piggyback Marking For IP Trace Back
IRJET-A Survey On Opportunistic Piggyback Marking For IP Trace BackIRJET-A Survey On Opportunistic Piggyback Marking For IP Trace Back
IRJET-A Survey On Opportunistic Piggyback Marking For IP Trace Back
IRJET Journal
 
IRJET-Review Paper On Usage Of Ferrocement Panels In Lightweight Sandwich Con...
IRJET-Review Paper On Usage Of Ferrocement Panels In Lightweight Sandwich Con...IRJET-Review Paper On Usage Of Ferrocement Panels In Lightweight Sandwich Con...
IRJET-Review Paper On Usage Of Ferrocement Panels In Lightweight Sandwich Con...
IRJET Journal
 

Viewers also liked (20)

IRJET-A Design Of Modified SSTBC Encoder to Noise Free Mimo Communication in ...
IRJET-A Design Of Modified SSTBC Encoder to Noise Free Mimo Communication in ...IRJET-A Design Of Modified SSTBC Encoder to Noise Free Mimo Communication in ...
IRJET-A Design Of Modified SSTBC Encoder to Noise Free Mimo Communication in ...
 
IRJET-CFD Analysis of conceptual Aircraft body
IRJET-CFD Analysis of conceptual Aircraft bodyIRJET-CFD Analysis of conceptual Aircraft body
IRJET-CFD Analysis of conceptual Aircraft body
 
IRJET-A Review on Two Stroke Single Cylinder Compressed Air Engine
IRJET-A Review on Two Stroke Single Cylinder Compressed Air EngineIRJET-A Review on Two Stroke Single Cylinder Compressed Air Engine
IRJET-A Review on Two Stroke Single Cylinder Compressed Air Engine
 
IRJET-A Virtual Grid-Based Dynamic Routes Adjustment (VGDRA) Scheme for Wir...
IRJET-A Virtual Grid-Based Dynamic Routes Adjustment (VGDRA) Scheme for   Wir...IRJET-A Virtual Grid-Based Dynamic Routes Adjustment (VGDRA) Scheme for   Wir...
IRJET-A Virtual Grid-Based Dynamic Routes Adjustment (VGDRA) Scheme for Wir...
 
IRJET- Simulation based design and analysis of combined effect of various ...
IRJET- 	  Simulation based design and analysis of combined effect of various ...IRJET- 	  Simulation based design and analysis of combined effect of various ...
IRJET- Simulation based design and analysis of combined effect of various ...
 
IRJET-A Survey on Stealthy Denial of Service Strategy in Cloud Computing
IRJET-A Survey on Stealthy Denial of Service Strategy in Cloud ComputingIRJET-A Survey on Stealthy Denial of Service Strategy in Cloud Computing
IRJET-A Survey on Stealthy Denial of Service Strategy in Cloud Computing
 
IRJET-Second Throat Diffuser System at Different Back Pressure for High Altit...
IRJET-Second Throat Diffuser System at Different Back Pressure for High Altit...IRJET-Second Throat Diffuser System at Different Back Pressure for High Altit...
IRJET-Second Throat Diffuser System at Different Back Pressure for High Altit...
 
IRJET- Review of Vapour Absorption System and Vapour Compression System.
IRJET- 	  Review of Vapour Absorption System and Vapour Compression System.IRJET- 	  Review of Vapour Absorption System and Vapour Compression System.
IRJET- Review of Vapour Absorption System and Vapour Compression System.
 
IRJET- Sensrless Luenberger Observer Based Sliding Mode Control of DC Motor
IRJET- 	  Sensrless Luenberger Observer Based Sliding Mode Control of DC MotorIRJET- 	  Sensrless Luenberger Observer Based Sliding Mode Control of DC Motor
IRJET- Sensrless Luenberger Observer Based Sliding Mode Control of DC Motor
 
IRJET-Power Flow & Voltage Stability Analysis using MATLAB
IRJET-Power Flow & Voltage Stability Analysis using MATLAB IRJET-Power Flow & Voltage Stability Analysis using MATLAB
IRJET-Power Flow & Voltage Stability Analysis using MATLAB
 
IRJET- Assessment of Environmental Impacts during Operational Phase of a T...
IRJET- 	  Assessment of Environmental Impacts during Operational Phase of a T...IRJET- 	  Assessment of Environmental Impacts during Operational Phase of a T...
IRJET- Assessment of Environmental Impacts during Operational Phase of a T...
 
IRJET-Accessing Information about Programs and Services through a Voice Site ...
IRJET-Accessing Information about Programs and Services through a Voice Site ...IRJET-Accessing Information about Programs and Services through a Voice Site ...
IRJET-Accessing Information about Programs and Services through a Voice Site ...
 
IRJET-Design of Pile Foundation for site in Sangli district of Maharashtra: C...
IRJET-Design of Pile Foundation for site in Sangli district of Maharashtra: C...IRJET-Design of Pile Foundation for site in Sangli district of Maharashtra: C...
IRJET-Design of Pile Foundation for site in Sangli district of Maharashtra: C...
 
IRJET- Missing Value Evaluation in SQL Queries: A Survey
IRJET- 	  Missing Value Evaluation in SQL Queries: A SurveyIRJET- 	  Missing Value Evaluation in SQL Queries: A Survey
IRJET- Missing Value Evaluation in SQL Queries: A Survey
 
IRJET-Design And Development Of Sugar Cane Sprout Cutter Machine By Human Pow...
IRJET-Design And Development Of Sugar Cane Sprout Cutter Machine By Human Pow...IRJET-Design And Development Of Sugar Cane Sprout Cutter Machine By Human Pow...
IRJET-Design And Development Of Sugar Cane Sprout Cutter Machine By Human Pow...
 
IRJET-Solar Power Generation with Capacitor Based Seven Level Inverter System
IRJET-Solar Power Generation with Capacitor Based Seven Level Inverter SystemIRJET-Solar Power Generation with Capacitor Based Seven Level Inverter System
IRJET-Solar Power Generation with Capacitor Based Seven Level Inverter System
 
IRJET-Evaluation of the Back Propagation Neural Network for Gravity Mapping
IRJET-Evaluation of the Back Propagation Neural Network for Gravity Mapping IRJET-Evaluation of the Back Propagation Neural Network for Gravity Mapping
IRJET-Evaluation of the Back Propagation Neural Network for Gravity Mapping
 
IRJET-Survival and Growth Rate of Clarias gariepinus Larvae Fed with Artemia ...
IRJET-Survival and Growth Rate of Clarias gariepinus Larvae Fed with Artemia ...IRJET-Survival and Growth Rate of Clarias gariepinus Larvae Fed with Artemia ...
IRJET-Survival and Growth Rate of Clarias gariepinus Larvae Fed with Artemia ...
 
IRJET-A Survey On Opportunistic Piggyback Marking For IP Trace Back
IRJET-A Survey On Opportunistic Piggyback Marking For IP Trace BackIRJET-A Survey On Opportunistic Piggyback Marking For IP Trace Back
IRJET-A Survey On Opportunistic Piggyback Marking For IP Trace Back
 
IRJET-Review Paper On Usage Of Ferrocement Panels In Lightweight Sandwich Con...
IRJET-Review Paper On Usage Of Ferrocement Panels In Lightweight Sandwich Con...IRJET-Review Paper On Usage Of Ferrocement Panels In Lightweight Sandwich Con...
IRJET-Review Paper On Usage Of Ferrocement Panels In Lightweight Sandwich Con...
 

Similar to IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hindi News Paper in Image Processing

Text Detection and Recognition in Natural Images
Text Detection and Recognition in Natural ImagesText Detection and Recognition in Natural Images
Text Detection and Recognition in Natural Images
IRJET Journal
 
Text Extraction and Recognition Using Median Filter
Text Extraction and Recognition Using Median FilterText Extraction and Recognition Using Median Filter
Text Extraction and Recognition Using Median Filter
IRJET Journal
 
Brain Tumor Classification using Support Vector Machine
Brain Tumor Classification using Support Vector MachineBrain Tumor Classification using Support Vector Machine
Brain Tumor Classification using Support Vector Machine
IRJET Journal
 
Feature Extraction and Feature Selection using Textual Analysis
Feature Extraction and Feature Selection using Textual AnalysisFeature Extraction and Feature Selection using Textual Analysis
Feature Extraction and Feature Selection using Textual Analysis
vivatechijri
 
A Comparative study of K-SVD and WSQ Algorithms in Fingerprint Compression Te...
A Comparative study of K-SVD and WSQ Algorithms in Fingerprint Compression Te...A Comparative study of K-SVD and WSQ Algorithms in Fingerprint Compression Te...
A Comparative study of K-SVD and WSQ Algorithms in Fingerprint Compression Te...
IRJET Journal
 
Design and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English FontDesign and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English Font
IRJET Journal
 
IRJET- Optical Character Recognition using Image Processing
IRJET-  	  Optical Character Recognition using Image ProcessingIRJET-  	  Optical Character Recognition using Image Processing
IRJET- Optical Character Recognition using Image Processing
IRJET Journal
 
IRJET - Object Detection using Hausdorff Distance
IRJET -  	  Object Detection using Hausdorff DistanceIRJET -  	  Object Detection using Hausdorff Distance
IRJET - Object Detection using Hausdorff Distance
IRJET Journal
 
IRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural NetworkIRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural Network
IRJET Journal
 
IRJET- Cheque Bounce Detection System using Image Processing
IRJET- Cheque Bounce Detection System using Image ProcessingIRJET- Cheque Bounce Detection System using Image Processing
IRJET- Cheque Bounce Detection System using Image Processing
IRJET Journal
 
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdfHandwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Sachin414679
 
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
IRJET Journal
 
Finding similarities between structured documents as a crucial stage for gene...
Finding similarities between structured documents as a crucial stage for gene...Finding similarities between structured documents as a crucial stage for gene...
Finding similarities between structured documents as a crucial stage for gene...
Alexander Decker
 
Inpainting scheme for text in video a survey
Inpainting scheme for text in video   a surveyInpainting scheme for text in video   a survey
Inpainting scheme for text in video a survey
eSAT Journals
 
Text Recognition using Convolutional Neural Network: A Review
Text Recognition using Convolutional Neural Network: A ReviewText Recognition using Convolutional Neural Network: A Review
Text Recognition using Convolutional Neural Network: A Review
IRJET Journal
 
Assignment-1-NF.docx
Assignment-1-NF.docxAssignment-1-NF.docx
Assignment-1-NF.docx
KhondokerAbuNaim
 
IRJET- Intelligent Character Recognition of Handwritten Characters
IRJET- Intelligent Character Recognition of Handwritten CharactersIRJET- Intelligent Character Recognition of Handwritten Characters
IRJET- Intelligent Character Recognition of Handwritten Characters
IRJET Journal
 
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
IRJET Journal
 
IRJET - Simulation of Colour Image Processing Techniques on VHDL
IRJET - Simulation of Colour Image Processing Techniques on VHDLIRJET - Simulation of Colour Image Processing Techniques on VHDL
IRJET - Simulation of Colour Image Processing Techniques on VHDL
IRJET Journal
 

Similar to IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hindi News Paper in Image Processing (20)

Text Detection and Recognition in Natural Images
Text Detection and Recognition in Natural ImagesText Detection and Recognition in Natural Images
Text Detection and Recognition in Natural Images
 
Text Extraction and Recognition Using Median Filter
Text Extraction and Recognition Using Median FilterText Extraction and Recognition Using Median Filter
Text Extraction and Recognition Using Median Filter
 
Brain Tumor Classification using Support Vector Machine
Brain Tumor Classification using Support Vector MachineBrain Tumor Classification using Support Vector Machine
Brain Tumor Classification using Support Vector Machine
 
Feature Extraction and Feature Selection using Textual Analysis
Feature Extraction and Feature Selection using Textual AnalysisFeature Extraction and Feature Selection using Textual Analysis
Feature Extraction and Feature Selection using Textual Analysis
 
A Comparative study of K-SVD and WSQ Algorithms in Fingerprint Compression Te...
A Comparative study of K-SVD and WSQ Algorithms in Fingerprint Compression Te...A Comparative study of K-SVD and WSQ Algorithms in Fingerprint Compression Te...
A Comparative study of K-SVD and WSQ Algorithms in Fingerprint Compression Te...
 
Design and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English FontDesign and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English Font
 
IRJET- Optical Character Recognition using Image Processing
IRJET-  	  Optical Character Recognition using Image ProcessingIRJET-  	  Optical Character Recognition using Image Processing
IRJET- Optical Character Recognition using Image Processing
 
IRJET - Object Detection using Hausdorff Distance
IRJET -  	  Object Detection using Hausdorff DistanceIRJET -  	  Object Detection using Hausdorff Distance
IRJET - Object Detection using Hausdorff Distance
 
IRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural NetworkIRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural Network
 
IRJET- Cheque Bounce Detection System using Image Processing
IRJET- Cheque Bounce Detection System using Image ProcessingIRJET- Cheque Bounce Detection System using Image Processing
IRJET- Cheque Bounce Detection System using Image Processing
 
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdfHandwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf
 
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
 
Finding similarities between structured documents as a crucial stage for gene...
Finding similarities between structured documents as a crucial stage for gene...Finding similarities between structured documents as a crucial stage for gene...
Finding similarities between structured documents as a crucial stage for gene...
 
Inpainting scheme for text in video a survey
Inpainting scheme for text in video   a surveyInpainting scheme for text in video   a survey
Inpainting scheme for text in video a survey
 
Text Recognition using Convolutional Neural Network: A Review
Text Recognition using Convolutional Neural Network: A ReviewText Recognition using Convolutional Neural Network: A Review
Text Recognition using Convolutional Neural Network: A Review
 
Assignment-1-NF.docx
Assignment-1-NF.docxAssignment-1-NF.docx
Assignment-1-NF.docx
 
IRJET- Intelligent Character Recognition of Handwritten Characters
IRJET- Intelligent Character Recognition of Handwritten CharactersIRJET- Intelligent Character Recognition of Handwritten Characters
IRJET- Intelligent Character Recognition of Handwritten Characters
 
40120140501009
4012014050100940120140501009
40120140501009
 
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
A Survey on Portable Camera-Based Assistive Text and Product Label Reading Fr...
 
IRJET - Simulation of Colour Image Processing Techniques on VHDL
IRJET - Simulation of Colour Image Processing Techniques on VHDLIRJET - Simulation of Colour Image Processing Techniques on VHDL
IRJET - Simulation of Colour Image Processing Techniques on VHDL
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
IRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
IRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
IRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
IRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
IRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
IRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
IRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
IRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 
block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
Divya Somashekar
 
Event Management System Vb Net Project Report.pdf
Event Management System Vb Net  Project Report.pdfEvent Management System Vb Net  Project Report.pdf
Event Management System Vb Net Project Report.pdf
Kamal Acharya
 
Gen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdfGen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdf
gdsczhcet
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
AJAYKUMARPUND1
 
Forklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella PartsForklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella Parts
Intella Parts
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Teleport Manpower Consultant
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
gerogepatton
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
ViniHema
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
Pratik Pawar
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
TeeVichai
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
LIGA(E)11111111111111111111111111111111111111111.ppt
LIGA(E)11111111111111111111111111111111111111111.pptLIGA(E)11111111111111111111111111111111111111111.ppt
LIGA(E)11111111111111111111111111111111111111111.ppt
ssuser9bd3ba
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
Osamah Alsalih
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Dr.Costas Sachpazis
 
Automobile Management System Project Report.pdf
Automobile Management System Project Report.pdfAutomobile Management System Project Report.pdf
Automobile Management System Project Report.pdf
Kamal Acharya
 

Recently uploaded (20)

H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
 
Event Management System Vb Net Project Report.pdf
Event Management System Vb Net  Project Report.pdfEvent Management System Vb Net  Project Report.pdf
Event Management System Vb Net Project Report.pdf
 
Gen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdfGen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdf
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
 
Forklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella PartsForklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella Parts
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
LIGA(E)11111111111111111111111111111111111111111.ppt
LIGA(E)11111111111111111111111111111111111111111.pptLIGA(E)11111111111111111111111111111111111111111.ppt
LIGA(E)11111111111111111111111111111111111111111.ppt
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
 
Automobile Management System Project Report.pdf
Automobile Management System Project Report.pdfAutomobile Management System Project Report.pdf
Automobile Management System Project Report.pdf
 

IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hindi News Paper in Image Processing

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 57 DOCUMENT LAYOUT ANALYSIS USING INVERSE SUPPORT VECTOR MACHINE (I-SVM) FOR HINDI NEWS PAPER IN IMAGE PROCESSING Rimmy Kathuria M .Tech Student of MRIU, Faridabad. Abstract: Optical character recognition (OCR) is the translation of handwritten, typed or impressed word images into a form that the computer can manipulate. There are various steps in OCR. One of them is classification. To do the Classification, we must have to compare a training data with many feature vectors. A classifier is needed to compare the feature vector of input and the feature vector of training data using inverse support vector machine. This paper proposed classification techniques, which are used to recognize character or word, and about some related work which has been done. It will also present a novel learning based framework to extract articles from newspaper images using bounding box method. The input to the system comprises blocks of text and graphics, obtained using standard image processing techniques. The fixed point model uses contextual information and features of each block to learn the layout of newspaper images and attains a contraction mapping to assign a unique label to every block. Experimental results show the applicability of our algorithm in document newspaper layout labeling and article extraction. Keywords: OCR, I-SVM, CANNY, SOBEL, Hindi newspaper script. I. INTRODUCTION This Research gives an overview about document layout analysis. It presents in detail one method which is part of the pre-processing phase, called Morphological operator. In doing so it introduces a special technique of how determining the bounding box and inverse support vector machine. Furthermore it presents numerous ways of approaches to speed up the process of document layout.The second part deals with page segmentation which is one of the huge parts in document image analysis. It illustrates a means of partitioning using bounding boxes of different entities. Reasonable success has been achieved at developing mono lingual OCR systems in Indian scripts. Scientists, optimistically, have started to look beyond. Development of bilingual OCR systems and OCR systems with capability to identify the text areas are some of the pointers to future activities in Indian scenario. The separation of text and non-text regions before considering the document layout for OCR is an important task. In this paper, we present a biologically inspired, multichannel filtering scheme for page layout analysis. The same scheme has been used for script recognition as well. Parameter tuning is mostly done heuristically. It has also been seen to be computationally viable for commercial OCR system development. Figure 1: Block diagram of the OCR II. OCR OCR Stands for Optical Character Recognition. It is Extracts the text from a given image. It is invented by Gustav Tauschek. Tauschek obtained a patent on OCR 1929 in Germany and 1935 in USA. Figure 2: Example of OCR
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 58 III. Data for work Data in a newspaper document analysis are usually captured by optical scanning and stored in a file of picture elements, called pixels that are sampled in a grid pattern throughout the document. These pixels may have values: OFF (0) or ON (1) for binary images, 0–255 for gray-scale images and 3 channels of 0–255 colour values for colour images. At a typical sampling resolution of 120 pixels per centimeter, a 20 x 30 cm page would yield an image of 2400x3600 pixels. When the document is on a different medium such as microfilm, palm leaves, or fabric, photographic methods are often used to capture images. In any case, it is important to understand that the image of the document contains only raw data that must be further analyzed to collect the information. IV. PROPOSED IMPLEMENTATION We have used scanned images of Hindi newspapers for our experimentation. Each document provides for a number of blocks, typically an average of multiple blocks exists in a single newspaper image. We use Bounding box to obtain these blocks. This is the most important stage of document layout OCR system. Relevant information from the selected data has been extracted for classification. Different shapes of the character parts have been selected from feature selection algorithm. They might be curves or points or linear shapes. But most of them are open curve shaped. In feature extraction, two techniques are proposed here. One is support vector based sub line direction and the other one is bounding box based shape detection. Here, directions are extracted by sub line direction and bounding box procedures from selected portion of character image. A. Preprocessing Of newspaper data The gray scale image is first binaries using the method described in our earlier work [2]. Horizontal and vertical lines are then removed on the basis of aspect ratio of connected components. B. Inverse-SVM classification The classification strategy we have adopted combines both deterministic and probabilistic decision making. First, using training samples, we try to learn the various conflicts among classes and equivalent representations for every class. . From here, we use probabilistic classification to assign a label to the test pattern. The important point to observe here is, the switch to conflict resolution can happen at any level (from 1 to 4), since conflicts can be defined at various levels. Usually this will be dictated by certain trade-offs between accuracy and precision-recall, as we show later. The classifier we use to resolve conflicts is the Inverse Support Vector Machine (I-SVM). Though it is not mandatory to use SVMs, their utility in character recognition has been well documented. We have chosen RBF kernel, and the SVM formulation “support vector classification” for multi-class classification. C. SOBEL OPERATOR The most popular approximation of equation (1) but using a 33 mask is the following. )2()2()2()2( 741963321987 zzzzzzzzzzzzf  (1) This approximation is known as the Sobel operator. If we consider the left of mask the Sobel operator, this causes differentiation along the y direction. If we isolate the following part of the mask and treat it as a one dimensional mask, we are interested in finding the effects of that mask. We will therefore, treat this mask as a one dimensional impulse response ][nh of the form Sobel operator - 1 00 2 - 2 1 0 - 1 1 -1 0-2 0 0 1 2 1 -1
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 59            otherwise0 11 02 11 ][ n n n nh The above response applied to a signal ][nx yields a signal ]1[][2]1[][  nxnxnxny or in z-transform domain )()1(cos2)()()2()( 1  jXjYzXzzzY   . (2) Therefore, ][nh is the impulse response of a system with transfer function )()1(cos2)(  jHjH  Shown in the figure below for ],0[  . This is a low pass filter type of response. Therefore, we can claim that the Sobel operator has a differentiation effect along one of the two directions and a smoothing effect along the other direction. D. Finding text lines Precise identification of text lines is an important part for most OCR systems. It is also very useful for document layout analysis. Numerous methods have been proposed for text line and baseline finding. Some methods attempt to find just text lines, e.g. by using Hough transforms, projection profiles, and Radon transforms. Others find text lines as part of more general document layout analysis tasks, e.g. XY cuts, whitespace segmentation, Voronoi diagrams, and distance-based grouping. Usually, such methods start by performing a rough analysis of the layout, often based on the proximity of bounding boxes of connected components or connected components themselves. More precise base- and text-line models are employed in subsequent steps, as required. The text line fitting algorithm correctly identified all lines present in the documents when the extracted bounding boxes allowed it to do so. No spurious text lines were detected, and some short one-word lines were ignored. One particularly interesting property of this algorithm is that it allows variations in text line orientations. This permits the algorithm to be applied to document images captured by photo or video cameras rather than by scanners. E. MORPHOLOGICAL OPERATION A morphological operation is used to remove the noise and smooth the shape of the candidate text areas. The element used here is also cross-shaped with size 11x45. The size of the elements for the morphological operations and the geometrical constraints give to the algorithm the ability to detect text in a specific range of character sizes (12-48 pixels). F. Canny edge detection We use canny edge detector applied in grey scale images. Canny uses Sobel masks in order to find the edge magnitude of the image, in gray scale, and then uses non- Maxima suppression and hysteresis thresholding. With these two post-processing operations canny edge detector manage to remove no maxima pixels, preserving the connectivity of the contours. After computing the Canny edge map, dilation by an element 5x21 is performed to connect the character contours of every text line. Dilation by a cross-shaped element 5x21 is performed to connect the character contours of every text line. G. Pseudo code of proposed method 1. Initialize input image from dataset 2. If input image in RGB format then convert in gray image 3. If no then go to next step 4. Now do the preprocessing step using Bounding box 5. Also apply the morphology operation 6. Then extract the feature 7. Testing and classification using Inverse support vector machine 8. Find the F-measure, Accuracy and Precision Recall value. 9. Finally get the output result 10. End 2 1 -1 10
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 60 Figure 3: Proposed flow chart V.RESULT The present growth of document analysis of books and manuscripts demands an immediate solution to access them electronically. This requires research in the area of document image understanding, specifically in the area of document layout analysis. There is an immense scope for such a feature extraction system for a digital Document Images. This paper presents an efficient Inverse support vector machine system for a Hindi newspaper document image collection. A recognition-free approach is followed because recognition based approach is inefficient in terms of performance. The data is pre-processed and segmented for faster matching and extraction. An efficient search technique - Correlation method is used to search in large collection of document images. Performance evaluation using different datasets of documents shows the effectiveness of the approach. Figure 5.1: Input Image with Noise For the next test scenario we used for training only the features corresponding to Hindi newspaper OCR letters. The image used for testing contained data, and the construction of the training set, which consisted of images containing examples of each Hindi OCR letter in the Hindi alphabet. We used gray scale for image without noise. Figure 5.2: Extracted character from input image Figure 5.3: Binary image Figure 5.4: Image after applying Median filter on binary image. Figure 5.5: Segmented image INPUT IMAGE WITH NOISE Binary Hindi Text for Morphological Operator Average Median Filter
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 61 Figure 5.6: comparisons between ISVM-morphology and SVM X-Y cut for f-measure and recall value Figure 5.7 comparisons between ISVM-morphology and SVM X-Y cut for Accuracy on iteration value Figure 5.8 comparisons between ISVM-morphology and SVM X-Y cut for precision and recall value Evaluation strategy A text line must have influence to the final evaluation measure proportional to the number of containing characters and not to the number of its pixels The number of characters in a box cannot be defined by the algorithm but it can be approximated by the ratio width/height of the bounding box hgi is the height of the ith ground truth bounding box . hdi is the height of the ith detection bounding box. EGi is the number of pixel of the ith ground truth bounding box. EDi is the number of pixel of the ith detection bounding box. EGD is the number of pixel of the intersection that belongs to ith ground truth bounding box. EGDi is the number of pixel of the intersection that belongs to ith detection bounding box. MSE = 13.1506 MSE=13.1506 PSNR = 85.0606 VI.CONCLUSION 0 20 40 60 80 100 120 0 0.2 0.4 0.6 0.8 1 1.2 1.4 recall Fmeasure Fmeasure results I-SVM Morphology SVM-XY cut 1 2 3 4 5 6 7 8 9 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Iteration Accuracy Accuracy results SVM-XY cut I-SVM Morphology 0 10 20 30 40 50 60 70 80 90 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 recall precision precision-recall for SVM-CC-BBX and ISVM-CC-BBX I-SVM Morphology SVM-XY cut
  • 6. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 62 In this paper, document layout analysis text extraction techniques such as I-SVM, bounding box, sobel operator, morphological method etc. have been discussed. The performance comparison of these methods for document text extraction on the basis of accuracy, precision rate, recall rate, processing time has been done. It is observed that better accuracy is best for document texture Analysis) approach and edge based text extraction techniques. Precision and recall rate is best in case of I-SVM algorithm and segmentation method. Future work We plan to exploit the colour homogeneity of text temporal, text detection from frame to frame.Multi-frame integration for image enhancement REFERENCES [1] Vijay Singh and Bhupendra Kumar “Document layout analysis for Indian newspapers using contour based symbiotic approach” 2014 International Conference on Computer Communication and Informatics (ICCCI -2014), Jan. 03 – 05, 2014, Coimbatore, INDIA. [2] S. Malakar, S. Halder, R. Sarker, N. Das, S. Basu, M. Nasipuri, Text line Extraction from Handwritten Document pages using spiral run length smearing algorithm, International Conference on communications, Devices and Intelligent Systems, Kolkata, Dec. 28-29 (2012) 616-619. [3] S.J. Ha, B. Jin, N.I. Cho, Fast Text Line Extraction in Document Images, 19th IEEE International Conference on Image Processing, Orlando, Sept. 30-Oct 3 (2012) 797-800. [4] S.V. Seeri, S. Giraddi, Prashant B.M, A Novel Approach for Kannada Text Extraction, Proceedings of the International Conference on Pattern Recognition, Informatics and Medical Engineering, Tamil Naidu, Mar. 21-23 (2012) 444-448. [5] Z. Li, J. Luo, Resolution Enhancement from Document Images for Text Extraction, 5th International Conference on Multimedia and Ubiquitous Engineering, Loutraki, June 28- 30 (2011) 251-256. [6] D. Zaravi, H. Rostami, A. Malahzaheh, S.S Mortazavi, Journals Subheadlines Text Extraction Using Wavelet Thresholding and New Projection Profile, World Academy of Science, Engineering and Technology, 49 (2011) 686-689. [7] T.V. Hoang, S. Tabbone, Text Extraction From Graphical Document Images Using Sparse Representation, International Workshop on Document Analysis Systems, June 9-11 (2010) 143-150. [8] P. Nagabhushan, S. Nirmala, Text Extraction in Complex Color Document Images for Enhanced Readability, Intelligent Information Management, 2 (2010) 120-133. [9] D. Dunn, W. E. Higgins, and J. Wakeley, “Texture segmentation using 2-D Gabor elementary functions,” IEEE transaction on Pattern Analysis and Machine Intelligence, vol. 16, no. 2, pp. 130-149, 1994. [l0] W. Chan and G. Coghill, “Text analysis using local energy,” Pattern Recognition, vol. 34, pp. 2523-2532,2001. [l1] U. Pal and B. B. Chaudhuri, “Script line separation from Indian muliti-script document,” in Proceedings of the International Conference on Document Analysis and Recognition, pp. 406-409, 1999. [I21 D. Dhanya, A. G. Ramakrishnan, and P. B. Pati, “Script identification in printed bilingual docuements,” Sadhana, vol. 27, pp. 73-82,2002. [13] Chih-Wei Hsu, Chih-len Lin. A comparison of methods for multiclass support vector machines. IEEE Trans Neural Networks. 13: 415-425, March (2002). [14] Vijay singh and Bhupendra kumar “document layout analysis for Indian newspapers using contour based symbiotic approach”, 2014 International Conference on Computer Communication and Informatics (ICCCI - 2014),IEEE. [15] Chih-Chung Chang and Chih-Jen Lin, LIBSVM: a library for support vector machines, Software available at http://www.csie.ntu.edu.tw/ cjlin/libsvm. [16] F. Liu, Y. Luo, D. Hu, and M. Yoshikawa. A new component based algorithm for newspaper layout analysis. pages 1176 1180. IEEE Computer Society, 2001. [17] S. Mao, A. Rosenfeld, and T. Kanungo. Document structure analysis algorithms: a literature survey. volume 5010 of SPIE Proceedings, pages 197–207. SPIE, 2003.