SlideShare a Scribd company logo
SEGMENTATION-BASED HISTORICAL HANDWRITTEN 
WORD SPOTTING USING DOCUMENT-SPECIFIC LOCAL 
FEATURES 
KONSTANTINOS ZAGORIS1,2 
IOANNIS PRATIKAKIS1 
BASILIS GATOS2 
Visual Computing Group 
Democritus University of Thrace 
Dept. of Electrical and Computer Engineering 
Xanthi, Greece 
National Centre of Scientific Research “Demokritos” 
Institute of Informatics and Telecommunications 
Athens, Greece 
1 
2
WHAT IS KEY WORD SPOTTING? 
• It is the task of identifying locations on a document image which have high 
probability to contain an instance of a queried word 
• without explicitly recognizing it. 
• It is related to Content-Based Image Retrieval systems. 
• Searching a word image from a set of unindexed document images using the image 
content as the only information source.
CURRENT LITERATURE TRENDS 
• Currently there are two distinct trends. 
(i) Segmentation-based and (ii) Segmentation-free approaches. 
• Their fundamental difference concerns the search space 
- segmented word images (segmentation-based) 
- complete document image (segmentation-free). 
We address the word spotting problem with a segmentation-based 
approach.
PREVIOUS LITERATURE 
Rath and Manmatha calculate two families of feature sets. 
• scalar type features that include aspect ratio, area, etc. 
• profile-based features that are based on horizontal and vertical words 
projections and the upper and lower word profiles. 
Zagoris et. al. created a similar set of profile-based features but: 
• encoded Discrete Cosine Transformation and 
• quantize through the Gustafson - Kessel fuzzy algorithm. 
Rodriguez and Perronnin extract features from a sliding window, based 
on the first gradient and inspired by the SIFT keypoint descriptor.
BAG-OF-VISUAL WORDS MODEL 
Recently, there was an influx of works based on the local features in the form of the 
Bag-of-Visual Words model. 
Llados et. al. evaluate the performance of various word descriptors : 
• a bag of visual words procedure (BoVW), 
• a pseudo-structural representation based on Loci Features, 
• a structural approach by using words as graphs, and 
• sequences of column features based on DTW. 
They found that the statistical approach of the BoVW produces the best results, 
although the memory requirements to store the descriptors are significant.
PROBLEMS WITH CURRENT LOCAL FEATURES 
Most works using local features are based on the Scale Invariant Feature 
Transform (SIFT) in order to describe the local information 
• The original application of these local features are the natural images which they 
have many structural differences compared to document images 
• The detection of the most powerful edges through pyramid scaling creates local 
points between text lines. 
• Invariant properties in the descriptor results in noise amplification so they are 
more sensitive to the noise and the complex texture of the background.
TEXTURE VS SHAPE FEATURES 
Features for word spotting which rely only on word shape characteristics are not 
effective in dealing with a document collection created by different writers, containing 
significant writing style variations. 
Although slant and skew preprocessing techniques can reduce the shape variations, 
they cannot eliminate the problem as the whole structure of the word is different in 
most of the cases. 
In this respect, we argue that although the shape information is meaningful, the 
texture information in a spatial context is more reliable.
DOCUMENT SPECIFIC LOCAL FEATURES (DSLF) 
Taking into account the aforementioned considerations, we propose: 
• novel local features which are specific for documents and a 
• matching procedure that does not rely on codebook creation (as on 
BoVW).
PROPOSED WORD SPOTTING FRAMEWORK 
Document 
Dataset 
Word Image 
Segmentation 
Word Image 
DSLF Calculation 
Matching Process 
Display Results 
Database DSLF Calculation 
ONLINE 
OFFLINE
KEYPOINT DETECTION AND SELECTION 
Keypoint Detection 
CCs Analysis 
Local Point Selection 
Gradient 
Orientation 
Quantization 
Convex Hull 
Corner 
Detection 
Entropy-based Final Keypoints Keypoint Filtering
KEYPOINT DETECTION AND SELECTION 
original document 
image 
orientation of the gradient 
vector 
quantization of the gradient 
vector orientation 
initial keypoints final keypoints
FEATURE EXTRACTION 
• The feature for the local keypoint is calculated upon the quantized gradient angles 
• An area of 18x18 pixels around the kP, is divided into 9 cells with size 6x6 for each of them. 
• Each cell is represented by a 3-bin histogram (each bin corresponds to a quantization level). 
• Each pixel accumulates a vote in the corresponding angle histogram bin. The strength of 
voting depends on the norm of the gradient vector and on the distance from the location of 
local point as shown at the following equation: 
x, y x, y x, y V  s  G 
    2 2 
, 
2 
1 
LP LP 
3 9 2 
x y 
x x y y 
s 
   
   
• The task of the 푠푥,푦 variable is to weigh the pixel participation to the histogram taking into account 
its distance from the kP.
MATCHING PROCEDURE 
• In the case of segmentation-based word spotting, the aim is to match the query 
keypoints to the corresponding keypoints of any word image in the document. 
• Local Proximity Nearest Neighbor (LPNN) search is implemented. 
• The advantage of LPNN search is two-fold: 
 it enables a search in focused areas instead of searching in a brute force manner and 
 it goes beyond the typical use of a descriptor by the incorporation of spatial context in the 
local search addressed.
MATCHING PROCEDURE 
Update the location for each keypoint to a 
new normalized space: 
′ 
= 
푝푥푖 
푝푥푖 
−푐푥 
퐷푥 
′ 
= 
, 푝푦푖 
푝푦푖 
−푐푦 
퐷푦 
where: 
푐푥, 푐푦 = 
푘 푖=푥1 
푖 
푝푘 
, 
푘 푖=푦1 
푖 
푝푘 
퐷푥 = 
푘 푝푥푖 
푖=1 
− 푐푥 
푘 
, 퐷푦 = 
푘 푝푦푖 
푖=1 
− 푐푦 
푘 
k denotes the total number of the keypoints in 
a word image.
EVALUATION - DATASETS 
BENTHAM DATASET 
• It consists of 50 high quality (approximately 3000 
pixel width and 4000 pixel height) handwritten 
manuscripts written by Jeremy Bentham (1748- 
1832). 
• The variation of the same word is extreme and 
involves writing style, font size, noise as well as their 
combination.
EVALUATION - DATASETS 
WASHINGTON DATASET 
• It consists of 20 document images from George 
Washington Collection of the Library of Congress 
• The documents are were scanned from microfilm in 
300 dpi resolution.
EVALUATION STRATEGY 
• Two evaluation metrics: Precision at the k Top Retrieved words (P@k) and the Mean Average 
Precision (MAP). 
• P@5 is the precision at top 5 retrieved words. This metric defines how successfully the 
algorithms produce relevant results to the first 5 positions of the ranking list 
• MAP is a typical measure for the performance of information retrieval systems 
• For the experiments, the word image segmentation information is taken from the ground truth 
corpora. 
• The total word image queries for the Washington dataset was 1570 and for the Bentham dataset 
was 3668. 
• Both query sets contain words appearing in various frequencies and sizes 
• Evaluated against two previous segmentation-based profile-based strategies 
• Then, in order to highlight the advantage of the proposed DSLF, it was replaced by the SIFT but the 
proposed matching algorithm remained the same.
OVERALL PERFORMANCE EVALUATION RESULTS 
Washington 
0.660 0.637 
0.631 0.600 0.608 0.577 
0.436 0.440 
0.800 
0.700 
0.600 
0.500 
0.400 
0.300 
0.200 
0.100 
0.000 
P@5 MAP 
WS CSPD SIFT Proposed Method 
Bentham 
0.701 0.680 
0.642 0.630 
0.629 0.615 
0.528 0.506 
0.800 
0.700 
0.600 
0.500 
0.400 
0.300 
0.200 
0.100 
0.000 
P@5 MAP 
WS CSPD SIFT Proposed Method
CONCLUSION 
In this work, novel local features are proposed driven by the challenges presented in 
historical handwritten word spotting scenarios. 
The proposed method outperformed both the profile-based strategies and the SIFT 
local features. 
Moreover, a matching procedure was presented based on Local Proximity Nearest 
Neighbour, that augments performance in terms of effectiveness and efficiency 
incorporating spatial context. 
The proposed framework achieves better performance after a consistent evaluation 
against two profile-based approaches as well as the proposed approach with the 
popular SIFT local features in two different handwritten datasets.
ΕΥΧΑΡΙΣΤΩ ΠΟΛY! 
THANK YOU!

More Related Content

What's hot

Self-Directing Text Detection and Removal from Images with Smoothing
Self-Directing Text Detection and Removal from Images with SmoothingSelf-Directing Text Detection and Removal from Images with Smoothing
Self-Directing Text Detection and Removal from Images with Smoothing
Priyanka Wagh
 
IRJET- Object Detection using Hausdorff Distance
IRJET-  	  Object Detection using Hausdorff DistanceIRJET-  	  Object Detection using Hausdorff Distance
IRJET- Object Detection using Hausdorff Distance
IRJET Journal
 
Text extraction from images
Text extraction from imagesText extraction from images
Text extraction from images
Garby Baby
 
Enhancement and Segmentation of Historical Records
Enhancement and Segmentation of Historical RecordsEnhancement and Segmentation of Historical Records
Enhancement and Segmentation of Historical Records
csandit
 
Currency recognition on mobile phones
Currency recognition on mobile phonesCurrency recognition on mobile phones
Currency recognition on mobile phones
habeebsab
 
Das09112008
Das09112008Das09112008
Das09112008
sunnyjohn
 
Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking
Integrated Hidden Markov Model and Kalman Filter for Online Object TrackingIntegrated Hidden Markov Model and Kalman Filter for Online Object Tracking
Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking
ijsrd.com
 
Improved wolf algorithm on document images detection using optimum mean techn...
Improved wolf algorithm on document images detection using optimum mean techn...Improved wolf algorithm on document images detection using optimum mean techn...
Improved wolf algorithm on document images detection using optimum mean techn...
journalBEEI
 
A Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionA Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detection
vivatechijri
 
C04741319
C04741319C04741319
C04741319
IOSR-JEN
 
AN IMPROVED MULTI-SOM ALGORITHM
AN IMPROVED MULTI-SOM ALGORITHMAN IMPROVED MULTI-SOM ALGORITHM
AN IMPROVED MULTI-SOM ALGORITHM
IJNSA Journal
 
Kernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of movingKernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of movingIAEME Publication
 
DETECTION OF DENSE, OVERLAPPING, GEOMETRIC OBJECTS
DETECTION OF DENSE, OVERLAPPING, GEOMETRIC OBJECTSDETECTION OF DENSE, OVERLAPPING, GEOMETRIC OBJECTS
DETECTION OF DENSE, OVERLAPPING, GEOMETRIC OBJECTS
ijaia
 
Text detection and recognition from natural scenes
Text detection and recognition from natural scenesText detection and recognition from natural scenes
Text detection and recognition from natural scenes
hemanthmcqueen
 
Self-organizing map
Self-organizing mapSelf-organizing map
Self-organizing map
Tarat Diloksawatdikul
 

What's hot (17)

Self-Directing Text Detection and Removal from Images with Smoothing
Self-Directing Text Detection and Removal from Images with SmoothingSelf-Directing Text Detection and Removal from Images with Smoothing
Self-Directing Text Detection and Removal from Images with Smoothing
 
IRJET- Object Detection using Hausdorff Distance
IRJET-  	  Object Detection using Hausdorff DistanceIRJET-  	  Object Detection using Hausdorff Distance
IRJET- Object Detection using Hausdorff Distance
 
Text extraction from images
Text extraction from imagesText extraction from images
Text extraction from images
 
Enhancement and Segmentation of Historical Records
Enhancement and Segmentation of Historical RecordsEnhancement and Segmentation of Historical Records
Enhancement and Segmentation of Historical Records
 
Currency recognition on mobile phones
Currency recognition on mobile phonesCurrency recognition on mobile phones
Currency recognition on mobile phones
 
Das09112008
Das09112008Das09112008
Das09112008
 
Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking
Integrated Hidden Markov Model and Kalman Filter for Online Object TrackingIntegrated Hidden Markov Model and Kalman Filter for Online Object Tracking
Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking
 
50120140501016
5012014050101650120140501016
50120140501016
 
Improved wolf algorithm on document images detection using optimum mean techn...
Improved wolf algorithm on document images detection using optimum mean techn...Improved wolf algorithm on document images detection using optimum mean techn...
Improved wolf algorithm on document images detection using optimum mean techn...
 
A Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionA Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detection
 
C04741319
C04741319C04741319
C04741319
 
AN IMPROVED MULTI-SOM ALGORITHM
AN IMPROVED MULTI-SOM ALGORITHMAN IMPROVED MULTI-SOM ALGORITHM
AN IMPROVED MULTI-SOM ALGORITHM
 
Technical Portion of PhD Research
Technical Portion of PhD ResearchTechnical Portion of PhD Research
Technical Portion of PhD Research
 
Kernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of movingKernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of moving
 
DETECTION OF DENSE, OVERLAPPING, GEOMETRIC OBJECTS
DETECTION OF DENSE, OVERLAPPING, GEOMETRIC OBJECTSDETECTION OF DENSE, OVERLAPPING, GEOMETRIC OBJECTS
DETECTION OF DENSE, OVERLAPPING, GEOMETRIC OBJECTS
 
Text detection and recognition from natural scenes
Text detection and recognition from natural scenesText detection and recognition from natural scenes
Text detection and recognition from natural scenes
 
Self-organizing map
Self-organizing mapSelf-organizing map
Self-organizing map
 

Viewers also liked

Svm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammogramsSvm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammograms
Konstantinos Zagoris
 
Spotting Customers in Trouble
Spotting Customers in TroubleSpotting Customers in Trouble
Spotting Customers in Trouble
Credit Management Association
 
A Semi-Automatic Annotation Tool For Arabic Online Handwritten Text
A Semi-Automatic Annotation Tool For Arabic Online Handwritten TextA Semi-Automatic Annotation Tool For Arabic Online Handwritten Text
A Semi-Automatic Annotation Tool For Arabic Online Handwritten Text
Randa Elanwar
 
Content and Metadata Based Image Document Retrieval (in Greek)
Content and Metadata Based Image Document Retrieval (in Greek)Content and Metadata Based Image Document Retrieval (in Greek)
Content and Metadata Based Image Document Retrieval (in Greek)
Konstantinos Zagoris
 
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Konstantinos Zagoris
 
A simple text ine segmentation method for handwritten documents1
A simple text ine segmentation method for handwritten documents1A simple text ine segmentation method for handwritten documents1
A simple text ine segmentation method for handwritten documents1
Prasad Babu
 
Word segmentation method for handwritten documents based on structured learning
Word segmentation method for handwritten documents based on structured learningWord segmentation method for handwritten documents based on structured learning
Word segmentation method for handwritten documents based on structured learning
I3E Technologies
 
Holistic Approach for Arabic Word Recognition
Holistic Approach for Arabic Word RecognitionHolistic Approach for Arabic Word Recognition
Holistic Approach for Arabic Word Recognition
Editor IJCATR
 
Query expansion based on visual content new
Query expansion based on visual content newQuery expansion based on visual content new
Query expansion based on visual content new
Lazaros Tsochatzidis
 
Segmentation and recognition of handwritten gurmukhi script
Segmentation  and recognition of handwritten gurmukhi scriptSegmentation  and recognition of handwritten gurmukhi script
Segmentation and recognition of handwritten gurmukhi script
RAJENDRA VERMA
 
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases
Dynamic Two-Stage Image Retrieval from Large Multimodal DatabasesDynamic Two-Stage Image Retrieval from Large Multimodal Databases
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases
Konstantinos Zagoris
 
Online handwritten script recognition
Online handwritten script recognitionOnline handwritten script recognition
Online handwritten script recognitionDhiraj Singh
 
Performance of Statistics Based Line Segmentation System for Unconstrained H...
Performance of Statistics Based Line Segmentation  System for Unconstrained H...Performance of Statistics Based Line Segmentation  System for Unconstrained H...
Performance of Statistics Based Line Segmentation System for Unconstrained H...
AM Publications
 
Ppt On Lcm & Hcf Questions For Cat Preparation
Ppt On Lcm & Hcf Questions For Cat PreparationPpt On Lcm & Hcf Questions For Cat Preparation
Ppt On Lcm & Hcf Questions For Cat Preparation
TCY Learning Solutions (P) Ltd.
 
Arabic Handwritten Script Recognition Towards Generalization: A Survey
Arabic Handwritten Script Recognition Towards Generalization: A Survey Arabic Handwritten Script Recognition Towards Generalization: A Survey
Arabic Handwritten Script Recognition Towards Generalization: A Survey
Randa Elanwar
 
Automatic handwriting recognition
Automatic handwriting recognitionAutomatic handwriting recognition
Automatic handwriting recognitionBIJIT GHOSH
 

Viewers also liked (16)

Svm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammogramsSvm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammograms
 
Spotting Customers in Trouble
Spotting Customers in TroubleSpotting Customers in Trouble
Spotting Customers in Trouble
 
A Semi-Automatic Annotation Tool For Arabic Online Handwritten Text
A Semi-Automatic Annotation Tool For Arabic Online Handwritten TextA Semi-Automatic Annotation Tool For Arabic Online Handwritten Text
A Semi-Automatic Annotation Tool For Arabic Online Handwritten Text
 
Content and Metadata Based Image Document Retrieval (in Greek)
Content and Metadata Based Image Document Retrieval (in Greek)Content and Metadata Based Image Document Retrieval (in Greek)
Content and Metadata Based Image Document Retrieval (in Greek)
 
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
 
A simple text ine segmentation method for handwritten documents1
A simple text ine segmentation method for handwritten documents1A simple text ine segmentation method for handwritten documents1
A simple text ine segmentation method for handwritten documents1
 
Word segmentation method for handwritten documents based on structured learning
Word segmentation method for handwritten documents based on structured learningWord segmentation method for handwritten documents based on structured learning
Word segmentation method for handwritten documents based on structured learning
 
Holistic Approach for Arabic Word Recognition
Holistic Approach for Arabic Word RecognitionHolistic Approach for Arabic Word Recognition
Holistic Approach for Arabic Word Recognition
 
Query expansion based on visual content new
Query expansion based on visual content newQuery expansion based on visual content new
Query expansion based on visual content new
 
Segmentation and recognition of handwritten gurmukhi script
Segmentation  and recognition of handwritten gurmukhi scriptSegmentation  and recognition of handwritten gurmukhi script
Segmentation and recognition of handwritten gurmukhi script
 
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases
Dynamic Two-Stage Image Retrieval from Large Multimodal DatabasesDynamic Two-Stage Image Retrieval from Large Multimodal Databases
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases
 
Online handwritten script recognition
Online handwritten script recognitionOnline handwritten script recognition
Online handwritten script recognition
 
Performance of Statistics Based Line Segmentation System for Unconstrained H...
Performance of Statistics Based Line Segmentation  System for Unconstrained H...Performance of Statistics Based Line Segmentation  System for Unconstrained H...
Performance of Statistics Based Line Segmentation System for Unconstrained H...
 
Ppt On Lcm & Hcf Questions For Cat Preparation
Ppt On Lcm & Hcf Questions For Cat PreparationPpt On Lcm & Hcf Questions For Cat Preparation
Ppt On Lcm & Hcf Questions For Cat Preparation
 
Arabic Handwritten Script Recognition Towards Generalization: A Survey
Arabic Handwritten Script Recognition Towards Generalization: A Survey Arabic Handwritten Script Recognition Towards Generalization: A Survey
Arabic Handwritten Script Recognition Towards Generalization: A Survey
 
Automatic handwriting recognition
Automatic handwriting recognitionAutomatic handwriting recognition
Automatic handwriting recognition
 

Similar to Segmentation - based Historical Handwritten Word Spotting using document-specific Local Features

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
Nimrita Koul
 
Reference Scope Identification of Citances Using Convolutional Neural Network
Reference Scope Identification of Citances Using Convolutional Neural NetworkReference Scope Identification of Citances Using Convolutional Neural Network
Reference Scope Identification of Citances Using Convolutional Neural Network
Saurav Jha
 
large_scale_search.pdf
large_scale_search.pdflarge_scale_search.pdf
large_scale_search.pdf
Emerald72
 
Enhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wildEnhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wild
Prerana Mukherjee
 
Spatial Data Mining : Seminar
Spatial Data Mining : SeminarSpatial Data Mining : Seminar
Spatial Data Mining : SeminarIpsit Dash
 
A Visual Exploration of Distance, Documents, and Distributions
A Visual Exploration of Distance, Documents, and DistributionsA Visual Exploration of Distance, Documents, and Distributions
A Visual Exploration of Distance, Documents, and Distributions
Rebecca Bilbro
 
Words in Space - Rebecca Bilbro
Words in Space - Rebecca BilbroWords in Space - Rebecca Bilbro
Words in Space - Rebecca Bilbro
PyData
 
2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee
Moazzem Hossain
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overview
LEE HOSEONG
 
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...
Thanh Tran
 
Efficient Query Processing in Geographic Web Search Engines
Efficient Query Processing in Geographic Web Search EnginesEfficient Query Processing in Geographic Web Search Engines
Efficient Query Processing in Geographic Web Search Engines
Yen-Yu Chen
 
Cork AI Meetup Number 3
Cork AI Meetup Number 3Cork AI Meetup Number 3
Cork AI Meetup Number 3
Nick Grattan
 
Presentation @SIGIR2015
Presentation @SIGIR2015Presentation @SIGIR2015
Presentation @SIGIR2015
Michele Trevisiol
 
Show observe and tell giang nguyen
Show observe and tell   giang nguyenShow observe and tell   giang nguyen
Show observe and tell giang nguyen
Nguyen Giang
 
P.maria sheeba 15 mco010
P.maria sheeba 15 mco010P.maria sheeba 15 mco010
P.maria sheeba 15 mco010
W3Edify
 
Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...
Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...
Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...
Faculty of Computer Science - Free University of Bozen-Bolzano
 
Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016
Sarasi Sarangi
 
Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016
Artificial Intelligence Institute at UofSC
 
COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...
COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...
COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...
ijdpsjournal
 
COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...
COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...
COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...
ijdpsjournal
 

Similar to Segmentation - based Historical Handwritten Word Spotting using document-specific Local Features (20)

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Reference Scope Identification of Citances Using Convolutional Neural Network
Reference Scope Identification of Citances Using Convolutional Neural NetworkReference Scope Identification of Citances Using Convolutional Neural Network
Reference Scope Identification of Citances Using Convolutional Neural Network
 
large_scale_search.pdf
large_scale_search.pdflarge_scale_search.pdf
large_scale_search.pdf
 
Enhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wildEnhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wild
 
Spatial Data Mining : Seminar
Spatial Data Mining : SeminarSpatial Data Mining : Seminar
Spatial Data Mining : Seminar
 
A Visual Exploration of Distance, Documents, and Distributions
A Visual Exploration of Distance, Documents, and DistributionsA Visual Exploration of Distance, Documents, and Distributions
A Visual Exploration of Distance, Documents, and Distributions
 
Words in Space - Rebecca Bilbro
Words in Space - Rebecca BilbroWords in Space - Rebecca Bilbro
Words in Space - Rebecca Bilbro
 
2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overview
 
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...
 
Efficient Query Processing in Geographic Web Search Engines
Efficient Query Processing in Geographic Web Search EnginesEfficient Query Processing in Geographic Web Search Engines
Efficient Query Processing in Geographic Web Search Engines
 
Cork AI Meetup Number 3
Cork AI Meetup Number 3Cork AI Meetup Number 3
Cork AI Meetup Number 3
 
Presentation @SIGIR2015
Presentation @SIGIR2015Presentation @SIGIR2015
Presentation @SIGIR2015
 
Show observe and tell giang nguyen
Show observe and tell   giang nguyenShow observe and tell   giang nguyen
Show observe and tell giang nguyen
 
P.maria sheeba 15 mco010
P.maria sheeba 15 mco010P.maria sheeba 15 mco010
P.maria sheeba 15 mco010
 
Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...
Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...
Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...
 
Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016
 
Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016
 
COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...
COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...
COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...
 
COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...
COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...
COHESIVE MULTI-ORIENTED TEXT DETECTION AND RECOGNITION STRUCTURE IN NATURAL S...
 

Recently uploaded

Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
Richard Gill
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
DiyaBiswas10
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
NathanBaughman3
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
muralinath2
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
Areesha Ahmad
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
AADYARAJPANDEY1
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
Health Advances
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
Sérgio Sacani
 

Recently uploaded (20)

Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 

Segmentation - based Historical Handwritten Word Spotting using document-specific Local Features

  • 1. SEGMENTATION-BASED HISTORICAL HANDWRITTEN WORD SPOTTING USING DOCUMENT-SPECIFIC LOCAL FEATURES KONSTANTINOS ZAGORIS1,2 IOANNIS PRATIKAKIS1 BASILIS GATOS2 Visual Computing Group Democritus University of Thrace Dept. of Electrical and Computer Engineering Xanthi, Greece National Centre of Scientific Research “Demokritos” Institute of Informatics and Telecommunications Athens, Greece 1 2
  • 2. WHAT IS KEY WORD SPOTTING? • It is the task of identifying locations on a document image which have high probability to contain an instance of a queried word • without explicitly recognizing it. • It is related to Content-Based Image Retrieval systems. • Searching a word image from a set of unindexed document images using the image content as the only information source.
  • 3. CURRENT LITERATURE TRENDS • Currently there are two distinct trends. (i) Segmentation-based and (ii) Segmentation-free approaches. • Their fundamental difference concerns the search space - segmented word images (segmentation-based) - complete document image (segmentation-free). We address the word spotting problem with a segmentation-based approach.
  • 4. PREVIOUS LITERATURE Rath and Manmatha calculate two families of feature sets. • scalar type features that include aspect ratio, area, etc. • profile-based features that are based on horizontal and vertical words projections and the upper and lower word profiles. Zagoris et. al. created a similar set of profile-based features but: • encoded Discrete Cosine Transformation and • quantize through the Gustafson - Kessel fuzzy algorithm. Rodriguez and Perronnin extract features from a sliding window, based on the first gradient and inspired by the SIFT keypoint descriptor.
  • 5. BAG-OF-VISUAL WORDS MODEL Recently, there was an influx of works based on the local features in the form of the Bag-of-Visual Words model. Llados et. al. evaluate the performance of various word descriptors : • a bag of visual words procedure (BoVW), • a pseudo-structural representation based on Loci Features, • a structural approach by using words as graphs, and • sequences of column features based on DTW. They found that the statistical approach of the BoVW produces the best results, although the memory requirements to store the descriptors are significant.
  • 6. PROBLEMS WITH CURRENT LOCAL FEATURES Most works using local features are based on the Scale Invariant Feature Transform (SIFT) in order to describe the local information • The original application of these local features are the natural images which they have many structural differences compared to document images • The detection of the most powerful edges through pyramid scaling creates local points between text lines. • Invariant properties in the descriptor results in noise amplification so they are more sensitive to the noise and the complex texture of the background.
  • 7. TEXTURE VS SHAPE FEATURES Features for word spotting which rely only on word shape characteristics are not effective in dealing with a document collection created by different writers, containing significant writing style variations. Although slant and skew preprocessing techniques can reduce the shape variations, they cannot eliminate the problem as the whole structure of the word is different in most of the cases. In this respect, we argue that although the shape information is meaningful, the texture information in a spatial context is more reliable.
  • 8. DOCUMENT SPECIFIC LOCAL FEATURES (DSLF) Taking into account the aforementioned considerations, we propose: • novel local features which are specific for documents and a • matching procedure that does not rely on codebook creation (as on BoVW).
  • 9. PROPOSED WORD SPOTTING FRAMEWORK Document Dataset Word Image Segmentation Word Image DSLF Calculation Matching Process Display Results Database DSLF Calculation ONLINE OFFLINE
  • 10. KEYPOINT DETECTION AND SELECTION Keypoint Detection CCs Analysis Local Point Selection Gradient Orientation Quantization Convex Hull Corner Detection Entropy-based Final Keypoints Keypoint Filtering
  • 11. KEYPOINT DETECTION AND SELECTION original document image orientation of the gradient vector quantization of the gradient vector orientation initial keypoints final keypoints
  • 12. FEATURE EXTRACTION • The feature for the local keypoint is calculated upon the quantized gradient angles • An area of 18x18 pixels around the kP, is divided into 9 cells with size 6x6 for each of them. • Each cell is represented by a 3-bin histogram (each bin corresponds to a quantization level). • Each pixel accumulates a vote in the corresponding angle histogram bin. The strength of voting depends on the norm of the gradient vector and on the distance from the location of local point as shown at the following equation: x, y x, y x, y V  s  G     2 2 , 2 1 LP LP 3 9 2 x y x x y y s       • The task of the 푠푥,푦 variable is to weigh the pixel participation to the histogram taking into account its distance from the kP.
  • 13. MATCHING PROCEDURE • In the case of segmentation-based word spotting, the aim is to match the query keypoints to the corresponding keypoints of any word image in the document. • Local Proximity Nearest Neighbor (LPNN) search is implemented. • The advantage of LPNN search is two-fold:  it enables a search in focused areas instead of searching in a brute force manner and  it goes beyond the typical use of a descriptor by the incorporation of spatial context in the local search addressed.
  • 14. MATCHING PROCEDURE Update the location for each keypoint to a new normalized space: ′ = 푝푥푖 푝푥푖 −푐푥 퐷푥 ′ = , 푝푦푖 푝푦푖 −푐푦 퐷푦 where: 푐푥, 푐푦 = 푘 푖=푥1 푖 푝푘 , 푘 푖=푦1 푖 푝푘 퐷푥 = 푘 푝푥푖 푖=1 − 푐푥 푘 , 퐷푦 = 푘 푝푦푖 푖=1 − 푐푦 푘 k denotes the total number of the keypoints in a word image.
  • 15. EVALUATION - DATASETS BENTHAM DATASET • It consists of 50 high quality (approximately 3000 pixel width and 4000 pixel height) handwritten manuscripts written by Jeremy Bentham (1748- 1832). • The variation of the same word is extreme and involves writing style, font size, noise as well as their combination.
  • 16. EVALUATION - DATASETS WASHINGTON DATASET • It consists of 20 document images from George Washington Collection of the Library of Congress • The documents are were scanned from microfilm in 300 dpi resolution.
  • 17. EVALUATION STRATEGY • Two evaluation metrics: Precision at the k Top Retrieved words (P@k) and the Mean Average Precision (MAP). • P@5 is the precision at top 5 retrieved words. This metric defines how successfully the algorithms produce relevant results to the first 5 positions of the ranking list • MAP is a typical measure for the performance of information retrieval systems • For the experiments, the word image segmentation information is taken from the ground truth corpora. • The total word image queries for the Washington dataset was 1570 and for the Bentham dataset was 3668. • Both query sets contain words appearing in various frequencies and sizes • Evaluated against two previous segmentation-based profile-based strategies • Then, in order to highlight the advantage of the proposed DSLF, it was replaced by the SIFT but the proposed matching algorithm remained the same.
  • 18. OVERALL PERFORMANCE EVALUATION RESULTS Washington 0.660 0.637 0.631 0.600 0.608 0.577 0.436 0.440 0.800 0.700 0.600 0.500 0.400 0.300 0.200 0.100 0.000 P@5 MAP WS CSPD SIFT Proposed Method Bentham 0.701 0.680 0.642 0.630 0.629 0.615 0.528 0.506 0.800 0.700 0.600 0.500 0.400 0.300 0.200 0.100 0.000 P@5 MAP WS CSPD SIFT Proposed Method
  • 19. CONCLUSION In this work, novel local features are proposed driven by the challenges presented in historical handwritten word spotting scenarios. The proposed method outperformed both the profile-based strategies and the SIFT local features. Moreover, a matching procedure was presented based on Local Proximity Nearest Neighbour, that augments performance in terms of effectiveness and efficiency incorporating spatial context. The proposed framework achieves better performance after a consistent evaluation against two profile-based approaches as well as the proposed approach with the popular SIFT local features in two different handwritten datasets.

Editor's Notes

  1. Although there is an abundance of systems suitable for both modern and historical printed material, very few of these systems are suitable to handwritten documents due to noise sensitivity, character variation and text layout complexity:
  2. we argue that it is not beneficial in document images to incorporate invariant properties in the descriptor of the local points as it results in noise amplification. We believe that the features that are invariant to rotation are more sensitive to the noise and the complex texture of the background.
  3. Features for word spotting which rely only on word shape characteristics are not effective in dealing with a document collection created by different writers, containing significant writing style variations. Although slant and skew preprocessing techniques can reduce the shape variations, they cannot eliminate the problem as the whole structure of the word is different in most of the cases. In this respect, we argue that although the shape information is meaningful, the texture information in a spatial context is more reliable.
  4. For the sake of clarity, it is worth to note that since the focus of this work is on features extraction and matching, the segmented word images used in the proposed approach are achieved from the available ground truth dataset without involving any particular word image segmentation method.
  5. But initially let’s look the proposed word spotting framework
  6. In the next stage, the LPNN for each keypoint that resides on the query image is addressed. LPNN is realized in a search area which is computed by taking into account a percentage (25%) of the already calculated distances Dx,Dy. During search, if there is one or more word keypoints in the proximity of the query keypoint under consideration, the Euclidean distance between their descriptors is calculated and the minimum distance is kept. This is repeated for each keypoint in the query image. The final similarity measure is the sum of all the minimal distances. If there is not a local point in its proximity then a penalty value is added to the similarity measure and it is equal to maximum Euclidean distance that can be calculated between the keypoint descriptors. As a final stage, the system presents to the user all the word images based on ascending sort order of the calculated similarity measure.
  7. The proposed method outperformed both the profile-based strategies and the SIFT local features. It is worth to note, that the profile-based features were applied in words that were binarized, denoised, de-skew and de-slant as opposed to the local features that were applied to the original word images. Moreover, although the SIFT descriptor contains more information than the proposed local features (128 values against only 27), the latter performed better in both datasets