SlideShare a Scribd company logo
www.projectsatbangalore.com 09591912372
2916 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 23, NO. 7, JULY 2014
Phase-Based Binarization of Ancient Document
Images: Model and Applications
Hossein Ziaei Nafchi, Reza Farrahi Moghaddam, Member, IEEE, and Mohamed Cheriet, Senior Member, IEEE
Abstract—In this paper, a phase-based binarization model for
ancient document images is proposed, as well as a postprocessing
method that can improve any binarization method and a ground
truth generation tool. Three feature maps derived from the phase
information of an input document image constitute the core of
this binarization model. These features are the maximum moment
of phase congruency covariance, a locally weighted mean phase
angle, and a phase preserved denoised image. The proposed
model consists of three standard steps: 1) preprocessing; 2) main
binarization; and 3) postprocessing. In the preprocessing and
main binarization steps, the features used are mainly phase
derived, while in the postprocessing step, specialized adaptive
Gaussian and median filters are considered. One of the outputs
of the binarization step, which shows high recall performance, is
used in a proposed postprocessing method to improve the perfor-
mance of other binarization methodologies. Finally, we develop
a ground truth generation tool, called PhaseGT, to simplify
and speed up the ground truth generation process for ancient
document images. The comprehensive experimental results on the
DIBCO’09, H-DIBCO’10, DIBCO’11, H-DIBCO’12, DIBCO’13,
PHIBD’12, and BICKLEY DIARY data sets show the robust-
ness of the proposed binarization method on various types of
degradation and document images.
Index Terms—Historical document binarization, phase-derived
features, ground truthing, document enhancement.
I. INTRODUCTION
LIBRARIES and archives around the world store an
abundance of old and historically important documents
and manuscripts. These documents accumulate a significant
amount of human heritage over time. However, many envi-
ronmental factors, improper handling, and the poor quality of
the materials used in their creation cause them to suffer a high
degree of degradation of various types. Today, there is a strong
move toward digitization of these manuscripts to preserve their
content for future generations. The huge amount of digital
data produced requires automatic processing, enhancement,
and recognition. A key step in all document image processing
workflows is binarization, but this is not a very sophisti-
cated process, which is unfortunate, as its performance has
Manuscript received August 8, 2013; revised January 29, 2014 and April 22,
2014; accepted April 26, 2014. Date of publication May 7, 2014; date of
current version May 27, 2014. This work was supported by the Natural
Sciences and Engineering Research Council of Canada. The associate editor
coordinating the review of this manuscript and approving it for publication
was Dr. Debargha Mukherjee.
The authors are with the Synchromedia Laboratory for Multime-
dia Communication in Telepresence, École de Technologie Supérieure,
Montreal, QC H3C 1K3, Canada (e-mail: hossein.zi@synchromedia.ca;
rfarrahi@synchromedia.ca; mohamed.cheriet@etsmtl.ca).
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org.
Digital Object Identifier 10.1109/TIP.2014.2322451
Fig. 1. Sample document images selected from the DIBCO’09 [21],
H-DIBCO’10 [22], and DIBCO’11 datasets [23].
a significant influence on the quality of OCR results. Many
research studies have been carried out to solve the problems
that arise in the binarization of old document images char-
acterized by many types of degradation [1]–[19], including
faded ink, bleed-through, show-through, uneven illumination,
variations in image contrast, and deterioration of the cellulose
structure [1], [20]. There are also differences in patterns of
hand-written and machine-printed documents, which add to the
difficulties associated with the binarization of old document
images.
To the best of our knowledge, none of the proposed methods
can deal with all types of documents and degradation. For
more details, see the Related Work section. Fig. 1 shows some
of the degraded document images used in this paper.
In this paper, a robust phase-based binarization method is
proposed for the binarization and enhancement of historical
documents and manuscripts. The three main steps in the
proposed method are: preprocessing, main binarization, and
post-processing. The preprocessing step mainly involves image
denoising with phase preservation [24], followed by some
morphological operations. We incorporate the Canny edge
detector [25] and a denoised image to obtain a binarized image
in rough form.
Then, we use the phase congruency features [18], [19],
[26] for the main binarization step. Phase congruency is
widely used in the machine vision and image processing
1057-7149 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

More Related Content

Viewers also liked

Portafolio de dibujos de despiece
Portafolio de dibujos de despiecePortafolio de dibujos de despiece
Portafolio de dibujos de despiece
diego_ruiz81
 
Momento 4 201420_59_2016_16_1
Momento 4 201420_59_2016_16_1Momento 4 201420_59_2016_16_1
Momento 4 201420_59_2016_16_1
alejkari_13
 
Momento 3 201420_59_2016_16_1
Momento 3 201420_59_2016_16_1Momento 3 201420_59_2016_16_1
Momento 3 201420_59_2016_16_1
alejkari_13
 
Operação de Câmera de Vídeo
Operação de Câmera de VídeoOperação de Câmera de Vídeo
Operação de Câmera de Vídeo
Luciano Dias
 
Thai Luxury furniture exporter : Sarnn company profile
Thai Luxury furniture exporter : Sarnn company profileThai Luxury furniture exporter : Sarnn company profile
Thai Luxury furniture exporter : Sarnn company profile
Anunyaporn Mongkol
 
Equipes e Fases da Produção Audiovisual
Equipes e Fases da Produção AudiovisualEquipes e Fases da Produção Audiovisual
Equipes e Fases da Produção Audiovisual
Luciano Dias
 

Viewers also liked (7)

Portafolio de dibujos de despiece
Portafolio de dibujos de despiecePortafolio de dibujos de despiece
Portafolio de dibujos de despiece
 
Momento 4 201420_59_2016_16_1
Momento 4 201420_59_2016_16_1Momento 4 201420_59_2016_16_1
Momento 4 201420_59_2016_16_1
 
Diapositivas de johana 2
Diapositivas de johana 2Diapositivas de johana 2
Diapositivas de johana 2
 
Momento 3 201420_59_2016_16_1
Momento 3 201420_59_2016_16_1Momento 3 201420_59_2016_16_1
Momento 3 201420_59_2016_16_1
 
Operação de Câmera de Vídeo
Operação de Câmera de VídeoOperação de Câmera de Vídeo
Operação de Câmera de Vídeo
 
Thai Luxury furniture exporter : Sarnn company profile
Thai Luxury furniture exporter : Sarnn company profileThai Luxury furniture exporter : Sarnn company profile
Thai Luxury furniture exporter : Sarnn company profile
 
Equipes e Fases da Produção Audiovisual
Equipes e Fases da Produção AudiovisualEquipes e Fases da Produção Audiovisual
Equipes e Fases da Produção Audiovisual
 

Similar to Phase-Based Binarization of Ancient Document Images: Model and Applications

Guides
GuidesGuides
Guides
caycho123
 
Improvement of binarization performance using local otsu thresholding
Improvement of binarization performance using local otsu thresholdingImprovement of binarization performance using local otsu thresholding
Improvement of binarization performance using local otsu thresholding
IJECEIAES
 
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Phase based-binarization-of-ancie...
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS  Phase based-binarization-of-ancie...IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS  Phase based-binarization-of-ancie...
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Phase based-binarization-of-ancie...
IEEEBEBTECHSTUDENTPROJECTS
 
Modified Approach of Hough Transform for Skew Detection and Correction in Doc...
Modified Approach of Hough Transform for Skew Detection and Correction in Doc...Modified Approach of Hough Transform for Skew Detection and Correction in Doc...
Modified Approach of Hough Transform for Skew Detection and Correction in Doc...
IJORCS
 
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, RomeWorkflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Carole Goble
 
An effective and robust technique for the binarization of degraded document i...
An effective and robust technique for the binarization of degraded document i...An effective and robust technique for the binarization of degraded document i...
An effective and robust technique for the binarization of degraded document i...
eSAT Publishing House
 
Web Mining Research Issues and Future Directions – A Survey
Web Mining Research Issues and Future Directions – A SurveyWeb Mining Research Issues and Future Directions – A Survey
Web Mining Research Issues and Future Directions – A Survey
IOSR Journals
 
DSD-INT 2019 Modelling in DANUBIUS-RI-Bellafiore
DSD-INT 2019 Modelling in DANUBIUS-RI-BellafioreDSD-INT 2019 Modelling in DANUBIUS-RI-Bellafiore
DSD-INT 2019 Modelling in DANUBIUS-RI-Bellafiore
Deltares
 
DCW Data Quality 1992
DCW Data Quality 1992DCW Data Quality 1992
DCW Data Quality 1992
Robert (Bob) Williams
 
Ic3414861499
Ic3414861499Ic3414861499
Ic3414861499
IJERA Editor
 
Digital Pathology Information Web Services (DPIWS): Convergence in Digital Pa...
Digital Pathology Information Web Services (DPIWS): Convergence in Digital Pa...Digital Pathology Information Web Services (DPIWS): Convergence in Digital Pa...
Digital Pathology Information Web Services (DPIWS): Convergence in Digital Pa...
Yves Sucaet
 
Simplifying Database Normalization within a Visual Interactive Simulation Model
Simplifying Database Normalization within a Visual Interactive Simulation ModelSimplifying Database Normalization within a Visual Interactive Simulation Model
Simplifying Database Normalization within a Visual Interactive Simulation Model
ijdms
 
DNA Query Language DNAQL: A Novel Approach
DNA Query Language DNAQL: A Novel ApproachDNA Query Language DNAQL: A Novel Approach
DNA Query Language DNAQL: A Novel ApproachEditor IJCATR
 
Enhancement of Degraded Document Images using Retinex and Morphological Opera...
Enhancement of Degraded Document Images using Retinex and Morphological Opera...Enhancement of Degraded Document Images using Retinex and Morphological Opera...
Enhancement of Degraded Document Images using Retinex and Morphological Opera...
IJCSIS Research Publications
 
Diagnostic hypothesis refinement in reproducible workflows for advanced medic...
Diagnostic hypothesis refinement in reproducible workflows for advanced medic...Diagnostic hypothesis refinement in reproducible workflows for advanced medic...
Diagnostic hypothesis refinement in reproducible workflows for advanced medic...
Cezary Mazurek
 
Design for Additively Manufactured Structure An Assessment
Design for Additively Manufactured Structure An AssessmentDesign for Additively Manufactured Structure An Assessment
Design for Additively Manufactured Structure An Assessment
ijtsrd
 

Similar to Phase-Based Binarization of Ancient Document Images: Model and Applications (20)

Guides
GuidesGuides
Guides
 
Improvement of binarization performance using local otsu thresholding
Improvement of binarization performance using local otsu thresholdingImprovement of binarization performance using local otsu thresholding
Improvement of binarization performance using local otsu thresholding
 
Sub1586
Sub1586Sub1586
Sub1586
 
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Phase based-binarization-of-ancie...
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS  Phase based-binarization-of-ancie...IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS  Phase based-binarization-of-ancie...
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Phase based-binarization-of-ancie...
 
Modified Approach of Hough Transform for Skew Detection and Correction in Doc...
Modified Approach of Hough Transform for Skew Detection and Correction in Doc...Modified Approach of Hough Transform for Skew Detection and Correction in Doc...
Modified Approach of Hough Transform for Skew Detection and Correction in Doc...
 
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, RomeWorkflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
 
An effective and robust technique for the binarization of degraded document i...
An effective and robust technique for the binarization of degraded document i...An effective and robust technique for the binarization of degraded document i...
An effective and robust technique for the binarization of degraded document i...
 
Web Mining Research Issues and Future Directions – A Survey
Web Mining Research Issues and Future Directions – A SurveyWeb Mining Research Issues and Future Directions – A Survey
Web Mining Research Issues and Future Directions – A Survey
 
DSD-INT 2019 Modelling in DANUBIUS-RI-Bellafiore
DSD-INT 2019 Modelling in DANUBIUS-RI-BellafioreDSD-INT 2019 Modelling in DANUBIUS-RI-Bellafiore
DSD-INT 2019 Modelling in DANUBIUS-RI-Bellafiore
 
DCW Data Quality 1992
DCW Data Quality 1992DCW Data Quality 1992
DCW Data Quality 1992
 
Ic3414861499
Ic3414861499Ic3414861499
Ic3414861499
 
project documentation
project documentationproject documentation
project documentation
 
Digital Pathology Information Web Services (DPIWS): Convergence in Digital Pa...
Digital Pathology Information Web Services (DPIWS): Convergence in Digital Pa...Digital Pathology Information Web Services (DPIWS): Convergence in Digital Pa...
Digital Pathology Information Web Services (DPIWS): Convergence in Digital Pa...
 
Digital Library
Digital LibraryDigital Library
Digital Library
 
Hyper3dpaper s
Hyper3dpaper sHyper3dpaper s
Hyper3dpaper s
 
Simplifying Database Normalization within a Visual Interactive Simulation Model
Simplifying Database Normalization within a Visual Interactive Simulation ModelSimplifying Database Normalization within a Visual Interactive Simulation Model
Simplifying Database Normalization within a Visual Interactive Simulation Model
 
DNA Query Language DNAQL: A Novel Approach
DNA Query Language DNAQL: A Novel ApproachDNA Query Language DNAQL: A Novel Approach
DNA Query Language DNAQL: A Novel Approach
 
Enhancement of Degraded Document Images using Retinex and Morphological Opera...
Enhancement of Degraded Document Images using Retinex and Morphological Opera...Enhancement of Degraded Document Images using Retinex and Morphological Opera...
Enhancement of Degraded Document Images using Retinex and Morphological Opera...
 
Diagnostic hypothesis refinement in reproducible workflows for advanced medic...
Diagnostic hypothesis refinement in reproducible workflows for advanced medic...Diagnostic hypothesis refinement in reproducible workflows for advanced medic...
Diagnostic hypothesis refinement in reproducible workflows for advanced medic...
 
Design for Additively Manufactured Structure An Assessment
Design for Additively Manufactured Structure An AssessmentDesign for Additively Manufactured Structure An Assessment
Design for Additively Manufactured Structure An Assessment
 

More from john236zaq

Images as Occlusions of Textures: A Framework for Segmentation
Images as Occlusions of Textures: A Framework for SegmentationImages as Occlusions of Textures: A Framework for Segmentation
Images as Occlusions of Textures: A Framework for Segmentation
john236zaq
 
Image Restoration Using Joint Statistical Modeling in a Space-Transform Domain
Image Restoration Using Joint Statistical Modeling in a Space-Transform DomainImage Restoration Using Joint Statistical Modeling in a Space-Transform Domain
Image Restoration Using Joint Statistical Modeling in a Space-Transform Domain
john236zaq
 
Low-Complexity DFT-Based Channel Estimation with Leakage Nulling for OFDM Sys...
Low-Complexity DFT-Based Channel Estimation with Leakage Nulling for OFDM Sys...Low-Complexity DFT-Based Channel Estimation with Leakage Nulling for OFDM Sys...
Low-Complexity DFT-Based Channel Estimation with Leakage Nulling for OFDM Sys...
john236zaq
 
Low-Rank Neighbor Embedding for Single Image Super-Resolution
Low-Rank Neighbor Embedding for Single Image Super-ResolutionLow-Rank Neighbor Embedding for Single Image Super-Resolution
Low-Rank Neighbor Embedding for Single Image Super-Resolution
john236zaq
 
Mining Weakly Labeled Web Facial Images for Search-Based Face Annotation
Mining Weakly Labeled Web Facial Images for Search-Based Face AnnotationMining Weakly Labeled Web Facial Images for Search-Based Face Annotation
Mining Weakly Labeled Web Facial Images for Search-Based Face Annotation
john236zaq
 
Modeling and Estimation of Transient Carrier Frequency Offset in Wireless Tra...
Modeling and Estimation of Transient Carrier Frequency Offset in Wireless Tra...Modeling and Estimation of Transient Carrier Frequency Offset in Wireless Tra...
Modeling and Estimation of Transient Carrier Frequency Offset in Wireless Tra...
john236zaq
 
OFDM Synthetic Aperture Radar Imaging With Sufficient Cyclic Prefix
OFDM Synthetic Aperture Radar Imaging With Sufficient Cyclic PrefixOFDM Synthetic Aperture Radar Imaging With Sufficient Cyclic Prefix
OFDM Synthetic Aperture Radar Imaging With Sufficient Cyclic Prefix
john236zaq
 
Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A ...
Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A ...Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A ...
Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A ...
john236zaq
 
Reversible De-Identification for Lossless Image Compression using Reversible ...
Reversible De-Identification for Lossless Image Compression using Reversible ...Reversible De-Identification for Lossless Image Compression using Reversible ...
Reversible De-Identification for Lossless Image Compression using Reversible ...
john236zaq
 
Stochastic Analysis of the LMS and NLMS Algorithms for Cyclostationary White ...
Stochastic Analysis of the LMS and NLMS Algorithms for Cyclostationary White ...Stochastic Analysis of the LMS and NLMS Algorithms for Cyclostationary White ...
Stochastic Analysis of the LMS and NLMS Algorithms for Cyclostationary White ...
john236zaq
 

More from john236zaq (10)

Images as Occlusions of Textures: A Framework for Segmentation
Images as Occlusions of Textures: A Framework for SegmentationImages as Occlusions of Textures: A Framework for Segmentation
Images as Occlusions of Textures: A Framework for Segmentation
 
Image Restoration Using Joint Statistical Modeling in a Space-Transform Domain
Image Restoration Using Joint Statistical Modeling in a Space-Transform DomainImage Restoration Using Joint Statistical Modeling in a Space-Transform Domain
Image Restoration Using Joint Statistical Modeling in a Space-Transform Domain
 
Low-Complexity DFT-Based Channel Estimation with Leakage Nulling for OFDM Sys...
Low-Complexity DFT-Based Channel Estimation with Leakage Nulling for OFDM Sys...Low-Complexity DFT-Based Channel Estimation with Leakage Nulling for OFDM Sys...
Low-Complexity DFT-Based Channel Estimation with Leakage Nulling for OFDM Sys...
 
Low-Rank Neighbor Embedding for Single Image Super-Resolution
Low-Rank Neighbor Embedding for Single Image Super-ResolutionLow-Rank Neighbor Embedding for Single Image Super-Resolution
Low-Rank Neighbor Embedding for Single Image Super-Resolution
 
Mining Weakly Labeled Web Facial Images for Search-Based Face Annotation
Mining Weakly Labeled Web Facial Images for Search-Based Face AnnotationMining Weakly Labeled Web Facial Images for Search-Based Face Annotation
Mining Weakly Labeled Web Facial Images for Search-Based Face Annotation
 
Modeling and Estimation of Transient Carrier Frequency Offset in Wireless Tra...
Modeling and Estimation of Transient Carrier Frequency Offset in Wireless Tra...Modeling and Estimation of Transient Carrier Frequency Offset in Wireless Tra...
Modeling and Estimation of Transient Carrier Frequency Offset in Wireless Tra...
 
OFDM Synthetic Aperture Radar Imaging With Sufficient Cyclic Prefix
OFDM Synthetic Aperture Radar Imaging With Sufficient Cyclic PrefixOFDM Synthetic Aperture Radar Imaging With Sufficient Cyclic Prefix
OFDM Synthetic Aperture Radar Imaging With Sufficient Cyclic Prefix
 
Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A ...
Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A ...Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A ...
Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A ...
 
Reversible De-Identification for Lossless Image Compression using Reversible ...
Reversible De-Identification for Lossless Image Compression using Reversible ...Reversible De-Identification for Lossless Image Compression using Reversible ...
Reversible De-Identification for Lossless Image Compression using Reversible ...
 
Stochastic Analysis of the LMS and NLMS Algorithms for Cyclostationary White ...
Stochastic Analysis of the LMS and NLMS Algorithms for Cyclostationary White ...Stochastic Analysis of the LMS and NLMS Algorithms for Cyclostationary White ...
Stochastic Analysis of the LMS and NLMS Algorithms for Cyclostationary White ...
 

Recently uploaded

ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
ydteq
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
AmarGB2
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
TeeVichai
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
SupreethSP4
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
manasideore6
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
gerogepatton
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
Pratik Pawar
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
AhmedHussein950959
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
ankuprajapati0525
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
thanhdowork
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
karthi keyan
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
zwunae
 

Recently uploaded (20)

ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
 

Phase-Based Binarization of Ancient Document Images: Model and Applications

  • 1. www.projectsatbangalore.com 09591912372 2916 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 23, NO. 7, JULY 2014 Phase-Based Binarization of Ancient Document Images: Model and Applications Hossein Ziaei Nafchi, Reza Farrahi Moghaddam, Member, IEEE, and Mohamed Cheriet, Senior Member, IEEE Abstract—In this paper, a phase-based binarization model for ancient document images is proposed, as well as a postprocessing method that can improve any binarization method and a ground truth generation tool. Three feature maps derived from the phase information of an input document image constitute the core of this binarization model. These features are the maximum moment of phase congruency covariance, a locally weighted mean phase angle, and a phase preserved denoised image. The proposed model consists of three standard steps: 1) preprocessing; 2) main binarization; and 3) postprocessing. In the preprocessing and main binarization steps, the features used are mainly phase derived, while in the postprocessing step, specialized adaptive Gaussian and median filters are considered. One of the outputs of the binarization step, which shows high recall performance, is used in a proposed postprocessing method to improve the perfor- mance of other binarization methodologies. Finally, we develop a ground truth generation tool, called PhaseGT, to simplify and speed up the ground truth generation process for ancient document images. The comprehensive experimental results on the DIBCO’09, H-DIBCO’10, DIBCO’11, H-DIBCO’12, DIBCO’13, PHIBD’12, and BICKLEY DIARY data sets show the robust- ness of the proposed binarization method on various types of degradation and document images. Index Terms—Historical document binarization, phase-derived features, ground truthing, document enhancement. I. INTRODUCTION LIBRARIES and archives around the world store an abundance of old and historically important documents and manuscripts. These documents accumulate a significant amount of human heritage over time. However, many envi- ronmental factors, improper handling, and the poor quality of the materials used in their creation cause them to suffer a high degree of degradation of various types. Today, there is a strong move toward digitization of these manuscripts to preserve their content for future generations. The huge amount of digital data produced requires automatic processing, enhancement, and recognition. A key step in all document image processing workflows is binarization, but this is not a very sophisti- cated process, which is unfortunate, as its performance has Manuscript received August 8, 2013; revised January 29, 2014 and April 22, 2014; accepted April 26, 2014. Date of publication May 7, 2014; date of current version May 27, 2014. This work was supported by the Natural Sciences and Engineering Research Council of Canada. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Debargha Mukherjee. The authors are with the Synchromedia Laboratory for Multime- dia Communication in Telepresence, École de Technologie Supérieure, Montreal, QC H3C 1K3, Canada (e-mail: hossein.zi@synchromedia.ca; rfarrahi@synchromedia.ca; mohamed.cheriet@etsmtl.ca). Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TIP.2014.2322451 Fig. 1. Sample document images selected from the DIBCO’09 [21], H-DIBCO’10 [22], and DIBCO’11 datasets [23]. a significant influence on the quality of OCR results. Many research studies have been carried out to solve the problems that arise in the binarization of old document images char- acterized by many types of degradation [1]–[19], including faded ink, bleed-through, show-through, uneven illumination, variations in image contrast, and deterioration of the cellulose structure [1], [20]. There are also differences in patterns of hand-written and machine-printed documents, which add to the difficulties associated with the binarization of old document images. To the best of our knowledge, none of the proposed methods can deal with all types of documents and degradation. For more details, see the Related Work section. Fig. 1 shows some of the degraded document images used in this paper. In this paper, a robust phase-based binarization method is proposed for the binarization and enhancement of historical documents and manuscripts. The three main steps in the proposed method are: preprocessing, main binarization, and post-processing. The preprocessing step mainly involves image denoising with phase preservation [24], followed by some morphological operations. We incorporate the Canny edge detector [25] and a denoised image to obtain a binarized image in rough form. Then, we use the phase congruency features [18], [19], [26] for the main binarization step. Phase congruency is widely used in the machine vision and image processing 1057-7149 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.