SlideShare a Scribd company logo
1 of 31
Mr. Sovann EN
5th
Year Engineering student
Dept. Computer Science & Communication
Institute of Technology of Cambodia
Phnom Penh, Cambodia
Khmer OCR Using Generic Fourier
Descriptor
Content
Khmer OCR Using Generic Fourier Descriptor
2
 Introduction
 Khmer OCR System
 Pre-processing
 Segmentation
 Feature Extraction
 Recognition Process
 Generic Fourier Descriptor In detail
 Future work
Introduction
 Optical Character Recognition (OCR) is the
electronic conversion of scanned images into machine-
encoded text.
 OCR makes it possible to edit the text, to search for
a word or a phrase, to display or to print a copy free of
scanning artifacts and so on.
3
Khmer OCR Using Generic Fourier Descriptor Back
Introduction
 The ideas started prior to World War II.
Throughout the years, many reliable commercial and
academic prototypes have been developed in many
natural languages.
 Still, due to the lacked effort in Khmer OCR, there
is no reliable Khmer OCR software.
4
Khmer OCR Using Generic Fourier Descriptor Back
Introduction
 The ideas started prior to World War II.
Throughout the years, many reliable commercial and
academic prototypes have been developed in many
natural languages.
 Still, due to the lacked effort in Khmer OCR, there
is no reliable Khmer OCR software.
5
Khmer OCR Using Generic Fourier Descriptor Back
Introduction
 The ideas started prior to World War II.
Throughout the years, many reliable commercial and
academic prototypes have been developed in many
natural languages.
 Still, due to the lacked effort in Khmer OCR, there
is no reliable Khmer OCR software.
6
Khmer OCR Using Generic Fourier Descriptor Back
Introduction
7
Khmer OCR Using Generic Fourier Descriptor Back
 The current system is based on Mr. Vanna Kruy’s
work for his master degree at Waseda University.
 The Objective is to produce a reliable Khmer OCR
system which is independent of Font and Size.
Khmer OCR System
Khmer OCR Using Generic Fourier Descriptor
 OCR System consist of four majors stages :
8
 Pre-processing
 Segmentation
 Feature Extraction
 Post processing
Pre-processing
Khmer OCR Using Generic Fourier Descriptor
9
 Pre-processing aims to produce data that are easy
for the OCR systems to operate accurately.
 The main objectives of pre-processing are :
 Binarization
 Noise removal
Binarization
10
 Image linearization (thresholding) refers to the
conversion of a gray-scale image into a binary image.
Khmer OCR Using Generic Fourier Descriptor
Binarization of Input Image
Salt-Pepper Noise removal
11
Salt-and-pepper noise is a kind of noise which is
usually caused by small unnecessary dots produced by
either the scanner or the source document itself.
Khmer OCR Using Generic Fourier Descriptor Back
Particle removal
Segmentation
12
 Segmentation aims to produce each component to
be recognized by the system.
 The process is to separate the text of a page into
each separate line, then to separate each line into
Vertical Component, and finally produce each
independent symbol.
Khmer OCR Using Generic Fourier Descriptor Back
Connected Component Analysis
13
 Connected Component is a group of pixels
accumulating together to form a shape.
 The process is to separate the text of a page into
each separate line, then to separate each line into
Vertical Component, and finally produce each
independent symbol.
Khmer OCR Using Generic Fourier Descriptor Back
14
Example using CCA
Khmer OCR Using Generic Fourier Descriptor Back
Word Segmentation using CCA
15
Example using CCA
Main part :
Sup script :
Sub script :
Ccdown :
Khmer OCR Using Generic Fourier Descriptor Back
Feature extraction
16
 In feature extraction stage, each character is
represented as a feature vector which becomes its
identity.
The major goal of feature extraction is to extract a
set of features which maximizes the recognition rate
with the least amount of elements.
Khmer OCR Using Generic Fourier Descriptor Back
Generic Fourier Descriptor
Khmer OCR Using Generic Fourier Descriptor
17
 GFD is derived by applying two-dimensional
Fourier transform on a polar-raster sampled shape
image.
 Polar-raster sampled : the image obtained after
circularly sampling an object in an image up to its
maximum radius.
Example of GFD
18
Khmer OCR Using Generic Fourier Descriptor Back
 The similarity between two shapes are measured
by the city block distance between the two set of
GFDs.
ររkhmer OS UI font and its GFD
Recognition Process
19
Khmer OCR Using Generic Fourier Descriptor Back
 To recognize, we compare the GFD’s score
 The score is obtain by comparing the CCs features
vector with CCs feature in database.
Recognition Process : Score
20
Khmer OCR Using Generic Fourier Descriptor Back
 The similarity between two shapes is measured by
the City-Block distance of the two feature vectors of
the shape.
 The lower value means the more similar the shapes
are.
Generic Fourier Descriptor In detail
Khmer OCR Using Generic Fourier Descriptor
21
 GFD is derived by applying two-dimensional
Fourier transform on a polar-raster sampled shape
image.
 A region-base shape descriptor proposed by
Dengsheng Zhang & Guojun Lu. It is confirm to
outperforms common contour-based and region-based
shape descriptors.
Algorithm to computer GFD
Khmer OCR Using Generic Fourier Descriptor
22
 Input the binary shape image data f(x, y)
 Get centroid of the shape (xc, yc)
 Set the centroid as the origin
 Compute Polar Fourier Transform
 Calculate Fourier Descriptor
 Output GFD
Polar Transform
23
 It is the transforming into polar image f(r, θ), for an
input image f(x, y).
c
c
cc
xx
yy
yyxxr
−
−
=−+−= arctan,)()( 22
θ
∑∑
−
=
−
=
==
1
0
1
0
1
and
1
where
M
y
c
N
x
c y
N
yx
M
x
shapetheof(barycent)centroidofrcoordinatotheis),( cc yx
Khmer OCR Using Generic Fourier Descriptor Back
Polar Raster Grid
24
 Polar raster sampling
Polar image Polar raster sample image in Cartesian space
Khmer OCR Using Generic Fourier Descriptor Back
Polar Raster Grid
25
 Binary polar raster sample shape images
Khmer OCR Using Generic Fourier Descriptor Back
Shape, shape normalization and its polar transform
2D- Fourier Transform
26
 2-D Fourier transform on polar raster sample
image f(r, θ ):
where 0≤r<R and θi
= i(2π/T) (0≤ i<T); 0≤ρ<R,
0≤φ<T. R and T are the radial frequency resolution and
angular frequency resolution respectively.
∑∑ +−=
r i
i
T
i
R
r
jrfPF )]
2
(2exp[),(),( φ
π
ρπθφρ
Khmer OCR Using Generic Fourier Descriptor Back
Normalization
27
 Translation invariant due to using centroid as
origin.
 Scale normalization is archived due to the
proportional equality:
}
|)0,0(|
|),(|
,...,
|)0,0(|
|)0,(|
,...,
|)0,0(|
|),0(|
,...,
|)0,0(|
|)1,0(|
,
|)0,0(|
{
PF
nmPF
PF
mPF
PF
nPF
PF
PF
area
PF
GFD =
Khmer OCR Using Generic Fourier Descriptor Back
Future Work
28
 Integrate GFD into the system
 Exploit the possibility to improve the accuracy by
using Classifier for Training module
Khmer OCR Using Generic Fourier Descriptor Back
Khmer OCR Using Generic Fourier Descriptor Back
Thank for your attention !!!
Reference
30
Khmer OCR Using Generic Fourier Descriptor Back
[1] V. Kruy. Preliminary Experiment on Khmer OCR. Kameyama Laboratory,
Waseda Univerisy, Japan.
[2] Thesis for master degree, Khmer OCR, Vanna Kruy.
[3] D. Zhang and G. Lu. Shape-based image retrieval using generic Fourier descriptor.
Gippsland School of Computing and Information
Technology. Monash University. Churchill, Victoria 3842, Australia.
[4] Thesis for Doctoral Degree, chapter 6: Generic Fourier Descriptor, Dengsheng
Zhang.
[5] J.C.Rupe. Vision-Based Hand Shape Identification for Sign Language
Recognition. Department of Computer Engineering Kate Gleason College of
Engineering Rochester Institute of Technology Rochester, NY.
Reference
31
Khmer OCR Using Generic Fourier Descriptor Back
[6] D. Dimov. A polar-Fourier-Wavelet’s Transform for Effective CBIR. 3rd
ADBIS
workshop on Data mining & Knowledge Discovery
[7] I. Lengieng, K. Sochenda and C. Sokhour. , Khmer OCR for Limon R1 Size 22
Report, PAN Localization Cambodia (PLC) of IDRC.er OCR
[8] A. Averbuch, R.R. Coifmany , D.L. Donohoz M. Eladx M. Israeli. Fast and
Accurate Polar Fourier Transform.
Department of Computer Science, Tel-Aviv University, Tel-Aviv 69978, Israel.
Department of mathematics, Yale University, New Haven CT 06520-8283 USA
Department of Statistics, Stanford University, Stanford 94305-9025 CA. USA.

More Related Content

Viewers also liked (6)

cs15 slide_v2
cs15 slide_v2cs15 slide_v2
cs15 slide_v2
 
0 tutorial contents
0 tutorial contents0 tutorial contents
0 tutorial contents
 
8 modernization efforts
8 modernization efforts8 modernization efforts
8 modernization efforts
 
7 layout analysis
7 layout analysis7 layout analysis
7 layout analysis
 
CS_rapport_final_fr_v3_1
CS_rapport_final_fr_v3_1CS_rapport_final_fr_v3_1
CS_rapport_final_fr_v3_1
 
1 intro history
1 intro history1 intro history
1 intro history
 

Similar to Khmer ocr using gfd

A computationally efficient method to find transformed residue
A computationally efficient method to find transformed residueA computationally efficient method to find transformed residue
A computationally efficient method to find transformed residue
iaemedu
 
nilesh-Mtech-presentation
nilesh-Mtech-presentationnilesh-Mtech-presentation
nilesh-Mtech-presentation
Nilesh Heda
 
Farsi character recognition using new hybrid feature extraction methods
Farsi character recognition using new hybrid feature extraction methodsFarsi character recognition using new hybrid feature extraction methods
Farsi character recognition using new hybrid feature extraction methods
ijcseit
 
International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...
ijcseit
 

Similar to Khmer ocr using gfd (20)

Orb feature by nitin
Orb feature by nitinOrb feature by nitin
Orb feature by nitin
 
A computationally efficient method to find transformed residue
A computationally efficient method to find transformed residueA computationally efficient method to find transformed residue
A computationally efficient method to find transformed residue
 
Improvement of the Recognition Rate by Random Forest
Improvement of the Recognition Rate by Random ForestImprovement of the Recognition Rate by Random Forest
Improvement of the Recognition Rate by Random Forest
 
Improvement oh the recognition rate by random forest
Improvement oh the recognition rate by random forestImprovement oh the recognition rate by random forest
Improvement oh the recognition rate by random forest
 
Text Detection and Recognition
Text Detection and RecognitionText Detection and Recognition
Text Detection and Recognition
 
A Framework for Curved Videotext Detection and Extraction
A Framework for Curved Videotext Detection and ExtractionA Framework for Curved Videotext Detection and Extraction
A Framework for Curved Videotext Detection and Extraction
 
A Framework for Curved Videotext Detection and Extraction
A Framework for Curved Videotext Detection and ExtractionA Framework for Curved Videotext Detection and Extraction
A Framework for Curved Videotext Detection and Extraction
 
Opticalcharacter recognition
Opticalcharacter recognition Opticalcharacter recognition
Opticalcharacter recognition
 
Optical Character Recognition from Text Image
Optical Character Recognition from Text ImageOptical Character Recognition from Text Image
Optical Character Recognition from Text Image
 
Script Identification for printed document images at text-line level using DC...
Script Identification for printed document images at text-line level using DC...Script Identification for printed document images at text-line level using DC...
Script Identification for printed document images at text-line level using DC...
 
Assignment-1-NF.docx
Assignment-1-NF.docxAssignment-1-NF.docx
Assignment-1-NF.docx
 
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...
 
Aocr Hmm Presentation
Aocr Hmm PresentationAocr Hmm Presentation
Aocr Hmm Presentation
 
Deblurring, Localization and Geometry Correction of 2D QR Bar Codes Using Ric...
Deblurring, Localization and Geometry Correction of 2D QR Bar Codes Using Ric...Deblurring, Localization and Geometry Correction of 2D QR Bar Codes Using Ric...
Deblurring, Localization and Geometry Correction of 2D QR Bar Codes Using Ric...
 
nilesh-Mtech-presentation
nilesh-Mtech-presentationnilesh-Mtech-presentation
nilesh-Mtech-presentation
 
Design and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English FontDesign and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English Font
 
Farsi character recognition using new hybrid feature extraction methods
Farsi character recognition using new hybrid feature extraction methodsFarsi character recognition using new hybrid feature extraction methods
Farsi character recognition using new hybrid feature extraction methods
 
International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...
 
New Approach of Preprocessing For Numeral Recognition
New Approach of Preprocessing For Numeral RecognitionNew Approach of Preprocessing For Numeral Recognition
New Approach of Preprocessing For Numeral Recognition
 
Two Methods for Recognition of Hand Written Farsi Characters
Two Methods for Recognition of Hand Written Farsi CharactersTwo Methods for Recognition of Hand Written Farsi Characters
Two Methods for Recognition of Hand Written Farsi Characters
 

Recently uploaded

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 

Recently uploaded (20)

Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 

Khmer ocr using gfd

  • 1. Mr. Sovann EN 5th Year Engineering student Dept. Computer Science & Communication Institute of Technology of Cambodia Phnom Penh, Cambodia Khmer OCR Using Generic Fourier Descriptor
  • 2. Content Khmer OCR Using Generic Fourier Descriptor 2  Introduction  Khmer OCR System  Pre-processing  Segmentation  Feature Extraction  Recognition Process  Generic Fourier Descriptor In detail  Future work
  • 3. Introduction  Optical Character Recognition (OCR) is the electronic conversion of scanned images into machine- encoded text.  OCR makes it possible to edit the text, to search for a word or a phrase, to display or to print a copy free of scanning artifacts and so on. 3 Khmer OCR Using Generic Fourier Descriptor Back
  • 4. Introduction  The ideas started prior to World War II. Throughout the years, many reliable commercial and academic prototypes have been developed in many natural languages.  Still, due to the lacked effort in Khmer OCR, there is no reliable Khmer OCR software. 4 Khmer OCR Using Generic Fourier Descriptor Back
  • 5. Introduction  The ideas started prior to World War II. Throughout the years, many reliable commercial and academic prototypes have been developed in many natural languages.  Still, due to the lacked effort in Khmer OCR, there is no reliable Khmer OCR software. 5 Khmer OCR Using Generic Fourier Descriptor Back
  • 6. Introduction  The ideas started prior to World War II. Throughout the years, many reliable commercial and academic prototypes have been developed in many natural languages.  Still, due to the lacked effort in Khmer OCR, there is no reliable Khmer OCR software. 6 Khmer OCR Using Generic Fourier Descriptor Back
  • 7. Introduction 7 Khmer OCR Using Generic Fourier Descriptor Back  The current system is based on Mr. Vanna Kruy’s work for his master degree at Waseda University.  The Objective is to produce a reliable Khmer OCR system which is independent of Font and Size.
  • 8. Khmer OCR System Khmer OCR Using Generic Fourier Descriptor  OCR System consist of four majors stages : 8  Pre-processing  Segmentation  Feature Extraction  Post processing
  • 9. Pre-processing Khmer OCR Using Generic Fourier Descriptor 9  Pre-processing aims to produce data that are easy for the OCR systems to operate accurately.  The main objectives of pre-processing are :  Binarization  Noise removal
  • 10. Binarization 10  Image linearization (thresholding) refers to the conversion of a gray-scale image into a binary image. Khmer OCR Using Generic Fourier Descriptor Binarization of Input Image
  • 11. Salt-Pepper Noise removal 11 Salt-and-pepper noise is a kind of noise which is usually caused by small unnecessary dots produced by either the scanner or the source document itself. Khmer OCR Using Generic Fourier Descriptor Back Particle removal
  • 12. Segmentation 12  Segmentation aims to produce each component to be recognized by the system.  The process is to separate the text of a page into each separate line, then to separate each line into Vertical Component, and finally produce each independent symbol. Khmer OCR Using Generic Fourier Descriptor Back
  • 13. Connected Component Analysis 13  Connected Component is a group of pixels accumulating together to form a shape.  The process is to separate the text of a page into each separate line, then to separate each line into Vertical Component, and finally produce each independent symbol. Khmer OCR Using Generic Fourier Descriptor Back
  • 14. 14 Example using CCA Khmer OCR Using Generic Fourier Descriptor Back Word Segmentation using CCA
  • 15. 15 Example using CCA Main part : Sup script : Sub script : Ccdown : Khmer OCR Using Generic Fourier Descriptor Back
  • 16. Feature extraction 16  In feature extraction stage, each character is represented as a feature vector which becomes its identity. The major goal of feature extraction is to extract a set of features which maximizes the recognition rate with the least amount of elements. Khmer OCR Using Generic Fourier Descriptor Back
  • 17. Generic Fourier Descriptor Khmer OCR Using Generic Fourier Descriptor 17  GFD is derived by applying two-dimensional Fourier transform on a polar-raster sampled shape image.  Polar-raster sampled : the image obtained after circularly sampling an object in an image up to its maximum radius.
  • 18. Example of GFD 18 Khmer OCR Using Generic Fourier Descriptor Back  The similarity between two shapes are measured by the city block distance between the two set of GFDs. ររkhmer OS UI font and its GFD
  • 19. Recognition Process 19 Khmer OCR Using Generic Fourier Descriptor Back  To recognize, we compare the GFD’s score  The score is obtain by comparing the CCs features vector with CCs feature in database.
  • 20. Recognition Process : Score 20 Khmer OCR Using Generic Fourier Descriptor Back  The similarity between two shapes is measured by the City-Block distance of the two feature vectors of the shape.  The lower value means the more similar the shapes are.
  • 21. Generic Fourier Descriptor In detail Khmer OCR Using Generic Fourier Descriptor 21  GFD is derived by applying two-dimensional Fourier transform on a polar-raster sampled shape image.  A region-base shape descriptor proposed by Dengsheng Zhang & Guojun Lu. It is confirm to outperforms common contour-based and region-based shape descriptors.
  • 22. Algorithm to computer GFD Khmer OCR Using Generic Fourier Descriptor 22  Input the binary shape image data f(x, y)  Get centroid of the shape (xc, yc)  Set the centroid as the origin  Compute Polar Fourier Transform  Calculate Fourier Descriptor  Output GFD
  • 23. Polar Transform 23  It is the transforming into polar image f(r, θ), for an input image f(x, y). c c cc xx yy yyxxr − − =−+−= arctan,)()( 22 θ ∑∑ − = − = == 1 0 1 0 1 and 1 where M y c N x c y N yx M x shapetheof(barycent)centroidofrcoordinatotheis),( cc yx Khmer OCR Using Generic Fourier Descriptor Back
  • 24. Polar Raster Grid 24  Polar raster sampling Polar image Polar raster sample image in Cartesian space Khmer OCR Using Generic Fourier Descriptor Back
  • 25. Polar Raster Grid 25  Binary polar raster sample shape images Khmer OCR Using Generic Fourier Descriptor Back Shape, shape normalization and its polar transform
  • 26. 2D- Fourier Transform 26  2-D Fourier transform on polar raster sample image f(r, θ ): where 0≤r<R and θi = i(2π/T) (0≤ i<T); 0≤ρ<R, 0≤φ<T. R and T are the radial frequency resolution and angular frequency resolution respectively. ∑∑ +−= r i i T i R r jrfPF )] 2 (2exp[),(),( φ π ρπθφρ Khmer OCR Using Generic Fourier Descriptor Back
  • 27. Normalization 27  Translation invariant due to using centroid as origin.  Scale normalization is archived due to the proportional equality: } |)0,0(| |),(| ,..., |)0,0(| |)0,(| ,..., |)0,0(| |),0(| ,..., |)0,0(| |)1,0(| , |)0,0(| { PF nmPF PF mPF PF nPF PF PF area PF GFD = Khmer OCR Using Generic Fourier Descriptor Back
  • 28. Future Work 28  Integrate GFD into the system  Exploit the possibility to improve the accuracy by using Classifier for Training module Khmer OCR Using Generic Fourier Descriptor Back
  • 29. Khmer OCR Using Generic Fourier Descriptor Back Thank for your attention !!!
  • 30. Reference 30 Khmer OCR Using Generic Fourier Descriptor Back [1] V. Kruy. Preliminary Experiment on Khmer OCR. Kameyama Laboratory, Waseda Univerisy, Japan. [2] Thesis for master degree, Khmer OCR, Vanna Kruy. [3] D. Zhang and G. Lu. Shape-based image retrieval using generic Fourier descriptor. Gippsland School of Computing and Information Technology. Monash University. Churchill, Victoria 3842, Australia. [4] Thesis for Doctoral Degree, chapter 6: Generic Fourier Descriptor, Dengsheng Zhang. [5] J.C.Rupe. Vision-Based Hand Shape Identification for Sign Language Recognition. Department of Computer Engineering Kate Gleason College of Engineering Rochester Institute of Technology Rochester, NY.
  • 31. Reference 31 Khmer OCR Using Generic Fourier Descriptor Back [6] D. Dimov. A polar-Fourier-Wavelet’s Transform for Effective CBIR. 3rd ADBIS workshop on Data mining & Knowledge Discovery [7] I. Lengieng, K. Sochenda and C. Sokhour. , Khmer OCR for Limon R1 Size 22 Report, PAN Localization Cambodia (PLC) of IDRC.er OCR [8] A. Averbuch, R.R. Coifmany , D.L. Donohoz M. Eladx M. Israeli. Fast and Accurate Polar Fourier Transform. Department of Computer Science, Tel-Aviv University, Tel-Aviv 69978, Israel. Department of mathematics, Yale University, New Haven CT 06520-8283 USA Department of Statistics, Stanford University, Stanford 94305-9025 CA. USA.