SlideShare a Scribd company logo
1 of 32
Download to read offline
Offline Handwritten Thai Character
Recognition Using Single Tier Classifier
and Local Features
Ferdin Joe John Joseph1
Panatchakorn Anantaprayoon2
2Kamnoetvidya Science Academy, Rayong
1Faculty of Information Technology, Thai – Nichi Institute of Technology, Bangkok
Introduction
Character Recognition
conversion of typewritten or handwritten text into machine-encoded
one.
http://www.cbnco.com/idsys/id-readers-eagle.php
http://cdn.iphonehacks.com/wp-
content/uploads/2013/01/mzl.bmcrwhfl.320x480-75.jpg
1
Types of Character Recognition
Offline Character Recognition
https://cs.stanford.edu/people/adityaj/HandwritingRecognition.pdf
Online Character Recognition
https://nl.pcmag.com/google-1/1723/news/handschrift-app-van-google-nu-beschikbaar
Introduction
2
Preprocessing
Input image
Feature Extraction
Collect data features
Classifier
Start
End
Classification
Class
Preprocessing
Input image
Feature Extraction
Classifier and
Validation
Start
End
Y
N
TRAINING TESTING
Character Recognition
Objective
To develop handwritten Thai alphabets recognition methods
using Local Binary Pattern (LBP)
3
Methodology
I. Experiment Design
II. Datasets Preparation
III. Coding and Testing
4
I. Experiment Design: Overall
5
1) Grayscaling,
2) 2D median filter
3) Binarization
4) Complementation
5) Edge detection
6.1) Cropping for LBP calculation, 6.2) Cropping for shape matrices calculation
LBP Shape Matrix
Classification
Class
Preprocessing
Input image
Feature Extraction
All images?
Start
End
Preprocessing
6
1) Grayscaling, 2) 2D median filter, 3) Binarization, 4) Complementation, 5) Edge detection, 6.1)
Cropping for LBP calculation, 6.2) Cropping for shape matrices calculation
Local Binary Pattern: LBP
7
LBP feature
Local Binary Pattern: LBP
8
1 image, 1 histogram!
Histogram of LBP features
LBP
20
Rotation Invariant LBP
21
1113
3
19
4
9717
6
18
8
4422
LBP
feature
The 58 different uniform
patterns in (8,R) neighborhood
http://www.codeproject.com/KB/graphics/741559/lbp5.png
Uniform LBP
22
Uniform LBP
23
https://i.stack.imgur.com/tH5jp.png
Uniform Rotation Invariant LBP
24
Unifor
m
Non-
uniform
(LBP
code=9)
Shape Matrix
9
0000001110011100101000000
Shape feature
Visual Program
10
I. Experiment Design
11
Experiment 2: Additional conditions that help to increase accuracy
Variables
• Size of shape matrix–25, 121, 144, 169, 196, 225 bits
Experiment 1: LBP types that yield the highest accuracy
Variables
• LBP types–Normal, Rotation invariant, Uniform, Uniform rotation invariant
MAIN EXPERIMENT
12
44 Thai alphabets/form
Total: 5,148 letters
(117 forms)
Used: 4,400 letters
II. Datasets
Preparation
II. Datasets Preparation
13
Naming: XX-XXX.jpg
Class
(00,01,…,44) Sample No.
(001,002,…)
III. Coding/Testing
14
Query Matching
1 1 0 1
0 0 0 0
1 1 1 1
Tested feature
Sample features
1 1 0 1
0 0 1 0
1
2
p1 p2 p3 p4
q1 q2 q3 q4
q1 q2 q3 q4
(𝒑𝒑𝒊𝒊 − 𝒒𝒒𝒊𝒊)𝟐𝟐
:
(𝒑𝒑𝒊𝒊 − 𝒒𝒒𝒊𝒊)𝟐𝟐
:
25
Query Matching
0 0 0 0
1 1 1 1
Sample features
1 1 0 1
0 0 1 0
3
1
Euclidean distance
1
2
26
Results & Conclusion
 Experiment 1: LBP types
 Experiment 2: Additional conditions
15
Experiment 1: LBP types
16
Number Type of LBP / bins Accuracy (%)
1 Normal LBP/ 64 68.05
2 Rotation invariant LBP/ 36 68.89
3 Uniform LBP / 59 68.96
4 Uniform rotation invariant LBP/ 10 67.42
Comparison with other Methods
S.No Methodology Classification Accuracy Classification basis of characters
1 [3] Ant Miner algorithm 82.7% Grouping
2 [2] Genetic Algorithm NA Grouping
3 [4] Single stage classifier with global features alone 54.61% Grouping
4 [4] Single stage classifier with global and local features 78.89% Grouping
5 Proposed Methodology using 64 bin LBP 68.82% Individual
6 Proposed Methodology using 59 bin LBP 68.96% Individual
Advantages over Existing Methodologies Listed
Paper: C. Pornpanomchai, V. Wongsawangtham, S.
Jeungudomporn, and N. Chatsumpun, “Thai Handwritten
Character Recognition by Genetic Algorithm (THCRGA),” Int. J.
Eng. Technol., vol. 3, no. 2, pp. 148–153, 2011.
• Ours is offline character recognition
• Dataset used is not available
• Genetic Algorithm is slower for 44 classes
• Feature set indices are not mentioned
Advantages over Existing Methodologies Listed
• Paper: P. Phokharatkul, K. Sankhuangaw, S. Somkuarnpanit, S.
Phaiboon, and C. Kimpan, “Off-Line Hand Written Thai Character
Recognition using Ant-Miner Algorithm,” Int. J. Comput. Electr.
Autom. Control Inf. Eng., vol. 8, no. 1, pp. 276–281, 2005.
• Dataset used is not available currently
• Ant Miner was slow when we tried with some matlab code on our dataset
• Grouped only to 5 classes but our proposed methodology used 44 classes
• Offline but the sampling methodology is not available
Advantages over Existing Methodologies Listed
• Paper: I. Methasate, S. Marukatat, S. Sae-Tang, and T.
Theeramunkong, “The feature combination technique for off-line Thai
character recognition system,” in Proceedings of the International
Conference on Document Analysis and Recognition, ICDAR, 2005, vol.
2005, pp. 1006–1009..
• Methodology seems impressive but classifying criteria is not same as
our proposed methodology
• Grouped only to 20 classes but our proposed methodology used 44
classes
• Similarly looking characters false negative is taken as correct
classification
Experiment 2: Additional conditions
17
Variable
Size of shape matrix–5x5, 11x11, 12x12, 13x13, 14x14, 15x15 bits
Results
12x12 > 11x11 > 13x13 > 14x14 > 15x15 > 5x5 bits
Future work
• Train with more datasets and include vowels and intonation
• Try other classifiers e.g. SVM, Neural Network and Deep
Learning
18
References
[1] “Thai Language,” Wikipedia. [Online]. Available: https://en.wikipedia.org/wiki/Thai_language.
[Accessed: 04-Nov-2016].
[2] C. Pornpanomchai, V. Wongsawangtham, S. Jeungudomporn, and N. Chatsumpun, “Thai
Handwritten Character Recognition by Genetic Algorithm (THCRGA),” Int. J. Eng. Technol., vol. 3, no.
2, pp. 148–153, 2011.
[3] P. Phokharatkul, K. Sankhuangaw, S. Somkuarnpanit, S. Phaiboon, and C. Kimpan, “Off-Line
Hand Written Thai Character Recognition using Ant-Miner Algorithm,” Int. J. Comput. Electr. Autom.
Control Inf. Eng., vol. 8, no. 1, pp. 276–281, 2005.
[4] I. Methasate, S. Marukatat, S. Sae-Tang, and T. Theeramunkong, “The feature combination
technique for off-line Thai character recognition system,” in Proceedings of the International
Conference on Document Analysis and Recognition, ICDAR, 2005, vol. 2005, pp. 1006–1009.
[5] F. J. John Joseph and S. Auwatanamongkol, “A crowding multi-objective genetic algorithm for
image parsing,” Neural Comput. Appl., vol. 27, no. 8, pp. 2217–2227, 2016.
[6] T. Ojala, M. Pietikainen, and D. Harwood, “A Comparative Study of Texture Measures with
Classification Based on Feature Distributions,” Pattern Recognit., vol. 29, no. 1, pp. 51–59, 1996.
[7] P. Anantaprayoon, F. J. John Joseph, and S. Marukatat, “Local Feature Based Offline
Handwritten Thai Character Recognition,” in Proceedings of Thai Japan Student ICT Fair, 2016, p. 41. 19

More Related Content

What's hot

Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Waqas Tariq
 
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...
CSCJournals
 
The Heuristic Extraction Algorithms for Freeman Chain Code of Handwritten Cha...
The Heuristic Extraction Algorithms for Freeman Chain Code of Handwritten Cha...The Heuristic Extraction Algorithms for Freeman Chain Code of Handwritten Cha...
The Heuristic Extraction Algorithms for Freeman Chain Code of Handwritten Cha...
Waqas Tariq
 
offline character recognition for handwritten gujarati text
offline character recognition for handwritten gujarati textoffline character recognition for handwritten gujarati text
offline character recognition for handwritten gujarati text
Bhumika Patel
 
A Dialogue System for Telugu, a Resource-Poor Language
A Dialogue System for Telugu, a Resource-Poor LanguageA Dialogue System for Telugu, a Resource-Poor Language
A Dialogue System for Telugu, a Resource-Poor Language
Sravanthi Mullapudi
 
MACHINE LEARNING ALGORITHMS FOR MYANMAR NEWS CLASSIFICATION
MACHINE LEARNING ALGORITHMS FOR MYANMAR NEWS CLASSIFICATIONMACHINE LEARNING ALGORITHMS FOR MYANMAR NEWS CLASSIFICATION
MACHINE LEARNING ALGORITHMS FOR MYANMAR NEWS CLASSIFICATION
ijnlc
 

What's hot (14)

Text Detection and Recognition: A Review
Text Detection and Recognition: A ReviewText Detection and Recognition: A Review
Text Detection and Recognition: A Review
 
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
 
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...
 
The Heuristic Extraction Algorithms for Freeman Chain Code of Handwritten Cha...
The Heuristic Extraction Algorithms for Freeman Chain Code of Handwritten Cha...The Heuristic Extraction Algorithms for Freeman Chain Code of Handwritten Cha...
The Heuristic Extraction Algorithms for Freeman Chain Code of Handwritten Cha...
 
HIDDEN MARKOV MODEL BASED NAMED ENTITY RECOGNITION TOOL
HIDDEN MARKOV MODEL BASED NAMED ENTITY RECOGNITION TOOLHIDDEN MARKOV MODEL BASED NAMED ENTITY RECOGNITION TOOL
HIDDEN MARKOV MODEL BASED NAMED ENTITY RECOGNITION TOOL
 
Human Being Character Analysis from Their Social Networking Profiles
Human Being Character Analysis from Their Social Networking ProfilesHuman Being Character Analysis from Their Social Networking Profiles
Human Being Character Analysis from Their Social Networking Profiles
 
offline character recognition for handwritten gujarati text
offline character recognition for handwritten gujarati textoffline character recognition for handwritten gujarati text
offline character recognition for handwritten gujarati text
 
A Dialogue System for Telugu, a Resource-Poor Language
A Dialogue System for Telugu, a Resource-Poor LanguageA Dialogue System for Telugu, a Resource-Poor Language
A Dialogue System for Telugu, a Resource-Poor Language
 
MACHINE LEARNING ALGORITHMS FOR MYANMAR NEWS CLASSIFICATION
MACHINE LEARNING ALGORITHMS FOR MYANMAR NEWS CLASSIFICATIONMACHINE LEARNING ALGORITHMS FOR MYANMAR NEWS CLASSIFICATION
MACHINE LEARNING ALGORITHMS FOR MYANMAR NEWS CLASSIFICATION
 
Topic modeling
Topic modelingTopic modeling
Topic modeling
 
Intrinsic and Extrinsic Evaluations of Word Embeddings
Intrinsic and Extrinsic Evaluations of Word EmbeddingsIntrinsic and Extrinsic Evaluations of Word Embeddings
Intrinsic and Extrinsic Evaluations of Word Embeddings
 
QUESTION ANALYSIS FOR ARABIC QUESTION ANSWERING SYSTEMS
QUESTION ANALYSIS FOR ARABIC QUESTION ANSWERING SYSTEMS QUESTION ANALYSIS FOR ARABIC QUESTION ANSWERING SYSTEMS
QUESTION ANALYSIS FOR ARABIC QUESTION ANSWERING SYSTEMS
 
Method-Level Code Clone Modification using Refactoring Techniques for Clone M...
Method-Level Code Clone Modification using Refactoring Techniques for Clone M...Method-Level Code Clone Modification using Refactoring Techniques for Clone M...
Method-Level Code Clone Modification using Refactoring Techniques for Clone M...
 
A Comprehensive Study On Handwritten Character Recognition System
A Comprehensive Study On Handwritten Character Recognition SystemA Comprehensive Study On Handwritten Character Recognition System
A Comprehensive Study On Handwritten Character Recognition System
 

Similar to Offline Handwritten Thai Character Recognition Using Single Tier Classifier and Local Features

Filtering out improper user accounts from twitter user accounts for discoveri...
Filtering out improper user accounts from twitter user accounts for discoveri...Filtering out improper user accounts from twitter user accounts for discoveri...
Filtering out improper user accounts from twitter user accounts for discoveri...
siramatu-lab
 
Detecting cyberbullying text using the approaches with machine learning model...
Detecting cyberbullying text using the approaches with machine learning model...Detecting cyberbullying text using the approaches with machine learning model...
Detecting cyberbullying text using the approaches with machine learning model...
IAESIJAI
 
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
cscpconf
 
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
csandit
 
Project report - Bengali digit recongnition using SVM
Project report - Bengali digit recongnition using SVMProject report - Bengali digit recongnition using SVM
Project report - Bengali digit recongnition using SVM
Mohammad Saiful Islam
 
Performance analysis of linkage learning techniques in genetic algorithms
Performance analysis of linkage learning techniques in genetic algorithmsPerformance analysis of linkage learning techniques in genetic algorithms
Performance analysis of linkage learning techniques in genetic algorithms
eSAT Journals
 

Similar to Offline Handwritten Thai Character Recognition Using Single Tier Classifier and Local Features (20)

Character Recognition using Data Mining Technique (Artificial Neural Network)
Character Recognition using Data Mining Technique (Artificial Neural Network)Character Recognition using Data Mining Technique (Artificial Neural Network)
Character Recognition using Data Mining Technique (Artificial Neural Network)
 
Bangla_handwritten_dig1] final proposal .pdf
Bangla_handwritten_dig1] final proposal .pdfBangla_handwritten_dig1] final proposal .pdf
Bangla_handwritten_dig1] final proposal .pdf
 
STRUCTURAL FEATURES FOR RECOGNITION OF HAND WRITTEN KANNADA CHARACTER BASED O...
STRUCTURAL FEATURES FOR RECOGNITION OF HAND WRITTEN KANNADA CHARACTER BASED O...STRUCTURAL FEATURES FOR RECOGNITION OF HAND WRITTEN KANNADA CHARACTER BASED O...
STRUCTURAL FEATURES FOR RECOGNITION OF HAND WRITTEN KANNADA CHARACTER BASED O...
 
STRUCTURAL FEATURES FOR RECOGNITION OF HAND WRITTEN KANNADA CHARACTER BASED O...
STRUCTURAL FEATURES FOR RECOGNITION OF HAND WRITTEN KANNADA CHARACTER BASED O...STRUCTURAL FEATURES FOR RECOGNITION OF HAND WRITTEN KANNADA CHARACTER BASED O...
STRUCTURAL FEATURES FOR RECOGNITION OF HAND WRITTEN KANNADA CHARACTER BASED O...
 
STRUCTURAL FEATURES FOR RECOGNITION OF HAND WRITTEN KANNADA CHARACTER BASED O...
STRUCTURAL FEATURES FOR RECOGNITION OF HAND WRITTEN KANNADA CHARACTER BASED O...STRUCTURAL FEATURES FOR RECOGNITION OF HAND WRITTEN KANNADA CHARACTER BASED O...
STRUCTURAL FEATURES FOR RECOGNITION OF HAND WRITTEN KANNADA CHARACTER BASED O...
 
Pattern Recognition using Artificial Neural Network
Pattern Recognition using Artificial Neural NetworkPattern Recognition using Artificial Neural Network
Pattern Recognition using Artificial Neural Network
 
Survey On Broken and Joint Devanagari Handwritten Characters Recognition Usin...
Survey On Broken and Joint Devanagari Handwritten Characters Recognition Usin...Survey On Broken and Joint Devanagari Handwritten Characters Recognition Usin...
Survey On Broken and Joint Devanagari Handwritten Characters Recognition Usin...
 
A Novel Framework For Numerical Character Recognition With Zoning Distance Fe...
A Novel Framework For Numerical Character Recognition With Zoning Distance Fe...A Novel Framework For Numerical Character Recognition With Zoning Distance Fe...
A Novel Framework For Numerical Character Recognition With Zoning Distance Fe...
 
Filtering out improper user accounts from twitter user accounts for discoveri...
Filtering out improper user accounts from twitter user accounts for discoveri...Filtering out improper user accounts from twitter user accounts for discoveri...
Filtering out improper user accounts from twitter user accounts for discoveri...
 
Detecting cyberbullying text using the approaches with machine learning model...
Detecting cyberbullying text using the approaches with machine learning model...Detecting cyberbullying text using the approaches with machine learning model...
Detecting cyberbullying text using the approaches with machine learning model...
 
An Optical Character Recognition for Handwritten Devanagari Script
An Optical Character Recognition for Handwritten Devanagari ScriptAn Optical Character Recognition for Handwritten Devanagari Script
An Optical Character Recognition for Handwritten Devanagari Script
 
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
 
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
 
06522405
0652240506522405
06522405
 
Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)
Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)
Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)
 
Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)
Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)
Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)
 
Do characters abuse more than words?
Do characters abuse more than words?Do characters abuse more than words?
Do characters abuse more than words?
 
Project report - Bengali digit recongnition using SVM
Project report - Bengali digit recongnition using SVMProject report - Bengali digit recongnition using SVM
Project report - Bengali digit recongnition using SVM
 
ICSE20_Tao_slides.pptx
ICSE20_Tao_slides.pptxICSE20_Tao_slides.pptx
ICSE20_Tao_slides.pptx
 
Performance analysis of linkage learning techniques in genetic algorithms
Performance analysis of linkage learning techniques in genetic algorithmsPerformance analysis of linkage learning techniques in genetic algorithms
Performance analysis of linkage learning techniques in genetic algorithms
 

More from Ferdin Joe John Joseph PhD

More from Ferdin Joe John Joseph PhD (20)

Invited Talk DGTiCon 2022
Invited Talk DGTiCon 2022Invited Talk DGTiCon 2022
Invited Talk DGTiCon 2022
 
Week 12: Cloud AI- DSA 441 Cloud Computing
Week 12: Cloud AI- DSA 441 Cloud ComputingWeek 12: Cloud AI- DSA 441 Cloud Computing
Week 12: Cloud AI- DSA 441 Cloud Computing
 
Week 11: Cloud Native- DSA 441 Cloud Computing
Week 11: Cloud Native- DSA 441 Cloud ComputingWeek 11: Cloud Native- DSA 441 Cloud Computing
Week 11: Cloud Native- DSA 441 Cloud Computing
 
Week 10: Cloud Security- DSA 441 Cloud Computing
Week 10: Cloud Security- DSA 441 Cloud ComputingWeek 10: Cloud Security- DSA 441 Cloud Computing
Week 10: Cloud Security- DSA 441 Cloud Computing
 
Week 9: Relational Database Service Alibaba Cloud- DSA 441 Cloud Computing
Week 9: Relational Database Service Alibaba Cloud- DSA 441 Cloud ComputingWeek 9: Relational Database Service Alibaba Cloud- DSA 441 Cloud Computing
Week 9: Relational Database Service Alibaba Cloud- DSA 441 Cloud Computing
 
Week 7: Object Storage Service Alibaba Cloud- DSA 441 Cloud Computing
Week 7: Object Storage Service Alibaba Cloud- DSA 441 Cloud ComputingWeek 7: Object Storage Service Alibaba Cloud- DSA 441 Cloud Computing
Week 7: Object Storage Service Alibaba Cloud- DSA 441 Cloud Computing
 
Week 6: Server Load Balancer and Auto Scaling Alibaba Cloud- DSA 441 Cloud Co...
Week 6: Server Load Balancer and Auto Scaling Alibaba Cloud- DSA 441 Cloud Co...Week 6: Server Load Balancer and Auto Scaling Alibaba Cloud- DSA 441 Cloud Co...
Week 6: Server Load Balancer and Auto Scaling Alibaba Cloud- DSA 441 Cloud Co...
 
Week 5: Elastic Compute Service (ECS) with Alibaba Cloud- DSA 441 Cloud Compu...
Week 5: Elastic Compute Service (ECS) with Alibaba Cloud- DSA 441 Cloud Compu...Week 5: Elastic Compute Service (ECS) with Alibaba Cloud- DSA 441 Cloud Compu...
Week 5: Elastic Compute Service (ECS) with Alibaba Cloud- DSA 441 Cloud Compu...
 
Week 4: Big Data and Hadoop in Alibaba Cloud - DSA 441 Cloud Computing
Week 4: Big Data and Hadoop in Alibaba Cloud - DSA 441 Cloud ComputingWeek 4: Big Data and Hadoop in Alibaba Cloud - DSA 441 Cloud Computing
Week 4: Big Data and Hadoop in Alibaba Cloud - DSA 441 Cloud Computing
 
Week 3: Virtual Private Cloud, On Premise, IaaS, PaaS, SaaS - DSA 441 Cloud C...
Week 3: Virtual Private Cloud, On Premise, IaaS, PaaS, SaaS - DSA 441 Cloud C...Week 3: Virtual Private Cloud, On Premise, IaaS, PaaS, SaaS - DSA 441 Cloud C...
Week 3: Virtual Private Cloud, On Premise, IaaS, PaaS, SaaS - DSA 441 Cloud C...
 
Week 2: Virtualization and VM Ware - DSA 441 Cloud Computing
Week 2: Virtualization and VM Ware - DSA 441 Cloud ComputingWeek 2: Virtualization and VM Ware - DSA 441 Cloud Computing
Week 2: Virtualization and VM Ware - DSA 441 Cloud Computing
 
Week 1: Introduction to Cloud Computing - DSA 441 Cloud Computing
Week 1: Introduction to Cloud Computing - DSA 441 Cloud ComputingWeek 1: Introduction to Cloud Computing - DSA 441 Cloud Computing
Week 1: Introduction to Cloud Computing - DSA 441 Cloud Computing
 
Sept 6 2021 BTech Artificial Intelligence and Data Science curriculum
Sept 6 2021 BTech Artificial Intelligence and Data Science curriculumSept 6 2021 BTech Artificial Intelligence and Data Science curriculum
Sept 6 2021 BTech Artificial Intelligence and Data Science curriculum
 
Hadoop in Alibaba Cloud
Hadoop in Alibaba CloudHadoop in Alibaba Cloud
Hadoop in Alibaba Cloud
 
Cloud Computing Essentials in Alibaba Cloud
Cloud Computing Essentials in Alibaba CloudCloud Computing Essentials in Alibaba Cloud
Cloud Computing Essentials in Alibaba Cloud
 
Transforming deep into transformers – a computer vision approach
Transforming deep into transformers – a computer vision approachTransforming deep into transformers – a computer vision approach
Transforming deep into transformers – a computer vision approach
 
Week 11: Programming for Data Analysis
Week 11: Programming for Data AnalysisWeek 11: Programming for Data Analysis
Week 11: Programming for Data Analysis
 
Week 10: Programming for Data Analysis
Week 10: Programming for Data AnalysisWeek 10: Programming for Data Analysis
Week 10: Programming for Data Analysis
 
Week 9: Programming for Data Analysis
Week 9: Programming for Data AnalysisWeek 9: Programming for Data Analysis
Week 9: Programming for Data Analysis
 
Week 8: Programming for Data Analysis
Week 8: Programming for Data AnalysisWeek 8: Programming for Data Analysis
Week 8: Programming for Data Analysis
 

Recently uploaded

原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证
pwgnohujw
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
acoha1
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
zifhagzkk
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
Amil baba
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
acoha1
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
fztigerwe
 
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
ju0dztxtn
 
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
yulianti213969
 
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
jk0tkvfv
 
edited gordis ebook sixth edition david d.pdf
edited gordis ebook sixth edition david d.pdfedited gordis ebook sixth edition david d.pdf
edited gordis ebook sixth edition david d.pdf
great91
 

Recently uploaded (20)

Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshare
 
原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证
 
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
 
Sensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
Sensing the Future: Anomaly Detection and Event Prediction in Sensor NetworksSensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
Sensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
 
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam DunksNOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
 
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI  MANAJEMEN OF PENYAKIT TETANUS.pptMATERI  MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
 
The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancing
 
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
 
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
 
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
 
Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024
 
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
 
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting Techniques
 
edited gordis ebook sixth edition david d.pdf
edited gordis ebook sixth edition david d.pdfedited gordis ebook sixth edition david d.pdf
edited gordis ebook sixth edition david d.pdf
 
Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdf
 

Offline Handwritten Thai Character Recognition Using Single Tier Classifier and Local Features

  • 1. Offline Handwritten Thai Character Recognition Using Single Tier Classifier and Local Features Ferdin Joe John Joseph1 Panatchakorn Anantaprayoon2 2Kamnoetvidya Science Academy, Rayong 1Faculty of Information Technology, Thai – Nichi Institute of Technology, Bangkok
  • 2. Introduction Character Recognition conversion of typewritten or handwritten text into machine-encoded one. http://www.cbnco.com/idsys/id-readers-eagle.php http://cdn.iphonehacks.com/wp- content/uploads/2013/01/mzl.bmcrwhfl.320x480-75.jpg 1
  • 3. Types of Character Recognition Offline Character Recognition https://cs.stanford.edu/people/adityaj/HandwritingRecognition.pdf Online Character Recognition https://nl.pcmag.com/google-1/1723/news/handschrift-app-van-google-nu-beschikbaar
  • 4. Introduction 2 Preprocessing Input image Feature Extraction Collect data features Classifier Start End Classification Class Preprocessing Input image Feature Extraction Classifier and Validation Start End Y N TRAINING TESTING Character Recognition
  • 5. Objective To develop handwritten Thai alphabets recognition methods using Local Binary Pattern (LBP) 3
  • 6. Methodology I. Experiment Design II. Datasets Preparation III. Coding and Testing 4
  • 7. I. Experiment Design: Overall 5 1) Grayscaling, 2) 2D median filter 3) Binarization 4) Complementation 5) Edge detection 6.1) Cropping for LBP calculation, 6.2) Cropping for shape matrices calculation LBP Shape Matrix Classification Class Preprocessing Input image Feature Extraction All images? Start End
  • 8. Preprocessing 6 1) Grayscaling, 2) 2D median filter, 3) Binarization, 4) Complementation, 5) Edge detection, 6.1) Cropping for LBP calculation, 6.2) Cropping for shape matrices calculation
  • 9. Local Binary Pattern: LBP 7 LBP feature
  • 10. Local Binary Pattern: LBP 8 1 image, 1 histogram! Histogram of LBP features
  • 13. The 58 different uniform patterns in (8,R) neighborhood http://www.codeproject.com/KB/graphics/741559/lbp5.png Uniform LBP 22
  • 15. Uniform Rotation Invariant LBP 24 Unifor m Non- uniform (LBP code=9)
  • 18. I. Experiment Design 11 Experiment 2: Additional conditions that help to increase accuracy Variables • Size of shape matrix–25, 121, 144, 169, 196, 225 bits Experiment 1: LBP types that yield the highest accuracy Variables • LBP types–Normal, Rotation invariant, Uniform, Uniform rotation invariant MAIN EXPERIMENT
  • 19. 12 44 Thai alphabets/form Total: 5,148 letters (117 forms) Used: 4,400 letters II. Datasets Preparation
  • 20. II. Datasets Preparation 13 Naming: XX-XXX.jpg Class (00,01,…,44) Sample No. (001,002,…)
  • 22. Query Matching 1 1 0 1 0 0 0 0 1 1 1 1 Tested feature Sample features 1 1 0 1 0 0 1 0 1 2 p1 p2 p3 p4 q1 q2 q3 q4 q1 q2 q3 q4 (𝒑𝒑𝒊𝒊 − 𝒒𝒒𝒊𝒊)𝟐𝟐 : (𝒑𝒑𝒊𝒊 − 𝒒𝒒𝒊𝒊)𝟐𝟐 : 25
  • 23. Query Matching 0 0 0 0 1 1 1 1 Sample features 1 1 0 1 0 0 1 0 3 1 Euclidean distance 1 2 26
  • 24. Results & Conclusion  Experiment 1: LBP types  Experiment 2: Additional conditions 15
  • 25. Experiment 1: LBP types 16 Number Type of LBP / bins Accuracy (%) 1 Normal LBP/ 64 68.05 2 Rotation invariant LBP/ 36 68.89 3 Uniform LBP / 59 68.96 4 Uniform rotation invariant LBP/ 10 67.42
  • 26. Comparison with other Methods S.No Methodology Classification Accuracy Classification basis of characters 1 [3] Ant Miner algorithm 82.7% Grouping 2 [2] Genetic Algorithm NA Grouping 3 [4] Single stage classifier with global features alone 54.61% Grouping 4 [4] Single stage classifier with global and local features 78.89% Grouping 5 Proposed Methodology using 64 bin LBP 68.82% Individual 6 Proposed Methodology using 59 bin LBP 68.96% Individual
  • 27. Advantages over Existing Methodologies Listed Paper: C. Pornpanomchai, V. Wongsawangtham, S. Jeungudomporn, and N. Chatsumpun, “Thai Handwritten Character Recognition by Genetic Algorithm (THCRGA),” Int. J. Eng. Technol., vol. 3, no. 2, pp. 148–153, 2011. • Ours is offline character recognition • Dataset used is not available • Genetic Algorithm is slower for 44 classes • Feature set indices are not mentioned
  • 28. Advantages over Existing Methodologies Listed • Paper: P. Phokharatkul, K. Sankhuangaw, S. Somkuarnpanit, S. Phaiboon, and C. Kimpan, “Off-Line Hand Written Thai Character Recognition using Ant-Miner Algorithm,” Int. J. Comput. Electr. Autom. Control Inf. Eng., vol. 8, no. 1, pp. 276–281, 2005. • Dataset used is not available currently • Ant Miner was slow when we tried with some matlab code on our dataset • Grouped only to 5 classes but our proposed methodology used 44 classes • Offline but the sampling methodology is not available
  • 29. Advantages over Existing Methodologies Listed • Paper: I. Methasate, S. Marukatat, S. Sae-Tang, and T. Theeramunkong, “The feature combination technique for off-line Thai character recognition system,” in Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, 2005, vol. 2005, pp. 1006–1009.. • Methodology seems impressive but classifying criteria is not same as our proposed methodology • Grouped only to 20 classes but our proposed methodology used 44 classes • Similarly looking characters false negative is taken as correct classification
  • 30. Experiment 2: Additional conditions 17 Variable Size of shape matrix–5x5, 11x11, 12x12, 13x13, 14x14, 15x15 bits Results 12x12 > 11x11 > 13x13 > 14x14 > 15x15 > 5x5 bits
  • 31. Future work • Train with more datasets and include vowels and intonation • Try other classifiers e.g. SVM, Neural Network and Deep Learning 18
  • 32. References [1] “Thai Language,” Wikipedia. [Online]. Available: https://en.wikipedia.org/wiki/Thai_language. [Accessed: 04-Nov-2016]. [2] C. Pornpanomchai, V. Wongsawangtham, S. Jeungudomporn, and N. Chatsumpun, “Thai Handwritten Character Recognition by Genetic Algorithm (THCRGA),” Int. J. Eng. Technol., vol. 3, no. 2, pp. 148–153, 2011. [3] P. Phokharatkul, K. Sankhuangaw, S. Somkuarnpanit, S. Phaiboon, and C. Kimpan, “Off-Line Hand Written Thai Character Recognition using Ant-Miner Algorithm,” Int. J. Comput. Electr. Autom. Control Inf. Eng., vol. 8, no. 1, pp. 276–281, 2005. [4] I. Methasate, S. Marukatat, S. Sae-Tang, and T. Theeramunkong, “The feature combination technique for off-line Thai character recognition system,” in Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, 2005, vol. 2005, pp. 1006–1009. [5] F. J. John Joseph and S. Auwatanamongkol, “A crowding multi-objective genetic algorithm for image parsing,” Neural Comput. Appl., vol. 27, no. 8, pp. 2217–2227, 2016. [6] T. Ojala, M. Pietikainen, and D. Harwood, “A Comparative Study of Texture Measures with Classification Based on Feature Distributions,” Pattern Recognit., vol. 29, no. 1, pp. 51–59, 1996. [7] P. Anantaprayoon, F. J. John Joseph, and S. Marukatat, “Local Feature Based Offline Handwritten Thai Character Recognition,” in Proceedings of Thai Japan Student ICT Fair, 2016, p. 41. 19