SlideShare a Scribd company logo
1 of 20
ARABIC OPTICAL CHARACTER
RECOGNITION AND FUTURE
TRENDS
CONTENT
• Introduction
• Challengings
• How it work
• Character Recognition
• Hidden Markova Model (HMM)
• Pre-processing Operations
• LINE SEGMENTATION
• WORD SEGMENTATION
• BASELINE DETECTION
• SEGMENTATION-BASED AOCR
• SEGMENTATION-FREE AOCR
• Typical OCR for Arabic
Language
• Recent Applications and
Trends for Arabic OCR
• Conclusion
INTRODUCTION
ABSTRACT :
Optical Character
Recognition (OCR) is a
computer system designed
to transform images of
typewritten text (typically
captured by a scanner)
into machine-processed
text.
MOTIVATION AND MORE :
• OCR is the machine replication
of human reading
• it has been subject of intensive
research
• Arabic differs from English as it
is written with connected
characters that possess extra
challenging for Arabic compares
to English
CHALLENGINGS
• Arabic differs from English as it is
written with connected characters
that possess
• Each character in Arabic has a
different shape depending on it
position with respect to the word
• , different characters have the same
main shape but they are different in
number of dots and their location
(e.g. Baa, Taa, and Thaa)
• segmentation is a necessary
because incorrectly segmented
characters will cause
misclassifications of characters and
in turn will lead to inaccurate
results
HOW IT WORK :
• Input
• Normalized
• Skew Corrected
• Noise Removed
• Lines Segmented
• Words Segmented
• Baseline Detected
Input Normalized Skew Corrected
Noise Removed Lines Segmented
Words
Segmented
Baseline Detected
CHARACTER RECOGNITION
• techniques have been developed to work with uncommon cases
of isolated characters
• character-based OCR is used to recognize a character in single
character’s image
• For example extracted fifteen geometric and topological
features, such as moments, zero-crossing and end-points, from
the input image as input to Hidden Markova Model (HMM)
• such approach for character recognition is unlikely to be used as
the goal of the OCR, which is developed to process real-life
Arabic text that involves words and lines rather than isolated
characters
HIDDEN MARKOVA MODEL (HMM)
PRE-PROCESSING OPERATIONS
• line segmentation and
word segmentation are
implemented on the
input text image. Given
that the expected input is
normal Arabic text, it is
necessary to implement
word isolation using
word and line
segmentations
• Other pre-processing
methods, such as
normalization and
skewness removal are
also used sometimes
with specific OCR
systems.
LINE SEGMENTATION
• line segmentation is implemented to isolate each text line for further
processing
• two approaches for line segmentation using fixed threshold as separation
between the lines or using horizontal projection
• The difficulty of line segmentation depends mainly on the type of the
document (historical document, newspapers, books and letters) and the
quality of the input document
• various line segmentation methods have been proposed like “ line
segmentation using line-height method that search for starting foreground
pixel and end foreground pixel of each line “ and “segmentation approach
that use the global maximum peak and baseline detection. This approach
segments the input image into columns and each column is then segmented
into rows, the line is then segmented by finding the maximum agreement
between the rows of all the columns”
• Noise is undesired noise removal is required to ensure a correct processing
and generated results Various noise removal algorithms have been proposed
some other filters, such as wiener2 filter can be used for Arabic OCR
WORD SEGMENTATION
• Word segmentation is implemented by conducting virtually
projecting
• word segmentation depends on determines the vertical white spaces
between words which
• word spotting methodology using block-based document description
for word segmentation
• image normalization is implemented to ease the process of feature
extraction and content recognition
• image normalization is implemented in form of binarization process
by which the image of multiple colours or grey scales are converted
into black and white images. The black is usually represented the
text while the white represents the background.
• The binarization process is an easy step that implemented by determine a
threshold value by which the image is binarized
BASELINE DETECTION
• main challenges of the Arabic OCR systems. Arabic characters are
connected to each other with a reference that called “the baseline”
• there are four different cases of characters connectivity with
regarding to the baseline
1. No touching
2. Ascender touching: the characters are connected above the baseline
3. Descender touching: characters connected below baseline and
4. Baseline touching
• various approaches have been proposed. Overall, OCR systems is
divided into two groups
1. segmentation-based OCR .
2. segmentation-free OCR.
SEGMENTATION-BASED AOCR
• deals with character connectivity problem by segmenting the
characters in the word
• Segmenting Arabic words into letters is the most difficult
problem in Arabic script recognition
• Cheung, proposed an OCR system that used a recognition-
based segmentation technique to overcome the limitations in
the classical segmentation approaches. A newly developed
Arabic word segmentation algorithm is also introduced to
separate horizontally overlapping Arabic words/sub-words.
There is also a feedback loop to control the combination of
character fragments for recognition purpose.
THE STRUCTURE OF THE PROPOSED OCR
SYSTEM BY CHEUNG
SEGMENTATION-FREE AOCR
• Most of the existing Arabic OCR is of segmentation-based OCR,
which is a successful approach in Latin typewriting, however,
due to the obvious difficulties and errors resulted from the
character segmentation process in Arabic language, some
techniques have been proposed without segmentation-phase.
• Al-Barhamtoshy and Rashwan proposed an OCR system with
various technical methods, in pre-processing, font resizing,
binarization, de-skewing, denoising, and segmentation, are
used. Experimental results showed that the proposed system
achieves a good performance with less complexity.
THE STRUCTURE OF THE PROPOSED OCR SYSTEM
BY AL-BARHAMTOSHY AND RASHWAN
TYPICAL OCR FOR ARABIC LANGUAGE
• Common pre-processing steps are required for every system in
order to ensure accurate results.
• Variations of the pre-processing steps are tested based on
different realization of the input documents.
• Segmentation-based approaches have lower accuracy and less
capabilities compared to segmentation-free systems.
• Language models, dictionaries and character variations are
required in order to ensure accurate results.
THE STRUCTURE OF THE COMMON OCR
PROCESS
RECENT APPLICATIONS AND TRENDS FOR ARABIC
OCR
• Optical Font Recognition
• FONT AND SIZE RECOGNITION
• Video-Based OCR : Recent systems for text recognition in
Arabic news captions have been proposed. The significant of
such systems is to ease the process of video indexing and
retrieval given that broadcast programs daily received and
stored in big databases.
• HISTORICAL DOCUMENT OCR : OCR is required in order to facilitate
advance indexing and retrieval of the contained information.
CONCLUSION
Mainly, there are six pre-processing steps that are (reorder
according to the change in paper) noise removal normalization,
skew correction, baseline detection line segmentation and word
segmentation, which are used regardless of the developed
system but depend on the input characteristics. obvious
differences between the existing systems are whether it is
segmentation-based or segmentation-free. Inter-process
variations are embodied in the utilized character segmentation,
windowing, and feature extraction and classification algorithms.
ANY QUESTIONS …. NO !
OK … I KNOW

More Related Content

Similar to OCR by Abdullah Ahmed Abu Rtima

Tesseract OCR Engine
Tesseract OCR EngineTesseract OCR Engine
Tesseract OCR Engine
Raghu nath
 
An OCR System for recognition of Urdu text in Nastaliq Font
An OCR System for recognition of Urdu text in Nastaliq FontAn OCR System for recognition of Urdu text in Nastaliq Font
An OCR System for recognition of Urdu text in Nastaliq Font
Dr. Syed Hassan Amin
 
DevanagiriOCR on CELL BROADBAND ENGINE
DevanagiriOCR on CELL BROADBAND ENGINEDevanagiriOCR on CELL BROADBAND ENGINE
DevanagiriOCR on CELL BROADBAND ENGINE
Pridhvi Kodamasimham
 
Pattern_Recognition_via_Character_Recogn.pptx
Pattern_Recognition_via_Character_Recogn.pptxPattern_Recognition_via_Character_Recogn.pptx
Pattern_Recognition_via_Character_Recogn.pptx
EngRSMY2
 

Similar to OCR by Abdullah Ahmed Abu Rtima (20)

Assessment of OCR quality and font identification in historical documents
Assessment of OCR quality and font identification in historical documentsAssessment of OCR quality and font identification in historical documents
Assessment of OCR quality and font identification in historical documents
 
Tesseract OCR Engine
Tesseract OCR EngineTesseract OCR Engine
Tesseract OCR Engine
 
Optical Character Recognition (OCR) based Retrieval
Optical Character Recognition (OCR) based RetrievalOptical Character Recognition (OCR) based Retrieval
Optical Character Recognition (OCR) based Retrieval
 
Design and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English FontDesign and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English Font
 
An OCR System for recognition of Urdu text in Nastaliq Font
An OCR System for recognition of Urdu text in Nastaliq FontAn OCR System for recognition of Urdu text in Nastaliq Font
An OCR System for recognition of Urdu text in Nastaliq Font
 
Ocr 1
Ocr 1Ocr 1
Ocr 1
 
Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Optical Character Recognition( OCR )
Optical Character Recognition( OCR )
 
DevanagiriOCR on CELL BROADBAND ENGINE
DevanagiriOCR on CELL BROADBAND ENGINEDevanagiriOCR on CELL BROADBAND ENGINE
DevanagiriOCR on CELL BROADBAND ENGINE
 
OCR, optical character reader
OCR, optical character readerOCR, optical character reader
OCR, optical character reader
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
 
Atks (Arabic Toolkit services)
Atks (Arabic Toolkit services)Atks (Arabic Toolkit services)
Atks (Arabic Toolkit services)
 
Introduction
IntroductionIntroduction
Introduction
 
OCV & OCR - A Validation Perspective
OCV & OCR - A Validation PerspectiveOCV & OCR - A Validation Perspective
OCV & OCR - A Validation Perspective
 
A study of feature extraction for Arabic calligraphy characters recognition
A study of feature extraction for Arabic calligraphy characters recognitionA study of feature extraction for Arabic calligraphy characters recognition
A study of feature extraction for Arabic calligraphy characters recognition
 
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...
 
Pattern_Recognition_via_Character_Recogn.pptx
Pattern_Recognition_via_Character_Recogn.pptxPattern_Recognition_via_Character_Recogn.pptx
Pattern_Recognition_via_Character_Recogn.pptx
 
Datech2014 - Session 3 - PoCoTo - An Open Source System For Efficient Interac...
Datech2014 - Session 3 - PoCoTo - An Open Source System For Efficient Interac...Datech2014 - Session 3 - PoCoTo - An Open Source System For Efficient Interac...
Datech2014 - Session 3 - PoCoTo - An Open Source System For Efficient Interac...
 
OCR Algorithm Accuracy Enhancement
OCR Algorithm Accuracy EnhancementOCR Algorithm Accuracy Enhancement
OCR Algorithm Accuracy Enhancement
 
Detecting text from natural images with Stroke Width Transform
Detecting text from natural images with Stroke Width TransformDetecting text from natural images with Stroke Width Transform
Detecting text from natural images with Stroke Width Transform
 
Code Inspection
Code InspectionCode Inspection
Code Inspection
 

Recently uploaded

Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
jaanualu31
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
Kamal Acharya
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
MayuraD1
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
 

Recently uploaded (20)

PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
 
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLEGEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Wadi Rum luxhotel lodge Analysis case study.pptx
Wadi Rum luxhotel lodge Analysis case study.pptxWadi Rum luxhotel lodge Analysis case study.pptx
Wadi Rum luxhotel lodge Analysis case study.pptx
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planes
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
Moment Distribution Method For Btech Civil
Moment Distribution Method For Btech CivilMoment Distribution Method For Btech Civil
Moment Distribution Method For Btech Civil
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
 

OCR by Abdullah Ahmed Abu Rtima

  • 2. CONTENT • Introduction • Challengings • How it work • Character Recognition • Hidden Markova Model (HMM) • Pre-processing Operations • LINE SEGMENTATION • WORD SEGMENTATION • BASELINE DETECTION • SEGMENTATION-BASED AOCR • SEGMENTATION-FREE AOCR • Typical OCR for Arabic Language • Recent Applications and Trends for Arabic OCR • Conclusion
  • 3. INTRODUCTION ABSTRACT : Optical Character Recognition (OCR) is a computer system designed to transform images of typewritten text (typically captured by a scanner) into machine-processed text. MOTIVATION AND MORE : • OCR is the machine replication of human reading • it has been subject of intensive research • Arabic differs from English as it is written with connected characters that possess extra challenging for Arabic compares to English
  • 4. CHALLENGINGS • Arabic differs from English as it is written with connected characters that possess • Each character in Arabic has a different shape depending on it position with respect to the word • , different characters have the same main shape but they are different in number of dots and their location (e.g. Baa, Taa, and Thaa) • segmentation is a necessary because incorrectly segmented characters will cause misclassifications of characters and in turn will lead to inaccurate results
  • 5. HOW IT WORK : • Input • Normalized • Skew Corrected • Noise Removed • Lines Segmented • Words Segmented • Baseline Detected Input Normalized Skew Corrected Noise Removed Lines Segmented Words Segmented Baseline Detected
  • 6. CHARACTER RECOGNITION • techniques have been developed to work with uncommon cases of isolated characters • character-based OCR is used to recognize a character in single character’s image • For example extracted fifteen geometric and topological features, such as moments, zero-crossing and end-points, from the input image as input to Hidden Markova Model (HMM) • such approach for character recognition is unlikely to be used as the goal of the OCR, which is developed to process real-life Arabic text that involves words and lines rather than isolated characters
  • 8. PRE-PROCESSING OPERATIONS • line segmentation and word segmentation are implemented on the input text image. Given that the expected input is normal Arabic text, it is necessary to implement word isolation using word and line segmentations • Other pre-processing methods, such as normalization and skewness removal are also used sometimes with specific OCR systems.
  • 9. LINE SEGMENTATION • line segmentation is implemented to isolate each text line for further processing • two approaches for line segmentation using fixed threshold as separation between the lines or using horizontal projection • The difficulty of line segmentation depends mainly on the type of the document (historical document, newspapers, books and letters) and the quality of the input document • various line segmentation methods have been proposed like “ line segmentation using line-height method that search for starting foreground pixel and end foreground pixel of each line “ and “segmentation approach that use the global maximum peak and baseline detection. This approach segments the input image into columns and each column is then segmented into rows, the line is then segmented by finding the maximum agreement between the rows of all the columns” • Noise is undesired noise removal is required to ensure a correct processing and generated results Various noise removal algorithms have been proposed some other filters, such as wiener2 filter can be used for Arabic OCR
  • 10. WORD SEGMENTATION • Word segmentation is implemented by conducting virtually projecting • word segmentation depends on determines the vertical white spaces between words which • word spotting methodology using block-based document description for word segmentation • image normalization is implemented to ease the process of feature extraction and content recognition • image normalization is implemented in form of binarization process by which the image of multiple colours or grey scales are converted into black and white images. The black is usually represented the text while the white represents the background. • The binarization process is an easy step that implemented by determine a threshold value by which the image is binarized
  • 11. BASELINE DETECTION • main challenges of the Arabic OCR systems. Arabic characters are connected to each other with a reference that called “the baseline” • there are four different cases of characters connectivity with regarding to the baseline 1. No touching 2. Ascender touching: the characters are connected above the baseline 3. Descender touching: characters connected below baseline and 4. Baseline touching • various approaches have been proposed. Overall, OCR systems is divided into two groups 1. segmentation-based OCR . 2. segmentation-free OCR.
  • 12. SEGMENTATION-BASED AOCR • deals with character connectivity problem by segmenting the characters in the word • Segmenting Arabic words into letters is the most difficult problem in Arabic script recognition • Cheung, proposed an OCR system that used a recognition- based segmentation technique to overcome the limitations in the classical segmentation approaches. A newly developed Arabic word segmentation algorithm is also introduced to separate horizontally overlapping Arabic words/sub-words. There is also a feedback loop to control the combination of character fragments for recognition purpose.
  • 13. THE STRUCTURE OF THE PROPOSED OCR SYSTEM BY CHEUNG
  • 14. SEGMENTATION-FREE AOCR • Most of the existing Arabic OCR is of segmentation-based OCR, which is a successful approach in Latin typewriting, however, due to the obvious difficulties and errors resulted from the character segmentation process in Arabic language, some techniques have been proposed without segmentation-phase. • Al-Barhamtoshy and Rashwan proposed an OCR system with various technical methods, in pre-processing, font resizing, binarization, de-skewing, denoising, and segmentation, are used. Experimental results showed that the proposed system achieves a good performance with less complexity.
  • 15. THE STRUCTURE OF THE PROPOSED OCR SYSTEM BY AL-BARHAMTOSHY AND RASHWAN
  • 16. TYPICAL OCR FOR ARABIC LANGUAGE • Common pre-processing steps are required for every system in order to ensure accurate results. • Variations of the pre-processing steps are tested based on different realization of the input documents. • Segmentation-based approaches have lower accuracy and less capabilities compared to segmentation-free systems. • Language models, dictionaries and character variations are required in order to ensure accurate results.
  • 17. THE STRUCTURE OF THE COMMON OCR PROCESS
  • 18. RECENT APPLICATIONS AND TRENDS FOR ARABIC OCR • Optical Font Recognition • FONT AND SIZE RECOGNITION • Video-Based OCR : Recent systems for text recognition in Arabic news captions have been proposed. The significant of such systems is to ease the process of video indexing and retrieval given that broadcast programs daily received and stored in big databases. • HISTORICAL DOCUMENT OCR : OCR is required in order to facilitate advance indexing and retrieval of the contained information.
  • 19. CONCLUSION Mainly, there are six pre-processing steps that are (reorder according to the change in paper) noise removal normalization, skew correction, baseline detection line segmentation and word segmentation, which are used regardless of the developed system but depend on the input characteristics. obvious differences between the existing systems are whether it is segmentation-based or segmentation-free. Inter-process variations are embodied in the utilized character segmentation, windowing, and feature extraction and classification algorithms.
  • 20. ANY QUESTIONS …. NO ! OK … I KNOW