SlideShare a Scribd company logo
1 of 2
Recognition of Bengali Handwritten Digits Using Convolutional
Neural Network Architectures
Summary : Recognition of Handwritten Digits has always been the pioneer interest in deep
learning. Due to the lacking’s required dataset, in this particular field the research count is
significantly low. NamtaDB dataset, has a collection of Bengali Handwritten Digit dataset,
which contains 85,000 digit’s from 2700 contributors. It is implicitly made confirm that dataset
represent the diversity as it is collected both from children and adults who are from different
regions.
Different types of augmentation on dataset is being applied for this kind of work. As for the
example Rotation, Translation, Blurring, Zoom in, salt pepper noise etc are used.
Rotating Image : Dimensions may not be preserved as previous after rotation.
This rotation takes place with respect to the center. And computes the
Inverse transformation of every pixel. RGB images computed observing
every color plane.
Translation : The movement of digits location along row or column. Shift
The image with respect to x and y coordinates by increasing or decreasing
value of the coordinates.
Blurring : smooth’s the operation, and it is applied on a linear filter on initial
image. The effect is to average out rapid changes in pixel intensity.
Hue saturation Value shifting (HSV) : An alternative representation of RGB
image. HSV color models were created as a more convenient way for us to
specify colors.
Superimpose : It is done to replicate the effect of text written on the back of an
already written page. image is vertically flipped and a weighted sum of two
images is taken.
Some of the augmentations includes affine transformation, Coarse dropout, Addition of noise,
Superimpose, Inversion etc. One of the difficult type of augmentation is to remove Coarse
dropout and affine transformation. Deleting Digit edge, scaling images, rotating, shear mapping,
and sometimes multiple of these effects are combined.
Even if an image is visually okay, it still may not be a good data for a Convolutional Neural
Network. So for the aid, median blur of filter size 9x9 is used. It removes the Gaussian noises
and sharp edges. Some problem may occur like some test data become victim of the blurring, and
the digit can be blurred to a point that is unrecognizable.
Sometimes when images are inverted but the outcome result show, the inversion don’t give any
change in result. The system is able to recognize the data set in background perfectly. It happens
for the cause of using Convolutional layers work on edges and that’s why it smoothly recognizes
the corners of the background or foreground color.
Some dataset from the NamtaDB is mislabeled even after the rigorous checking. Some samples
get distorted which is not easy to recognize for human being. If there is any update from
NamtaDB with minimum number of mislabeled images then handling these distorted images will
be easier. Here, the obtained accuracy rate is 99.3359%. This might be addressed in future work
in broader problems such as license plate recognition, or handwritten character recognition.

More Related Content

What's hot

Matlab noise reduction based on partial-reference, dual-tree complex wavelet...
Matlab  noise reduction based on partial-reference, dual-tree complex wavelet...Matlab  noise reduction based on partial-reference, dual-tree complex wavelet...
Matlab noise reduction based on partial-reference, dual-tree complex wavelet...
Ecway Technologies
 
Computer graphics
Computer graphicsComputer graphics
Computer graphics
bhaveshbunk
 

What's hot (19)

Hog
HogHog
Hog
 
Image colorization
Image colorizationImage colorization
Image colorization
 
Antialiasing
Antialiasing  Antialiasing
Antialiasing
 
Matlab noise reduction based on partial-reference, dual-tree complex wavelet...
Matlab  noise reduction based on partial-reference, dual-tree complex wavelet...Matlab  noise reduction based on partial-reference, dual-tree complex wavelet...
Matlab noise reduction based on partial-reference, dual-tree complex wavelet...
 
07 dimensionality reduction
07 dimensionality reduction07 dimensionality reduction
07 dimensionality reduction
 
[PR12] Generative Models as Distributions of Functions
[PR12] Generative Models as Distributions of Functions[PR12] Generative Models as Distributions of Functions
[PR12] Generative Models as Distributions of Functions
 
Hands-On Machine Learning with Scikit-Learn and TensorFlow - Chapter8
Hands-On Machine Learning with Scikit-Learn and TensorFlow - Chapter8Hands-On Machine Learning with Scikit-Learn and TensorFlow - Chapter8
Hands-On Machine Learning with Scikit-Learn and TensorFlow - Chapter8
 
Random forest algorithm
Random forest algorithmRandom forest algorithm
Random forest algorithm
 
3D Image visualization
3D Image visualization3D Image visualization
3D Image visualization
 
PR-155: Exploring Randomly Wired Neural Networks for Image Recognition
PR-155: Exploring Randomly Wired Neural Networks for Image RecognitionPR-155: Exploring Randomly Wired Neural Networks for Image Recognition
PR-155: Exploring Randomly Wired Neural Networks for Image Recognition
 
Computer graphics
Computer graphicsComputer graphics
Computer graphics
 
[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques
 
Mobile scanner
Mobile scannerMobile scanner
Mobile scanner
 
Lecture 17 Image Enhancement Process in Image Processing
Lecture 17 Image Enhancement Process in Image ProcessingLecture 17 Image Enhancement Process in Image Processing
Lecture 17 Image Enhancement Process in Image Processing
 
Pixel Recursive Super Resolution. Google Brain
 Pixel Recursive Super Resolution.  Google Brain Pixel Recursive Super Resolution.  Google Brain
Pixel Recursive Super Resolution. Google Brain
 
Machine Learning - Introduction to Convolutional Neural Networks
Machine Learning - Introduction to Convolutional Neural NetworksMachine Learning - Introduction to Convolutional Neural Networks
Machine Learning - Introduction to Convolutional Neural Networks
 
Translation Invariance (TI) based Novel Approach for better De-noising of Dig...
Translation Invariance (TI) based Novel Approach for better De-noising of Dig...Translation Invariance (TI) based Novel Approach for better De-noising of Dig...
Translation Invariance (TI) based Novel Approach for better De-noising of Dig...
 
Facial keypoint recognition
Facial keypoint recognitionFacial keypoint recognition
Facial keypoint recognition
 
Ppts21
Ppts21Ppts21
Ppts21
 

Similar to Digital image processing recognition of bengali handwritten digits using convolutional neural network architectures

Defended Data Embedding For Chiseler Avoidance in Visible Cryptography by Usi...
Defended Data Embedding For Chiseler Avoidance in Visible Cryptography by Usi...Defended Data Embedding For Chiseler Avoidance in Visible Cryptography by Usi...
Defended Data Embedding For Chiseler Avoidance in Visible Cryptography by Usi...
IOSR Journals
 
Using A Application For A Desktop Application
Using A Application For A Desktop ApplicationUsing A Application For A Desktop Application
Using A Application For A Desktop Application
Tracy Huang
 
Satellite image contrast enhancement using discrete wavelet transform
Satellite image contrast enhancement using discrete wavelet transformSatellite image contrast enhancement using discrete wavelet transform
Satellite image contrast enhancement using discrete wavelet transform
Harishwar Reddy
 
Image enhancement
Image enhancementImage enhancement
Image enhancement
Ayaelshiwi
 

Similar to Digital image processing recognition of bengali handwritten digits using convolutional neural network architectures (20)

Defended Data Embedding For Chiseler Avoidance in Visible Cryptography by Usi...
Defended Data Embedding For Chiseler Avoidance in Visible Cryptography by Usi...Defended Data Embedding For Chiseler Avoidance in Visible Cryptography by Usi...
Defended Data Embedding For Chiseler Avoidance in Visible Cryptography by Usi...
 
Application of interpolation in CSE
Application of interpolation in CSEApplication of interpolation in CSE
Application of interpolation in CSE
 
Using A Application For A Desktop Application
Using A Application For A Desktop ApplicationUsing A Application For A Desktop Application
Using A Application For A Desktop Application
 
Satellite image contrast enhancement using discrete wavelet transform
Satellite image contrast enhancement using discrete wavelet transformSatellite image contrast enhancement using discrete wavelet transform
Satellite image contrast enhancement using discrete wavelet transform
 
Object recognition
Object recognitionObject recognition
Object recognition
 
IRJET- Low Light Image Enhancement using Convolutional Neural Network
IRJET-  	  Low Light Image Enhancement using Convolutional Neural NetworkIRJET-  	  Low Light Image Enhancement using Convolutional Neural Network
IRJET- Low Light Image Enhancement using Convolutional Neural Network
 
I010634450
I010634450I010634450
I010634450
 
Performance of Efficient Closed-Form Solution to Comprehensive Frontier Exposure
Performance of Efficient Closed-Form Solution to Comprehensive Frontier ExposurePerformance of Efficient Closed-Form Solution to Comprehensive Frontier Exposure
Performance of Efficient Closed-Form Solution to Comprehensive Frontier Exposure
 
A Survey on Implementation of Discrete Wavelet Transform for Image Denoising
A Survey on Implementation of Discrete Wavelet Transform for Image DenoisingA Survey on Implementation of Discrete Wavelet Transform for Image Denoising
A Survey on Implementation of Discrete Wavelet Transform for Image Denoising
 
Image enhancement
Image enhancementImage enhancement
Image enhancement
 
Ik3415621565
Ik3415621565Ik3415621565
Ik3415621565
 
G0352039045
G0352039045G0352039045
G0352039045
 
N043020970100
N043020970100N043020970100
N043020970100
 
SIFT
SIFTSIFT
SIFT
 
D010332630
D010332630D010332630
D010332630
 
Sift detector boosted by adaptive contrast threshold to improve matching robu...
Sift detector boosted by adaptive contrast threshold to improve matching robu...Sift detector boosted by adaptive contrast threshold to improve matching robu...
Sift detector boosted by adaptive contrast threshold to improve matching robu...
 
Sift detector boosted by adaptive contrast threshold to improve matching robu...
Sift detector boosted by adaptive contrast threshold to improve matching robu...Sift detector boosted by adaptive contrast threshold to improve matching robu...
Sift detector boosted by adaptive contrast threshold to improve matching robu...
 
Enhanced Optimization of Edge Detection for High Resolution Images Using Veri...
Enhanced Optimization of Edge Detection for High Resolution Images Using Veri...Enhanced Optimization of Edge Detection for High Resolution Images Using Veri...
Enhanced Optimization of Edge Detection for High Resolution Images Using Veri...
 
image-processing-husseina-ozigi-otaru.ppt
image-processing-husseina-ozigi-otaru.pptimage-processing-husseina-ozigi-otaru.ppt
image-processing-husseina-ozigi-otaru.ppt
 
IRJET- Design of Image Resolution Enhancement by using DWT and SWT
IRJET-  	  Design of Image Resolution Enhancement by using DWT and SWTIRJET-  	  Design of Image Resolution Enhancement by using DWT and SWT
IRJET- Design of Image Resolution Enhancement by using DWT and SWT
 

More from Sunjid Hasan

More from Sunjid Hasan (10)

Software engineering( sms )
Software engineering( sms )Software engineering( sms )
Software engineering( sms )
 
System analysis design of Fire Service & Civil Defence
System analysis design of Fire Service & Civil DefenceSystem analysis design of Fire Service & Civil Defence
System analysis design of Fire Service & Civil Defence
 
Artificial intelligence - python
Artificial intelligence - pythonArtificial intelligence - python
Artificial intelligence - python
 
Artificial intelligence - Prolog
Artificial intelligence - Prolog Artificial intelligence - Prolog
Artificial intelligence - Prolog
 
Cover page sample
Cover page sampleCover page sample
Cover page sample
 
Compiler and symbol table
Compiler and symbol tableCompiler and symbol table
Compiler and symbol table
 
System analysis design of fire service and civil defence
System analysis design of fire service and civil defenceSystem analysis design of fire service and civil defence
System analysis design of fire service and civil defence
 
Online blood sharing application
Online blood  sharing applicationOnline blood  sharing application
Online blood sharing application
 
Messaging application
Messaging applicationMessaging application
Messaging application
 
Estimation for software
Estimation for softwareEstimation for software
Estimation for software
 

Recently uploaded

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 

Recently uploaded (20)

HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 

Digital image processing recognition of bengali handwritten digits using convolutional neural network architectures

  • 1. Recognition of Bengali Handwritten Digits Using Convolutional Neural Network Architectures Summary : Recognition of Handwritten Digits has always been the pioneer interest in deep learning. Due to the lacking’s required dataset, in this particular field the research count is significantly low. NamtaDB dataset, has a collection of Bengali Handwritten Digit dataset, which contains 85,000 digit’s from 2700 contributors. It is implicitly made confirm that dataset represent the diversity as it is collected both from children and adults who are from different regions. Different types of augmentation on dataset is being applied for this kind of work. As for the example Rotation, Translation, Blurring, Zoom in, salt pepper noise etc are used. Rotating Image : Dimensions may not be preserved as previous after rotation. This rotation takes place with respect to the center. And computes the Inverse transformation of every pixel. RGB images computed observing every color plane. Translation : The movement of digits location along row or column. Shift The image with respect to x and y coordinates by increasing or decreasing value of the coordinates. Blurring : smooth’s the operation, and it is applied on a linear filter on initial image. The effect is to average out rapid changes in pixel intensity. Hue saturation Value shifting (HSV) : An alternative representation of RGB image. HSV color models were created as a more convenient way for us to specify colors. Superimpose : It is done to replicate the effect of text written on the back of an
  • 2. already written page. image is vertically flipped and a weighted sum of two images is taken. Some of the augmentations includes affine transformation, Coarse dropout, Addition of noise, Superimpose, Inversion etc. One of the difficult type of augmentation is to remove Coarse dropout and affine transformation. Deleting Digit edge, scaling images, rotating, shear mapping, and sometimes multiple of these effects are combined. Even if an image is visually okay, it still may not be a good data for a Convolutional Neural Network. So for the aid, median blur of filter size 9x9 is used. It removes the Gaussian noises and sharp edges. Some problem may occur like some test data become victim of the blurring, and the digit can be blurred to a point that is unrecognizable. Sometimes when images are inverted but the outcome result show, the inversion don’t give any change in result. The system is able to recognize the data set in background perfectly. It happens for the cause of using Convolutional layers work on edges and that’s why it smoothly recognizes the corners of the background or foreground color. Some dataset from the NamtaDB is mislabeled even after the rigorous checking. Some samples get distorted which is not easy to recognize for human being. If there is any update from NamtaDB with minimum number of mislabeled images then handling these distorted images will be easier. Here, the obtained accuracy rate is 99.3359%. This might be addressed in future work in broader problems such as license plate recognition, or handwritten character recognition.