This is the Bangla Handwritten Digit Recognition Report. you can see this report for your helping hand.
**Bengali is the world's fifth most spoken language, with 265 million native and non-native speakers accounting for 4% of the global population.
**Despite the large number of Bengali speakers, very little research has been conducted on Bangali handwritten digit recognition.
**The application of the BHwDR system is wide from postal code digit recognition to license plate recognition, digit recognition in cheques in the banking system to exam paper registration number recognition.
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...CSCJournals
Handwritten text and character recognition is a challenging task compared to recognition of handwritten numeral and computer printed text due to its large variety in nature. As practical pattern recognition problems uses bulk data and there is a one step self sufficient deterministic theory to resolve recognition problems by calculating inverse of Hessian Matrix and multiplication the inverse matrix it with first order local gradient vector. But in practical cases when neural network is large the inversing operation of the Hessian Matrix is not manageable and another condition must be satisfied the Hessian Matrix must be positive definite which may not be satishfied. In these cases some repetitive recursive models are taken. In several research work in past decade it was experienced that Neural Network based approach provides most reliable performance in handwritten character and text recognition but recognition performance depends upon some important factors like no of training samples, reliable features and no of features per character, training time, variety of handwriting etc. Important features from different types of handwriting are collected and are fed to the neural network for training. It is true that more no of features increases test efficiency but it takes longer time to converge the error curve. To reduce this training time effectively proper train algorithm should be chosen so that the system provides best train and test efficiency in least possible time that is to provide the system fastest intelligence. We have used several second order conjugate gradient algorithms for training of neural network. We have found that Scaled Conjugate Gradient Algorithm , a second order training algorithm as the fastest for training of neural network for our application. Training using SCG takes minimum time with excellent test efficiency. A scanned handwritten text is taken as input and character level segmentation is done. Some important and reliable features from each character are extracted and used as input to a neural network for training. When the error level reaches into a satisfactory level (10 -12 ) weights are accepted for testing a test script. Finally a lexicon matching algorithm solves the minor misclassification problems.
Course Title CS591-Advance Artificial Intelligence CruzIbarra161
Course Title: CS591-Advance Artificial Intelligence StudentName: Namratha Valle, Malemarpuram Chaitanya
sai, Sasidhar Reddy Vajrala, Nagendra Mokara SEMOID#S02023694
StudentEmail: [email protected] Date:04/20/2021
Violations of academic honesty represent a serious breach of discipline and may be considered grounds for disciplinary action, including dismissal
from the University. The University requires that all assignments submitted to faculty members by students be the work of the individual student
submitting the work. An exception would be group projects assigned by the instructor. (Source: SEMO website)
Advanced Artificial Intelligence Assignment
Graduate project level 2
Abstract
Artificial Intelligence (AI) is a crucial technical technology that is commonly used in today's
society. Deep Learning, in particular, has a variety of uses due to its ability to learn robust
representations from images. A Convolutional Neural Network (CNN) is a Deep Learning
algorithm which commands the input image, assigns significance to numerous aspects/objects in
the image, and can distinguish between them. For image classification, CNN is the most popular
Deep Learning architecture. To get better results, we used various automated processing tasks for
fruit and vegetable images. In comparison to other classification deep learning algorithms, the
amount of pre-processing needed by a CNN model is much lower. Furthermore, the learning
capabilities of Deep Learning architectures can be used to improve sound classification in order
to solve efficiency problems. CNN is used in this project, and layers are created to classify the
sound waves into their various categories.
Introduction
We humans enjoy analyzing items, and everything you can think of can be classified into a
classification or class. It is an everyday issue in business; analysis of parts, installations,
gatherings, and products are necessary for the daily routine. This is the reason why people have
devised procedures such as Machine Learning (ML), Neural Networks (NN), and Deep Learning
(DL), among other calculations, to automate the arrangement period. Deep learning will be one
of them that we will explore. Deep learning is an artificial intelligence (AI) function that
simulates how the human brain processes data and creates patterns to make decisions. The
classification of photographs of fruits and vegetables with the naked eye is very difficult. As a
result, we're using pyTorch to process image datasets with Deep Learning. We're developing a
CNN model for image detection and categorization using these datasets. A custom CNN is
introduced and then compared to a ResNet CNN for the purposes of this study. The oth ...
Using Deep Learning to Find Similar DressesHJ van Veen
Report by Luís Mey ( https://www.linkedin.com/in/lu%C3%ADs-gustavo-bernardo-mey-97b38927/ ) on Udacity Machine Learning Course - Final Project: Use Deep Learning to Find Similar Dresses.
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...CSCJournals
Handwritten text and character recognition is a challenging task compared to recognition of handwritten numeral and computer printed text due to its large variety in nature. As practical pattern recognition problems uses bulk data and there is a one step self sufficient deterministic theory to resolve recognition problems by calculating inverse of Hessian Matrix and multiplication the inverse matrix it with first order local gradient vector. But in practical cases when neural network is large the inversing operation of the Hessian Matrix is not manageable and another condition must be satisfied the Hessian Matrix must be positive definite which may not be satishfied. In these cases some repetitive recursive models are taken. In several research work in past decade it was experienced that Neural Network based approach provides most reliable performance in handwritten character and text recognition but recognition performance depends upon some important factors like no of training samples, reliable features and no of features per character, training time, variety of handwriting etc. Important features from different types of handwriting are collected and are fed to the neural network for training. It is true that more no of features increases test efficiency but it takes longer time to converge the error curve. To reduce this training time effectively proper train algorithm should be chosen so that the system provides best train and test efficiency in least possible time that is to provide the system fastest intelligence. We have used several second order conjugate gradient algorithms for training of neural network. We have found that Scaled Conjugate Gradient Algorithm , a second order training algorithm as the fastest for training of neural network for our application. Training using SCG takes minimum time with excellent test efficiency. A scanned handwritten text is taken as input and character level segmentation is done. Some important and reliable features from each character are extracted and used as input to a neural network for training. When the error level reaches into a satisfactory level (10 -12 ) weights are accepted for testing a test script. Finally a lexicon matching algorithm solves the minor misclassification problems.
Course Title CS591-Advance Artificial Intelligence CruzIbarra161
Course Title: CS591-Advance Artificial Intelligence StudentName: Namratha Valle, Malemarpuram Chaitanya
sai, Sasidhar Reddy Vajrala, Nagendra Mokara SEMOID#S02023694
StudentEmail: [email protected] Date:04/20/2021
Violations of academic honesty represent a serious breach of discipline and may be considered grounds for disciplinary action, including dismissal
from the University. The University requires that all assignments submitted to faculty members by students be the work of the individual student
submitting the work. An exception would be group projects assigned by the instructor. (Source: SEMO website)
Advanced Artificial Intelligence Assignment
Graduate project level 2
Abstract
Artificial Intelligence (AI) is a crucial technical technology that is commonly used in today's
society. Deep Learning, in particular, has a variety of uses due to its ability to learn robust
representations from images. A Convolutional Neural Network (CNN) is a Deep Learning
algorithm which commands the input image, assigns significance to numerous aspects/objects in
the image, and can distinguish between them. For image classification, CNN is the most popular
Deep Learning architecture. To get better results, we used various automated processing tasks for
fruit and vegetable images. In comparison to other classification deep learning algorithms, the
amount of pre-processing needed by a CNN model is much lower. Furthermore, the learning
capabilities of Deep Learning architectures can be used to improve sound classification in order
to solve efficiency problems. CNN is used in this project, and layers are created to classify the
sound waves into their various categories.
Introduction
We humans enjoy analyzing items, and everything you can think of can be classified into a
classification or class. It is an everyday issue in business; analysis of parts, installations,
gatherings, and products are necessary for the daily routine. This is the reason why people have
devised procedures such as Machine Learning (ML), Neural Networks (NN), and Deep Learning
(DL), among other calculations, to automate the arrangement period. Deep learning will be one
of them that we will explore. Deep learning is an artificial intelligence (AI) function that
simulates how the human brain processes data and creates patterns to make decisions. The
classification of photographs of fruits and vegetables with the naked eye is very difficult. As a
result, we're using pyTorch to process image datasets with Deep Learning. We're developing a
CNN model for image detection and categorization using these datasets. A custom CNN is
introduced and then compared to a ResNet CNN for the purposes of this study. The oth ...
Using Deep Learning to Find Similar DressesHJ van Veen
Report by Luís Mey ( https://www.linkedin.com/in/lu%C3%ADs-gustavo-bernardo-mey-97b38927/ ) on Udacity Machine Learning Course - Final Project: Use Deep Learning to Find Similar Dresses.
One-shot learning is an object categorization problem in computer vision. Whereas most machine learning based object categorization algorithms require training on hundreds or thousands of images and very large datasets, one-shot learning aims to learn information about object categories from one, or only a few, training images
Performance Comparison between Pytorch and Mindsporeijdms
Deep learning has been well used in many fields. However, there is a large amount of data when training neural networks, which makes many deep learning frameworks appear to serve deep learning practitioners, providing services that are more convenient to use and perform better. MindSpore and PyTorch are both deep learning frameworks. MindSpore is owned by HUAWEI, while PyTorch is owned by Facebook. Some people think that HUAWEI's MindSpore has better performance than FaceBook's PyTorch, which makes deep learning practitioners confused about the choice between the two. In this paper, we perform analytical and experimental analysis to reveal the comparison of training speed of MIndSpore and PyTorch on a single GPU. To ensure that our survey is as comprehensive as possible, we carefully selected neural networks in 2 main domains, which cover computer vision and natural language processing (NLP). The contribution of this work is twofold. First, we conduct detailed benchmarking experiments on MindSpore and PyTorch to analyze the reasons for their performance differences. This work provides guidance for end users to choose between these two frameworks.
Image Captioning Generator using Deep Machine Learningijtsrd
Technologys scope has evolved into one of the most powerful tools for human development in a variety of fields.AI and machine learning have become one of the most powerful tools for completing tasks quickly and accurately without the need for human intervention. This project demonstrates how deep machine learning can be used to create a caption or a sentence for a given picture. This can be used for visually impaired persons, as well as automobiles for self identification, and for various applications to verify quickly and easily. The Convolutional Neural Network CNN is used to describe the alphabet, and the Long Short Term Memory LSTM is used to organize the right meaningful sentences in this model. The flicker 8k and flicker 30k datasets were used to train this. Sreejith S P | Vijayakumar A "Image Captioning Generator using Deep Machine Learning" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-4 , June 2021, URL: https://www.ijtsrd.compapers/ijtsrd42344.pdf Paper URL: https://www.ijtsrd.comcomputer-science/artificial-intelligence/42344/image-captioning-generator-using-deep-machine-learning/sreejith-s-p
Fast and robust tracking of multiple faces is receiving increased attention from computer vision researchers as it finds potential applications in many fields like video surveillance and computer mediated video conferencing. Real-time tracking of multiple faces in high resolution videos involve three basic tasks namely initialization, tracking and display. Among these, tracking is quite compute intensive as it involves particle filtering that won’t yield a real time performance if we use a conventional CPU based system alone.
This paper presents a study of the efficiency and performance speedup achieved by applying Graphics Processing Units for Face Recognition Solutions. We explore one of the possibilities of parallelizing and optimizing a well-known Face Recognition algorithm, Principal Component Analysis (PCA) with Eigenfaces. In recent years, the Graphics Processing Units (GPU) has been the subject of extensive research and the computation speed of GPUs has been rapidly increasing.
DATA AUGMENTATION TECHNIQUES AND TRANSFER LEARNING APPROACHES APPLIED TO FACI...ijaia
The face expression is the first thing we pay attention to when we want to understand a person’s state of
mind. Thus, the ability to recognize facial expressions in an automatic way is a very interesting research
field. In this paper, because the small size of available training datasets, we propose a novel data
augmentation technique that improves the performances in the recognition task. We apply geometrical
transformations and build from scratch GAN models able to generate new synthetic images for each
emotion type. Thus, on the augmented datasets we fine tune pretrained convolutional neural networks with
different architectures. To measure the generalization ability of the models, we apply extra-database
protocol approach, namely we train models on the augmented versions of training dataset and test them on
two different databases. The combination of these techniques allows to reach average accuracy values of
the order of 85% for the InceptionResNetV2 model.
Handwriting identification using deep convolutional neural network methodTELKOMNIKA JOURNAL
Handwriting is a unique thing that produced differently for each person. Handwriting has a characteristic that remain the same with single writer, so a handwriting can be used as a variable in biometric systems. Each person have a different form of handwriting style but with a small possibility that same characters have something commons. We propose a handwriting identification method using sentence segmented handwriting forms. Sentence form is used to get more complete handwriting characteristics than using a single characters or words. Dataset used is divided into three categories of images, binary, grayscale, and inverted binary. All datasets have same image with different in color and consist of 100 class. Transfer learning used in this paper are pre-trained model VGG19. Training was conducted in 100 epochs. Highest result is grayscale images with genuince acceptance rate of 92.3% and equal error rate of 7.7%.
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION cscpconf
This paper aims at providing insight on the transferability of deep CNN features to
unsupervised problems. We study the impact of different pretrained CNN feature extractors on
the problem of image set clustering for object classification as well as fine-grained
classification. We propose a rather straightforward pipeline combining deep-feature extraction
using a CNN pretrained on ImageNet and a classic clustering algorithm to classify sets of
images. This approach is compared to state-of-the-art algorithms in image-clustering and
provides better results. These results strengthen the belief that supervised training of deep CNN
on large datasets, with a large variability of classes, extracts better features than most carefully
designed engineering approaches, even for unsupervised tasks. We also validate our approach
on a robotic application, consisting in sorting and storing objects smartly based on clustering
One-shot learning is an object categorization problem in computer vision. Whereas most machine learning based object categorization algorithms require training on hundreds or thousands of images and very large datasets, one-shot learning aims to learn information about object categories from one, or only a few, training images
Performance Comparison between Pytorch and Mindsporeijdms
Deep learning has been well used in many fields. However, there is a large amount of data when training neural networks, which makes many deep learning frameworks appear to serve deep learning practitioners, providing services that are more convenient to use and perform better. MindSpore and PyTorch are both deep learning frameworks. MindSpore is owned by HUAWEI, while PyTorch is owned by Facebook. Some people think that HUAWEI's MindSpore has better performance than FaceBook's PyTorch, which makes deep learning practitioners confused about the choice between the two. In this paper, we perform analytical and experimental analysis to reveal the comparison of training speed of MIndSpore and PyTorch on a single GPU. To ensure that our survey is as comprehensive as possible, we carefully selected neural networks in 2 main domains, which cover computer vision and natural language processing (NLP). The contribution of this work is twofold. First, we conduct detailed benchmarking experiments on MindSpore and PyTorch to analyze the reasons for their performance differences. This work provides guidance for end users to choose between these two frameworks.
Image Captioning Generator using Deep Machine Learningijtsrd
Technologys scope has evolved into one of the most powerful tools for human development in a variety of fields.AI and machine learning have become one of the most powerful tools for completing tasks quickly and accurately without the need for human intervention. This project demonstrates how deep machine learning can be used to create a caption or a sentence for a given picture. This can be used for visually impaired persons, as well as automobiles for self identification, and for various applications to verify quickly and easily. The Convolutional Neural Network CNN is used to describe the alphabet, and the Long Short Term Memory LSTM is used to organize the right meaningful sentences in this model. The flicker 8k and flicker 30k datasets were used to train this. Sreejith S P | Vijayakumar A "Image Captioning Generator using Deep Machine Learning" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-4 , June 2021, URL: https://www.ijtsrd.compapers/ijtsrd42344.pdf Paper URL: https://www.ijtsrd.comcomputer-science/artificial-intelligence/42344/image-captioning-generator-using-deep-machine-learning/sreejith-s-p
Fast and robust tracking of multiple faces is receiving increased attention from computer vision researchers as it finds potential applications in many fields like video surveillance and computer mediated video conferencing. Real-time tracking of multiple faces in high resolution videos involve three basic tasks namely initialization, tracking and display. Among these, tracking is quite compute intensive as it involves particle filtering that won’t yield a real time performance if we use a conventional CPU based system alone.
This paper presents a study of the efficiency and performance speedup achieved by applying Graphics Processing Units for Face Recognition Solutions. We explore one of the possibilities of parallelizing and optimizing a well-known Face Recognition algorithm, Principal Component Analysis (PCA) with Eigenfaces. In recent years, the Graphics Processing Units (GPU) has been the subject of extensive research and the computation speed of GPUs has been rapidly increasing.
DATA AUGMENTATION TECHNIQUES AND TRANSFER LEARNING APPROACHES APPLIED TO FACI...ijaia
The face expression is the first thing we pay attention to when we want to understand a person’s state of
mind. Thus, the ability to recognize facial expressions in an automatic way is a very interesting research
field. In this paper, because the small size of available training datasets, we propose a novel data
augmentation technique that improves the performances in the recognition task. We apply geometrical
transformations and build from scratch GAN models able to generate new synthetic images for each
emotion type. Thus, on the augmented datasets we fine tune pretrained convolutional neural networks with
different architectures. To measure the generalization ability of the models, we apply extra-database
protocol approach, namely we train models on the augmented versions of training dataset and test them on
two different databases. The combination of these techniques allows to reach average accuracy values of
the order of 85% for the InceptionResNetV2 model.
Handwriting identification using deep convolutional neural network methodTELKOMNIKA JOURNAL
Handwriting is a unique thing that produced differently for each person. Handwriting has a characteristic that remain the same with single writer, so a handwriting can be used as a variable in biometric systems. Each person have a different form of handwriting style but with a small possibility that same characters have something commons. We propose a handwriting identification method using sentence segmented handwriting forms. Sentence form is used to get more complete handwriting characteristics than using a single characters or words. Dataset used is divided into three categories of images, binary, grayscale, and inverted binary. All datasets have same image with different in color and consist of 100 class. Transfer learning used in this paper are pre-trained model VGG19. Training was conducted in 100 epochs. Highest result is grayscale images with genuince acceptance rate of 92.3% and equal error rate of 7.7%.
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION cscpconf
This paper aims at providing insight on the transferability of deep CNN features to
unsupervised problems. We study the impact of different pretrained CNN feature extractors on
the problem of image set clustering for object classification as well as fine-grained
classification. We propose a rather straightforward pipeline combining deep-feature extraction
using a CNN pretrained on ImageNet and a classic clustering algorithm to classify sets of
images. This approach is compared to state-of-the-art algorithms in image-clustering and
provides better results. These results strengthen the belief that supervised training of deep CNN
on large datasets, with a large variability of classes, extracts better features than most carefully
designed engineering approaches, even for unsupervised tasks. We also validate our approach
on a robotic application, consisting in sorting and storing objects smartly based on clustering
Project t Proposal Bangla alphabet handwritten recognition using deep learnin...KhondokerAbuNaim
Bangla alphabet handwritten recognition using deep learning is a process of automatically identifying and classifying handwritten Bangla characters using machine learning techniques. This involves training a deep learning model on a large dataset of handwritten Bangla characters to learn the features that distinguish each character from one another. The trained model can then be used to recognize new handwritten Bangla characters and convert them into digital text.
Francesca Gottschalk - How can education support child empowerment.pptxEduSkills OECD
Francesca Gottschalk from the OECD’s Centre for Educational Research and Innovation presents at the Ask an Expert Webinar: How can education support child empowerment?
Acetabularia Information For Class 9 .docxvaibhavrinwa19
Acetabularia acetabulum is a single-celled green alga that in its vegetative state is morphologically differentiated into a basal rhizoid and an axially elongated stalk, which bears whorls of branching hairs. The single diploid nucleus resides in the rhizoid.
A Strategic Approach: GenAI in EducationPeter Windle
Artificial Intelligence (AI) technologies such as Generative AI, Image Generators and Large Language Models have had a dramatic impact on teaching, learning and assessment over the past 18 months. The most immediate threat AI posed was to Academic Integrity with Higher Education Institutes (HEIs) focusing their efforts on combating the use of GenAI in assessment. Guidelines were developed for staff and students, policies put in place too. Innovative educators have forged paths in the use of Generative AI for teaching, learning and assessments leading to pockets of transformation springing up across HEIs, often with little or no top-down guidance, support or direction.
This Gasta posits a strategic approach to integrating AI into HEIs to prepare staff, students and the curriculum for an evolving world and workplace. We will highlight the advantages of working with these technologies beyond the realm of teaching, learning and assessment by considering prompt engineering skills, industry impact, curriculum changes, and the need for staff upskilling. In contrast, not engaging strategically with Generative AI poses risks, including falling behind peers, missed opportunities and failing to ensure our graduates remain employable. The rapid evolution of AI technologies necessitates a proactive and strategic approach if we are to remain relevant.
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
Biological screening of herbal drugs: Introduction and Need for
Phyto-Pharmacological Screening, New Strategies for evaluating
Natural Products, In vitro evaluation techniques for Antioxidants, Antimicrobial and Anticancer drugs. In vivo evaluation techniques
for Anti-inflammatory, Antiulcer, Anticancer, Wound healing, Antidiabetic, Hepatoprotective, Cardio protective, Diuretics and
Antifertility, Toxicity studies as per OECD guidelines
Introduction to AI for Nonprofits with Tapp NetworkTechSoup
Dive into the world of AI! Experts Jon Hill and Tareq Monaur will guide you through AI's role in enhancing nonprofit websites and basic marketing strategies, making it easy to understand and apply.
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
Palestine last event orientationfvgnh .pptxRaedMohamed3
An EFL lesson about the current events in Palestine. It is intended to be for intermediate students who wish to increase their listening skills through a short lesson in power point.
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
Unit 8 - Information and Communication Technology (Paper I).pdf
Bangla Handwritten Digit Recognition Report.pdf
1. Bangla Handwritten Digit Recognition Using Deep Learning.
Khondoker Abu Naim (200101103) and Abu Rayhan Mouno (180201118)
Course Code: CSE 4132 Course Title: Artificial Neural Networks and Fuzzy Systems Sessional
Semester: Winter 2023
*Department of Computer Science and Engineering, Bangladesh Army University of Science and Technology (BAUST)
Abstract
Due to the wide range of shapes, sizes, and writing styles, handwritten digit recognition has
always been difficult. For all of the recognition algorithms built in this thesis, the rewarded
method is used. Bangla-digit is the largest dataset. It is a collection for Bengali handwritten
digits. This is a massive dataset with over 3000 images in it. This dataset, however, is incredibly
difficult to work with due to its complexity. The aim of this project is to preprocess images that
can be used to train deep learning models with high accuracy. In this project, various
preprocessing techniques are developed for image processing, with a deep convolutional neural
network (CNN) functioning as the classification algorithm. On the bangla-digit image dataset,
the performance is systematically evaluated of this process. Finally, 93% accuracy is obtained
for bangla-digit dataset.
1. Introduction
Bangla handwritten digit recognition is a classical problem in the field of computer vision.
There are various kinds of practical application of this system such as OCR, postal code
recognition, license plate recognition, bank checks recognition etc. Recognizing Bangla digit
from documents is becoming more important. The unique number of Bangla digits are total
10. So the recognition task is to classify 10 different classes. The critical task of handwritten
digit recognition is recognizing unique handwritten digits. Because every human has his own
writing styles. But our contribution is for the more challenging task. The challenging task is
about getting robust performance and high accuracy for large, unbiased, unprocessed, and
highly augmented “bangla-digit” dataset. The dataset is a combination of ten class datasets
that were gathered from different sources and at different times containing blurring, noise,
rotation, translation, shear, zooming, height/width shift,brightness, contrast, occlusions, and
superimposition. We have not processed all kinds of augmentation of this dataset. We have
processed blur and noisy images mainly. Then our processed image are classified by a deep
convolutional neural network (CNN).
2. Literature Review
Bangla digits were also identified using the local binary pattern. The convolutional neural
network, deep learning approach was used to improve supervised learning and performance.
Handwritten digit recognition was proposed by EMNIST for both balanced and imbalanced
datasets.Researchers used a local binary pattern to identify Bangla digits, achieving more
accuracy. As a result of the recent success of deep learning, especially convolutional neural
network (CNN) for computer vision, many researchers have been encouraged to use the CNN
to recognize handwritten characters and digits as a computer vision challenge.Proposed
handwritten Bangla digit recognition using classification combinations by Dempster–Shafer
(DS) technique.
2. They used the DS technique and MLP classifier for classification on the training dataset of
6000 handwritten samples, as well as threefold cross-validation.
Use this concept in their genetic algorithm-based handwritten digit recognition technique.
Islam and his colleagues suggested Bayanno-Net, in which they used CNN to understand
Bangla handwritten numerals offered a comparative analysis of existing classification
algorithms for recognizing Bengali handwritten numerals. A model based on handwritten
digits was proposed by Shawon et al. An efficient automated design to generate UML
diagram from natural language specifications by Gulia et al; the main aim of their paper is to
focus on the production of activity diagram and sequence diagram through natural language
specifications and on the other hand by applying a conceptual architecture for creating a
human computing environment involving the applicability of deep learning algorithms.
Intelligent human computing: A Deep Learning-Based Approach by the paper.A Bangla
document classification using deep recurrent neural network with BiLSTM by Saifur Rahman
and Partha Chakraborty et al. The collected data were processed for the Bangla text document
and designing the model architecture training data; it was fitted out into the model. A Bangla
documents classification using transformer-based deep learning models by Md Mahbubur
Rahman, Md. They have applied the BERT and ELECTRA models for Bangla text
classification. The main goal of their paper is to identify fake profiles and eliminate the fake
accounts on online social networks.
3. Dataset Description
For recognition and generation, “bangla-digit” dataset has been used which contains 2746
handwritten Bangla digits. This is one of the biggest datasets of handwritten Bangla digits.
This dataset is divided into 10 subfolder.These 2746 data is collected from 30 persons then
use image augmentation. The image are gray-scale and the dimension is 124∗ 124.The
sample images of the “Bangla–diti” dataset are shown in Fig-1.
Fig1:Sample data Image from bangla-digit dataset.
Class Number of Sample
0 274
1 276
2 276
3 276
4 276
5 275
6 275
7 276
8 275
3. 9 267
Table-1: Summarizes the Dataset.
4. Methodology
The experimentation process was conducted using two cloud-based platforms, namely
Google Colab and Kaggle. To run the experiments, we set up the required software packages
and libraries in both environments, including TensorFlow, Keras, and NumPy. We used
TensorFlow version 2.4.0 and Keras version 2.4.3, which were the latest stable versions at the
time of the experiments.
Three main steps are applied to recognize handwritten digits. Among them, feature extraction
step and classification step are combined in one step. So the number of main steps of our digit
recognition is two. One is preprocessing of images,and another is deep CNN model.
Here we include models EfficientNetB0, MobileNet. These models were chosen based on
their proven performance in image classification tasks.
These steps are shown in Fig.
Fig-2: Block diagram of proposed Bangla digit recognition
Fig-2: Propose Model
Baseline model: A baseline model is a simple model that is used as a starting point for
developing more complex models. It is typically the simplest model that can be built for a
given problem, and it serves as a reference point against which other models can be
Input Image
Image Preprocessing
Feature Extration
Image Classification
Detected Digit
4. compared. The purpose of a baseline model is to establish a minimum level of performance
that must be exceeded by more complex models in order to justify their additional
complexity.
For example, in machine learning, a baseline model might be a simple linear regression
model that uses only one or two features to make predictions. This model can be used to
establish a baseline level of prediction accuracy, and more complex models can be developed
and compared to this baseline to determine if they are worth the additional computational cost
and complexity.
Transfer learning: Transfer learning was used to train the pre-trained models on the facial
emotion recognition dataset. In this approach, the pre-trained models were used as feature
extractors, and the final layer(s) of the models were replaced with a new layer(s) that was
trained on the facial emotion recognition dataset.
Transfer learning: Transfer learning was used to train the pre-trained models on the facial
emotion recognition dataset. In this approach, the pre-trained models were used as feature
extractors, and the final layer(s) of the models were replaced with a new layer(s) that was
trained on the facial emotion recognition dataset.
Dataset preprocessing: The collected dataset was preprocessed by resizing the images to a
standard size of 124x124 pixels and then converting them to JPEG format. First of all we
collect the image then we crop the image.
Data Augmentation: Data augmentation was performed using the ImageDataGenerator class
from the keras.preprocessing.image module. The following transformations were applied to
the images:
Rotation: Randomly rotate the image by a certain angle in the range 10.
Width and Height Shift: Randomly shift the image horizontally and vertically by a
fraction of the image size in the range 0.1, 0.2.
Zoom: Randomly zoom into the image by a factor in the range 0.0.
Hyperparameters: Several hyperparameters have been used for model training and
optimization, such as:
Batch Size: This hyperparameter determines the number of samples that are processed
in a single batch. A batch size of 32 has been used.
Size: This hyperparameter defines the size of the input image after resizing. In the
code, the size has been set to 124x124 pixels.
Epochs: This hyperparameter defines the number of times the entire dataset is passed
through the model during training. The number of epochs has been set to 16 and it
varies while fine tuning.
Learning Rate: This hyperparameter determines the step size at which the optimizer
adjusts the model parameters during training. Adam optimizer with a learning rate of
0.001 has been used and also varies while training for different models of different
variations.
Optimizer: This hyperparameter defines the optimization algorithm used to adjust the
weights of the model during training. Adam optimizer has been used.
5. 5. Result and Analysis
The train and test split ratio for our dataset is 10%. For every training dataset accuracy is
calculated. Here, train and test split ratio is 10%. Three outcomes are obtained after 16
epochs for own dataset. Results are given in Table-2. It also tests the processing time of
training. Eight seconds are required per epoch.
Table-2: Result comparison in different mode.
Fig-3: Model Comparison
EfficientNetB0: The EfficientNetB0 model achieved the highest accuracy in transfer
learning, with a score of 95%. The baseline model achieved 32% accuracy. which is
relatively low. However, fine-tuning improved the accuracy to 99.4%, which is quite good
MobileNet:The mobileNet model did not perform as well as the EffNetB0 model. Transfer
learning achieved a moderate accuracy of 92%. However, the baseline model achieved 9%
accuracy, which is relatively low. However, fine-tuning improved the accuracy to 94%.
31
9
99.4
94
95 92
0
20
40
60
80
100
120
EffNetB0 MobileNet
Result Analysis of Model Comparison
Baseline Fine-tuning Transfer learning
6. Fig-4: Graph of model accuracy and loss
6. Stakeholders
The stakeholders for Bangla handwriting recognition for a report could include:
Researchers and developers working in the field of Bangla handwriting recognition
technology.
Users of the technology, such as businesses or organizations that utilize Bangla
handwriting recognition for data entry or analysis.
Government bodies and policy-makers who are interested in promoting the
development and use of Bangla handwriting recognition technology.
Educational institutions and instructors who teach Bangla language and writing, as
handwriting recognition technology could be used in language learning.
Bangla language and culture advocates, who may be interested in preserving and
promoting the use of the Bangla language in the digital age.
7. Issues Encountered
Large image size: The size (in terms of space) of the images used in the dataset was
too large, which made it difficult to work with. As a result, preprocessing techniques
had to be used to reduce the image size to a more manageable level.
Face data augmentation problem.
Variability in handwriting: Bengali script has many complex characters, and each
character can be written in several different ways. This variability makes it difficult to
develop a reliable recognition system that can accurately recognize all variations of
each character.
8. Conclusion, Limitations and Future Recommendations
Despite having lots of variation in the test set our ensemble of residual networks based
approach with Xception architecture performed really well even though the final model has
7. comparatively lower number of parameters and the models were trained with limited
resources.The impacts of using residual network and batch-normalization were prominent to
improve the overall performance of the classifier model. The number of parameters can be
further reduced with more optimized set of parameter selection while introducing more
augmentation can improve overall performance of the model further.
Limitations:
i. Sample size: The sample size used in this research was limited, which may have
affected the statistical power and generalizability of the findings.
ii. Participant Expression Accuracy: The study was limited by the participants' inability
to accurately convey their facial expressions, which may have impacted the reliability
and validity of the data collected.
iii. Data quality: The quality of the data used in this research was dependent on the
accuracy and completeness of the responses provided by the participants, which may
have been affected by recall bias or response.
.
Here are some potential future recommendations:
i. Explore additional pre-trained models: While the current implementation uses a
variety of pre-trained models, there are many other options available. Exploring
additional models could help improve the accuracy of the facial emotion recognition
system.
ii. Improve dataset quality: Collecting a high-quality dataset is crucial for training
accurate machine learning models. To improve the performance of the current
system, more high-quality data could be collected, particularly for underrepresented
classes
References
[1] Chakraborty, Partha, Syeda Surma Jahanapi, and Tanupriya Choudhury. "Bangla
Handwritten Digit Recognition." Cyber Intelligence and Information Retrieval: Proceedings
of CIIR 2021. Springer Singapore, 2022.
[2] Hossain, M. Zahid, M. Ashraful Amin, and Hong Yan. "Rapid feature extraction for
Bangla handwritten digit recognition." 2011 International Conference on Machine Learning
and Cybernetics. Vol. 4. IEEE, 2011.
[3] Fahim Sikder, Md. "Bangla handwritten digit recognition and generation." Proceedings of
International Joint Conference on Computational Intelligence: IJCCI 2018. Springer
Singapore, 2020.
[4] Hoq, Md Nazmul, et al. "A comparative overview of classification algorithm for Bangla
handwritten digit recognition." Proceedings of International Joint Conference on
Computational Intelligence: IJCCI 2018. Springer Singapore, 2020.
[5] Rabby, AKM Shahariar Azad, et al. "Bangla handwritten digit recognition using
convolutional neural network." Emerging Technologies in Data Mining and Information
Security: Proceedings of IEMIS 2018, Volume 1. Springer Singapore, 2019.
8. Appendix
Attainment of Complex Engineering Problem (CP)
S.L. CP No. Attainment Remarks
1. P1: Depth of
Knowledge Required
K3 (Engineering Fundamentals):
K4 (Engineering Specialization):
K5 (Design):
K6 (Technology):
K8 (Research):
2. P2: Range of
Conflicting
Requirements
3. P3: Depth of Analysis
Required
4. P4: Familiarity of
Issues
5. P5: Extent of
Applicable Codes
6. P6: Extent of
Stakeholder
Involvement and
Conflicting
Requirements
7. P7: Interdependence
Mapping of Complex Engineering Activities (CA)
S.L. CA No. Attainment Remarks
1. A1: Range of
resources
2. A2: Level of
interaction
3. A3: Innovation
4. A4: Consequences for
Society and the
Environment
5. A5: Familiarity
Code:
https://www.kaggle.com/code/khondokerabunaim/mobilenetv2transfer