Automatic License Plate Recognition systems aim to provide a solution for detecting, localizing, and recognizing license plate characters from vehicles appearing in video frames. However, deploying such systems in the real world requires real-time performance in low-resource environments. In our paper, we propose a two-stage detection pipeline paired with Vision API that provides real-time inference speed along with consistently accurate detection and recognition performance. We used a haar-cascade classifier as a filter on top of our backbone MobileNet SSDv2 detection model. This reduces inference time by only focusing on high confidence detections and using them for recognition. We also impose a temporal frame separation strategy to distinguish between multiple vehicle license plates in the same clip. Furthermore, there are no publicly available Bangla license plate datasets, for which we created an image dataset and a video dataset containing license plates in the wild. We trained our models on the image dataset and achieved an AP (0.5) score of 86% and tested our pipeline on the video dataset and observed reasonable detection and recognition performance (82.7% detection rate, and 60.8% OCR F1 score) with real-time processing speed (27.2 frames per second).
We develop custom Image Recognition systems for Aerospace and defence applications. Using algorithms like Deep Convolutional Neural Networks and Regional Convolutional Neural Networks.
Our algorithms for Target Recognition and Tracking are designed from the beginning to be run on embedded systems. We target both GPU and FPGA devices.
To Train and Validate our algorithms we developed a process to generate photorealistic 3D environments.
Those 3D Environments are used to produce realistic video streams of the targets in different environmental conditions (lighting, adverse meteorological conditions, camouflage, point-of-view).
The same technology can be used to Train and Test Automotive Vision Systems.
We develop custom Image Recognition systems for Aerospace and defence applications. Using algorithms like Deep Convolutional Neural Networks and Regional Convolutional Neural Networks.
Our algorithms for Target Recognition and Tracking are designed from the beginning to be run on embedded systems. We target both GPU and FPGA devices.
To Train and Validate our algorithms we developed a process to generate photorealistic 3D environments.
Those 3D Environments are used to produce realistic video streams of the targets in different environmental conditions (lighting, adverse meteorological conditions, camouflage, point-of-view).
The same technology can be used to Train and Test Automotive Vision Systems.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2022/06/tools-for-creating-next-gen-computer-vision-apps-on-snapdragon-a-presentation-from-qualcomm/
Judd Heape, Vice President of Product Management for Camera, Computer Vision and Video Technology at Qualcomm, presents the “Tools for Creating Next-Gen Computer Vision Apps on Snapdragon” tutorial at the May 2022 Embedded Vision Summit.
The Snapdragon Mobile Platform powers the world’s best smartphones, XR headsets, PCs, wearables, cars and IoT products. Thanks to Snapdragon, these products feature powerful computer vision technologies that you can tap into to build next-gen apps. Inside Snapdragon is a hardware engine dedicated to computer vision–the Engine for Visual Analytics (EVA). EVA hardware acceleration gives developers access to high-performance, low-power computer vision functions to enhance apps that rely on advanced camera or video processing.
The EVA includes a motion processing unit, a feature descriptor unit, a depth estimation unit, a geometric correction unit and an object detection unit. These blocks power high-level functions such as electronic image stabilization, multi-frame HDR, face detection and real-time bokeh. In this presentation, Heape does a deep-dive into EVA’s Software Developer Kit (SDK) and available APIs, such as Optical Flow and Depth from Stereo, and explores how these features can be integrated into your apps.
These are PPT slides presented for the announcement of the result of xDR Challenge 2018. This presentation was given at the special session "A survey on Indoor Localization Competitions" in IPIN 2018.
These are PPT slides presented for the announcement of the result of xDR Challenge 2018. This presentation was given at the special session "A survey on Indoor Localization Competitions" in IPIN 2018.
End-to-End Natural Language Understanding Pipeline for Bangla Conversational ...MD Abdullah Al Nasim
Chatbots are intelligent software built to be used as a replacement for human interaction. Existing studies typically do not provide enough support for low-resource languages like Bangla. Due to the increasing popularity of social media, we can also see the rise of interactions in Bangla transliteration (mostly in English) among the native Bangla speakers. In this paper, we propose a novel approach to build a Bangla chatbot aimed to be used as a business assistant which can communicate in low-resource langauage like Bangla and Bangla Transliteration in English with high confidence consistently. Since annotated data was not available for this purpose, we had to work on the whole machine learning life cycle (data preparation, machine learning modeling, and model deployment) using Rasa Open Source Framework, fastText embeddings, Polyglot embeddings, Flask, and other systems as building blocks. While working with the skewed annotated dataset, we try out different components and pipelines to evaluate which works best and provide possible reasoning behind the observed results. Finally, we present a pipeline for intent classification and entity extraction which achieves reasonable performance (accuracy: 83.02%, precision: 80.82%, recall: 83.02%, F1-score: 80%).
A Soft Computing Based Customer Lifetime Value Classifier for Digital Retail ...MD Abdullah Al Nasim
This era of Big Data and digital marketing has created varied scopes and alternatives to spend investments, in order to maximize profit in business. However, identifying the right target group (TG) to invest the right resources emerges as a critical problem, for both acquisition and retention. Customer lifetime value (CLV) is a metric that may aid in such decisions, the calculation of which seems quite generic in traditional literature. This paper recognizes the potential of a large, user-specific transaction dataset from a retail business operating online and proposes an AI-powered CLV classifier. For this, we exploratorily mine customers’ buying tendencies from their geographic, monetary, chronological, categorical data using PostgreSQL and then, use those as input features to a TensorFlow-coded neural network (NN) in order to predict their profitability in different ranges. We further validate the features using statistical inference, the optimized feature set resulting from which is again passed through another NN, providing more crisp metrics in lesser computation. The training data is prepared by iterative thresholding on customers’ number of days ordered. An SQL-backed recommender, working on maximum buyers’ tendencies, is also proposed to cross-sell and up-sell customers in order to maximize the said CLV. The research indicates ways to filter out factors influencing CLV and endorses the applicability of modern soft computing in devising business-specific solutions by exploiting commonly available users’ databases.
More Related Content
Similar to Real-time Bangla License Plate Recognition System for Low Resource Video-based Applications
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2022/06/tools-for-creating-next-gen-computer-vision-apps-on-snapdragon-a-presentation-from-qualcomm/
Judd Heape, Vice President of Product Management for Camera, Computer Vision and Video Technology at Qualcomm, presents the “Tools for Creating Next-Gen Computer Vision Apps on Snapdragon” tutorial at the May 2022 Embedded Vision Summit.
The Snapdragon Mobile Platform powers the world’s best smartphones, XR headsets, PCs, wearables, cars and IoT products. Thanks to Snapdragon, these products feature powerful computer vision technologies that you can tap into to build next-gen apps. Inside Snapdragon is a hardware engine dedicated to computer vision–the Engine for Visual Analytics (EVA). EVA hardware acceleration gives developers access to high-performance, low-power computer vision functions to enhance apps that rely on advanced camera or video processing.
The EVA includes a motion processing unit, a feature descriptor unit, a depth estimation unit, a geometric correction unit and an object detection unit. These blocks power high-level functions such as electronic image stabilization, multi-frame HDR, face detection and real-time bokeh. In this presentation, Heape does a deep-dive into EVA’s Software Developer Kit (SDK) and available APIs, such as Optical Flow and Depth from Stereo, and explores how these features can be integrated into your apps.
These are PPT slides presented for the announcement of the result of xDR Challenge 2018. This presentation was given at the special session "A survey on Indoor Localization Competitions" in IPIN 2018.
These are PPT slides presented for the announcement of the result of xDR Challenge 2018. This presentation was given at the special session "A survey on Indoor Localization Competitions" in IPIN 2018.
End-to-End Natural Language Understanding Pipeline for Bangla Conversational ...MD Abdullah Al Nasim
Chatbots are intelligent software built to be used as a replacement for human interaction. Existing studies typically do not provide enough support for low-resource languages like Bangla. Due to the increasing popularity of social media, we can also see the rise of interactions in Bangla transliteration (mostly in English) among the native Bangla speakers. In this paper, we propose a novel approach to build a Bangla chatbot aimed to be used as a business assistant which can communicate in low-resource langauage like Bangla and Bangla Transliteration in English with high confidence consistently. Since annotated data was not available for this purpose, we had to work on the whole machine learning life cycle (data preparation, machine learning modeling, and model deployment) using Rasa Open Source Framework, fastText embeddings, Polyglot embeddings, Flask, and other systems as building blocks. While working with the skewed annotated dataset, we try out different components and pipelines to evaluate which works best and provide possible reasoning behind the observed results. Finally, we present a pipeline for intent classification and entity extraction which achieves reasonable performance (accuracy: 83.02%, precision: 80.82%, recall: 83.02%, F1-score: 80%).
A Soft Computing Based Customer Lifetime Value Classifier for Digital Retail ...MD Abdullah Al Nasim
This era of Big Data and digital marketing has created varied scopes and alternatives to spend investments, in order to maximize profit in business. However, identifying the right target group (TG) to invest the right resources emerges as a critical problem, for both acquisition and retention. Customer lifetime value (CLV) is a metric that may aid in such decisions, the calculation of which seems quite generic in traditional literature. This paper recognizes the potential of a large, user-specific transaction dataset from a retail business operating online and proposes an AI-powered CLV classifier. For this, we exploratorily mine customers’ buying tendencies from their geographic, monetary, chronological, categorical data using PostgreSQL and then, use those as input features to a TensorFlow-coded neural network (NN) in order to predict their profitability in different ranges. We further validate the features using statistical inference, the optimized feature set resulting from which is again passed through another NN, providing more crisp metrics in lesser computation. The training data is prepared by iterative thresholding on customers’ number of days ordered. An SQL-backed recommender, working on maximum buyers’ tendencies, is also proposed to cross-sell and up-sell customers in order to maximize the said CLV. The research indicates ways to filter out factors influencing CLV and endorses the applicability of modern soft computing in devising business-specific solutions by exploiting commonly available users’ databases.
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical AnalysisMD Abdullah Al Nasim
Cancer of the brain is deadly and requires careful surgical segmentation. The brain tumors were segmented using U-Net using a Convolutional Neural Network (CNN). When looking for overlaps of necrotic, edematous, growing, and healthy tissue, it might be hard to get relevant information from the images. The 2D U-Net network was improved and trained with the BraTS datasets to find these four areas. U-Net can set up many encoder and decoder routes that can be used to get information from images that can be used in different ways. To reduce computational time, we use image segmentation to exclude insignificant background details. Experiments on the BraTS datasets show that our proposed model for segmenting brain tumors from MRI (MRI) works well. In this study, we demonstrate that the BraTS datasets for 2017, 2018, 2019, and 2020 do not significantly differ from the BraTS 2019 dataset's attained dice scores of 0.8717 (necrotic), 0.9506 (edema), and 0.9427 (enhancing).
An automated approach for the recognition of bengali license plates presentationMD Abdullah Al Nasim
Automatic Number Plate Recognition (ALPR) is a system for automatically identifying the license plates of any vehicle. This process is important for tracking, ticketing, and any billing system, among other things. With the use of information and communication technology (ICT), all systems are being automated, including the vehicle tracking system. This study proposes a hybrid method for detecting license plates using characters from them. Our captured image information was used for the recognition procedure in Bangladeshi vehicles, which is the topic of this study. Here, for license plate detection, the YOLO model was used where 81\% was correctly predicted. And then, for license plate segmentation, Otsu's Thresholding was used and eventually, for character recognition, the CNN model was applied. This model will allow the vehicle's automated license plate detection system to avoid any misuse.
A data flow diagram (DFD) is a graphical representation of the flow of data through an information system, modeling its process aspects. DFD is often used as a preliminary step to create an overview of the system without going into great detail, which can later be elaborated. DFD can also be used for the visualization of data process.
An entity-relationship diagram (ERD) is a data modeling technique that graphically illustrates an information system’s entities and the relationships between those entities. An ERD is a conceptual and representational model of data used to represent the entity framework infrastructure. DBA and System Analyst used ERD to perform data modeling and explain the diagram to the stakeholders.
Performance analysis of automated brain tumor detection from MR imaging and CT scan using basic image processing techniques based on various hard and soft computing has been performed in our work. Moreover, we applied six traditional classifiers to detect brain tumor in the images. Then we applied CNN for brain tumor detection to include deep learning method in our work. We compared the result of the traditional one having the best accuracy (SVM) with the result of CNN. Furthermore, our work presents a generic method of tumor detection and extraction of its various features.
Nutraceutical market, scope and growth: Herbal drug technologyLokesh Patil
As consumer awareness of health and wellness rises, the nutraceutical market—which includes goods like functional meals, drinks, and dietary supplements that provide health advantages beyond basic nutrition—is growing significantly. As healthcare expenses rise, the population ages, and people want natural and preventative health solutions more and more, this industry is increasing quickly. Further driving market expansion are product formulation innovations and the use of cutting-edge technology for customized nutrition. With its worldwide reach, the nutraceutical industry is expected to keep growing and provide significant chances for research and investment in a number of categories, including vitamins, minerals, probiotics, and herbal supplements.
Seminar of U.V. Spectroscopy by SAMIR PANDASAMIR PANDA
Spectroscopy is a branch of science dealing the study of interaction of electromagnetic radiation with matter.
Ultraviolet-visible spectroscopy refers to absorption spectroscopy or reflect spectroscopy in the UV-VIS spectral region.
Ultraviolet-visible spectroscopy is an analytical method that can measure the amount of light received by the analyte.
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Sérgio Sacani
We characterize the earliest galaxy population in the JADES Origins Field (JOF), the deepest
imaging field observed with JWST. We make use of the ancillary Hubble optical images (5 filters
spanning 0.4−0.9µm) and novel JWST images with 14 filters spanning 0.8−5µm, including 7 mediumband filters, and reaching total exposure times of up to 46 hours per filter. We combine all our data
at > 2.3µm to construct an ultradeep image, reaching as deep as ≈ 31.4 AB mag in the stack and
30.3-31.0 AB mag (5σ, r = 0.1” circular aperture) in individual filters. We measure photometric
redshifts and use robust selection criteria to identify a sample of eight galaxy candidates at redshifts
z = 11.5 − 15. These objects show compact half-light radii of R1/2 ∼ 50 − 200pc, stellar masses of
M⋆ ∼ 107−108M⊙, and star-formation rates of SFR ∼ 0.1−1 M⊙ yr−1
. Our search finds no candidates
at 15 < z < 20, placing upper limits at these redshifts. We develop a forward modeling approach to
infer the properties of the evolving luminosity function without binning in redshift or luminosity that
marginalizes over the photometric redshift uncertainty of our candidate galaxies and incorporates the
impact of non-detections. We find a z = 12 luminosity function in good agreement with prior results,
and that the luminosity function normalization and UV luminosity density decline by a factor of ∼ 2.5
from z = 12 to z = 14. We discuss the possible implications of our results in the context of theoretical
models for evolution of the dark matter halo mass function.
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...Scintica Instrumentation
Intravital microscopy (IVM) is a powerful tool utilized to study cellular behavior over time and space in vivo. Much of our understanding of cell biology has been accomplished using various in vitro and ex vivo methods; however, these studies do not necessarily reflect the natural dynamics of biological processes. Unlike traditional cell culture or fixed tissue imaging, IVM allows for the ultra-fast high-resolution imaging of cellular processes over time and space and were studied in its natural environment. Real-time visualization of biological processes in the context of an intact organism helps maintain physiological relevance and provide insights into the progression of disease, response to treatments or developmental processes.
In this webinar we give an overview of advanced applications of the IVM system in preclinical research. IVIM technology is a provider of all-in-one intravital microscopy systems and solutions optimized for in vivo imaging of live animal models at sub-micron resolution. The system’s unique features and user-friendly software enables researchers to probe fast dynamic biological processes such as immune cell tracking, cell-cell interaction as well as vascularization and tumor metastasis with exceptional detail. This webinar will also give an overview of IVM being utilized in drug development, offering a view into the intricate interaction between drugs/nanoparticles and tissues in vivo and allows for the evaluation of therapeutic intervention in a variety of tissues and organs. This interdisciplinary collaboration continues to drive the advancements of novel therapeutic strategies.
Richard's aventures in two entangled wonderlandsRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...Wasswaderrick3
In this book, we use conservation of energy techniques on a fluid element to derive the Modified Bernoulli equation of flow with viscous or friction effects. We derive the general equation of flow/ velocity and then from this we derive the Pouiselle flow equation, the transition flow equation and the turbulent flow equation. In the situations where there are no viscous effects , the equation reduces to the Bernoulli equation. From experimental results, we are able to include other terms in the Bernoulli equation. We also look at cases where pressure gradients exist. We use the Modified Bernoulli equation to derive equations of flow rate for pipes of different cross sectional areas connected together. We also extend our techniques of energy conservation to a sphere falling in a viscous medium under the effect of gravity. We demonstrate Stokes equation of terminal velocity and turbulent flow equation. We look at a way of calculating the time taken for a body to fall in a viscous medium. We also look at the general equation of terminal velocity.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.Sérgio Sacani
The return of a sample of near-surface atmosphere from Mars would facilitate answers to several first-order science questions surrounding the formation and evolution of the planet. One of the important aspects of terrestrial planet formation in general is the role that primary atmospheres played in influencing the chemistry and structure of the planets and their antecedents. Studies of the martian atmosphere can be used to investigate the role of a primary atmosphere in its history. Atmosphere samples would also inform our understanding of the near-surface chemistry of the planet, and ultimately the prospects for life. High-precision isotopic analyses of constituent gases are needed to address these questions, requiring that the analyses are made on returned samples rather than in situ.
BLOOD AND BLOOD COMPONENT- introduction to blood physiology
Real-time Bangla License Plate Recognition System for Low Resource Video-based Applications
1. Real-time Bangla License Plate Recognition System
for Low Resource Video-based Applications
Alif Ashrafee
alifashrafee@iut-dhaka.edu
Islamic University of Technology
Akib Mohammed Khan
akibmohammed@iut-dhaka.edu
Islamic University of Technology
Mohammad Sabik Irbaz
sabikirbaz@iut-dhaka.edu
Pioneer Alpha Ltd.
MD Abdullah Al Nasim
nasim.abdullah@ieee.org
Pioneer Alpha Ltd.
1
2. Winter Conference on Applications
of Computer Vision 2022
Real-World Surveillance:
Applications and Challenges Workshop
(RWS @ WACV2022)
2
3. Introduction
● Automatic License Plate Recognition (ALPR) systems aim to detect, localize,
and recognize license plate characters from vehicles appearing in images or
video frames.
● There are no publicly available Bangla datasets for training such systems.
● Our main focus is to provide comprehensive image and video datasets of
Bangla license plates in-the-wild.
● We aim to provide a system for real time inference to recognizing license
plates from video frames in low resource environments.
3
5. Motivation
5
● Due to the large volume of vehicles in metropolitan areas of Bangladesh, it is difficult
to find a vacant parking slot due to the lack of optimized parking systems.
● There are no publicly available datasets for Bangla License plates to be trained or
evaluated on.
● No Bangla ALPR system provides real-time performance using low resource.
6. Problem
Statement
6
Given a video feed of cars and
motorcycles having Bangla license
plates, can we teach the machine to
identify and recognize license plates?
7. 7
Training for
object detection
models are
computationally
expensive
No benchmark
datasets are
available for
Bangla license
plate videos
Separating
temporally
distinct
instances of
vehicles from
video
Running an
object detection
model in CPU in
real-time
Research Challenges
1 2 3 4
9. Cascade Classifier [Viola et. al., 2001]
9
● Trained using positive and negative
examples.
● Consists of multiple stages, where each
stage is an ensemble of weak classifiers.
● Divide the image into sub-windows.
● The weak classifiers are used to discard
the negative subwindows as fast as
possible.
● Detect all the positive instances to
achieve low false-positive rates.
10. MobileNet SSDv2 [Chiu et. al., 2020]
10
● Single Shot object detection (SSD) model based on MobileNet-v2[1].
● Lightweight and can be applied with limited computational resources.
● Includes feature pyramid network
11. Two stage detection with Temporal
Redundancy [Laroca et. al., 2018]
11
● Detection in two stages, vehicle detection and then LP detection.
● Temporal redundancy information through the union of all frames
belonging to the same vehicle.
12. YOLO vs RCNN vs SSD [Adarsh et. al., 2020]
12
● SSD provides an optimal solution for both accuracy and speed.
● YOLO is fast but not very accurate, Faster RCNN is more accurate but slower.
14. License Plate Image Dataset
14
➔ We created our own dataset of 1000 images.
➔ With different distances, angles and lighting conditions around Dhaka City
captured using varying models of smartphones.
➔ Converted to 800 x 800 pixels.
➔ We manually annotated them by drawing bounding boxes around the license
plate regions for training our detection models.
16. License Plate Video Dataset
16
➔ We collected 79 video clips containing 98 license plates from different types
of vehicles using crowdsourcing.
➔ Each clip includes single or multiple vehicles from around different districts
of Bangladesh.
➔ Converted all of the videos to a resolution of 480 x 480 pixels.
➔ Each video contains an average of 254 frames and the frame rate of each
video was 24 frames per second (FPS).
➔ Videos were taken in a variety of locations, such as inside garages, car
parks, lanes, and highways.
17. License Plate Video Dataset (cont.)
➔ Since videos were taken from several different districts, there is a good
distribution of the authorized characters in our dataset.
➔ Videos include plate variations, artifacts on plates, occlusions, illumination
changes, and unclear characters.
➔ We annotated the content of each plate: the ground truth of each character
present in the plate, and the number of license plates present in each video.
17
23. Evaluation Metrics
23
Mean Average Precision (mAP) for Training
detection model
● mAP score is calculated by taking the mean
AP over all classes and/or overall IoU
thresholds.
● AP is the area under the precision-recall curve.
● Popular metric for evaluating detection
models.
24. 24
Evaluation Metrics (cont.)
Levenshtein Distance for Character Recognition [2]
● Measures the difference between two strings based on the number of
insertions, deletions, and substitutions that have to be performed on the
target string in order to match it with the reference string.
Cₑᵣᵣₒᵣ = Total character error
Cₐₙₙ = # characters in the aligned
sequence
CH
= # characters in hypothesis string
CR
= # characters in reference string
25. Experimental Setup
25
● We trained the MobileNet SSDv2 model using a Tesla T4 GPU on Google Colaboratory.
● For testing, all the experiments were conducted on a single thread of an Intel Core
i5-7200U mobile CPU with 8GB of RAM and video resized to 480*480 pixels.
Image size 300 * 300
Batch Size 24
IOU Threshold 0.35
Learning Rate 0.004
Epochs Run 50
26. Result Analysis (Quantitative)
26
Training Results
Model AP AP50 AP75 APs APm APl
MobileNet
SSDv2
0.47 0.86 0.47 0.13 0.46 0.55
AP: Average Precision AP50: AP at IOU thresh = 0.5 AP75: AP at IOU thresh = 0.75
APs: AP for small objects APm: AP for medium objects APl: AP for large objects
29. 29
29
License plates that are missed by Cascade + SSDv2
but successfully detected by standalone SSDv2
Favorable for Vision OCR Unfavorable for Vision OCR
30. 30
Standalone SSDv2 OCR Predictions
that are not favorable for Vision API
Cascade + SSDv2 OCR predictions
that are favorable for Vision API
31. Conclusion and Future Work
● We provide two Bangla license plate image and video datasets.
● We propose a robust system that can detect license plates from video
footage in low resource environment.
● Consists of a method for individually storing temporally separate instances
of different vehicles appearing in the same video.
● We plan to deploy our system in various real-life environments and verify its
day-to-day performance.
● We also plan to build our own Bangla license plate OCR system that can
improve the recognition performance.
31
32. References
[1] Sandler, Mark, et al. "Mobilenetv2: Inverted residuals and linear bottlenecks." Proceedings of
the IEEE conference on computer vision and pattern recognition. 2018.
[2] Karpinski, Romain, Devashish Lohani, and Abdel Belaid. "Metrics for Complete Evaluation of
OCR Performance." IPCV'18-The 22nd Int'l Conf on Image Processing, Computer Vision, & Pattern
Recognition. 2018.
32