SlideShare a Scribd company logo
1 of 33
Real-time Bangla License Plate Recognition System
for Low Resource Video-based Applications
Alif Ashrafee
alifashrafee@iut-dhaka.edu
Islamic University of Technology
Akib Mohammed Khan
akibmohammed@iut-dhaka.edu
Islamic University of Technology
Mohammad Sabik Irbaz
sabikirbaz@iut-dhaka.edu
Pioneer Alpha Ltd.
MD Abdullah Al Nasim
nasim.abdullah@ieee.org
Pioneer Alpha Ltd.
1
Winter Conference on Applications
of Computer Vision 2022
Real-World Surveillance:
Applications and Challenges Workshop
(RWS @ WACV2022)
2
Introduction
● Automatic License Plate Recognition (ALPR) systems aim to detect, localize,
and recognize license plate characters from vehicles appearing in images or
video frames.
● There are no publicly available Bangla datasets for training such systems.
● Our main focus is to provide comprehensive image and video datasets of
Bangla license plates in-the-wild.
● We aim to provide a system for real time inference to recognizing license
plates from video frames in low resource environments.
3
4
Video Frames
Cropped
License Plate
WB 06 F 5977
License Plate
Characters
Example of an ALPR system for video feed
Motivation
5
● Due to the large volume of vehicles in metropolitan areas of Bangladesh, it is difficult
to find a vacant parking slot due to the lack of optimized parking systems.
● There are no publicly available datasets for Bangla License plates to be trained or
evaluated on.
● No Bangla ALPR system provides real-time performance using low resource.
Problem
Statement
6
Given a video feed of cars and
motorcycles having Bangla license
plates, can we teach the machine to
identify and recognize license plates?
7
Training for
object detection
models are
computationally
expensive
No benchmark
datasets are
available for
Bangla license
plate videos
Separating
temporally
distinct
instances of
vehicles from
video
Running an
object detection
model in CPU in
real-time
Research Challenges
1 2 3 4
Related Works
8
Cascade Classifier [Viola et. al., 2001]
9
● Trained using positive and negative
examples.
● Consists of multiple stages, where each
stage is an ensemble of weak classifiers.
● Divide the image into sub-windows.
● The weak classifiers are used to discard
the negative subwindows as fast as
possible.
● Detect all the positive instances to
achieve low false-positive rates.
MobileNet SSDv2 [Chiu et. al., 2020]
10
● Single Shot object detection (SSD) model based on MobileNet-v2[1].
● Lightweight and can be applied with limited computational resources.
● Includes feature pyramid network
Two stage detection with Temporal
Redundancy [Laroca et. al., 2018]
11
● Detection in two stages, vehicle detection and then LP detection.
● Temporal redundancy information through the union of all frames
belonging to the same vehicle.
YOLO vs RCNN vs SSD [Adarsh et. al., 2020]
12
● SSD provides an optimal solution for both accuracy and speed.
● YOLO is fast but not very accurate, Faster RCNN is more accurate but slower.
Datasets
13
License Plate Image Dataset
14
➔ We created our own dataset of 1000 images.
➔ With different distances, angles and lighting conditions around Dhaka City
captured using varying models of smartphones.
➔ Converted to 800 x 800 pixels.
➔ We manually annotated them by drawing bounding boxes around the license
plate regions for training our detection models.
15
License Plate Video Dataset
16
➔ We collected 79 video clips containing 98 license plates from different types
of vehicles using crowdsourcing.
➔ Each clip includes single or multiple vehicles from around different districts
of Bangladesh.
➔ Converted all of the videos to a resolution of 480 x 480 pixels.
➔ Each video contains an average of 254 frames and the frame rate of each
video was 24 frames per second (FPS).
➔ Videos were taken in a variety of locations, such as inside garages, car
parks, lanes, and highways.
License Plate Video Dataset (cont.)
➔ Since videos were taken from several different districts, there is a good
distribution of the authorized characters in our dataset.
➔ Videos include plate variations, artifacts on plates, occlusions, illumination
changes, and unclear characters.
➔ We annotated the content of each plate: the ground truth of each character
present in the plate, and the number of license plates present in each video.
17
18
Proposed Method
19
20
Four Stage Pipeline
21
Overall Pipeline for Real-time interaction
Experimental Analysis
22
Evaluation Metrics
23
Mean Average Precision (mAP) for Training
detection model
● mAP score is calculated by taking the mean
AP over all classes and/or overall IoU
thresholds.
● AP is the area under the precision-recall curve.
● Popular metric for evaluating detection
models.
24
Evaluation Metrics (cont.)
Levenshtein Distance for Character Recognition [2]
● Measures the difference between two strings based on the number of
insertions, deletions, and substitutions that have to be performed on the
target string in order to match it with the reference string.
Cₑᵣᵣₒᵣ = Total character error
Cₐₙₙ = # characters in the aligned
sequence
CH
= # characters in hypothesis string
CR
= # characters in reference string
Experimental Setup
25
● We trained the MobileNet SSDv2 model using a Tesla T4 GPU on Google Colaboratory.
● For testing, all the experiments were conducted on a single thread of an Intel Core
i5-7200U mobile CPU with 8GB of RAM and video resized to 480*480 pixels.
Image size 300 * 300
Batch Size 24
IOU Threshold 0.35
Learning Rate 0.004
Epochs Run 50
Result Analysis (Quantitative)
26
Training Results
Model AP AP50 AP75 APs APm APl
MobileNet
SSDv2
0.47 0.86 0.47 0.13 0.46 0.55
AP: Average Precision AP50: AP at IOU thresh = 0.5 AP75: AP at IOU thresh = 0.75
APs: AP for small objects APm: AP for medium objects APl: AP for large objects
Result Analysis (Quantitative) (cont.)
Ablation Study
27
Pipeline Precision(%) Recall(%) F1 Score (%)
Detection
Rate(%)
FPS
YoloV3 Tiny 60.2 55.6 57.1 75.5 11.1
Cascade +
YoloV3 Tiny
56.3 51.6 53.1 69.4 27.1
SSDv2 66.1 58.4 60.5 98.0 17.7
Cascade +
SSDv2
63.6 59.3 60.8 82.7 27.2
Result Analysis (Qualitative)
28
29
29
License plates that are missed by Cascade + SSDv2
but successfully detected by standalone SSDv2
Favorable for Vision OCR Unfavorable for Vision OCR
30
Standalone SSDv2 OCR Predictions
that are not favorable for Vision API
Cascade + SSDv2 OCR predictions
that are favorable for Vision API
Conclusion and Future Work
● We provide two Bangla license plate image and video datasets.
● We propose a robust system that can detect license plates from video
footage in low resource environment.
● Consists of a method for individually storing temporally separate instances
of different vehicles appearing in the same video.
● We plan to deploy our system in various real-life environments and verify its
day-to-day performance.
● We also plan to build our own Bangla license plate OCR system that can
improve the recognition performance.
31
References
[1] Sandler, Mark, et al. "Mobilenetv2: Inverted residuals and linear bottlenecks." Proceedings of
the IEEE conference on computer vision and pattern recognition. 2018.
[2] Karpinski, Romain, Devashish Lohani, and Abdel Belaid. "Metrics for Complete Evaluation of
OCR Performance." IPCV'18-The 22nd Int'l Conf on Image Processing, Computer Vision, & Pattern
Recognition. 2018.
32
Thank you!
33

More Related Content

Similar to Real-time Bangla License Plate Recognition System for Low Resource Video-based Applications

Different Methodologies for Indian License Plate Detection
Different Methodologies for Indian License Plate DetectionDifferent Methodologies for Indian License Plate Detection
Different Methodologies for Indian License Plate DetectionIRJET Journal
 
IRJET- Drivers Stupor Scrutinizing System
IRJET-  	  Drivers Stupor Scrutinizing SystemIRJET-  	  Drivers Stupor Scrutinizing System
IRJET- Drivers Stupor Scrutinizing SystemIRJET Journal
 
Vision-Based Motorcycle Crash Detection and Reporting Using Deep Learning
Vision-Based Motorcycle Crash Detection and Reporting Using Deep LearningVision-Based Motorcycle Crash Detection and Reporting Using Deep Learning
Vision-Based Motorcycle Crash Detection and Reporting Using Deep LearningIRJET Journal
 
Automatic Detection of Unexpected Accidents Monitoring Conditions in Tunnels
Automatic Detection of Unexpected Accidents Monitoring Conditions in TunnelsAutomatic Detection of Unexpected Accidents Monitoring Conditions in Tunnels
Automatic Detection of Unexpected Accidents Monitoring Conditions in TunnelsIRJET Journal
 
IRJET- Robust and Fast Detection of Moving Vechiles in Aerial Videos usin...
IRJET-  	  Robust and Fast Detection of Moving Vechiles in Aerial Videos usin...IRJET-  	  Robust and Fast Detection of Moving Vechiles in Aerial Videos usin...
IRJET- Robust and Fast Detection of Moving Vechiles in Aerial Videos usin...IRJET Journal
 
“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...
“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...
“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...Edge AI and Vision Alliance
 
Visual pattern recognition in robotics
Visual pattern recognition in roboticsVisual pattern recognition in robotics
Visual pattern recognition in roboticsIAEME Publication
 
Real-Time Automated Overspeeding Detection and Identification System
Real-Time Automated Overspeeding Detection and Identification SystemReal-Time Automated Overspeeding Detection and Identification System
Real-Time Automated Overspeeding Detection and Identification SystemIRJET Journal
 
Review On Different Feature Extraction Algorithms
Review On Different Feature Extraction AlgorithmsReview On Different Feature Extraction Algorithms
Review On Different Feature Extraction AlgorithmsIRJET Journal
 
IRJET- Dynamic Traffic Management System
IRJET- Dynamic Traffic Management SystemIRJET- Dynamic Traffic Management System
IRJET- Dynamic Traffic Management SystemIRJET Journal
 
Object Detection for Autonomous Cars using AI/ML
Object Detection for Autonomous Cars using AI/MLObject Detection for Autonomous Cars using AI/ML
Object Detection for Autonomous Cars using AI/MLIRJET Journal
 
Raspberry Pi Vehicles Number Plate Recognition
Raspberry Pi Vehicles Number Plate RecognitionRaspberry Pi Vehicles Number Plate Recognition
Raspberry Pi Vehicles Number Plate RecognitionIRJET Journal
 
IRJET - Traffic Density Estimation by Counting Vehicles using Aggregate Chann...
IRJET - Traffic Density Estimation by Counting Vehicles using Aggregate Chann...IRJET - Traffic Density Estimation by Counting Vehicles using Aggregate Chann...
IRJET - Traffic Density Estimation by Counting Vehicles using Aggregate Chann...IRJET Journal
 
IRJET - License Plate Recognition
IRJET - License Plate RecognitionIRJET - License Plate Recognition
IRJET - License Plate RecognitionIRJET Journal
 
IRJET- Automatic Number Plate Recognition using CCA Algorithm and RANSAC ...
IRJET-  	  Automatic Number Plate Recognition using CCA Algorithm and RANSAC ...IRJET-  	  Automatic Number Plate Recognition using CCA Algorithm and RANSAC ...
IRJET- Automatic Number Plate Recognition using CCA Algorithm and RANSAC ...IRJET Journal
 
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...Lionel Briand
 
IoT-based Autonomously Driven Vehicle by using Machine Learning & Image Proce...
IoT-based Autonomously Driven Vehicle by using Machine Learning & Image Proce...IoT-based Autonomously Driven Vehicle by using Machine Learning & Image Proce...
IoT-based Autonomously Driven Vehicle by using Machine Learning & Image Proce...IRJET Journal
 
Announcement of Results of xDR Challenge 2018
Announcement of Results of xDR Challenge 2018 Announcement of Results of xDR Challenge 2018
Announcement of Results of xDR Challenge 2018 Ryosuke Ichikari
 
Announcement of Results of xDR Challenge 2018
Announcement of Results of xDR Challenge 2018 Announcement of Results of xDR Challenge 2018
Announcement of Results of xDR Challenge 2018 Ryosuke Ichikari
 
Visual pattern recognition in robotics
Visual pattern recognition in roboticsVisual pattern recognition in robotics
Visual pattern recognition in roboticsIAEME Publication
 

Similar to Real-time Bangla License Plate Recognition System for Low Resource Video-based Applications (20)

Different Methodologies for Indian License Plate Detection
Different Methodologies for Indian License Plate DetectionDifferent Methodologies for Indian License Plate Detection
Different Methodologies for Indian License Plate Detection
 
IRJET- Drivers Stupor Scrutinizing System
IRJET-  	  Drivers Stupor Scrutinizing SystemIRJET-  	  Drivers Stupor Scrutinizing System
IRJET- Drivers Stupor Scrutinizing System
 
Vision-Based Motorcycle Crash Detection and Reporting Using Deep Learning
Vision-Based Motorcycle Crash Detection and Reporting Using Deep LearningVision-Based Motorcycle Crash Detection and Reporting Using Deep Learning
Vision-Based Motorcycle Crash Detection and Reporting Using Deep Learning
 
Automatic Detection of Unexpected Accidents Monitoring Conditions in Tunnels
Automatic Detection of Unexpected Accidents Monitoring Conditions in TunnelsAutomatic Detection of Unexpected Accidents Monitoring Conditions in Tunnels
Automatic Detection of Unexpected Accidents Monitoring Conditions in Tunnels
 
IRJET- Robust and Fast Detection of Moving Vechiles in Aerial Videos usin...
IRJET-  	  Robust and Fast Detection of Moving Vechiles in Aerial Videos usin...IRJET-  	  Robust and Fast Detection of Moving Vechiles in Aerial Videos usin...
IRJET- Robust and Fast Detection of Moving Vechiles in Aerial Videos usin...
 
“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...
“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...
“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...
 
Visual pattern recognition in robotics
Visual pattern recognition in roboticsVisual pattern recognition in robotics
Visual pattern recognition in robotics
 
Real-Time Automated Overspeeding Detection and Identification System
Real-Time Automated Overspeeding Detection and Identification SystemReal-Time Automated Overspeeding Detection and Identification System
Real-Time Automated Overspeeding Detection and Identification System
 
Review On Different Feature Extraction Algorithms
Review On Different Feature Extraction AlgorithmsReview On Different Feature Extraction Algorithms
Review On Different Feature Extraction Algorithms
 
IRJET- Dynamic Traffic Management System
IRJET- Dynamic Traffic Management SystemIRJET- Dynamic Traffic Management System
IRJET- Dynamic Traffic Management System
 
Object Detection for Autonomous Cars using AI/ML
Object Detection for Autonomous Cars using AI/MLObject Detection for Autonomous Cars using AI/ML
Object Detection for Autonomous Cars using AI/ML
 
Raspberry Pi Vehicles Number Plate Recognition
Raspberry Pi Vehicles Number Plate RecognitionRaspberry Pi Vehicles Number Plate Recognition
Raspberry Pi Vehicles Number Plate Recognition
 
IRJET - Traffic Density Estimation by Counting Vehicles using Aggregate Chann...
IRJET - Traffic Density Estimation by Counting Vehicles using Aggregate Chann...IRJET - Traffic Density Estimation by Counting Vehicles using Aggregate Chann...
IRJET - Traffic Density Estimation by Counting Vehicles using Aggregate Chann...
 
IRJET - License Plate Recognition
IRJET - License Plate RecognitionIRJET - License Plate Recognition
IRJET - License Plate Recognition
 
IRJET- Automatic Number Plate Recognition using CCA Algorithm and RANSAC ...
IRJET-  	  Automatic Number Plate Recognition using CCA Algorithm and RANSAC ...IRJET-  	  Automatic Number Plate Recognition using CCA Algorithm and RANSAC ...
IRJET- Automatic Number Plate Recognition using CCA Algorithm and RANSAC ...
 
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...
 
IoT-based Autonomously Driven Vehicle by using Machine Learning & Image Proce...
IoT-based Autonomously Driven Vehicle by using Machine Learning & Image Proce...IoT-based Autonomously Driven Vehicle by using Machine Learning & Image Proce...
IoT-based Autonomously Driven Vehicle by using Machine Learning & Image Proce...
 
Announcement of Results of xDR Challenge 2018
Announcement of Results of xDR Challenge 2018 Announcement of Results of xDR Challenge 2018
Announcement of Results of xDR Challenge 2018
 
Announcement of Results of xDR Challenge 2018
Announcement of Results of xDR Challenge 2018 Announcement of Results of xDR Challenge 2018
Announcement of Results of xDR Challenge 2018
 
Visual pattern recognition in robotics
Visual pattern recognition in roboticsVisual pattern recognition in robotics
Visual pattern recognition in robotics
 

More from MD Abdullah Al Nasim

End-to-End Natural Language Understanding Pipeline for Bangla Conversational ...
End-to-End Natural Language Understanding Pipeline for Bangla Conversational ...End-to-End Natural Language Understanding Pipeline for Bangla Conversational ...
End-to-End Natural Language Understanding Pipeline for Bangla Conversational ...MD Abdullah Al Nasim
 
A Soft Computing Based Customer Lifetime Value Classifier for Digital Retail ...
A Soft Computing Based Customer Lifetime Value Classifier for Digital Retail ...A Soft Computing Based Customer Lifetime Value Classifier for Digital Retail ...
A Soft Computing Based Customer Lifetime Value Classifier for Digital Retail ...MD Abdullah Al Nasim
 
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical AnalysisBrain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical AnalysisMD Abdullah Al Nasim
 
An automated approach for the recognition of bengali license plates presentation
An automated approach for the recognition of bengali license plates presentationAn automated approach for the recognition of bengali license plates presentation
An automated approach for the recognition of bengali license plates presentationMD Abdullah Al Nasim
 
Brain tumor detection using convolutional neural network
Brain tumor detection using convolutional neural network Brain tumor detection using convolutional neural network
Brain tumor detection using convolutional neural network MD Abdullah Al Nasim
 

More from MD Abdullah Al Nasim (9)

End-to-End Natural Language Understanding Pipeline for Bangla Conversational ...
End-to-End Natural Language Understanding Pipeline for Bangla Conversational ...End-to-End Natural Language Understanding Pipeline for Bangla Conversational ...
End-to-End Natural Language Understanding Pipeline for Bangla Conversational ...
 
A Soft Computing Based Customer Lifetime Value Classifier for Digital Retail ...
A Soft Computing Based Customer Lifetime Value Classifier for Digital Retail ...A Soft Computing Based Customer Lifetime Value Classifier for Digital Retail ...
A Soft Computing Based Customer Lifetime Value Classifier for Digital Retail ...
 
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical AnalysisBrain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
 
An automated approach for the recognition of bengali license plates presentation
An automated approach for the recognition of bengali license plates presentationAn automated approach for the recognition of bengali license plates presentation
An automated approach for the recognition of bengali license plates presentation
 
Microsoft Imagine cup 2019
Microsoft Imagine cup 2019Microsoft Imagine cup 2019
Microsoft Imagine cup 2019
 
Interview - Amar iSchool
Interview - Amar iSchoolInterview - Amar iSchool
Interview - Amar iSchool
 
Data Flow Diagram - Amar iSchool
Data Flow Diagram - Amar iSchoolData Flow Diagram - Amar iSchool
Data Flow Diagram - Amar iSchool
 
Erd Amar iSchool
Erd Amar iSchoolErd Amar iSchool
Erd Amar iSchool
 
Brain tumor detection using convolutional neural network
Brain tumor detection using convolutional neural network Brain tumor detection using convolutional neural network
Brain tumor detection using convolutional neural network
 

Recently uploaded

SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 

Recently uploaded (20)

SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 

Real-time Bangla License Plate Recognition System for Low Resource Video-based Applications

  • 1. Real-time Bangla License Plate Recognition System for Low Resource Video-based Applications Alif Ashrafee alifashrafee@iut-dhaka.edu Islamic University of Technology Akib Mohammed Khan akibmohammed@iut-dhaka.edu Islamic University of Technology Mohammad Sabik Irbaz sabikirbaz@iut-dhaka.edu Pioneer Alpha Ltd. MD Abdullah Al Nasim nasim.abdullah@ieee.org Pioneer Alpha Ltd. 1
  • 2. Winter Conference on Applications of Computer Vision 2022 Real-World Surveillance: Applications and Challenges Workshop (RWS @ WACV2022) 2
  • 3. Introduction ● Automatic License Plate Recognition (ALPR) systems aim to detect, localize, and recognize license plate characters from vehicles appearing in images or video frames. ● There are no publicly available Bangla datasets for training such systems. ● Our main focus is to provide comprehensive image and video datasets of Bangla license plates in-the-wild. ● We aim to provide a system for real time inference to recognizing license plates from video frames in low resource environments. 3
  • 4. 4 Video Frames Cropped License Plate WB 06 F 5977 License Plate Characters Example of an ALPR system for video feed
  • 5. Motivation 5 ● Due to the large volume of vehicles in metropolitan areas of Bangladesh, it is difficult to find a vacant parking slot due to the lack of optimized parking systems. ● There are no publicly available datasets for Bangla License plates to be trained or evaluated on. ● No Bangla ALPR system provides real-time performance using low resource.
  • 6. Problem Statement 6 Given a video feed of cars and motorcycles having Bangla license plates, can we teach the machine to identify and recognize license plates?
  • 7. 7 Training for object detection models are computationally expensive No benchmark datasets are available for Bangla license plate videos Separating temporally distinct instances of vehicles from video Running an object detection model in CPU in real-time Research Challenges 1 2 3 4
  • 9. Cascade Classifier [Viola et. al., 2001] 9 ● Trained using positive and negative examples. ● Consists of multiple stages, where each stage is an ensemble of weak classifiers. ● Divide the image into sub-windows. ● The weak classifiers are used to discard the negative subwindows as fast as possible. ● Detect all the positive instances to achieve low false-positive rates.
  • 10. MobileNet SSDv2 [Chiu et. al., 2020] 10 ● Single Shot object detection (SSD) model based on MobileNet-v2[1]. ● Lightweight and can be applied with limited computational resources. ● Includes feature pyramid network
  • 11. Two stage detection with Temporal Redundancy [Laroca et. al., 2018] 11 ● Detection in two stages, vehicle detection and then LP detection. ● Temporal redundancy information through the union of all frames belonging to the same vehicle.
  • 12. YOLO vs RCNN vs SSD [Adarsh et. al., 2020] 12 ● SSD provides an optimal solution for both accuracy and speed. ● YOLO is fast but not very accurate, Faster RCNN is more accurate but slower.
  • 14. License Plate Image Dataset 14 ➔ We created our own dataset of 1000 images. ➔ With different distances, angles and lighting conditions around Dhaka City captured using varying models of smartphones. ➔ Converted to 800 x 800 pixels. ➔ We manually annotated them by drawing bounding boxes around the license plate regions for training our detection models.
  • 15. 15
  • 16. License Plate Video Dataset 16 ➔ We collected 79 video clips containing 98 license plates from different types of vehicles using crowdsourcing. ➔ Each clip includes single or multiple vehicles from around different districts of Bangladesh. ➔ Converted all of the videos to a resolution of 480 x 480 pixels. ➔ Each video contains an average of 254 frames and the frame rate of each video was 24 frames per second (FPS). ➔ Videos were taken in a variety of locations, such as inside garages, car parks, lanes, and highways.
  • 17. License Plate Video Dataset (cont.) ➔ Since videos were taken from several different districts, there is a good distribution of the authorized characters in our dataset. ➔ Videos include plate variations, artifacts on plates, occlusions, illumination changes, and unclear characters. ➔ We annotated the content of each plate: the ground truth of each character present in the plate, and the number of license plates present in each video. 17
  • 18. 18
  • 21. 21 Overall Pipeline for Real-time interaction
  • 23. Evaluation Metrics 23 Mean Average Precision (mAP) for Training detection model ● mAP score is calculated by taking the mean AP over all classes and/or overall IoU thresholds. ● AP is the area under the precision-recall curve. ● Popular metric for evaluating detection models.
  • 24. 24 Evaluation Metrics (cont.) Levenshtein Distance for Character Recognition [2] ● Measures the difference between two strings based on the number of insertions, deletions, and substitutions that have to be performed on the target string in order to match it with the reference string. Cₑᵣᵣₒᵣ = Total character error Cₐₙₙ = # characters in the aligned sequence CH = # characters in hypothesis string CR = # characters in reference string
  • 25. Experimental Setup 25 ● We trained the MobileNet SSDv2 model using a Tesla T4 GPU on Google Colaboratory. ● For testing, all the experiments were conducted on a single thread of an Intel Core i5-7200U mobile CPU with 8GB of RAM and video resized to 480*480 pixels. Image size 300 * 300 Batch Size 24 IOU Threshold 0.35 Learning Rate 0.004 Epochs Run 50
  • 26. Result Analysis (Quantitative) 26 Training Results Model AP AP50 AP75 APs APm APl MobileNet SSDv2 0.47 0.86 0.47 0.13 0.46 0.55 AP: Average Precision AP50: AP at IOU thresh = 0.5 AP75: AP at IOU thresh = 0.75 APs: AP for small objects APm: AP for medium objects APl: AP for large objects
  • 27. Result Analysis (Quantitative) (cont.) Ablation Study 27 Pipeline Precision(%) Recall(%) F1 Score (%) Detection Rate(%) FPS YoloV3 Tiny 60.2 55.6 57.1 75.5 11.1 Cascade + YoloV3 Tiny 56.3 51.6 53.1 69.4 27.1 SSDv2 66.1 58.4 60.5 98.0 17.7 Cascade + SSDv2 63.6 59.3 60.8 82.7 27.2
  • 29. 29 29 License plates that are missed by Cascade + SSDv2 but successfully detected by standalone SSDv2 Favorable for Vision OCR Unfavorable for Vision OCR
  • 30. 30 Standalone SSDv2 OCR Predictions that are not favorable for Vision API Cascade + SSDv2 OCR predictions that are favorable for Vision API
  • 31. Conclusion and Future Work ● We provide two Bangla license plate image and video datasets. ● We propose a robust system that can detect license plates from video footage in low resource environment. ● Consists of a method for individually storing temporally separate instances of different vehicles appearing in the same video. ● We plan to deploy our system in various real-life environments and verify its day-to-day performance. ● We also plan to build our own Bangla license plate OCR system that can improve the recognition performance. 31
  • 32. References [1] Sandler, Mark, et al. "Mobilenetv2: Inverted residuals and linear bottlenecks." Proceedings of the IEEE conference on computer vision and pattern recognition. 2018. [2] Karpinski, Romain, Devashish Lohani, and Abdel Belaid. "Metrics for Complete Evaluation of OCR Performance." IPCV'18-The 22nd Int'l Conf on Image Processing, Computer Vision, & Pattern Recognition. 2018. 32