SlideShare a Scribd company logo
1 of 24
Download to read offline
Combining CNNs and Conventional
Algorithms for Low-Compute
Vision:
A Case Study in the Garage
Nathan Kopp
Chamberlain Group
September 2020
© 2020 Chamberlain Group
Intro
The Question:
Are CNNs always the best choice for every computer vision problem?
The Idea:
Sometimes simpler algorithms are the better choice.
Simpler algorithms can be guided by CNNs to overcome limitations.
Simplify the problem and leverage constraints to utilize simpler algorithms.
The Story:
Creating the Smart Garage (Chapter 1)
2
© 2020 Chamberlain Group
Company Background
3
© 2020 Chamberlain Group
Creating the Smart Garage
New flagship garage door opener with camera
How do we make this product smarter?
• Know when vehicles leave and arrive.
• Know whether a specific vehicle is in the garage.
4
Vehicle Presence
&
Vehicle Re-Identification
© 2020 Chamberlain Group
CNN Strengths and
Weaknesses
© 2020 Chamberlain Group
CNN Strengths and Weaknesses
Strengths
• Generalize to unseen circumstances
• Can be trained for various tasks
• State-of-the-art results
Weaknesses
• Requires large training dataset
• High compute requirements
• High memory requirements
6
© 2020 Chamberlain Group
CNN-Only Solutions: Vehicle Presence
• Accuracy issues
• Need custom dataset to
improve accuracy
7
Object Detection (YOLO, SSD) Object Tracking
• Analyze video = high cost
© 2020 Chamberlain Group
CNN-Only Solutions: Vehicle Identification
• Accuracy Issues
• Mix-up similar-looking cars
• Datasets trained for fully-visible
vehicles
8
Make / Model Classification License Plate Recognition
• Plate not visible when parked
• Analyze video = high cost
• Blur, lighting, compression
White
Tesla
Model 3
Blue
Subaru
Forrester
Pairwise Similarity
• No truly-free pre-trained
models
• No truly free datasets
• Need a large training
dataset
© 2020 Chamberlain Group
More Problems: Cost Constraints
• Existing cameras have relatively low compute:
• Single core low-cost ARM
• Low memory
• Minimal cloud expenses allowed
9
© 2020 Chamberlain Group
Simplifying the Problem
© 2020 Chamberlain Group
Simplifying the Question
11
Raw Video
Process
with CNN
Rich
Metadata
Interpret
Metadata
Goal
Rich metadata is fun, but
we still must SIMPLIFY to
solve the real problem.
Vehicle Presence
&
Vehicle Re-Identification
Blue
Subaru
ForresterWhite
Tesla
Model 3
Plate: [ABC123] Plate: [XYZ987]
© 2020 Chamberlain Group
Simplifying the Question
12
Simplify the
question
Raw Video
Process with
Simple
Algorithm
Simple
Metadata
Goal
Raw Video
Process
with CNN
Rich
Metadata
Interpret
Metadata
Goal
Simplifying EARLIER in the
process can reduce compute.
Try to simplify the question,
instead of simplifying the
answer.
Vehicle Presence
&
Vehicle Re-Identification
© 2020 Chamberlain Group
Find Useful Constraints
Stuck with “bad” constraints? Look for “useful” constraints!
• Fixed camera
• Limited number of vehicles
• Fixed vehicle location (region of interest)
• Personal habits of parking location
• Temporal consistency of the background (When did you last clean your garage?)
13
Less generalization is needed!
© 2020 Chamberlain Group
Our Hybrid Solution
© 2020 Chamberlain Group
Find an Appropriate Simple Algorithm
• HOG lowers the spatial resolution: built in flexibility
• Features can be translated, to some degree
• Add a custom “attention mask”
• Easy to compare features
• Implement ”training mode” to automatically gather images
representing known conditions
15
Tesla Model 3 Present
Tesla Model 3 Absent
Selected Algorithm: Modified Histogram of Oriented Gradients (HOG)
© 2020 Chamberlain Group
The “Fast Scene Matcher”
• Train Mode
1. Gather images of known conditions
2. Extract features
3. Build attention masks
4. Process features into library
• Run Mode
1. Gather image of unknown condition
2. Extract features
3. Compare to library
4. Find best match, or no match
16
© 2020 Chamberlain Group
Combined Meta-Algorithm
Train Mode
• Multiple CNNs (PoseNet, YOLO, etc.)
• Object tracking, ROIs, line crossing
• License plate recognition
• Leverage user input to fix mistakes
Output:
• Processed “known-condition” images
17
Run Mode
• HOG-based “Fast Scene Matcher”
Output:
(for each parking spot)
• Specific vehicle present
• OR empty
• OR indeterminate
Train Mode Run Mode Retrain Mode
“Ask
for
Help”
© 2020 Chamberlain Group
“Guide-Follower” Design Pattern
18
Guide
Follower
Informs
• Expensive
• Slow
• Generalized
• Infrequent
• Specialized
• Frequent
• Inexpensive
• Fast
Input Output
Ask for
Help
© 2020 Chamberlain Group
Results
© 2020 Chamberlain Group
Results – Single-Frame Run Mode
Metric YOLO-COCO
256x256 1
YOLO-VEH
256x256 1
YOLO-VEH
416x416 2
FSM-HOG-4 4 FSM-HOG-7 5
Detect Any Vehicle Present 41.5% 45.5% 57.8% 86.1% 93.0%
Detect Vehicle Absent 99.4% 98.7% 99.4% 99.7% 99.4%
Re-Identify Known Vehicle n/a n/a n/a 89.2% 95.9%
Recognize Unknown Vehicle n/a n/a n/a 63.6% 80.4%
Recognize Moved Camera Position n/a n/a n/a 37.5% 29.2%
Cloud cost per 1,000,000 frames $7.38 $7.38 $19.50 $0.39 $0.39
Edge compute on existing hardware n/a n/a n/a 2 FPS 2 FPS
20
1) 256x256 YOLOv2; trained on MS COCO + VOC datasets; threshold 0.3; cost based on AWS g3s.xlarge instance size
2) 256x256 YOLOv2; trained on custom vehicle dataset; threshold 0.35; cost based on AWS g3s.xlarge instance size
3) 416x416 YOLOv2; trained on custom vehicle dataset; threshold 0.35; cost based on AWS g3s.xlarge instance size
4) Unoptimized OpenCV C++ code; 4 example images per known condition; cost based on AWS m4.xlarge instance size
5) Same as (4), but with up to 7 example images per known-vehicle.
© 2020 Chamberlain Group
Hybrid Solution: Combined Meta-Algorithm
21
“Inference” on the Edge“Train” in the Cloud
Guide
Follower
on already-deployed devices!
© 2020 Chamberlain Group
Take-Aways
1. Simplify the problem first.
• Identify your core goal.
• Simplify the question, instead of simplifying the answer.
• Leverage useful constraints.
2. Sometimes simpler algorithms are the better choice.
• Simplifying the problem allow use of a less generalized algorithm.
• Simpler algorithms can be guided by CNNs to overcome limitations.
• It is possible to preserve most of the “magic” of deep learning, at a lower cost.
22
© 2020 Chamberlain Group
Q & A
© 2020 Chamberlain Group
Resources
Chamberlain
https://www.chamberlain.com/
YOLOv2
https://pjreddie.com/darknet/yolo/
Histogram of Oriented Gradients
https://en.wikipedia.org/wiki/Histogram_of_oriented_gradients
2020 Embedded Vision Summit
“Combining CNNs and Conventional
Algorithms for Low-Compute Vision: A
Case Study in the Garage”
Nathan Kopp
1:00pm, September 17, 2020
24

More Related Content

Similar to “Combining CNNs and Conventional Algorithms for Low-Compute Vision: A Case Study in the Garage,” a Presentation from the Chamberlain Group

Machine learning for newbies
Machine learning for newbiesMachine learning for newbies
Machine learning for newbiesAndrew Nikishaev
 
Kostiantyn Yelisavenko "Mastering Macro Benchmarking in .NET"
Kostiantyn Yelisavenko "Mastering Macro Benchmarking in .NET"Kostiantyn Yelisavenko "Mastering Macro Benchmarking in .NET"
Kostiantyn Yelisavenko "Mastering Macro Benchmarking in .NET"LogeekNightUkraine
 
“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...
“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...
“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...Edge AI and Vision Alliance
 
Driver drowsiness detection
Driver drowsiness detectionDriver drowsiness detection
Driver drowsiness detectionConnecting Point
 
Recognition of highway workzones for reliable autonomous driving
Recognition of highway workzones for reliable autonomous drivingRecognition of highway workzones for reliable autonomous driving
Recognition of highway workzones for reliable autonomous drivingdinukj
 
self driving car nvidia whitepaper
self driving car nvidia whitepaperself driving car nvidia whitepaper
self driving car nvidia whitepaperOlivier Eeckhoutte
 
"Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an...
"Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an..."Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an...
"Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an...Edge AI and Vision Alliance
 
Next Generation Workshop Car Diagnostics at BMW Powered by Apache Spark with ...
Next Generation Workshop Car Diagnostics at BMW Powered by Apache Spark with ...Next Generation Workshop Car Diagnostics at BMW Powered by Apache Spark with ...
Next Generation Workshop Car Diagnostics at BMW Powered by Apache Spark with ...Databricks
 
Presentation - Vision
Presentation - VisionPresentation - Vision
Presentation - VisionJaweria Khan
 
Prototyping presentation
Prototyping presentationPrototyping presentation
Prototyping presentationIbtihaj Uddin
 
IRJET- Front View Identification of Vehicles by using Machine Learning Te...
IRJET-  	  Front View Identification of Vehicles by using Machine Learning Te...IRJET-  	  Front View Identification of Vehicles by using Machine Learning Te...
IRJET- Front View Identification of Vehicles by using Machine Learning Te...IRJET Journal
 
THE CHALLENGES OF IMAGE QUALITY IN THE RACE FOR AUTONOMOUS VEHICLES
THE CHALLENGES OF IMAGE QUALITY IN THE RACE FOR AUTONOMOUS VEHICLESTHE CHALLENGES OF IMAGE QUALITY IN THE RACE FOR AUTONOMOUS VEHICLES
THE CHALLENGES OF IMAGE QUALITY IN THE RACE FOR AUTONOMOUS VEHICLESwle-ss
 
“Data Collection in the Wild,” a Presentation from BMW Group
“Data Collection in the Wild,” a Presentation from BMW Group“Data Collection in the Wild,” a Presentation from BMW Group
“Data Collection in the Wild,” a Presentation from BMW GroupEdge AI and Vision Alliance
 

Similar to “Combining CNNs and Conventional Algorithms for Low-Compute Vision: A Case Study in the Garage,” a Presentation from the Chamberlain Group (20)

Mobileye Project
Mobileye ProjectMobileye Project
Mobileye Project
 
Machine learning for newbies
Machine learning for newbiesMachine learning for newbies
Machine learning for newbies
 
Kostiantyn Yelisavenko "Mastering Macro Benchmarking in .NET"
Kostiantyn Yelisavenko "Mastering Macro Benchmarking in .NET"Kostiantyn Yelisavenko "Mastering Macro Benchmarking in .NET"
Kostiantyn Yelisavenko "Mastering Macro Benchmarking in .NET"
 
Final_presentation_ITS
Final_presentation_ITSFinal_presentation_ITS
Final_presentation_ITS
 
“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...
“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...
“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentat...
 
Driver drowsiness detection
Driver drowsiness detectionDriver drowsiness detection
Driver drowsiness detection
 
Recognition of highway workzones for reliable autonomous driving
Recognition of highway workzones for reliable autonomous drivingRecognition of highway workzones for reliable autonomous driving
Recognition of highway workzones for reliable autonomous driving
 
self driving car nvidia whitepaper
self driving car nvidia whitepaperself driving car nvidia whitepaper
self driving car nvidia whitepaper
 
"Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an...
"Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an..."Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an...
"Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an...
 
Next Generation Workshop Car Diagnostics at BMW Powered by Apache Spark with ...
Next Generation Workshop Car Diagnostics at BMW Powered by Apache Spark with ...Next Generation Workshop Car Diagnostics at BMW Powered by Apache Spark with ...
Next Generation Workshop Car Diagnostics at BMW Powered by Apache Spark with ...
 
Presentation - Vision
Presentation - VisionPresentation - Vision
Presentation - Vision
 
Prototyping presentation
Prototyping presentationPrototyping presentation
Prototyping presentation
 
IRJET- Front View Identification of Vehicles by using Machine Learning Te...
IRJET-  	  Front View Identification of Vehicles by using Machine Learning Te...IRJET-  	  Front View Identification of Vehicles by using Machine Learning Te...
IRJET- Front View Identification of Vehicles by using Machine Learning Te...
 
Dave
DaveDave
Dave
 
Drone ppt
Drone pptDrone ppt
Drone ppt
 
THE CHALLENGES OF IMAGE QUALITY IN THE RACE FOR AUTONOMOUS VEHICLES
THE CHALLENGES OF IMAGE QUALITY IN THE RACE FOR AUTONOMOUS VEHICLESTHE CHALLENGES OF IMAGE QUALITY IN THE RACE FOR AUTONOMOUS VEHICLES
THE CHALLENGES OF IMAGE QUALITY IN THE RACE FOR AUTONOMOUS VEHICLES
 
Surveon Video Analysis Camera Series
Surveon Video Analysis Camera SeriesSurveon Video Analysis Camera Series
Surveon Video Analysis Camera Series
 
Trucking technology
Trucking technologyTrucking technology
Trucking technology
 
Cars.com Journey to AWS Cloud
Cars.com Journey to AWS CloudCars.com Journey to AWS Cloud
Cars.com Journey to AWS Cloud
 
“Data Collection in the Wild,” a Presentation from BMW Group
“Data Collection in the Wild,” a Presentation from BMW Group“Data Collection in the Wild,” a Presentation from BMW Group
“Data Collection in the Wild,” a Presentation from BMW Group
 

More from Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...Edge AI and Vision Alliance
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...Edge AI and Vision Alliance
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...Edge AI and Vision Alliance
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...Edge AI and Vision Alliance
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...Edge AI and Vision Alliance
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsightsEdge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...Edge AI and Vision Alliance
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...Edge AI and Vision Alliance
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...Edge AI and Vision Alliance
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...Edge AI and Vision Alliance
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from SamsaraEdge AI and Vision Alliance
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...Edge AI and Vision Alliance
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...Edge AI and Vision Alliance
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...Edge AI and Vision Alliance
 

More from Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Recently uploaded

Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 

Recently uploaded (20)

Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 

“Combining CNNs and Conventional Algorithms for Low-Compute Vision: A Case Study in the Garage,” a Presentation from the Chamberlain Group

  • 1. Combining CNNs and Conventional Algorithms for Low-Compute Vision: A Case Study in the Garage Nathan Kopp Chamberlain Group September 2020
  • 2. © 2020 Chamberlain Group Intro The Question: Are CNNs always the best choice for every computer vision problem? The Idea: Sometimes simpler algorithms are the better choice. Simpler algorithms can be guided by CNNs to overcome limitations. Simplify the problem and leverage constraints to utilize simpler algorithms. The Story: Creating the Smart Garage (Chapter 1) 2
  • 3. © 2020 Chamberlain Group Company Background 3
  • 4. © 2020 Chamberlain Group Creating the Smart Garage New flagship garage door opener with camera How do we make this product smarter? • Know when vehicles leave and arrive. • Know whether a specific vehicle is in the garage. 4 Vehicle Presence & Vehicle Re-Identification
  • 5. © 2020 Chamberlain Group CNN Strengths and Weaknesses
  • 6. © 2020 Chamberlain Group CNN Strengths and Weaknesses Strengths • Generalize to unseen circumstances • Can be trained for various tasks • State-of-the-art results Weaknesses • Requires large training dataset • High compute requirements • High memory requirements 6
  • 7. © 2020 Chamberlain Group CNN-Only Solutions: Vehicle Presence • Accuracy issues • Need custom dataset to improve accuracy 7 Object Detection (YOLO, SSD) Object Tracking • Analyze video = high cost
  • 8. © 2020 Chamberlain Group CNN-Only Solutions: Vehicle Identification • Accuracy Issues • Mix-up similar-looking cars • Datasets trained for fully-visible vehicles 8 Make / Model Classification License Plate Recognition • Plate not visible when parked • Analyze video = high cost • Blur, lighting, compression White Tesla Model 3 Blue Subaru Forrester Pairwise Similarity • No truly-free pre-trained models • No truly free datasets • Need a large training dataset
  • 9. © 2020 Chamberlain Group More Problems: Cost Constraints • Existing cameras have relatively low compute: • Single core low-cost ARM • Low memory • Minimal cloud expenses allowed 9
  • 10. © 2020 Chamberlain Group Simplifying the Problem
  • 11. © 2020 Chamberlain Group Simplifying the Question 11 Raw Video Process with CNN Rich Metadata Interpret Metadata Goal Rich metadata is fun, but we still must SIMPLIFY to solve the real problem. Vehicle Presence & Vehicle Re-Identification Blue Subaru ForresterWhite Tesla Model 3 Plate: [ABC123] Plate: [XYZ987]
  • 12. © 2020 Chamberlain Group Simplifying the Question 12 Simplify the question Raw Video Process with Simple Algorithm Simple Metadata Goal Raw Video Process with CNN Rich Metadata Interpret Metadata Goal Simplifying EARLIER in the process can reduce compute. Try to simplify the question, instead of simplifying the answer. Vehicle Presence & Vehicle Re-Identification
  • 13. © 2020 Chamberlain Group Find Useful Constraints Stuck with “bad” constraints? Look for “useful” constraints! • Fixed camera • Limited number of vehicles • Fixed vehicle location (region of interest) • Personal habits of parking location • Temporal consistency of the background (When did you last clean your garage?) 13 Less generalization is needed!
  • 14. © 2020 Chamberlain Group Our Hybrid Solution
  • 15. © 2020 Chamberlain Group Find an Appropriate Simple Algorithm • HOG lowers the spatial resolution: built in flexibility • Features can be translated, to some degree • Add a custom “attention mask” • Easy to compare features • Implement ”training mode” to automatically gather images representing known conditions 15 Tesla Model 3 Present Tesla Model 3 Absent Selected Algorithm: Modified Histogram of Oriented Gradients (HOG)
  • 16. © 2020 Chamberlain Group The “Fast Scene Matcher” • Train Mode 1. Gather images of known conditions 2. Extract features 3. Build attention masks 4. Process features into library • Run Mode 1. Gather image of unknown condition 2. Extract features 3. Compare to library 4. Find best match, or no match 16
  • 17. © 2020 Chamberlain Group Combined Meta-Algorithm Train Mode • Multiple CNNs (PoseNet, YOLO, etc.) • Object tracking, ROIs, line crossing • License plate recognition • Leverage user input to fix mistakes Output: • Processed “known-condition” images 17 Run Mode • HOG-based “Fast Scene Matcher” Output: (for each parking spot) • Specific vehicle present • OR empty • OR indeterminate Train Mode Run Mode Retrain Mode “Ask for Help”
  • 18. © 2020 Chamberlain Group “Guide-Follower” Design Pattern 18 Guide Follower Informs • Expensive • Slow • Generalized • Infrequent • Specialized • Frequent • Inexpensive • Fast Input Output Ask for Help
  • 19. © 2020 Chamberlain Group Results
  • 20. © 2020 Chamberlain Group Results – Single-Frame Run Mode Metric YOLO-COCO 256x256 1 YOLO-VEH 256x256 1 YOLO-VEH 416x416 2 FSM-HOG-4 4 FSM-HOG-7 5 Detect Any Vehicle Present 41.5% 45.5% 57.8% 86.1% 93.0% Detect Vehicle Absent 99.4% 98.7% 99.4% 99.7% 99.4% Re-Identify Known Vehicle n/a n/a n/a 89.2% 95.9% Recognize Unknown Vehicle n/a n/a n/a 63.6% 80.4% Recognize Moved Camera Position n/a n/a n/a 37.5% 29.2% Cloud cost per 1,000,000 frames $7.38 $7.38 $19.50 $0.39 $0.39 Edge compute on existing hardware n/a n/a n/a 2 FPS 2 FPS 20 1) 256x256 YOLOv2; trained on MS COCO + VOC datasets; threshold 0.3; cost based on AWS g3s.xlarge instance size 2) 256x256 YOLOv2; trained on custom vehicle dataset; threshold 0.35; cost based on AWS g3s.xlarge instance size 3) 416x416 YOLOv2; trained on custom vehicle dataset; threshold 0.35; cost based on AWS g3s.xlarge instance size 4) Unoptimized OpenCV C++ code; 4 example images per known condition; cost based on AWS m4.xlarge instance size 5) Same as (4), but with up to 7 example images per known-vehicle.
  • 21. © 2020 Chamberlain Group Hybrid Solution: Combined Meta-Algorithm 21 “Inference” on the Edge“Train” in the Cloud Guide Follower on already-deployed devices!
  • 22. © 2020 Chamberlain Group Take-Aways 1. Simplify the problem first. • Identify your core goal. • Simplify the question, instead of simplifying the answer. • Leverage useful constraints. 2. Sometimes simpler algorithms are the better choice. • Simplifying the problem allow use of a less generalized algorithm. • Simpler algorithms can be guided by CNNs to overcome limitations. • It is possible to preserve most of the “magic” of deep learning, at a lower cost. 22
  • 23. © 2020 Chamberlain Group Q & A
  • 24. © 2020 Chamberlain Group Resources Chamberlain https://www.chamberlain.com/ YOLOv2 https://pjreddie.com/darknet/yolo/ Histogram of Oriented Gradients https://en.wikipedia.org/wiki/Histogram_of_oriented_gradients 2020 Embedded Vision Summit “Combining CNNs and Conventional Algorithms for Low-Compute Vision: A Case Study in the Garage” Nathan Kopp 1:00pm, September 17, 2020 24