object-detection.pptx

Object Detection using Convolutional Neural Networks

Agenda
Sample Footer Text
Why CNNs?
What is a CNN?
Object Detection: Definition
Sliding Windows Detection
Region Proposals
R-CNN
Fast R-CNN
Faster R-CNN
YOLO
IoU
NMS
Open-Source Resources
Variables of object detection
Next Steps

Object Detection: Why CNNs?
Graph credit: CS231n, Stanford University

What is a CNN?
Activation map
Input image
Applying many
filters
That’s it! A full convolutional layer.
A representation of the image.
https://analyticsindiamag.com/convolutional-neural-network-image-classification-overview/
Filter (3x3)

Object Detection: Definition
CNN
RGB
Image
List of objects
Output
Input
For each object:
1. Category label (person, car, cat, …)
2. Bounding box
(𝑥, 𝑦)
𝑊𝑖𝑑𝑡ℎ
𝐻𝑒𝑖𝑔ℎ𝑡
What is in the image and where is it?

Sliding Windows Detection
CNN
Is there a car in the image?
1
CNN 0
Issue? Huge computational cost
OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks https://arxiv.org/pdf/1312.6229v4.pdf

Region Proposals
https://web.eecs.umich.edu/~justincj/teaching/eecs498/WI2022/
Find a small set of boxes that are likely to cover all objects
Selective Search

R-CNN: Region-Based CNN
Proposed
regions
(~2K)
Warped
image
regions
224x224
Rich feature hierarchies for accurate object detection and semantic segmentation.
Method:
1. Run selective search to get ~2K regions.
2. Resize (warp) regions to 224x224
3. Run regions independently through a CNN.
4. Linear SVM (FC layers)
What if regions do not exactly match the object?
Solution: CNN should learn to output a transformation of the Bbox size.
Caveat: CNNs share weights!
Issues?
1. Very slow! Run ~2k forward passes per image.
2. Using the selective search to select image regions. There is no learning at that stage.

Fast R-CNN
Idea: swap the order of the CNN with the warping.
Method:
1. Feed the input image into a CNN and compute feature maps.
2. Run the selective search on feature maps. “Cropping”
3. Warp (resize) the cropped features.
4. Feed warped features into a small “Per-region” network (e.g., FC layers).
5. Output bounding boxes with classification scores.

Faster R-CNN
Idea: use a neural network (Region Proposal Network) instead of the selective search algorithm for region proposals.
Method:
1. Feed the input image into the backbone network to get image features.
2. Pass image features to RPN to get region proposals.
3. Warp (resize) the cropped features.
4. Feed warped features into a small “Per-region” network (e.g., FC layers).
5. Output bounding boxes with classification scores.

YOLO: You Only Look Once
You only look once: Unified, real-time object detection
SSD: Single-Shot MultiBox Detector
Idea: use one giant CNN to go from the input image to a tensor of scores.
Eliminates the need for region proposals.

YOLO: You Only Look Once
You only look once: Unified, real-time object detection
SSD: Single-Shot MultiBox Detector
Input image
448x448
CNN
YOLO Architecture
Output tensor
𝑆 × 𝑆 × (𝐵 ∗ 5 + 𝐶)
𝐵 is the number of template bounding boxes
Template Boxes (𝐵 = 4):

Evaluating object localization: IoU
IoU (Intersection over Union) is used to measure the overlap between two bounding boxes.
https://pyimagesearch.com/2016/11/07/intersection-over-union-iou-for-object-detection/

Non-max Suppression (NMS)
Ensures that each object is detected only once.
A solution for overlapping boxes.
Method:
Given a set of predictions (scores and boxes). Each output prediction:
𝑝𝑐
𝑏𝑥
𝑏𝑦
𝑏ℎ
𝑏𝑤
(Greedy Implementation)
1. Discard all boxes with 𝑝𝑐 ≤ 0.6
2. While there are any remaining boxes:
• Pick box with largest 𝑝𝑐 as the prediction.
• Discard all boxes with 𝐼𝑜𝑈 ≥ 0.7 with the chosen box.
https://towardsdatascience.com/non-maximum-suppression-nms-93ce178e177c

Open-Source Resources
https://github.com/facebookresearch/detectron2
https://github.com/tensorflow/models/tree/master/research/object_detection
Implement object detectors from scratch only for learning purposes!

Object Detection: variables
credit: CS231n, Stanford University

object-detection.pptx

Recommended

Recommended

More Related Content

Similar to object-detection.pptx

Similar to object-detection.pptx (20)

Recently uploaded

Recently uploaded (20)

object-detection.pptx

Editor's Notes