Faster rcnn

•Download as PPTX, PDF•

13 likes•11,768 views

The document summarizes the faster R-CNN object detection model. It introduces the Region Proposal Network (RPN) layer that predicts bounding boxes and classifies objects in one pass of the convolutional layers, making it faster than R-CNN and fast R-CNN models. It also discusses the training procedure involving initial training of the RPN, then training the full model in stages to balance the losses. Test results show faster R-CNN achieves real-time speeds while maintaining high accuracy compared to previous models.

Science

Faster R-CNN: Towards
Real-Time Object detection
Microsoft Research, NIPS2015
Presentor: Andy Tsai

Key Idea: Region Proposal Net (RPN) layer
Conv
Layers
Predict BBoxes Classify objects
conv5

Testing Time: faster R-CNN
Conv
Layers
Predict BBoxes Classify objects
conv5
1*ConvTime
1*SmallNet 1000*
FcTime

Testing Time: R-CNN
Conv
Layers
conv5
Object Proposal
…………….
Object Proposal
1000*ConvTime
1000*FcTime

Testing Time: fast R-CNN
Conv
Layers
conv5
Object Proposal
…………….
Object Proposal
1*ConvTime
1000*FcTime
ROI pooling

Fast, Accurate Object Detection
- fastest region proposal method: Edge Boxes [4fps, 1000 proposal]
- Testing stage
Model Time
Edge boxes + R-CNN 0.25 sec + 1000*ConvTime + 1000*FcTime
Edge boxes + fast R-CNN 0.25 sec + 1*ConvTime + 1000*FcTime
faster R-CNN 1*ConvTime + 1000*FcTime

RPN: Loss Function
- 2 class Softmax cross entropy loss
- Discriminative training:
- pi* = 1 if IoU > 0.7
- pi* = 0 if IoU < 0.3
- otherwise, do not contribute to loss

RPN: Loss Function
only positive sample contribute to reg. Loss

How to train faster R-CNN ?
- balance sampling ( neg. vs pos. = 1:1 )
- joint training is almost impossible ( conv update alternating )
- 4 step training !

How to train faster R-CNN ?
1. train RPN with ImageNet pre-trained model
2. train fast R-CNN using proposal generated by 1. [ no params. sharing ]
3. use conv trained by 2. to initialize model, fix shared conv, update RPN
4. fix shared conv, fine-tune fc

Result - MAP
Using VGG ConvNet, fast R-CNN vs faster R-CNN

Result - Time
VOC 2007, fast R-CNN vs faster R-CNN
70.0%
73.2%
59.9%

Code available at gitHub
Matlab(faster-rcnn) & python(py-faster-rcnn) version

What's hot

Object Detection Using R-CNN Deep Learning FrameworkNader Karimi

Intro to Object Detection with SSDThomas Delteil

You only look once: Unified, real-time object detection (UPC Reading Group)Universitat Politècnica de Catalunya

Object Detection using Deep Neural NetworksUsman Qayyum

Mask-RCNN for Instance SegmentationDat Nguyen

Single Shot Multibox DetectorNamHyuk Ahn

You only look once (YOLO) : unified real time object detectionEntrepreneur / Startup

SSD: Single Shot MultiBox Detector (UPC Reading Group)Universitat Politècnica de Catalunya

Viola-Jones Object DetectionVenugopal Boddu

Yolo v2 ai_tech_20190421穗碧陳

Yol ov2Bang Tsui Liou

[PR12] You Only Look Once (YOLO): Unified Real-Time Object DetectionTaegyun Jeon

Introduction to object detectionBrodmann17

PR-132: SSD: Single Shot MultiBox DetectorJinwon Lee

Yolov5 Hochschule Bonn-Rhein-Sieg

PR-207: YOLOv3: An Incremental ImprovementJinwon Lee

Object Pose EstimationArithmer Inc.

Pr057 mask rcnnTaeoh Kim

Object detection - RCNNs vs RetinanetRishabh Indoria

Introduction to Convolutional Neural NetworksHannes Hapke

What's hot (20)

Object Detection Using R-CNN Deep Learning Framework

Intro to Object Detection with SSD

You only look once: Unified, real-time object detection (UPC Reading Group)

Object Detection using Deep Neural Networks

Mask-RCNN for Instance Segmentation

Single Shot Multibox Detector

You only look once (YOLO) : unified real time object detection

SSD: Single Shot MultiBox Detector (UPC Reading Group)

Viola-Jones Object Detection

Yolo v2 ai_tech_20190421

Yol ov2

[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection

Introduction to object detection

PR-132: SSD: Single Shot MultiBox Detector

Yolov5

PR-207: YOLOv3: An Incremental Improvement

Object Pose Estimation

Pr057 mask rcnn

Object detection - RCNNs vs Retinanet

Introduction to Convolutional Neural Networks

Similar to Faster rcnn

Object Detection - Míriam Bellver - UPC Barcelona 2018Universitat Politècnica de Catalunya

D3L4-objects.pdfssusere945ae

Deep Learning for Computer Vision: Object Detection (UPC 2016)Universitat Politècnica de Catalunya

Auro tripathy - Localizing with CNNsAuro Tripathy

Detectionsimplyinsimple

On-the-fly Visual Category Search in Web-scale Image CollectionsKen Chatfield

Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

150807 Fast R-CNNJunho Cho

Comparative Study of Object Detection AlgorithmsIRJET Journal

Understanding Rbmguestdfba847

Improving region based CNN object detector using bayesian optimizationAmgad Muhammad

R-FCN : object detection via region-based fully convolutional networksEntrepreneur / Startup

Similar to Faster rcnn (12)

Object Detection - Míriam Bellver - UPC Barcelona 2018

D3L4-objects.pdf

Deep Learning for Computer Vision: Object Detection (UPC 2016)

Auro tripathy - Localizing with CNNs

Detection

On-the-fly Visual Category Search in Web-scale Image Collections

Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)

150807 Fast R-CNN

Comparative Study of Object Detection Algorithms

Understanding Rbm

Improving region based CNN object detector using bayesian optimization

R-FCN : object detection via region-based fully convolutional networks

Recently uploaded

Mammalian Pineal Body Structure and Also FunctionsYOGESH DOGRA

National Biodiversity protection initiatives and Convention on Biological Di...PABOLU TEJASREE

Richard's entangled aventures in wonderlandRichard Gill

Shuaib Y-basedComprehensive mahmudj.pptxMdAbuRayhan16

Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...Sérgio Sacani

SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSELF-EXPLANATORY

Jet reorientation in central galaxies of clusters and groups: insights from V...Sérgio Sacani

THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.Sérgio Sacani

GBSN - Biochemistry (Unit 5) Chemistry of LipidsAreesha Ahmad

Anemia_ different types_causes_ conditionsmuralinath2

Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...NathanBaughman3

Lab report on liquid viscosity of glycerinossaicprecious19

The ASGCT Annual Meeting was packed with exciting progress in the field advan...Health Advances

word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...Subhajit Sahu

Aerodynamics. flippatterncn5tm5ttnj6nmnynypptsreddyrahul

THYROID-PARATHYROID medical surgical nursingJocelyn Atis

NuGOweek 2024 Ghent - programme - final versionpablovgd

SAMPLING.pptx for analystical chemistry sample techniquesrodneykiptoo8

Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdfPirithiRaju

Constraints on Neutrino Natal Kicks from Black-Hole Binary VFTS 243Sérgio Sacani

Recently uploaded (20)

Mammalian Pineal Body Structure and Also Functions

National Biodiversity protection initiatives and Convention on Biological Di...

Richard's entangled aventures in wonderland

Shuaib Y-basedComprehensive mahmudj.pptx

Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...

SCHIZOPHRENIA Disorder/ Brain Disorder.pdf

Jet reorientation in central galaxies of clusters and groups: insights from V...

THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.

GBSN - Biochemistry (Unit 5) Chemistry of Lipids

Anemia_ different types_causes_ conditions

Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...

Lab report on liquid viscosity of glycerin

The ASGCT Annual Meeting was packed with exciting progress in the field advan...

word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...

Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt

THYROID-PARATHYROID medical surgical nursing

NuGOweek 2024 Ghent - programme - final version

SAMPLING.pptx for analystical chemistry sample techniques

Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdf

Constraints on Neutrino Natal Kicks from Black-Hole Binary VFTS 243

Faster rcnn

1. Faster R-CNN: Towards Real-Time Object detection Microsoft Research, NIPS2015 Presentor: Andy Tsai

2. Key Idea: Region Proposal Net (RPN) layer Conv Layers Predict BBoxes Classify objects conv5

3. Testing Time: faster R-CNN Conv Layers Predict BBoxes Classify objects conv5 1*ConvTime 1*SmallNet 1000* FcTime

4. Testing Time: R-CNN Conv Layers conv5 Object Proposal ……………. Object Proposal 1000*ConvTime 1000*FcTime

5. Testing Time: fast R-CNN Conv Layers conv5 Object Proposal ……………. Object Proposal 1*ConvTime 1000*FcTime ROI pooling

6. Fast, Accurate Object Detection - fastest region proposal method: Edge Boxes [4fps, 1000 proposal] - Testing stage Model Time Edge boxes + R-CNN 0.25 sec + 1000*ConvTime + 1000*FcTime Edge boxes + fast R-CNN 0.25 sec + 1*ConvTime + 1000*FcTime faster R-CNN 1*ConvTime + 1000*FcTime

7. RPN layer

8. RPN layer

9. RPN layer Anchor regression

10. RPN: Loss Function - 2 class Softmax cross entropy loss - Discriminative training: - pi* = 1 if IoU > 0.7 - pi* = 0 if IoU < 0.3 - otherwise, do not contribute to loss

11. RPN: Loss Function only positive sample contribute to reg. Loss

12. How to train faster R-CNN ? - balance sampling ( neg. vs pos. = 1:1 ) - joint training is almost impossible ( conv update alternating ) - 4 step training !

13. How to train faster R-CNN ? 1. train RPN with ImageNet pre-trained model 2. train fast R-CNN using proposal generated by 1. [ no params. sharing ] 3. use conv trained by 2. to initialize model, fix shared conv, update RPN 4. fix shared conv, fine-tune fc

14. Result - MAP - VOC2007, ZF ConvNet

15. Result - MAP Using VGG ConvNet, fast R-CNN vs faster R-CNN

16. Result - Time VOC 2007, fast R-CNN vs faster R-CNN 70.0% 73.2% 59.9%

17. Official Leader Board Score

18. Code available at gitHub Matlab(faster-rcnn) & python(py-faster-rcnn) version

19. Thank you :)

Editor's Notes

為何不用fc ? 因為這樣的sliding window會有很多個，每個要用同一組weight
為何不用fc ? 因為這樣的sliding window會有很多個，每個要用同一組weight 會提出幾個proposal呢 ? k*你在conv5 feature map 上可以滑幾次, tipically, about 6k proposal

Faster rcnn

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Faster rcnn

Similar to Faster rcnn (12)

More from 捷恩蔡

More from 捷恩蔡 (8)

Recently uploaded

Recently uploaded (20)

Faster rcnn

Editor's Notes

Faster rcnn

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Faster rcnn

Similar to Faster rcnn (12)

More from 捷恩 蔡

More from 捷恩 蔡 (8)

Recently uploaded

Recently uploaded (20)

Faster rcnn

Editor's Notes

More from 捷恩蔡

More from 捷恩蔡 (8)