SlideShare a Scribd company logo
1 of 21
Mask R-CNN
CM Seminar 2017.09.01
Jaehyun Jun
Biointelligence Laboratory
Interdisciplinary Program of Neuro Science, Seoul National Univertisy
http://bi.snu.ac.kr
Overview
 Task
 object detection: classify objects and localize using bounding box
 instance segmentation: classify each pixel into a fixed set of
categories
© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr 2
Main Idea
 Goal: develop a comparably enabling framework for
instance segmentation
 Extension of Faster R-CNN
 RoIPool -> RoIAlign
 decouple mask and class prediction
3© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
Related Work
 R-CNN (Region-based CNN)
 Bounding box: Selective Search -> AlexNet -> linear regression
 Classification: Selective Search -> AlexNet -> SVM
 Faster R-CNN
 Bounding box: Region Proposal Network (RPN)
 Bounding box + Classification
: extract feature using RoIPool
 RoIPool: large negative effect on
predicting pixel-accurate mask
 misalign problem
4© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
History
 R-CNN -> SPP-net
 SPP-net -> Fast R-CNN
5© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
History
 Fast R-CNN -> Faster R-CNN
6© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
 Faster R-CNN -> Mask R-CNN
Related Work
 ResNeXt
 increasing cardinality is more effective than depth or width of
networks
 ResNeXt works better than ResNet having same number of
parameters
7© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
Mask R-CNN
 Two stage
1. Bounding box: Region Proposal Network (RPN)
2. Class and Box offset
 Output
 binary mask
 one for each classes
 Loss
 L = Lcls + Lbox + Lmask
 Lmask: average binary cross-entropy loss
 positive RoIs
8© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
Region Proposal Networks
1. Propose k reference boxes on sliding-window
2. Map sliding-window into lower-dimensional feature
3.1. feed on box-regression layer
3.2. feed on box-classification layer
9© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
RoIPooling
10© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
RoIPooling - Differentiable
11© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
RoIPooling - SGD step
12© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
 Region-wise sampling to make mini-batches
RoIAlign
 RoIPool: misalignment problem
 RoIAlign: use decimal points as boundary size & apply
bilinear interpolation (e.g. [x/16] -> x/16)
13© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
Architecture
 Backbone: feature extraction
 ResNet-50 or ResNet-101
 ResNeXt-50 or ResNeXt-101
 Feature Pyramid Network (FPN)
 C4 or C5 (convolutional layer)
 RoIAlign: aligning the extracted features with the input
 quantization X -> bilinear interpolation
 [x/16] -> x/16
 Head: bounding-box recognition & mask prediction
 ResNet-C4 -> 9-layer ‘res5’
 FPN -> res5 + filter
14© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
Dataset
 MS COCO
 Object Instance Annotations
 Object Keypoint Annotations
15© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
Experiment
 Mask R-CNN vs. FCIS+++
 no artifacts
16© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
Experiment
 RoIPool vs. RoIWarp vs. RoIAlign
17© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
Experiment
 Mask R-CNN vs. Faster R-CNN
 Improve 3~6%
18© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
Experiment
 Human Pose Estimation (Keypoint Detection)
19© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
Experiment
 Cityscapes
 1st on Instance Level Semantic
Labeling Task
20© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
Q & A

More Related Content

What's hot

Introduction to object detection
Introduction to object detectionIntroduction to object detection
Introduction to object detectionBrodmann17
 
CNN Attention Networks
CNN Attention NetworksCNN Attention Networks
CNN Attention NetworksTaeoh Kim
 
Deep Learning in Computer Vision
Deep Learning in Computer VisionDeep Learning in Computer Vision
Deep Learning in Computer VisionSungjoon Choi
 
Object Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksObject Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksUsman Qayyum
 
ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]Dongmin Choi
 
Convolutional neural network
Convolutional neural network Convolutional neural network
Convolutional neural network Yan Xu
 
Mask-RCNN for Instance Segmentation
Mask-RCNN for Instance SegmentationMask-RCNN for Instance Segmentation
Mask-RCNN for Instance SegmentationDat Nguyen
 
PR-217: EfficientDet: Scalable and Efficient Object Detection
PR-217: EfficientDet: Scalable and Efficient Object DetectionPR-217: EfficientDet: Scalable and Efficient Object Detection
PR-217: EfficientDet: Scalable and Efficient Object DetectionJinwon Lee
 
Object Detection Methods using Deep Learning
Object Detection Methods using Deep LearningObject Detection Methods using Deep Learning
Object Detection Methods using Deep LearningSungjoon Choi
 
Semantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesSemantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesFellowship at Vodafone FutureLab
 
Object Detection and Recognition
Object Detection and Recognition Object Detection and Recognition
Object Detection and Recognition Intel Nervana
 
CNN Machine learning DeepLearning
CNN Machine learning DeepLearningCNN Machine learning DeepLearning
CNN Machine learning DeepLearningAbhishek Sharma
 
Yolo v2 ai_tech_20190421
Yolo v2 ai_tech_20190421Yolo v2 ai_tech_20190421
Yolo v2 ai_tech_20190421穗碧 陳
 
Deep learning based object detection basics
Deep learning based object detection basicsDeep learning based object detection basics
Deep learning based object detection basicsBrodmann17
 
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기NAVER Engineering
 

What's hot (20)

Introduction to object detection
Introduction to object detectionIntroduction to object detection
Introduction to object detection
 
CNN Attention Networks
CNN Attention NetworksCNN Attention Networks
CNN Attention Networks
 
Deep Learning in Computer Vision
Deep Learning in Computer VisionDeep Learning in Computer Vision
Deep Learning in Computer Vision
 
Object Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksObject Detection using Deep Neural Networks
Object Detection using Deep Neural Networks
 
ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]
 
Dcgan
DcganDcgan
Dcgan
 
Convolutional neural network
Convolutional neural network Convolutional neural network
Convolutional neural network
 
Mask-RCNN for Instance Segmentation
Mask-RCNN for Instance SegmentationMask-RCNN for Instance Segmentation
Mask-RCNN for Instance Segmentation
 
PR-217: EfficientDet: Scalable and Efficient Object Detection
PR-217: EfficientDet: Scalable and Efficient Object DetectionPR-217: EfficientDet: Scalable and Efficient Object Detection
PR-217: EfficientDet: Scalable and Efficient Object Detection
 
Object Detection Methods using Deep Learning
Object Detection Methods using Deep LearningObject Detection Methods using Deep Learning
Object Detection Methods using Deep Learning
 
Semantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesSemantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network Approaches
 
Yolov5
Yolov5 Yolov5
Yolov5
 
Object Detection and Recognition
Object Detection and Recognition Object Detection and Recognition
Object Detection and Recognition
 
CNN Machine learning DeepLearning
CNN Machine learning DeepLearningCNN Machine learning DeepLearning
CNN Machine learning DeepLearning
 
Yolo v2 ai_tech_20190421
Yolo v2 ai_tech_20190421Yolo v2 ai_tech_20190421
Yolo v2 ai_tech_20190421
 
Deep learning based object detection basics
Deep learning based object detection basicsDeep learning based object detection basics
Deep learning based object detection basics
 
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
 
Deep Learning for Video: Action Recognition (UPC 2018)
Deep Learning for Video: Action Recognition (UPC 2018)Deep Learning for Video: Action Recognition (UPC 2018)
Deep Learning for Video: Action Recognition (UPC 2018)
 
Densenet CNN
Densenet CNNDensenet CNN
Densenet CNN
 
Faster rcnn
Faster rcnnFaster rcnn
Faster rcnn
 

Similar to Mask R-CNN

Object Single Frame Using YOLO Model
Object Single Frame Using YOLO ModelObject Single Frame Using YOLO Model
Object Single Frame Using YOLO ModelIRJET Journal
 
Deep galaxy classification of galaxies based on deep convolutional neural ne...
Deep galaxy  classification of galaxies based on deep convolutional neural ne...Deep galaxy  classification of galaxies based on deep convolutional neural ne...
Deep galaxy classification of galaxies based on deep convolutional neural ne...Aboul Ella Hassanien
 
Recognition and Detection of Real-Time Objects Using Unified Network of Faste...
Recognition and Detection of Real-Time Objects Using Unified Network of Faste...Recognition and Detection of Real-Time Objects Using Unified Network of Faste...
Recognition and Detection of Real-Time Objects Using Unified Network of Faste...dbpublications
 
CNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent AdvancesCNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent AdvancesDmytro Mishkin
 
Garbage Classification Using Deep Learning Techniques
Garbage Classification Using Deep Learning TechniquesGarbage Classification Using Deep Learning Techniques
Garbage Classification Using Deep Learning TechniquesIRJET Journal
 
[Paper] DetectoRS for Object Detection
[Paper] DetectoRS for Object Detection[Paper] DetectoRS for Object Detection
[Paper] DetectoRS for Object DetectionSusang Kim
 
Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331Jihong Kang
 
Remote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 ProjectsRemote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 ProjectsVijay Karan
 
SULI poster
SULI posterSULI poster
SULI posterTing Chi
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - Hiroshi Fukui
 
IJSRED-V2I5P40
IJSRED-V2I5P40IJSRED-V2I5P40
IJSRED-V2I5P40IJSRED
 
Remote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 ProjectsRemote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 ProjectsVijay Karan
 
A Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionA Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionvivatechijri
 
Deep Learning - Optimization Basic
Deep Learning - Optimization BasicDeep Learning - Optimization Basic
Deep Learning - Optimization BasicJaehyun Jun
 
IRJET- Weakly Supervised Object Detection by using Fast R-CNN
IRJET- Weakly Supervised Object Detection by using Fast R-CNNIRJET- Weakly Supervised Object Detection by using Fast R-CNN
IRJET- Weakly Supervised Object Detection by using Fast R-CNNIRJET Journal
 
(Research Note) Delving deeper into convolutional neural networks for camera ...
(Research Note) Delving deeper into convolutional neural networks for camera ...(Research Note) Delving deeper into convolutional neural networks for camera ...
(Research Note) Delving deeper into convolutional neural networks for camera ...Jacky Liu
 
A Review on Color Recognition using Deep Learning and Different Image Segment...
A Review on Color Recognition using Deep Learning and Different Image Segment...A Review on Color Recognition using Deep Learning and Different Image Segment...
A Review on Color Recognition using Deep Learning and Different Image Segment...IRJET Journal
 
IRJET- Extension to Visual Information Narrator using Neural Network
IRJET- Extension to Visual Information Narrator using Neural NetworkIRJET- Extension to Visual Information Narrator using Neural Network
IRJET- Extension to Visual Information Narrator using Neural NetworkIRJET Journal
 

Similar to Mask R-CNN (20)

Object Single Frame Using YOLO Model
Object Single Frame Using YOLO ModelObject Single Frame Using YOLO Model
Object Single Frame Using YOLO Model
 
Deep galaxy classification of galaxies based on deep convolutional neural ne...
Deep galaxy  classification of galaxies based on deep convolutional neural ne...Deep galaxy  classification of galaxies based on deep convolutional neural ne...
Deep galaxy classification of galaxies based on deep convolutional neural ne...
 
Recognition and Detection of Real-Time Objects Using Unified Network of Faste...
Recognition and Detection of Real-Time Objects Using Unified Network of Faste...Recognition and Detection of Real-Time Objects Using Unified Network of Faste...
Recognition and Detection of Real-Time Objects Using Unified Network of Faste...
 
CNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent AdvancesCNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent Advances
 
Garbage Classification Using Deep Learning Techniques
Garbage Classification Using Deep Learning TechniquesGarbage Classification Using Deep Learning Techniques
Garbage Classification Using Deep Learning Techniques
 
PointNet
PointNetPointNet
PointNet
 
[Paper] DetectoRS for Object Detection
[Paper] DetectoRS for Object Detection[Paper] DetectoRS for Object Detection
[Paper] DetectoRS for Object Detection
 
Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331
 
Remote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 ProjectsRemote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 Projects
 
SULI poster
SULI posterSULI poster
SULI poster
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に -
 
IJSRED-V2I5P40
IJSRED-V2I5P40IJSRED-V2I5P40
IJSRED-V2I5P40
 
Resume_optics_Gupta Roy
Resume_optics_Gupta RoyResume_optics_Gupta Roy
Resume_optics_Gupta Roy
 
Remote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 ProjectsRemote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 Projects
 
A Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionA Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detection
 
Deep Learning - Optimization Basic
Deep Learning - Optimization BasicDeep Learning - Optimization Basic
Deep Learning - Optimization Basic
 
IRJET- Weakly Supervised Object Detection by using Fast R-CNN
IRJET- Weakly Supervised Object Detection by using Fast R-CNNIRJET- Weakly Supervised Object Detection by using Fast R-CNN
IRJET- Weakly Supervised Object Detection by using Fast R-CNN
 
(Research Note) Delving deeper into convolutional neural networks for camera ...
(Research Note) Delving deeper into convolutional neural networks for camera ...(Research Note) Delving deeper into convolutional neural networks for camera ...
(Research Note) Delving deeper into convolutional neural networks for camera ...
 
A Review on Color Recognition using Deep Learning and Different Image Segment...
A Review on Color Recognition using Deep Learning and Different Image Segment...A Review on Color Recognition using Deep Learning and Different Image Segment...
A Review on Color Recognition using Deep Learning and Different Image Segment...
 
IRJET- Extension to Visual Information Narrator using Neural Network
IRJET- Extension to Visual Information Narrator using Neural NetworkIRJET- Extension to Visual Information Narrator using Neural Network
IRJET- Extension to Visual Information Narrator using Neural Network
 

Recently uploaded

Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 

Recently uploaded (20)

Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 

Mask R-CNN

  • 1. Mask R-CNN CM Seminar 2017.09.01 Jaehyun Jun Biointelligence Laboratory Interdisciplinary Program of Neuro Science, Seoul National Univertisy http://bi.snu.ac.kr
  • 2. Overview  Task  object detection: classify objects and localize using bounding box  instance segmentation: classify each pixel into a fixed set of categories © 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr 2
  • 3. Main Idea  Goal: develop a comparably enabling framework for instance segmentation  Extension of Faster R-CNN  RoIPool -> RoIAlign  decouple mask and class prediction 3© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 4. Related Work  R-CNN (Region-based CNN)  Bounding box: Selective Search -> AlexNet -> linear regression  Classification: Selective Search -> AlexNet -> SVM  Faster R-CNN  Bounding box: Region Proposal Network (RPN)  Bounding box + Classification : extract feature using RoIPool  RoIPool: large negative effect on predicting pixel-accurate mask  misalign problem 4© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 5. History  R-CNN -> SPP-net  SPP-net -> Fast R-CNN 5© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 6. History  Fast R-CNN -> Faster R-CNN 6© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr  Faster R-CNN -> Mask R-CNN
  • 7. Related Work  ResNeXt  increasing cardinality is more effective than depth or width of networks  ResNeXt works better than ResNet having same number of parameters 7© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 8. Mask R-CNN  Two stage 1. Bounding box: Region Proposal Network (RPN) 2. Class and Box offset  Output  binary mask  one for each classes  Loss  L = Lcls + Lbox + Lmask  Lmask: average binary cross-entropy loss  positive RoIs 8© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 9. Region Proposal Networks 1. Propose k reference boxes on sliding-window 2. Map sliding-window into lower-dimensional feature 3.1. feed on box-regression layer 3.2. feed on box-classification layer 9© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 10. RoIPooling 10© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 11. RoIPooling - Differentiable 11© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 12. RoIPooling - SGD step 12© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr  Region-wise sampling to make mini-batches
  • 13. RoIAlign  RoIPool: misalignment problem  RoIAlign: use decimal points as boundary size & apply bilinear interpolation (e.g. [x/16] -> x/16) 13© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 14. Architecture  Backbone: feature extraction  ResNet-50 or ResNet-101  ResNeXt-50 or ResNeXt-101  Feature Pyramid Network (FPN)  C4 or C5 (convolutional layer)  RoIAlign: aligning the extracted features with the input  quantization X -> bilinear interpolation  [x/16] -> x/16  Head: bounding-box recognition & mask prediction  ResNet-C4 -> 9-layer ‘res5’  FPN -> res5 + filter 14© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 15. Dataset  MS COCO  Object Instance Annotations  Object Keypoint Annotations 15© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 16. Experiment  Mask R-CNN vs. FCIS+++  no artifacts 16© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 17. Experiment  RoIPool vs. RoIWarp vs. RoIAlign 17© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 18. Experiment  Mask R-CNN vs. Faster R-CNN  Improve 3~6% 18© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 19. Experiment  Human Pose Estimation (Keypoint Detection) 19© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 20. Experiment  Cityscapes  1st on Instance Level Semantic Labeling Task 20© 2017, SNU Biointelligence Lab., http://bi.snu.ac.kr
  • 21. Q & A

Editor's Notes

  1. 2016 City scape dataset - Instance Level Semantic Labeling Task 1등
  2. R-CNN은 모든 object 에 대해서 별개의 network로 feature map을 뽑기 때문에 중복된 연산이 많이 일어남 -> RoIPool: 전체 이미지에 대해서 하나의 network로 feature map을 뽑고 object에 해당하는 feature map을 추출하여 사용
  3. mask와 class prediction을 분리시키는 것이 핵심
  4. RoIPool 에서 [x/16] 을 사용한 이유? bilinear interpolation는 어디에 어떻게 사용되고 사용하는 이유?
  5. AP50, 75 의 의미? IoU threshold 라고 하는데 overlap된 영역이 50% 75% 넘으면 처리하지 않는다 정도의 내용인지…