SlideShare a Scribd company logo
1 of 27
Download to read offline
김 성 철
Contents
1. Introduction
2. Panoptic Segmentation
3. Human Consistency Study
4. Machine Performance Baselines
5. Future of Panoptic Segmentation
2
Introduction & Related Work
• Introduction
• Semantic Segmentation
• Stuff (such as grass, sky, road, ..)
• Simply assign a class label to each pixel in an image
(treat thing classes as stuff)
• FCN으로 object instance를 나누기 힘듬
• Instance Segmentation
• Things (such as people, animals, tools, ..)
• Detect each object and delineate it with a bbox or mask
• Region-based method로 overlapping을 피하기 힘듬
3
Introduction & Related Work
• Introduction
• Panoptic Segmentation
• Encompass both stuff and thing classes
→ Panoptic : including every thing visible in one view
• Use a simple but general output format
→ 각 pixel은 semantic label과 instance id를 할당받음
• Introduce a uniform evaluation metric
→ Semantic, Instance 모두 적용 가능한 Panoptic Quality
4
Panoptic Segmentation
• Format
• Task format
• Semantic classes ℒ ≔ {0, … , ℒ − 1}
• Each pixel 𝑖 is mapped to a pair 𝒍𝒊, 𝒛𝒊 ∈ 𝓛 × ℕ
• 𝑙𝑖 : the semantic class of pixel 𝑖
• 𝑧𝑖 : its instance id. Group pixels of the same class into distinct segments
• 애매하거나 out-of-class pixel들은 void label로 배정
→ 모든 pixel이 semantic label을 갖는 건 아님
5
Panoptic Segmentation
• Format
• Stuff and thing labels
• Semantic label set은 subset ℒ 𝑆𝑡와 ℒ 𝑇ℎ로 구성
• ℒ = ℒ 𝑆𝑡
∪ ℒ 𝑇ℎ
, ℒ 𝑆𝑡
∩ ℒ 𝑇ℎ
= ∅
• Pixel이 𝑙𝑖 ∈ ℒ 𝑆𝑡
로 레이블 될 때, 𝑧𝑖는 관계 없음
→ 모든 stuff classes는 같은 instance를 갖기 때문 (e.g., the same sky)
• 모든 pixel이 같은 𝒍𝒊, 𝒛𝒊 일 때 (𝑙𝑖 ∈ ℒ 𝑇ℎ), 같은 instance에 속함 (e.g., the same car)
• Single instance에 속하는 경우 같은 𝒍𝒊, 𝒛𝒊 을 가짐
6
Panoptic Segmentation
• Format
• Relationship to semantic segmentation
• Panoptic segmentation은 semantic segmentation의 strict generalization
• 각 pixel에 semantic label을 할당하는 것은 동일, thing class가 들어가는 순간 달라짐
• Relationship to instance segmentation
• Instance segmentation은 overlapping segment 발생
→ Panoptic segmentation은 각 pixel에 semantic label과 instance id 하나씩 할당
→ Overlapping 발생 X
7
Panoptic Segmentation
• Metric
• 기존 방법들은 semantic or instance segmentation에 특화
→ stuff와 thing을 동시에 측정 불가
• Stuff와 thing을 동시에 측정할 수 있는 metric 제시
• Completeness : Treat stuff and thing classes in a uniform way
• Interpretability : A metric with identifiable meaning
• Simplicity : Simple to define and implement
→ Panoptic Quality (PQ)
8
Panoptic Segmentation
• Metric
• Segment Matching
• 𝐼𝑜𝑈 𝑝𝑖, 𝑔 =
|𝑝 𝑖∩𝑔|
|𝑝 𝑖∪𝑔|
≤
𝑝 𝑖∩𝑔
𝑔
𝑓𝑜𝑟 𝑖 ∈ 1, 2
• 𝑝1 ∩ 𝑝2 = ∅, |𝑝𝑖 ∪ 𝑔| ≥ |𝑔|
• 𝐼𝑜𝑈 𝑝1, 𝑔 + 𝐼𝑜𝑈 𝑝2, 𝑔 ≤
𝑝1∩𝑔 + 𝑝2∩𝑔
𝑔
≤ 1
• 𝑝1 ∩ 𝑔 + 𝑝2 ∩ 𝑔 ≤ g
• Simple and Interpretable!
9
Panoptic Segmentation
• Metric
• PQ Computation
• Class별로 PQ 계산 후 평균 → class imbalance에 insensitive하게 만듬
• 𝑷𝑸 =
σ 𝒑,𝒈 ∈𝑻𝑷 𝑰𝒐𝑼(𝒑,𝒈)
𝑻𝑷 +
𝟏
𝟐
𝑭𝑷 +
𝟏
𝟐
𝑭𝑵
•
1
|𝑇𝑃|
σ 𝑝,𝑔 ∈𝑇𝑃 𝐼𝑜𝑈(𝑝, 𝑔) : the average 𝐼𝑜𝑈 of matched segments
•
1
2
𝐹𝑃 +
1
2
𝐹𝑁 : to penalize segments without matches
• All segments receive equal importance regardless of their area!
10
Panoptic Segmentation
• Metric
• PQ Computation
• 𝑷𝑸 =
σ 𝒑,𝒈 ∈𝑻𝑷 𝑰𝒐𝑼(𝒑,𝒈)
𝑻𝑷 +
𝟏
𝟐
𝑭𝑷 +
𝟏
𝟐
𝑭𝑵
=
σ 𝒑,𝒈 ∈𝑻𝑷 𝑰𝒐𝑼(𝒑,𝒈)
𝑻𝑷
×
|𝑻𝑷|
𝑻𝑷 +
𝟏
𝟐
𝑭𝑷 +
𝟏
𝟐
𝑭𝑵
(𝑺𝑸 × 𝑹𝑸)
•
σ 𝑝,𝑔 ∈𝑇𝑃 𝐼𝑜𝑈(𝑝,𝑔)
𝑇𝑃
: segmentation quality (SQ)  the average 𝐼𝑜𝑈 of matched segments
•
|𝑇𝑃|
𝑇𝑃 +
1
2
𝐹𝑃 +
1
2
𝐹𝑁
: recognition quality (RQ)  𝐹1 score과 흡사
11
Panoptic Segmentation
• Metric
• PQ Computation
• Void labels in the ground truth
• Out of class pixels / ambiguous or unknown pixels → 평가하지 않음
• Ground truth에서 void labels인 pixel들은 제외하고 𝐼𝑜𝑈 계산
• Void pixel의 일부가 포함된 unmatched predicted segments는 제거 & FP로 처리X
• Group labels
• Cityscapes, COCO는 instance id 대신 group label을 사용
• Matching 중에는 group region 사용 X
• Matching 이후, matching threshold를 넘는 same class의 일부를 포함하는 12
Panoptic Segmentation
• Metric
• Comparison to Existing Metrics
• Semantic segmentation metrics
• 𝐼𝑜𝑈 → pixel output/labels 기반 계산 & object-level labels 무시
• Instance segmentation metrics
• Average Precision : Confidence score 필요 → semantic segmentation에 적합하지 않음
• Panoptic quality
• Treat all classes (stuff and things) in a uniform way
• SQ, RQ로 decomposing 할 수 있지만, PQ가 semantic과 instance의 metric은 아님
→ 오히려 SQ와 RQ가 각각 semantic과 instance의 metric 13
Panoptic Segmentation
• Datasets
• Cityscapes
• 5000 images (2975 / 500 / 1525)
• 19 classes among which 8 have instance-level
• ADE20k
• 25k images (20k / 2k / 3k)
• 100 thing and 50 stuff classes in the 2017 Places Challenge
• Mapillary Vistas
• 25k street-view images (18k / 2k / 5k)
• 28 stuff and 37 thing classes in ‘research edition’
14
Human Consistency Study
• Human annotation
• 하나를 ground truth, 다른 하나를 prediction으로 설정 후 PQ 측정
• Human consistency
15
Human Consistency Study
• Stuff 𝒗𝒔. Things
• 모든 stuff and things classes가 고르게 분포
• Small 𝒗𝒔. large objects
• Small 25% | middle 50% | largest 25%
16
Human Consistency Study
• 𝑰𝒐𝑼 threshold
• SQ 𝒗𝒔. RQ balance
• 𝑅𝑄 𝛼 =
|𝑇𝑃|
𝑇𝑃 +𝛼 𝐹𝑃 +𝛼|𝐹𝑁|
(default 𝛼 = 0.5)
• default 𝛼 가 SQ와 RQ 사이에 균형을 맞춰 줌
• 상황에 따라 변경해주면 될 듯
17
Machine Performance Baseline
• 세 가지 관점
• How do heuristic combinations of top-performing instance and semantic
segmentation systems perform on panoptic segmentation?
• How does PQ compare to existing metrics like 𝐴𝑃 and 𝐼𝑜𝑈?
• How do the machine results compare to the human results that we
presented previously?
18
Machine Performance Baseline
• Algorithms and data
• 각 dataset에서 algorithm output을 얻음
• Cityscapes : masks generated by PSPNet and Mask R-CNN
• ADE20k : the winners of both the semantic and instance seg. in the 2017 Places
Challenge
• Vistas : 주최측으로 부터 1k testset에 대한 semantic & instance seg. 결과를 제공받음
19
Machine Performance Baseline
• Instance segmentation
• Overlapping segment를 얻음 → 해결
• Confidence score 순으로 sorting
→ 낮은 score는 제거
• 가장 confident한 것부터 시작해서 반복
• Remove pixels which have been assigned to previous segments
• Segment의 충분한 부분이 남아있으면 non-overlapping, 그렇지 않으면 전체 segment 제거
20
Machine Performance Baseline
• Semantic segmentation
• No overlapping → 바로 PQ 계산 가능
• Multi-scale : skip connection (?)
21
Machine Performance Baseline
• Panoptic segmentation
• Instance + Semantic for resolving any overlap between thing and stuff
22
Machine Performance Baseline
• Human 𝒗𝒔. machine panoptic segmentation
23
Future of Panoptic Segmentation
• Stuff와 thing을 동시에 설명하는 end-to-end model
• Instance segmentation에서 non-overlapping을 위한 실험들이 진행됨
• Some form of higher-level ‘reasoning’ may be beneficial
• Extend learnable NMS to PS
24
Future of Panoptic Segmentation
• Stuff와 thing을 동시에 설명하는 end-to-end model
• Instance segmentation에서 non-overlapping을 위한 실험들이 진행됨
• Some form of higher-level ‘reasoning’ may be beneficial
• Extend learnable NMS to PS
25
UPSNet (in CVPR 2019)
26
감 사 합 니 다
27

More Related Content

What's hot

StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery ivaderivader
 
Introduction to object detection
Introduction to object detectionIntroduction to object detection
Introduction to object detectionBrodmann17
 
Image segmentation with deep learning
Image segmentation with deep learningImage segmentation with deep learning
Image segmentation with deep learningAntonio Rueda-Toicen
 
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic SegmentationSemantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation岳華 杜
 
Object detection with deep learning
Object detection with deep learningObject detection with deep learning
Object detection with deep learningSushant Shrivastava
 
Object Detection Using R-CNN Deep Learning Framework
Object Detection Using R-CNN Deep Learning FrameworkObject Detection Using R-CNN Deep Learning Framework
Object Detection Using R-CNN Deep Learning FrameworkNader Karimi
 
Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)Hwa Pyung Kim
 
Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...Universitat Politècnica de Catalunya
 
#10 pydata warsaw object detection with dn ns
#10   pydata warsaw object detection with dn ns#10   pydata warsaw object detection with dn ns
#10 pydata warsaw object detection with dn nsAndrew Brozek
 
morphological image processing
morphological image processingmorphological image processing
morphological image processingAnubhav Kumar
 
PR-284: End-to-End Object Detection with Transformers(DETR)
PR-284: End-to-End Object Detection with Transformers(DETR)PR-284: End-to-End Object Detection with Transformers(DETR)
PR-284: End-to-End Object Detection with Transformers(DETR)Jinwon Lee
 
210523 swin transformer v1.5
210523 swin transformer v1.5210523 swin transformer v1.5
210523 swin transformer v1.5taeseon ryu
 
Deeplabv1, v2, v3, v3+
Deeplabv1, v2, v3, v3+Deeplabv1, v2, v3, v3+
Deeplabv1, v2, v3, v3+Sungchul Kim
 
PR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox DetectorPR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox DetectorJinwon Lee
 
Image Object Detection Pipeline
Image Object Detection PipelineImage Object Detection Pipeline
Image Object Detection PipelineAbhinav Dadhich
 

What's hot (20)

YOLO v1
YOLO v1YOLO v1
YOLO v1
 
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
 
Introduction to object detection
Introduction to object detectionIntroduction to object detection
Introduction to object detection
 
Image segmentation with deep learning
Image segmentation with deep learningImage segmentation with deep learning
Image segmentation with deep learning
 
You only look once
You only look onceYou only look once
You only look once
 
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic SegmentationSemantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
 
Object detection with deep learning
Object detection with deep learningObject detection with deep learning
Object detection with deep learning
 
Object Detection Using R-CNN Deep Learning Framework
Object Detection Using R-CNN Deep Learning FrameworkObject Detection Using R-CNN Deep Learning Framework
Object Detection Using R-CNN Deep Learning Framework
 
Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)
 
Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...
 
#10 pydata warsaw object detection with dn ns
#10   pydata warsaw object detection with dn ns#10   pydata warsaw object detection with dn ns
#10 pydata warsaw object detection with dn ns
 
morphological image processing
morphological image processingmorphological image processing
morphological image processing
 
PR-284: End-to-End Object Detection with Transformers(DETR)
PR-284: End-to-End Object Detection with Transformers(DETR)PR-284: End-to-End Object Detection with Transformers(DETR)
PR-284: End-to-End Object Detection with Transformers(DETR)
 
210523 swin transformer v1.5
210523 swin transformer v1.5210523 swin transformer v1.5
210523 swin transformer v1.5
 
Deeplabv1, v2, v3, v3+
Deeplabv1, v2, v3, v3+Deeplabv1, v2, v3, v3+
Deeplabv1, v2, v3, v3+
 
Faster rcnn
Faster rcnnFaster rcnn
Faster rcnn
 
PR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox DetectorPR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox Detector
 
Yol ov2
Yol ov2Yol ov2
Yol ov2
 
Image Object Detection Pipeline
Image Object Detection PipelineImage Object Detection Pipeline
Image Object Detection Pipeline
 
Density based clustering
Density based clusteringDensity based clustering
Density based clustering
 

Similar to Panoptic Segmentation

A Unified Framework for Computer Vision Tasks: (Conditional) Generative Model...
A Unified Framework for Computer Vision Tasks: (Conditional) Generative Model...A Unified Framework for Computer Vision Tasks: (Conditional) Generative Model...
A Unified Framework for Computer Vision Tasks: (Conditional) Generative Model...Sangwoo Mo
 
Computer Vision descriptors
Computer Vision descriptorsComputer Vision descriptors
Computer Vision descriptorsWael Badawy
 
Evolving a Medical Image Similarity Search
Evolving a Medical Image Similarity SearchEvolving a Medical Image Similarity Search
Evolving a Medical Image Similarity SearchSujit Pal
 
Multi-class Classification on Riemannian Manifolds for Video Surveillance
Multi-class Classification on Riemannian Manifolds for Video SurveillanceMulti-class Classification on Riemannian Manifolds for Video Surveillance
Multi-class Classification on Riemannian Manifolds for Video SurveillanceDiego Tosato
 
FCIS - Fully Instance-aware Semantic Segmentation -
FCIS - Fully Instance-aware Semantic Segmentation -FCIS - Fully Instance-aware Semantic Segmentation -
FCIS - Fully Instance-aware Semantic Segmentation -晋吾 北川
 
Viola-Jones Object Detection
Viola-Jones Object DetectionViola-Jones Object Detection
Viola-Jones Object DetectionVenugopal Boddu
 
The Machine Learning Workflow with Azure
The Machine Learning Workflow with AzureThe Machine Learning Workflow with Azure
The Machine Learning Workflow with AzureIvo Andreev
 
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya
 
Video Object Segmentation in Videos
Video Object Segmentation in VideosVideo Object Segmentation in Videos
Video Object Segmentation in VideosNAVER Engineering
 
Feature recognition and classification
Feature recognition and classificationFeature recognition and classification
Feature recognition and classificationSooraz Sresta
 
Salient KeypointSelection for Object Representation
Salient KeypointSelection for Object RepresentationSalient KeypointSelection for Object Representation
Salient KeypointSelection for Object RepresentationPrerana Mukherjee
 
“Introduction to Semantic Segmentation,” a Presentation from Au-Zone Technolo...
“Introduction to Semantic Segmentation,” a Presentation from Au-Zone Technolo...“Introduction to Semantic Segmentation,” a Presentation from Au-Zone Technolo...
“Introduction to Semantic Segmentation,” a Presentation from Au-Zone Technolo...Edge AI and Vision Alliance
 
Online video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident networkOnline video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident networkNAVER Engineering
 
Deep Learning Models for Question Answering
Deep Learning Models for Question AnsweringDeep Learning Models for Question Answering
Deep Learning Models for Question AnsweringSujit Pal
 
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Dr. Aparna Varde
 
[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object DetectionDeep Learning JP
 

Similar to Panoptic Segmentation (20)

A Unified Framework for Computer Vision Tasks: (Conditional) Generative Model...
A Unified Framework for Computer Vision Tasks: (Conditional) Generative Model...A Unified Framework for Computer Vision Tasks: (Conditional) Generative Model...
A Unified Framework for Computer Vision Tasks: (Conditional) Generative Model...
 
Computer Vision descriptors
Computer Vision descriptorsComputer Vision descriptors
Computer Vision descriptors
 
Evolving a Medical Image Similarity Search
Evolving a Medical Image Similarity SearchEvolving a Medical Image Similarity Search
Evolving a Medical Image Similarity Search
 
lecture_16_jiajun.pdf
lecture_16_jiajun.pdflecture_16_jiajun.pdf
lecture_16_jiajun.pdf
 
Multi-class Classification on Riemannian Manifolds for Video Surveillance
Multi-class Classification on Riemannian Manifolds for Video SurveillanceMulti-class Classification on Riemannian Manifolds for Video Surveillance
Multi-class Classification on Riemannian Manifolds for Video Surveillance
 
FCIS - Fully Instance-aware Semantic Segmentation -
FCIS - Fully Instance-aware Semantic Segmentation -FCIS - Fully Instance-aware Semantic Segmentation -
FCIS - Fully Instance-aware Semantic Segmentation -
 
Viola-Jones Object Detection
Viola-Jones Object DetectionViola-Jones Object Detection
Viola-Jones Object Detection
 
The Machine Learning Workflow with Azure
The Machine Learning Workflow with AzureThe Machine Learning Workflow with Azure
The Machine Learning Workflow with Azure
 
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
 
Face detection system design seminar
Face detection system design seminarFace detection system design seminar
Face detection system design seminar
 
Video Object Segmentation in Videos
Video Object Segmentation in VideosVideo Object Segmentation in Videos
Video Object Segmentation in Videos
 
Feature recognition and classification
Feature recognition and classificationFeature recognition and classification
Feature recognition and classification
 
Object Recognition
Object RecognitionObject Recognition
Object Recognition
 
Salient KeypointSelection for Object Representation
Salient KeypointSelection for Object RepresentationSalient KeypointSelection for Object Representation
Salient KeypointSelection for Object Representation
 
“Introduction to Semantic Segmentation,” a Presentation from Au-Zone Technolo...
“Introduction to Semantic Segmentation,” a Presentation from Au-Zone Technolo...“Introduction to Semantic Segmentation,” a Presentation from Au-Zone Technolo...
“Introduction to Semantic Segmentation,” a Presentation from Au-Zone Technolo...
 
Machine Learning with R
Machine Learning with RMachine Learning with R
Machine Learning with R
 
Online video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident networkOnline video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident network
 
Deep Learning Models for Question Answering
Deep Learning Models for Question AnsweringDeep Learning Models for Question Answering
Deep Learning Models for Question Answering
 
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
 
[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
 

More from Sungchul Kim

PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo SupervisionPR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo SupervisionSungchul Kim
 
Revisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural NetworksRevisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural NetworksSungchul Kim
 
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision TransformersEmerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision TransformersSungchul Kim
 
PR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation LearningPR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation LearningSungchul Kim
 
Score based Generative Modeling through Stochastic Differential Equations
Score based Generative Modeling through Stochastic Differential EquationsScore based Generative Modeling through Stochastic Differential Equations
Score based Generative Modeling through Stochastic Differential EquationsSungchul Kim
 
Exploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningExploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningSungchul Kim
 
Revisiting the Sibling Head in Object Detector
Revisiting the Sibling Head in Object DetectorRevisiting the Sibling Head in Object Detector
Revisiting the Sibling Head in Object DetectorSungchul Kim
 
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...Sungchul Kim
 
Going Deeper with Convolutions
Going Deeper with ConvolutionsGoing Deeper with Convolutions
Going Deeper with ConvolutionsSungchul Kim
 
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based LocalizationGrad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based LocalizationSungchul Kim
 
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...Sungchul Kim
 
On the Variance of the Adaptive Learning Rate and Beyond
On the Variance of the Adaptive Learning Rate and BeyondOn the Variance of the Adaptive Learning Rate and Beyond
On the Variance of the Adaptive Learning Rate and BeyondSungchul Kim
 
A Benchmark for Interpretability Methods in Deep Neural Networks
A Benchmark for Interpretability Methods in Deep Neural NetworksA Benchmark for Interpretability Methods in Deep Neural Networks
A Benchmark for Interpretability Methods in Deep Neural NetworksSungchul Kim
 
KDGAN: Knowledge Distillation with Generative Adversarial Networks
KDGAN: Knowledge Distillation with Generative Adversarial NetworksKDGAN: Knowledge Distillation with Generative Adversarial Networks
KDGAN: Knowledge Distillation with Generative Adversarial NetworksSungchul Kim
 
Designing Network Design Spaces
Designing Network Design SpacesDesigning Network Design Spaces
Designing Network Design SpacesSungchul Kim
 
Search to Distill: Pearls are Everywhere but not the Eyes
Search to Distill: Pearls are Everywhere but not the EyesSearch to Distill: Pearls are Everywhere but not the Eyes
Search to Distill: Pearls are Everywhere but not the EyesSungchul Kim
 
Supervised Constrastive Learning
Supervised Constrastive LearningSupervised Constrastive Learning
Supervised Constrastive LearningSungchul Kim
 
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Bootstrap Your Own Latent: A New Approach to Self-Supervised LearningBootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Bootstrap Your Own Latent: A New Approach to Self-Supervised LearningSungchul Kim
 
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...Sungchul Kim
 
Regularizing Class-wise Predictions via Self-knowledge Distillation
Regularizing Class-wise Predictions via Self-knowledge DistillationRegularizing Class-wise Predictions via Self-knowledge Distillation
Regularizing Class-wise Predictions via Self-knowledge DistillationSungchul Kim
 

More from Sungchul Kim (20)

PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo SupervisionPR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
 
Revisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural NetworksRevisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural Networks
 
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision TransformersEmerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
 
PR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation LearningPR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation Learning
 
Score based Generative Modeling through Stochastic Differential Equations
Score based Generative Modeling through Stochastic Differential EquationsScore based Generative Modeling through Stochastic Differential Equations
Score based Generative Modeling through Stochastic Differential Equations
 
Exploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningExploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation Learning
 
Revisiting the Sibling Head in Object Detector
Revisiting the Sibling Head in Object DetectorRevisiting the Sibling Head in Object Detector
Revisiting the Sibling Head in Object Detector
 
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
 
Going Deeper with Convolutions
Going Deeper with ConvolutionsGoing Deeper with Convolutions
Going Deeper with Convolutions
 
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based LocalizationGrad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
 
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...
 
On the Variance of the Adaptive Learning Rate and Beyond
On the Variance of the Adaptive Learning Rate and BeyondOn the Variance of the Adaptive Learning Rate and Beyond
On the Variance of the Adaptive Learning Rate and Beyond
 
A Benchmark for Interpretability Methods in Deep Neural Networks
A Benchmark for Interpretability Methods in Deep Neural NetworksA Benchmark for Interpretability Methods in Deep Neural Networks
A Benchmark for Interpretability Methods in Deep Neural Networks
 
KDGAN: Knowledge Distillation with Generative Adversarial Networks
KDGAN: Knowledge Distillation with Generative Adversarial NetworksKDGAN: Knowledge Distillation with Generative Adversarial Networks
KDGAN: Knowledge Distillation with Generative Adversarial Networks
 
Designing Network Design Spaces
Designing Network Design SpacesDesigning Network Design Spaces
Designing Network Design Spaces
 
Search to Distill: Pearls are Everywhere but not the Eyes
Search to Distill: Pearls are Everywhere but not the EyesSearch to Distill: Pearls are Everywhere but not the Eyes
Search to Distill: Pearls are Everywhere but not the Eyes
 
Supervised Constrastive Learning
Supervised Constrastive LearningSupervised Constrastive Learning
Supervised Constrastive Learning
 
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Bootstrap Your Own Latent: A New Approach to Self-Supervised LearningBootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
 
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
 
Regularizing Class-wise Predictions via Self-knowledge Distillation
Regularizing Class-wise Predictions via Self-knowledge DistillationRegularizing Class-wise Predictions via Self-knowledge Distillation
Regularizing Class-wise Predictions via Self-knowledge Distillation
 

Recently uploaded

KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 

Recently uploaded (20)

KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 

Panoptic Segmentation

  • 2. Contents 1. Introduction 2. Panoptic Segmentation 3. Human Consistency Study 4. Machine Performance Baselines 5. Future of Panoptic Segmentation 2
  • 3. Introduction & Related Work • Introduction • Semantic Segmentation • Stuff (such as grass, sky, road, ..) • Simply assign a class label to each pixel in an image (treat thing classes as stuff) • FCN으로 object instance를 나누기 힘듬 • Instance Segmentation • Things (such as people, animals, tools, ..) • Detect each object and delineate it with a bbox or mask • Region-based method로 overlapping을 피하기 힘듬 3
  • 4. Introduction & Related Work • Introduction • Panoptic Segmentation • Encompass both stuff and thing classes → Panoptic : including every thing visible in one view • Use a simple but general output format → 각 pixel은 semantic label과 instance id를 할당받음 • Introduce a uniform evaluation metric → Semantic, Instance 모두 적용 가능한 Panoptic Quality 4
  • 5. Panoptic Segmentation • Format • Task format • Semantic classes ℒ ≔ {0, … , ℒ − 1} • Each pixel 𝑖 is mapped to a pair 𝒍𝒊, 𝒛𝒊 ∈ 𝓛 × ℕ • 𝑙𝑖 : the semantic class of pixel 𝑖 • 𝑧𝑖 : its instance id. Group pixels of the same class into distinct segments • 애매하거나 out-of-class pixel들은 void label로 배정 → 모든 pixel이 semantic label을 갖는 건 아님 5
  • 6. Panoptic Segmentation • Format • Stuff and thing labels • Semantic label set은 subset ℒ 𝑆𝑡와 ℒ 𝑇ℎ로 구성 • ℒ = ℒ 𝑆𝑡 ∪ ℒ 𝑇ℎ , ℒ 𝑆𝑡 ∩ ℒ 𝑇ℎ = ∅ • Pixel이 𝑙𝑖 ∈ ℒ 𝑆𝑡 로 레이블 될 때, 𝑧𝑖는 관계 없음 → 모든 stuff classes는 같은 instance를 갖기 때문 (e.g., the same sky) • 모든 pixel이 같은 𝒍𝒊, 𝒛𝒊 일 때 (𝑙𝑖 ∈ ℒ 𝑇ℎ), 같은 instance에 속함 (e.g., the same car) • Single instance에 속하는 경우 같은 𝒍𝒊, 𝒛𝒊 을 가짐 6
  • 7. Panoptic Segmentation • Format • Relationship to semantic segmentation • Panoptic segmentation은 semantic segmentation의 strict generalization • 각 pixel에 semantic label을 할당하는 것은 동일, thing class가 들어가는 순간 달라짐 • Relationship to instance segmentation • Instance segmentation은 overlapping segment 발생 → Panoptic segmentation은 각 pixel에 semantic label과 instance id 하나씩 할당 → Overlapping 발생 X 7
  • 8. Panoptic Segmentation • Metric • 기존 방법들은 semantic or instance segmentation에 특화 → stuff와 thing을 동시에 측정 불가 • Stuff와 thing을 동시에 측정할 수 있는 metric 제시 • Completeness : Treat stuff and thing classes in a uniform way • Interpretability : A metric with identifiable meaning • Simplicity : Simple to define and implement → Panoptic Quality (PQ) 8
  • 9. Panoptic Segmentation • Metric • Segment Matching • 𝐼𝑜𝑈 𝑝𝑖, 𝑔 = |𝑝 𝑖∩𝑔| |𝑝 𝑖∪𝑔| ≤ 𝑝 𝑖∩𝑔 𝑔 𝑓𝑜𝑟 𝑖 ∈ 1, 2 • 𝑝1 ∩ 𝑝2 = ∅, |𝑝𝑖 ∪ 𝑔| ≥ |𝑔| • 𝐼𝑜𝑈 𝑝1, 𝑔 + 𝐼𝑜𝑈 𝑝2, 𝑔 ≤ 𝑝1∩𝑔 + 𝑝2∩𝑔 𝑔 ≤ 1 • 𝑝1 ∩ 𝑔 + 𝑝2 ∩ 𝑔 ≤ g • Simple and Interpretable! 9
  • 10. Panoptic Segmentation • Metric • PQ Computation • Class별로 PQ 계산 후 평균 → class imbalance에 insensitive하게 만듬 • 𝑷𝑸 = σ 𝒑,𝒈 ∈𝑻𝑷 𝑰𝒐𝑼(𝒑,𝒈) 𝑻𝑷 + 𝟏 𝟐 𝑭𝑷 + 𝟏 𝟐 𝑭𝑵 • 1 |𝑇𝑃| σ 𝑝,𝑔 ∈𝑇𝑃 𝐼𝑜𝑈(𝑝, 𝑔) : the average 𝐼𝑜𝑈 of matched segments • 1 2 𝐹𝑃 + 1 2 𝐹𝑁 : to penalize segments without matches • All segments receive equal importance regardless of their area! 10
  • 11. Panoptic Segmentation • Metric • PQ Computation • 𝑷𝑸 = σ 𝒑,𝒈 ∈𝑻𝑷 𝑰𝒐𝑼(𝒑,𝒈) 𝑻𝑷 + 𝟏 𝟐 𝑭𝑷 + 𝟏 𝟐 𝑭𝑵 = σ 𝒑,𝒈 ∈𝑻𝑷 𝑰𝒐𝑼(𝒑,𝒈) 𝑻𝑷 × |𝑻𝑷| 𝑻𝑷 + 𝟏 𝟐 𝑭𝑷 + 𝟏 𝟐 𝑭𝑵 (𝑺𝑸 × 𝑹𝑸) • σ 𝑝,𝑔 ∈𝑇𝑃 𝐼𝑜𝑈(𝑝,𝑔) 𝑇𝑃 : segmentation quality (SQ)  the average 𝐼𝑜𝑈 of matched segments • |𝑇𝑃| 𝑇𝑃 + 1 2 𝐹𝑃 + 1 2 𝐹𝑁 : recognition quality (RQ)  𝐹1 score과 흡사 11
  • 12. Panoptic Segmentation • Metric • PQ Computation • Void labels in the ground truth • Out of class pixels / ambiguous or unknown pixels → 평가하지 않음 • Ground truth에서 void labels인 pixel들은 제외하고 𝐼𝑜𝑈 계산 • Void pixel의 일부가 포함된 unmatched predicted segments는 제거 & FP로 처리X • Group labels • Cityscapes, COCO는 instance id 대신 group label을 사용 • Matching 중에는 group region 사용 X • Matching 이후, matching threshold를 넘는 same class의 일부를 포함하는 12
  • 13. Panoptic Segmentation • Metric • Comparison to Existing Metrics • Semantic segmentation metrics • 𝐼𝑜𝑈 → pixel output/labels 기반 계산 & object-level labels 무시 • Instance segmentation metrics • Average Precision : Confidence score 필요 → semantic segmentation에 적합하지 않음 • Panoptic quality • Treat all classes (stuff and things) in a uniform way • SQ, RQ로 decomposing 할 수 있지만, PQ가 semantic과 instance의 metric은 아님 → 오히려 SQ와 RQ가 각각 semantic과 instance의 metric 13
  • 14. Panoptic Segmentation • Datasets • Cityscapes • 5000 images (2975 / 500 / 1525) • 19 classes among which 8 have instance-level • ADE20k • 25k images (20k / 2k / 3k) • 100 thing and 50 stuff classes in the 2017 Places Challenge • Mapillary Vistas • 25k street-view images (18k / 2k / 5k) • 28 stuff and 37 thing classes in ‘research edition’ 14
  • 15. Human Consistency Study • Human annotation • 하나를 ground truth, 다른 하나를 prediction으로 설정 후 PQ 측정 • Human consistency 15
  • 16. Human Consistency Study • Stuff 𝒗𝒔. Things • 모든 stuff and things classes가 고르게 분포 • Small 𝒗𝒔. large objects • Small 25% | middle 50% | largest 25% 16
  • 17. Human Consistency Study • 𝑰𝒐𝑼 threshold • SQ 𝒗𝒔. RQ balance • 𝑅𝑄 𝛼 = |𝑇𝑃| 𝑇𝑃 +𝛼 𝐹𝑃 +𝛼|𝐹𝑁| (default 𝛼 = 0.5) • default 𝛼 가 SQ와 RQ 사이에 균형을 맞춰 줌 • 상황에 따라 변경해주면 될 듯 17
  • 18. Machine Performance Baseline • 세 가지 관점 • How do heuristic combinations of top-performing instance and semantic segmentation systems perform on panoptic segmentation? • How does PQ compare to existing metrics like 𝐴𝑃 and 𝐼𝑜𝑈? • How do the machine results compare to the human results that we presented previously? 18
  • 19. Machine Performance Baseline • Algorithms and data • 각 dataset에서 algorithm output을 얻음 • Cityscapes : masks generated by PSPNet and Mask R-CNN • ADE20k : the winners of both the semantic and instance seg. in the 2017 Places Challenge • Vistas : 주최측으로 부터 1k testset에 대한 semantic & instance seg. 결과를 제공받음 19
  • 20. Machine Performance Baseline • Instance segmentation • Overlapping segment를 얻음 → 해결 • Confidence score 순으로 sorting → 낮은 score는 제거 • 가장 confident한 것부터 시작해서 반복 • Remove pixels which have been assigned to previous segments • Segment의 충분한 부분이 남아있으면 non-overlapping, 그렇지 않으면 전체 segment 제거 20
  • 21. Machine Performance Baseline • Semantic segmentation • No overlapping → 바로 PQ 계산 가능 • Multi-scale : skip connection (?) 21
  • 22. Machine Performance Baseline • Panoptic segmentation • Instance + Semantic for resolving any overlap between thing and stuff 22
  • 23. Machine Performance Baseline • Human 𝒗𝒔. machine panoptic segmentation 23
  • 24. Future of Panoptic Segmentation • Stuff와 thing을 동시에 설명하는 end-to-end model • Instance segmentation에서 non-overlapping을 위한 실험들이 진행됨 • Some form of higher-level ‘reasoning’ may be beneficial • Extend learnable NMS to PS 24
  • 25. Future of Panoptic Segmentation • Stuff와 thing을 동시에 설명하는 end-to-end model • Instance segmentation에서 non-overlapping을 위한 실험들이 진행됨 • Some form of higher-level ‘reasoning’ may be beneficial • Extend learnable NMS to PS 25
  • 26. UPSNet (in CVPR 2019) 26
  • 27. 감 사 합 니 다 27