SlideShare a Scribd company logo
1 of 16
DA-CIL: Towards Domain Adaptive Class-Incremental
3D Object Detection
Ziyuan Zhao1,2, Mingxi Xu1,3, Peisheng Qian1, Ramanpreet Singh Pahwa1,2, and Richard Chang1
Presenter: Zhao Ziyuan
1 Institute for Infocomm Research (I2R), A*STAR, Singapore
2 Artificial Intelligence Analytics & Informatics (AI3), A*STAR, Singapore
3 Nanyang Technological University, Singapore
Paper ID: 916
2
Introduction – Catastrophic Forgetting & Domain Shift
- Deep learning-based 3D object detection has received considerable attention in 3D point clouds.
- Catastrophic forgetting: the performance of DL models on old classes tends to decrease substantially
when trained on novel classes.
- Domain Shift: DL models trained on one domain (i.e., source domain) always suffer tremendous
performance degradation when evaluated on another domain (i.e., target domain).
3
Motivation – Domain Adaptive Class-incremental Learning
- Existing CIL and UDA methods are designed for addressing different challenges separately, ignoring
that the catastrophic forgetting and domain shift problems concurrently exist in 3D object detection
on point clouds of real-world environments.
- We aim to leverage labeled old classes on the source domain and labeled new classes on the
target domain to close the domain shift and adapt to new classes without forgetting old classes on the
target domain in DA-CIL.
4
Method – Transformation-consistent Meta-hallucination
- We proposed a new CIL paradigm to enable Domain Adaptive Class-Incremental Learning for 3D object
detection.
- Dual-domain copy-paste (CP) address both in-domain data scarcity and cross-domain distribution
shift.
- Dual-teacher knowledge distillation (KD) enforces multi-level consistency regularization (MLC).
5
Method (1) – Dual-Domain Copy-Paste
- To relieve data scarcity and reduce the domain gap at the data level, we extensively leverage copy-
paste (CP) augmentation techniques for creating cross-domain and in-domain point clouds.
6
Method (2) – Multi-Level Consistency
- To facilitate the dual-teacher knowledge transfer, we propose multi-level consistency regularization from
two aspects.
- Statistics-Level (SC) Consistency ( Batch Normalization parameters alignment)
- Bounding Box-Level Consistency ( center-, class and size-level consistency loss)
Na Zhao, Tat-Seng Chua, and Gim Hee Lee. Sess: Self-ensembling semi-supervised 3d object detection. CVPR, pp.11079–11087, 2020.
7
Method (3) – Dual-Teacher Training
- Cross-domain Teacher: The student model learns the underlying knowledge in base classes from a
cross-domain teacher via distillation loss (square of Euclidean distance between the classification
logits of different bounding boxes).
- In-domain Teacher: Meanwhile, the EMA in-domain teacher helps student model capture structure and
semantic invariant information in objects with consistency loss
- The mixed labels (pseudo base + real novel) are transformed by the same augmentation step that is
applied on the augmented source domain to compute a supervised loss with the backbone VoteNet
Charles R Qi, Or Litany, Kaiming He, and Leonidas J Guibas. Deep hough voting for 3d object detection in point clouds. CVPR, pp. 9277–9286, 2019.
8
Experimental Results
- Dataset
- We evaluate our method on 3D object detection datasets, ScanNet (source) and SUN RGB-D
(target).
- 5 categories (bathtub, bed, bookshelf, chair, desk) were selected as base classes.
- 5 additional categories (dresser, nightstand, sofa, table, toilet) in SUN RGB-D were selected as
novel classes in the target domain.
ScanNet dataset SUN RGB-D dataset
9
Experimental Results (1) – Comparison with different methods
- For class-incremental learning, we evaluated 2 naive transfer learning baselines Freeze-and-add,
and Fine-tune. We also used SDCoT as a strong baseline for CIL.
- For unsupervised domain adaptation, We implemented one recent popular self-ensembling method,
mean teacher (MT).
- Our method outperforms many methods by a clear margin in CIL under domain shift.
Na Zhao, Tat-Seng Chua, and Gim Hee Lee. Sess: Self-ensembling semi-supervised 3d object detection. CVPR, pp.11079–11087, 2020.
10
Experimental Results (2) – Comparison with different augmentation techniques
- We implemented two popular augmentation techniques for comparison, Mix3D and CutMix.
- Mix3D: combines 2 point clouds into a mixed point cloud via concatenation, which results in
excessive overlaps in the mixed point clouds
- CutMix: randomly replaces patches of point clouds with patches from other point clouds, which
unnecessarily introduces extra context from the source domain.
- Our method outperforms the two augmentation techniques.
11
Experimental Results (3) – Ablation Analysis
- Without cross-domain copy-paste, the model achieves a lower result in base classes, which proves
the effectiveness of cross-domain copy-paste augmentation in base class recognition.
- Similarly, it can be inferred from results without in-domain augmentation that in-domain copy-paste
can enhance the model performance in novel classes.
- Moreover, statistics-level consistency provides around 0.8% performance increment in all classes.
12
Experimental Results (4) – Qualitative Comparison (good cases)
- Our method can accurately detect both
old and novel classes in the target
domain, overcoming the domain shift in
old classes.
13
Experimental Results (4) – Qualitative Comparison (failure cases)
- Fail to detect the bookshelf in the point clouds.
The failure is due to the loss of the geometric
structure of the bookshelf in the point cloud data.
- The desk object is misclassified as a table,
which is likely due to geometric similarities
between the 2 classes.
- The bounding box of the desk object deviates
from the ground-truth, which can be attributed to
the large size and irregular shape of the object.
14
Conclusions
- We identify and explore a novel domain adaptive class-incremental learning paradigm for 3D object
detection.
- We propose a novel 3D object detection framework, in which we design a novel dual-domain copy-
paste augmentation method to address both in-domain data scarcity and cross-domain distribution shift.
- We further enhance the dual-teacher knowledge distillation with multi-level consistency between
different domains.
- We achieve superior performance in extensive experiments.
15
Conclusions
- We identify and explore a novel domain adaptive class-incremental learning paradigm for 3D object
detection.
- We propose a novel 3D object detection framework, in which we design a novel dual-domain copy-
paste augmentation method to address both in-domain data scarcity and cross-domain distribution shift.
- We further enhance the dual-teacher knowledge distillation with multi-level consistency between
different domains.
- We achieve superior performance in extensive experiments.
Thanks
Zhao_Ziyuan@i2r.a-star.edu.sg
https://jacobzhaoziyuan.github.io/

More Related Content

What's hot

[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...
[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...
[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...Ziyuan Zhao
 
A Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionA Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionvivatechijri
 
Photo-realistic Single Image Super-resolution using a Generative Adversarial ...
Photo-realistic Single Image Super-resolution using a Generative Adversarial ...Photo-realistic Single Image Super-resolution using a Generative Adversarial ...
Photo-realistic Single Image Super-resolution using a Generative Adversarial ...Hansol Kang
 
Semantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesSemantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesFellowship at Vodafone FutureLab
 
Petals Count and Flower Image Classification using computer vision
Petals Count and Flower Image Classification using computer visionPetals Count and Flower Image Classification using computer vision
Petals Count and Flower Image Classification using computer visionIndranil Datta
 
Image Classification using deep learning
Image Classification using deep learning Image Classification using deep learning
Image Classification using deep learning Asma-AH
 
Linear discriminant analysis
Linear discriminant analysisLinear discriminant analysis
Linear discriminant analysisBangalore
 
Artificial Neural Network Lecture 6- Associative Memories & Discrete Hopfield...
Artificial Neural Network Lecture 6- Associative Memories & Discrete Hopfield...Artificial Neural Network Lecture 6- Associative Memories & Discrete Hopfield...
Artificial Neural Network Lecture 6- Associative Memories & Discrete Hopfield...Mohammed Bennamoun
 
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic SegmentationSemantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation岳華 杜
 
U-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image SegmentationU-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image Segmentationfake can
 
Lung Cancer Prediction using Image Classification
Lung Cancer Prediction using Image ClassificationLung Cancer Prediction using Image Classification
Lung Cancer Prediction using Image ClassificationShreshth Saxena
 
[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning fo...
[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning fo...[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning fo...
[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning fo...Ziyuan Zhao
 
Batch normalization presentation
Batch normalization presentationBatch normalization presentation
Batch normalization presentationOwin Will
 
ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]Dongmin Choi
 
UNetEliyaLaialy (2).pptx
UNetEliyaLaialy (2).pptxUNetEliyaLaialy (2).pptx
UNetEliyaLaialy (2).pptxNoorUlHaq47
 
3.5 model based clustering
3.5 model based clustering3.5 model based clustering
3.5 model based clusteringKrish_ver2
 
A Beginner's Guide to Monocular Depth Estimation
A Beginner's Guide to Monocular Depth EstimationA Beginner's Guide to Monocular Depth Estimation
A Beginner's Guide to Monocular Depth EstimationRyo Takahashi
 

What's hot (20)

[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...
[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...
[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...
 
A Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionA Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detection
 
SSD: Single Shot MultiBox Detector (UPC Reading Group)
SSD: Single Shot MultiBox Detector (UPC Reading Group)SSD: Single Shot MultiBox Detector (UPC Reading Group)
SSD: Single Shot MultiBox Detector (UPC Reading Group)
 
Photo-realistic Single Image Super-resolution using a Generative Adversarial ...
Photo-realistic Single Image Super-resolution using a Generative Adversarial ...Photo-realistic Single Image Super-resolution using a Generative Adversarial ...
Photo-realistic Single Image Super-resolution using a Generative Adversarial ...
 
Semantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesSemantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network Approaches
 
Unsupervised learning networks
Unsupervised learning networksUnsupervised learning networks
Unsupervised learning networks
 
Lect12 graph mining
Lect12 graph miningLect12 graph mining
Lect12 graph mining
 
Petals Count and Flower Image Classification using computer vision
Petals Count and Flower Image Classification using computer visionPetals Count and Flower Image Classification using computer vision
Petals Count and Flower Image Classification using computer vision
 
Image Classification using deep learning
Image Classification using deep learning Image Classification using deep learning
Image Classification using deep learning
 
Linear discriminant analysis
Linear discriminant analysisLinear discriminant analysis
Linear discriminant analysis
 
Artificial Neural Network Lecture 6- Associative Memories & Discrete Hopfield...
Artificial Neural Network Lecture 6- Associative Memories & Discrete Hopfield...Artificial Neural Network Lecture 6- Associative Memories & Discrete Hopfield...
Artificial Neural Network Lecture 6- Associative Memories & Discrete Hopfield...
 
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic SegmentationSemantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
 
U-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image SegmentationU-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image Segmentation
 
Lung Cancer Prediction using Image Classification
Lung Cancer Prediction using Image ClassificationLung Cancer Prediction using Image Classification
Lung Cancer Prediction using Image Classification
 
[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning fo...
[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning fo...[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning fo...
[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning fo...
 
Batch normalization presentation
Batch normalization presentationBatch normalization presentation
Batch normalization presentation
 
ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]
 
UNetEliyaLaialy (2).pptx
UNetEliyaLaialy (2).pptxUNetEliyaLaialy (2).pptx
UNetEliyaLaialy (2).pptx
 
3.5 model based clustering
3.5 model based clustering3.5 model based clustering
3.5 model based clustering
 
A Beginner's Guide to Monocular Depth Estimation
A Beginner's Guide to Monocular Depth EstimationA Beginner's Guide to Monocular Depth Estimation
A Beginner's Guide to Monocular Depth Estimation
 

Similar to [BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detection

GeoAI: A Model-Agnostic Meta-Ensemble Zero-Shot Learning Method for Hyperspec...
GeoAI: A Model-Agnostic Meta-Ensemble Zero-Shot Learning Method for Hyperspec...GeoAI: A Model-Agnostic Meta-Ensemble Zero-Shot Learning Method for Hyperspec...
GeoAI: A Model-Agnostic Meta-Ensemble Zero-Shot Learning Method for Hyperspec...Konstantinos Demertzis
 
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...Universitat Politècnica de Catalunya
 
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018Universitat Politècnica de Catalunya
 
Machine learning clustering
Machine learning clusteringMachine learning clustering
Machine learning clusteringNadeem Oozeer
 
Few shot learning/ one shot learning/ machine learning
Few shot learning/ one shot learning/ machine learningFew shot learning/ one shot learning/ machine learning
Few shot learning/ one shot learning/ machine learningﺁﺻﻒ ﻋﻠﯽ ﻣﯿﺮ
 
K means Clustering
K means ClusteringK means Clustering
K means ClusteringEdureka!
 
Could a Data Science Program use Data Science Insights?
Could a Data Science Program use Data Science Insights?Could a Data Science Program use Data Science Insights?
Could a Data Science Program use Data Science Insights?Zachary Thomas
 
Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331Jihong Kang
 
CrossScenarioTransferPersonReidentification_finalManuscript
CrossScenarioTransferPersonReidentification_finalManuscriptCrossScenarioTransferPersonReidentification_finalManuscript
CrossScenarioTransferPersonReidentification_finalManuscriptXiaojuan (Kathleen) WANG
 
Analysis on Domain Adaptation based on different papers
Analysis on Domain Adaptation based on different papersAnalysis on Domain Adaptation based on different papers
Analysis on Domain Adaptation based on different papersharshavardhan814108
 
Deep Semi-supervised Learning methods
Deep Semi-supervised Learning methodsDeep Semi-supervised Learning methods
Deep Semi-supervised Learning methodsPrincy Joy
 
ON DISTRIBUTED FUZZY DECISION TREES FOR BIG DATA
 ON DISTRIBUTED FUZZY DECISION TREES FOR BIG DATA ON DISTRIBUTED FUZZY DECISION TREES FOR BIG DATA
ON DISTRIBUTED FUZZY DECISION TREES FOR BIG DATANexgen Technology
 
Shift-Robust Node Classification via Graph Adversarial Clustering Neurips 202...
Shift-Robust Node Classification via Graph Adversarial Clustering Neurips 202...Shift-Robust Node Classification via Graph Adversarial Clustering Neurips 202...
Shift-Robust Node Classification via Graph Adversarial Clustering Neurips 202...ssuser2624f71
 
Relational knowledge distillation
Relational knowledge distillationRelational knowledge distillation
Relational knowledge distillationNAVER Engineering
 
Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...
Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...
Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...Universitat Politècnica de Catalunya
 
Sim-to-Real Transfer in Deep Reinforcement Learning
Sim-to-Real Transfer in Deep Reinforcement LearningSim-to-Real Transfer in Deep Reinforcement Learning
Sim-to-Real Transfer in Deep Reinforcement Learningatulshah16
 

Similar to [BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detection (20)

GeoAI: A Model-Agnostic Meta-Ensemble Zero-Shot Learning Method for Hyperspec...
GeoAI: A Model-Agnostic Meta-Ensemble Zero-Shot Learning Method for Hyperspec...GeoAI: A Model-Agnostic Meta-Ensemble Zero-Shot Learning Method for Hyperspec...
GeoAI: A Model-Agnostic Meta-Ensemble Zero-Shot Learning Method for Hyperspec...
 
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
 
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018
 
Machine learning clustering
Machine learning clusteringMachine learning clustering
Machine learning clustering
 
Machine learning clustering
Machine learning clusteringMachine learning clustering
Machine learning clustering
 
ClusetrigBasic.ppt
ClusetrigBasic.pptClusetrigBasic.ppt
ClusetrigBasic.ppt
 
Few shot learning/ one shot learning/ machine learning
Few shot learning/ one shot learning/ machine learningFew shot learning/ one shot learning/ machine learning
Few shot learning/ one shot learning/ machine learning
 
K means Clustering
K means ClusteringK means Clustering
K means Clustering
 
Could a Data Science Program use Data Science Insights?
Could a Data Science Program use Data Science Insights?Could a Data Science Program use Data Science Insights?
Could a Data Science Program use Data Science Insights?
 
Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331
 
ME Synopsis
ME SynopsisME Synopsis
ME Synopsis
 
CrossScenarioTransferPersonReidentification_finalManuscript
CrossScenarioTransferPersonReidentification_finalManuscriptCrossScenarioTransferPersonReidentification_finalManuscript
CrossScenarioTransferPersonReidentification_finalManuscript
 
Analysis on Domain Adaptation based on different papers
Analysis on Domain Adaptation based on different papersAnalysis on Domain Adaptation based on different papers
Analysis on Domain Adaptation based on different papers
 
PointNet
PointNetPointNet
PointNet
 
Deep Semi-supervised Learning methods
Deep Semi-supervised Learning methodsDeep Semi-supervised Learning methods
Deep Semi-supervised Learning methods
 
ON DISTRIBUTED FUZZY DECISION TREES FOR BIG DATA
 ON DISTRIBUTED FUZZY DECISION TREES FOR BIG DATA ON DISTRIBUTED FUZZY DECISION TREES FOR BIG DATA
ON DISTRIBUTED FUZZY DECISION TREES FOR BIG DATA
 
Shift-Robust Node Classification via Graph Adversarial Clustering Neurips 202...
Shift-Robust Node Classification via Graph Adversarial Clustering Neurips 202...Shift-Robust Node Classification via Graph Adversarial Clustering Neurips 202...
Shift-Robust Node Classification via Graph Adversarial Clustering Neurips 202...
 
Relational knowledge distillation
Relational knowledge distillationRelational knowledge distillation
Relational knowledge distillation
 
Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...
Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...
Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...
 
Sim-to-Real Transfer in Deep Reinforcement Learning
Sim-to-Real Transfer in Deep Reinforcement LearningSim-to-Real Transfer in Deep Reinforcement Learning
Sim-to-Real Transfer in Deep Reinforcement Learning
 

More from Ziyuan Zhao

[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...
[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...
[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...Ziyuan Zhao
 
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning for Medical...
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning  for Medical...[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning  for Medical...
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning for Medical...Ziyuan Zhao
 
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...Ziyuan Zhao
 
[ICIP 2022] ACT-NET: Asymmetric Co-Teacher Network for Semi-Supervised Memory...
[ICIP 2022] ACT-NET: Asymmetric Co-Teacher Network for Semi-Supervised Memory...[ICIP 2022] ACT-NET: Asymmetric Co-Teacher Network for Semi-Supervised Memory...
[ICIP 2022] ACT-NET: Asymmetric Co-Teacher Network for Semi-Supervised Memory...Ziyuan Zhao
 
[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...
[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...
[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...Ziyuan Zhao
 
[MICCAI 2022] Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Imag...
[MICCAI 2022] Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Imag...[MICCAI 2022] Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Imag...
[MICCAI 2022] Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Imag...Ziyuan Zhao
 
[EMBC 2022] Self-supervised Assisted Active Learning for Skin Lesion Segmenta...
[EMBC 2022] Self-supervised Assisted Active Learning for Skin Lesion Segmenta...[EMBC 2022] Self-supervised Assisted Active Learning for Skin Lesion Segmenta...
[EMBC 2022] Self-supervised Assisted Active Learning for Skin Lesion Segmenta...Ziyuan Zhao
 
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age EstimationZiyuan Zhao
 
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...Ziyuan Zhao
 
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...Ziyuan Zhao
 
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...Ziyuan Zhao
 
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...Ziyuan Zhao
 

More from Ziyuan Zhao (12)

[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...
[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...
[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...
 
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning for Medical...
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning  for Medical...[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning  for Medical...
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning for Medical...
 
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...
 
[ICIP 2022] ACT-NET: Asymmetric Co-Teacher Network for Semi-Supervised Memory...
[ICIP 2022] ACT-NET: Asymmetric Co-Teacher Network for Semi-Supervised Memory...[ICIP 2022] ACT-NET: Asymmetric Co-Teacher Network for Semi-Supervised Memory...
[ICIP 2022] ACT-NET: Asymmetric Co-Teacher Network for Semi-Supervised Memory...
 
[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...
[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...
[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...
 
[MICCAI 2022] Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Imag...
[MICCAI 2022] Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Imag...[MICCAI 2022] Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Imag...
[MICCAI 2022] Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Imag...
 
[EMBC 2022] Self-supervised Assisted Active Learning for Skin Lesion Segmenta...
[EMBC 2022] Self-supervised Assisted Active Learning for Skin Lesion Segmenta...[EMBC 2022] Self-supervised Assisted Active Learning for Skin Lesion Segmenta...
[EMBC 2022] Self-supervised Assisted Active Learning for Skin Lesion Segmenta...
 
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation
 
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...
 
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
 
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...
 
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...
 

Recently uploaded

Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 

Recently uploaded (20)

Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

[BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detection

  • 1. DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detection Ziyuan Zhao1,2, Mingxi Xu1,3, Peisheng Qian1, Ramanpreet Singh Pahwa1,2, and Richard Chang1 Presenter: Zhao Ziyuan 1 Institute for Infocomm Research (I2R), A*STAR, Singapore 2 Artificial Intelligence Analytics & Informatics (AI3), A*STAR, Singapore 3 Nanyang Technological University, Singapore Paper ID: 916
  • 2. 2 Introduction – Catastrophic Forgetting & Domain Shift - Deep learning-based 3D object detection has received considerable attention in 3D point clouds. - Catastrophic forgetting: the performance of DL models on old classes tends to decrease substantially when trained on novel classes. - Domain Shift: DL models trained on one domain (i.e., source domain) always suffer tremendous performance degradation when evaluated on another domain (i.e., target domain).
  • 3. 3 Motivation – Domain Adaptive Class-incremental Learning - Existing CIL and UDA methods are designed for addressing different challenges separately, ignoring that the catastrophic forgetting and domain shift problems concurrently exist in 3D object detection on point clouds of real-world environments. - We aim to leverage labeled old classes on the source domain and labeled new classes on the target domain to close the domain shift and adapt to new classes without forgetting old classes on the target domain in DA-CIL.
  • 4. 4 Method – Transformation-consistent Meta-hallucination - We proposed a new CIL paradigm to enable Domain Adaptive Class-Incremental Learning for 3D object detection. - Dual-domain copy-paste (CP) address both in-domain data scarcity and cross-domain distribution shift. - Dual-teacher knowledge distillation (KD) enforces multi-level consistency regularization (MLC).
  • 5. 5 Method (1) – Dual-Domain Copy-Paste - To relieve data scarcity and reduce the domain gap at the data level, we extensively leverage copy- paste (CP) augmentation techniques for creating cross-domain and in-domain point clouds.
  • 6. 6 Method (2) – Multi-Level Consistency - To facilitate the dual-teacher knowledge transfer, we propose multi-level consistency regularization from two aspects. - Statistics-Level (SC) Consistency ( Batch Normalization parameters alignment) - Bounding Box-Level Consistency ( center-, class and size-level consistency loss) Na Zhao, Tat-Seng Chua, and Gim Hee Lee. Sess: Self-ensembling semi-supervised 3d object detection. CVPR, pp.11079–11087, 2020.
  • 7. 7 Method (3) – Dual-Teacher Training - Cross-domain Teacher: The student model learns the underlying knowledge in base classes from a cross-domain teacher via distillation loss (square of Euclidean distance between the classification logits of different bounding boxes). - In-domain Teacher: Meanwhile, the EMA in-domain teacher helps student model capture structure and semantic invariant information in objects with consistency loss - The mixed labels (pseudo base + real novel) are transformed by the same augmentation step that is applied on the augmented source domain to compute a supervised loss with the backbone VoteNet Charles R Qi, Or Litany, Kaiming He, and Leonidas J Guibas. Deep hough voting for 3d object detection in point clouds. CVPR, pp. 9277–9286, 2019.
  • 8. 8 Experimental Results - Dataset - We evaluate our method on 3D object detection datasets, ScanNet (source) and SUN RGB-D (target). - 5 categories (bathtub, bed, bookshelf, chair, desk) were selected as base classes. - 5 additional categories (dresser, nightstand, sofa, table, toilet) in SUN RGB-D were selected as novel classes in the target domain. ScanNet dataset SUN RGB-D dataset
  • 9. 9 Experimental Results (1) – Comparison with different methods - For class-incremental learning, we evaluated 2 naive transfer learning baselines Freeze-and-add, and Fine-tune. We also used SDCoT as a strong baseline for CIL. - For unsupervised domain adaptation, We implemented one recent popular self-ensembling method, mean teacher (MT). - Our method outperforms many methods by a clear margin in CIL under domain shift. Na Zhao, Tat-Seng Chua, and Gim Hee Lee. Sess: Self-ensembling semi-supervised 3d object detection. CVPR, pp.11079–11087, 2020.
  • 10. 10 Experimental Results (2) – Comparison with different augmentation techniques - We implemented two popular augmentation techniques for comparison, Mix3D and CutMix. - Mix3D: combines 2 point clouds into a mixed point cloud via concatenation, which results in excessive overlaps in the mixed point clouds - CutMix: randomly replaces patches of point clouds with patches from other point clouds, which unnecessarily introduces extra context from the source domain. - Our method outperforms the two augmentation techniques.
  • 11. 11 Experimental Results (3) – Ablation Analysis - Without cross-domain copy-paste, the model achieves a lower result in base classes, which proves the effectiveness of cross-domain copy-paste augmentation in base class recognition. - Similarly, it can be inferred from results without in-domain augmentation that in-domain copy-paste can enhance the model performance in novel classes. - Moreover, statistics-level consistency provides around 0.8% performance increment in all classes.
  • 12. 12 Experimental Results (4) – Qualitative Comparison (good cases) - Our method can accurately detect both old and novel classes in the target domain, overcoming the domain shift in old classes.
  • 13. 13 Experimental Results (4) – Qualitative Comparison (failure cases) - Fail to detect the bookshelf in the point clouds. The failure is due to the loss of the geometric structure of the bookshelf in the point cloud data. - The desk object is misclassified as a table, which is likely due to geometric similarities between the 2 classes. - The bounding box of the desk object deviates from the ground-truth, which can be attributed to the large size and irregular shape of the object.
  • 14. 14 Conclusions - We identify and explore a novel domain adaptive class-incremental learning paradigm for 3D object detection. - We propose a novel 3D object detection framework, in which we design a novel dual-domain copy- paste augmentation method to address both in-domain data scarcity and cross-domain distribution shift. - We further enhance the dual-teacher knowledge distillation with multi-level consistency between different domains. - We achieve superior performance in extensive experiments.
  • 15. 15 Conclusions - We identify and explore a novel domain adaptive class-incremental learning paradigm for 3D object detection. - We propose a novel 3D object detection framework, in which we design a novel dual-domain copy- paste augmentation method to address both in-domain data scarcity and cross-domain distribution shift. - We further enhance the dual-teacher knowledge distillation with multi-level consistency between different domains. - We achieve superior performance in extensive experiments.

Editor's Notes

  1. Hello everyone. Welcome to my presentation. My name is Zhao Ziyuan. Here I am presenting our paper titled DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detection.
  2. Deep learning-based 3D object detection has received considerable attention in 3D point clouds. But there are still many challenges in the real scenario. Firstly, the catastrophic forgetting problem seriously limits model performance in class incremental learning (CIL). Secondly, as shown in the figure, there are distribution gaps between two different datasets, and the distribution gap is part of the domain gap. So, in the presence of the domain shift, deep learning models trained on one domain always suffer tremendous performance degradation when evaluated on another domain.
  3. Existing CIL and UDA methods address different challenges separately, ignoring that the catastrophic forgetting and domain shift concurrently exist in 3D object detection on point clouds of real-world environments. And we aim to leverage labeled old classes on the source domain and labeled new classes on the target domain to close the domain shift and adapt to new classes without forgetting old classes on the target domain in DA-CIL.
  4. As fig3 shows, our DA-CIL framework contains two steps: First, cross-domain intermediate point clouds and in-domain augmented point clouds are generated by dual-domain copy-paste augmentation. Then, a dual-teacher network is constructed to distill knowledge from these domains for domain adaptive class-incremental learning.
  5. As shown in fig 4, our dual-domain copy paste first extracts objects from the source domain to populate the ground-truth database. Then objects are copy-pasted into point clouds from the source domain and the target domain to generate the augmented domain and the intermediate domain, respectively.
  6. To facilitate the dual-teacher knowledge transfer, we propose multi-level consistency regularization from two aspects: Statistics-Level Consistency and Bounding Box-Level Consistency. Statistics-Level Consistency is implemented in the feedforward process and Bounding Box-Level Consistency is calculated as above equations.
  7. Read slide
  8. We evaluate our method on 3D object detection datasets, ScanNet and SUN RGB-D. We set 5 categories as base classes and addition 5 categories in SUN RGB-D as novel classes. The above two figures present the gap between the two datasets intuitively.
  9. We evaluated 2 naive transfer learning baselines Freeze-and-add, and Fine-tune, one recent popular self-ensembling method, Mean Teacher and SDCoT, the co-teaching architecture, as a strong baseline for class-incremental learning. As table 1 shows, our method outperforms many methods by a clear margin in CIL under domain shift.
  10. Read slide
  11. To evaluate the effectiveness of different components in our method for CIL under Domain Shift, we perform ablation studies by removing components in our approach. +slide
  12. As illustrated in left figure, our method is able to accurately detect both old and novel classes in the target domain, overcoming the domain shift in old classes. Moreover, our model consistently outperforms the baseline SDCoT in both scenarios.
  13. Read
  14. Read slide
  15. Read slide