SlideShare a Scribd company logo
MMGL: Multi-Scale Multi-View Global-Local Contrastive learning for
Semi-supervised Cardiac Image Segmentation
Ziyuan Zhao, Jinxuan Hu, Zeng Zeng, Xulei Yang, Peisheng Qian,
Bharadwaj Veeravalli, Cuntai Guan
I2R, A*STAR, Singapore
Nanyang Technological University, Singapore
National University of Singapore, Singapore
ID: 1311
Introduction – Cardiac Image Segmentation
- Cardiovascular disease is one of the most serious threats to human health globally, and automatic
cardiac substructure segmentation can help disease diagnosis and treatment planning.
- Deep learning has been widely used for medical image segmentation
Cardiac Substructure segmentation[1]
[1] Hu, Xinrong , et al. "Semi-supervised Contrastive Learning for Label-efficient Medical Image Segmentation." MICCAI, 2021.
Challenge – Annotation Scarcity
- DCNNs are data-hungry and require large amounts of well-annotated data.
- Annotation process is expensive, laborious, time-consuming, and requires expert knowledge, leading to
label scarcity.
- In this regard, many not-so-supervised methods like contrastive learning have been proposed to reduce
annotation efforts.
Solution - Contrastive Learning
- Intuition: Push original and augmented images (positive pairs) closer and push original and negative
images (negative pairs) away by minimizing contrastive loss.
- Limitation: Mostly used for image-level global representations.
Basic intuition behind contrastive learning paradigm[2]
[2] Jaiswal A , Babu A R , Zadeh M Z , et al. A Survey on Contrastive Self-supervised Learning[J]. 2020.
Related work - Semi-Supervised contrastive learning
Pixel-level local representation:
- use a decoder to extract local representations from the feature map
- use labels to form more accurate contrastive pairs (positive pairs & negative pairs)
[1] Hu, Xinrong , et al. "Semi-supervised Contrastive Learning for Label-efficient Medical Image Segmentation." MICCAI, 2021.
Existing contrastive learning methods for cardiac image segmentation[1]
Motivation
- To realize the 2D segmentation of cardiac substructures via contrastive learning methodology.
- Existing work only applied contrastive learning on the last layer of encoder or decoder, ignoring the rich
multi-scale information for robust representation learning.
- Volumetric information is still under-explored for contrastive learning due to computational complexity.
Method - Overview
- Model Architecture: 4 components
- Multi-scale Multi-view Global-Local contrastive learning framework (MMGL)
Method (1) - Multi-view Co-training
- Different three views from the transaxial plane(x-axis), sagittal plane(y-axis) and coronal plane(z-axis)
have related spatial information (Volumetric information).
- Main view: transaxial view
- Auxiliary views: sagittal view and coronal view. The auxiliary views can add more information.
Method (2) - Multi-scale Global Unsupervised Contrastive Learning
- To extract image-level representation:
- Global unsupervised contrastive loss of the e-th layer:
- Multi-scale global unsupervised contrastive loss:
Method (3) - Multi-scale Local Supervised Contrastive Learning
- To extract pixel-level representation:
- Local supervised contrastive loss for a feature map 𝑓𝑙 :
- Multi-scale local supervised contrastive loss:
Method (4) - Multi-scale Deeply Supervised Learning
- To utilize multi-scale information in fine-tune stage:
- Deeply supervised loss:
Dataset - MM-WHS(Multi-Modality Whole Heart Segmentation)
- The expert labels for MM-WHS consist of 7 cardiac substructures: left ventricle (LV), left atrium (LA),
right ventricle (RV), right atrium (RA), myocardium (MYO), ascending aorta (AA), and pulmonary artery
(PA).
- 20 CT 3D volumes from multiple sites. Training set, validation set, and testing set : (2 : 1 : 1)
Experimental Results
- Comparison with existing methods:
- Random: randomly initialize the model without any pretraining method
- Global: only apply the global unsupervised contrastive learning
- Global+Local: apply unsupervised global and local contrastive learning
- SemiContrast: apply global unsupervised learning and local supervised learning
- MMGL has better performance than other methods and demonstrates the effectiveness of multi-scale
features for contrastive learning with multi-view images.
Ablation Studies
- Ablation experiments demonstrate the effectiveness of the four components since every component has
increased the performance.
Conclusions
- We propose a multi-scale multi-view global and local contrastive learning strategy to solve the problem
of label scarcity in medical image segmentation tasks.
- Add multi-view information in the pre-training stage as auxiliary information without additional
annotations.
- Employed a multi-scale method in the pre-training stage to explicitly leverage the information in the
different hidden layers from both the encoder and decoder parts of the network.
- Injected deep supervision in the fine-tuning stage.
Thanks
Zhao_Ziyuan@i2r.a-star.edu.sg
https://jacobzhaoziyuan.github.io/

More Related Content

What's hot

Brain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical AnalysisBrain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
MD Abdullah Al Nasim
 
Substructure Search Face-off
Substructure Search Face-offSubstructure Search Face-off
Substructure Search Face-off
NextMove Software
 
SECURE Trial
SECURE TrialSECURE Trial
Data Mining with WEKA WEKA
Data Mining with WEKA WEKAData Mining with WEKA WEKA
Data Mining with WEKA WEKAbutest
 
Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...
Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...
Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...
Analytics India Magazine
 
Diabetes Data Science
Diabetes Data ScienceDiabetes Data Science
Diabetes Data Science
Philip Bourne
 
Overview of Machine Learning for Molecules and Materials Workshop @ NIPS2017
Overview of Machine Learning for Molecules and Materials Workshop @ NIPS2017Overview of Machine Learning for Molecules and Materials Workshop @ NIPS2017
Overview of Machine Learning for Molecules and Materials Workshop @ NIPS2017
Kenta Oono
 
Single image haze removal
Single image haze removalSingle image haze removal
Single image haze removal
MohsinGhazi2
 
Semantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesSemantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network Approaches
Fellowship at Vodafone FutureLab
 
Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)
Hsing-chuan Hsieh
 
Machine Learning and Reasoning for Drug Discovery
Machine Learning and Reasoning for Drug DiscoveryMachine Learning and Reasoning for Drug Discovery
Machine Learning and Reasoning for Drug Discovery
Deakin University
 
Capter10 cluster basic : Han & Kamber
Capter10 cluster basic : Han & KamberCapter10 cluster basic : Han & Kamber
Capter10 cluster basic : Han & Kamber
Houw Liong The
 
Classification
ClassificationClassification
Classification
Datamining Tools
 
Presentation3.pdf
Presentation3.pdfPresentation3.pdf
Presentation3.pdf
sanjaysriram9
 
brain tumor ppt.pptx
brain tumor ppt.pptxbrain tumor ppt.pptx
brain tumor ppt.pptx
AdityaSingh728086
 
ROAD SIGN DETECTION USING CONVOLUTIONAL NEURAL NETWORK (CNN)
ROAD SIGN DETECTION USING CONVOLUTIONAL NEURAL NETWORK (CNN)ROAD SIGN DETECTION USING CONVOLUTIONAL NEURAL NETWORK (CNN)
ROAD SIGN DETECTION USING CONVOLUTIONAL NEURAL NETWORK (CNN)
IRJET Journal
 
Brain Tumor Segmentation in MRI Images
Brain Tumor Segmentation in MRI ImagesBrain Tumor Segmentation in MRI Images
Brain Tumor Segmentation in MRI Images
IJRAT
 
Introduction to Grad-CAM (short version)
Introduction to Grad-CAM (short version)Introduction to Grad-CAM (short version)
Introduction to Grad-CAM (short version)
Hsing-chuan Hsieh
 
Expectation maximization
Expectation maximizationExpectation maximization
Expectation maximization
LALAOUIBENCHERIFSIDI
 

What's hot (20)

Brain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical AnalysisBrain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
Brain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis
 
Substructure Search Face-off
Substructure Search Face-offSubstructure Search Face-off
Substructure Search Face-off
 
SECURE Trial
SECURE TrialSECURE Trial
SECURE Trial
 
Data Mining with WEKA WEKA
Data Mining with WEKA WEKAData Mining with WEKA WEKA
Data Mining with WEKA WEKA
 
Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...
Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...
Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...
 
Diabetes Data Science
Diabetes Data ScienceDiabetes Data Science
Diabetes Data Science
 
Overview of Machine Learning for Molecules and Materials Workshop @ NIPS2017
Overview of Machine Learning for Molecules and Materials Workshop @ NIPS2017Overview of Machine Learning for Molecules and Materials Workshop @ NIPS2017
Overview of Machine Learning for Molecules and Materials Workshop @ NIPS2017
 
Single image haze removal
Single image haze removalSingle image haze removal
Single image haze removal
 
Semantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesSemantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network Approaches
 
Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)
 
Machine Learning and Reasoning for Drug Discovery
Machine Learning and Reasoning for Drug DiscoveryMachine Learning and Reasoning for Drug Discovery
Machine Learning and Reasoning for Drug Discovery
 
Capter10 cluster basic : Han & Kamber
Capter10 cluster basic : Han & KamberCapter10 cluster basic : Han & Kamber
Capter10 cluster basic : Han & Kamber
 
Classification
ClassificationClassification
Classification
 
Presentation3.pdf
Presentation3.pdfPresentation3.pdf
Presentation3.pdf
 
brain tumor ppt.pptx
brain tumor ppt.pptxbrain tumor ppt.pptx
brain tumor ppt.pptx
 
ROAD SIGN DETECTION USING CONVOLUTIONAL NEURAL NETWORK (CNN)
ROAD SIGN DETECTION USING CONVOLUTIONAL NEURAL NETWORK (CNN)ROAD SIGN DETECTION USING CONVOLUTIONAL NEURAL NETWORK (CNN)
ROAD SIGN DETECTION USING CONVOLUTIONAL NEURAL NETWORK (CNN)
 
Hmm viterbi
Hmm viterbiHmm viterbi
Hmm viterbi
 
Brain Tumor Segmentation in MRI Images
Brain Tumor Segmentation in MRI ImagesBrain Tumor Segmentation in MRI Images
Brain Tumor Segmentation in MRI Images
 
Introduction to Grad-CAM (short version)
Introduction to Grad-CAM (short version)Introduction to Grad-CAM (short version)
Introduction to Grad-CAM (short version)
 
Expectation maximization
Expectation maximizationExpectation maximization
Expectation maximization
 

Similar to [ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning for Semi-supervised Cardiac Image Segmentation

[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...
[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...
[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...
Ziyuan Zhao
 
TL-DTC_Michael Ravoo.pdf
TL-DTC_Michael Ravoo.pdfTL-DTC_Michael Ravoo.pdf
TL-DTC_Michael Ravoo.pdf
FlorinSpainhour
 
Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...
Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...
Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...
Seunghyun Hwang
 
An annotation sparsification strategy for 3D medical image segmentation via r...
An annotation sparsification strategy for 3D medical image segmentation via r...An annotation sparsification strategy for 3D medical image segmentation via r...
An annotation sparsification strategy for 3D medical image segmentation via r...
Seunghyun Hwang
 
An automatic heart disease prediction using cluster-based bidirectional LSTM ...
An automatic heart disease prediction using cluster-based bidirectional LSTM ...An automatic heart disease prediction using cluster-based bidirectional LSTM ...
An automatic heart disease prediction using cluster-based bidirectional LSTM ...
BASMAJUMAASALEHALMOH
 
IRJET- A System to Detect Heart Failure using Deep Learning Techniques
IRJET- A System to Detect Heart Failure using Deep Learning TechniquesIRJET- A System to Detect Heart Failure using Deep Learning Techniques
IRJET- A System to Detect Heart Failure using Deep Learning Techniques
IRJET Journal
 
Predicting heart failure using a wrapper-based feature selection
Predicting heart failure using a wrapper-based feature selectionPredicting heart failure using a wrapper-based feature selection
Predicting heart failure using a wrapper-based feature selection
nooriasukmaningtyas
 
Brain Tumor Extraction from T1- Weighted MRI using Co-clustering and Level Se...
Brain Tumor Extraction from T1- Weighted MRI using Co-clustering and Level Se...Brain Tumor Extraction from T1- Weighted MRI using Co-clustering and Level Se...
Brain Tumor Extraction from T1- Weighted MRI using Co-clustering and Level Se...
CSCJournals
 
A review deep learning for medical image segmentation using multi modality fu...
A review deep learning for medical image segmentation using multi modality fu...A review deep learning for medical image segmentation using multi modality fu...
A review deep learning for medical image segmentation using multi modality fu...
Aykut DİKER
 
Bata-Unet: Deep Learning Model for Liver Segmentation
Bata-Unet: Deep Learning Model for Liver SegmentationBata-Unet: Deep Learning Model for Liver Segmentation
Bata-Unet: Deep Learning Model for Liver Segmentation
sipij
 
BATA-UNET: DEEP LEARNING MODEL FOR LIVER SEGMENTATION
BATA-UNET: DEEP LEARNING MODEL FOR LIVER SEGMENTATIONBATA-UNET: DEEP LEARNING MODEL FOR LIVER SEGMENTATION
BATA-UNET: DEEP LEARNING MODEL FOR LIVER SEGMENTATION
sipij
 
A Novel Efficient Medical Image Segmentation Methodology
A Novel Efficient Medical Image Segmentation MethodologyA Novel Efficient Medical Image Segmentation Methodology
A Novel Efficient Medical Image Segmentation Methodology
aciijournal
 
Augmentation of Multimodal 3D Magnetic Resonance Imaging using Generative Adv...
Augmentation of Multimodal 3D Magnetic Resonance Imaging using Generative Adv...Augmentation of Multimodal 3D Magnetic Resonance Imaging using Generative Adv...
Augmentation of Multimodal 3D Magnetic Resonance Imaging using Generative Adv...
IRJET Journal
 
Retinal Blood Vessels Exudates Classification For Detection Of Hemmorages Tha...
Retinal Blood Vessels Exudates Classification For Detection Of Hemmorages Tha...Retinal Blood Vessels Exudates Classification For Detection Of Hemmorages Tha...
Retinal Blood Vessels Exudates Classification For Detection Of Hemmorages Tha...
IJSRED
 
Hybrid CNN and LSTM Network For Heart Disease Prediction
Hybrid CNN and LSTM Network For Heart Disease PredictionHybrid CNN and LSTM Network For Heart Disease Prediction
Hybrid CNN and LSTM Network For Heart Disease Prediction
BASMAJUMAASALEHALMOH
 
Diagnosis of rheumatoid arthritis using an ensemble learning approach
Diagnosis of rheumatoid arthritis using an ensemble learning approachDiagnosis of rheumatoid arthritis using an ensemble learning approach
Diagnosis of rheumatoid arthritis using an ensemble learning approach
csandit
 
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
cscpconf
 
Ultrasound image segmentation through deep learning based improvised U-Net
Ultrasound image segmentation through deep learning based  improvised U-NetUltrasound image segmentation through deep learning based  improvised U-Net
Ultrasound image segmentation through deep learning based improvised U-Net
nooriasukmaningtyas
 
Neuroengineering.pdf
Neuroengineering.pdfNeuroengineering.pdf
Neuroengineering.pdf
Hyunjun10
 
Mri image registration based segmentation framework for whole heart
Mri image registration based segmentation framework for whole heartMri image registration based segmentation framework for whole heart
Mri image registration based segmentation framework for whole heart
eSAT Publishing House
 

Similar to [ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning for Semi-supervised Cardiac Image Segmentation (20)

[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...
[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...
[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...
 
TL-DTC_Michael Ravoo.pdf
TL-DTC_Michael Ravoo.pdfTL-DTC_Michael Ravoo.pdf
TL-DTC_Michael Ravoo.pdf
 
Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...
Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...
Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...
 
An annotation sparsification strategy for 3D medical image segmentation via r...
An annotation sparsification strategy for 3D medical image segmentation via r...An annotation sparsification strategy for 3D medical image segmentation via r...
An annotation sparsification strategy for 3D medical image segmentation via r...
 
An automatic heart disease prediction using cluster-based bidirectional LSTM ...
An automatic heart disease prediction using cluster-based bidirectional LSTM ...An automatic heart disease prediction using cluster-based bidirectional LSTM ...
An automatic heart disease prediction using cluster-based bidirectional LSTM ...
 
IRJET- A System to Detect Heart Failure using Deep Learning Techniques
IRJET- A System to Detect Heart Failure using Deep Learning TechniquesIRJET- A System to Detect Heart Failure using Deep Learning Techniques
IRJET- A System to Detect Heart Failure using Deep Learning Techniques
 
Predicting heart failure using a wrapper-based feature selection
Predicting heart failure using a wrapper-based feature selectionPredicting heart failure using a wrapper-based feature selection
Predicting heart failure using a wrapper-based feature selection
 
Brain Tumor Extraction from T1- Weighted MRI using Co-clustering and Level Se...
Brain Tumor Extraction from T1- Weighted MRI using Co-clustering and Level Se...Brain Tumor Extraction from T1- Weighted MRI using Co-clustering and Level Se...
Brain Tumor Extraction from T1- Weighted MRI using Co-clustering and Level Se...
 
A review deep learning for medical image segmentation using multi modality fu...
A review deep learning for medical image segmentation using multi modality fu...A review deep learning for medical image segmentation using multi modality fu...
A review deep learning for medical image segmentation using multi modality fu...
 
Bata-Unet: Deep Learning Model for Liver Segmentation
Bata-Unet: Deep Learning Model for Liver SegmentationBata-Unet: Deep Learning Model for Liver Segmentation
Bata-Unet: Deep Learning Model for Liver Segmentation
 
BATA-UNET: DEEP LEARNING MODEL FOR LIVER SEGMENTATION
BATA-UNET: DEEP LEARNING MODEL FOR LIVER SEGMENTATIONBATA-UNET: DEEP LEARNING MODEL FOR LIVER SEGMENTATION
BATA-UNET: DEEP LEARNING MODEL FOR LIVER SEGMENTATION
 
A Novel Efficient Medical Image Segmentation Methodology
A Novel Efficient Medical Image Segmentation MethodologyA Novel Efficient Medical Image Segmentation Methodology
A Novel Efficient Medical Image Segmentation Methodology
 
Augmentation of Multimodal 3D Magnetic Resonance Imaging using Generative Adv...
Augmentation of Multimodal 3D Magnetic Resonance Imaging using Generative Adv...Augmentation of Multimodal 3D Magnetic Resonance Imaging using Generative Adv...
Augmentation of Multimodal 3D Magnetic Resonance Imaging using Generative Adv...
 
Retinal Blood Vessels Exudates Classification For Detection Of Hemmorages Tha...
Retinal Blood Vessels Exudates Classification For Detection Of Hemmorages Tha...Retinal Blood Vessels Exudates Classification For Detection Of Hemmorages Tha...
Retinal Blood Vessels Exudates Classification For Detection Of Hemmorages Tha...
 
Hybrid CNN and LSTM Network For Heart Disease Prediction
Hybrid CNN and LSTM Network For Heart Disease PredictionHybrid CNN and LSTM Network For Heart Disease Prediction
Hybrid CNN and LSTM Network For Heart Disease Prediction
 
Diagnosis of rheumatoid arthritis using an ensemble learning approach
Diagnosis of rheumatoid arthritis using an ensemble learning approachDiagnosis of rheumatoid arthritis using an ensemble learning approach
Diagnosis of rheumatoid arthritis using an ensemble learning approach
 
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
 
Ultrasound image segmentation through deep learning based improvised U-Net
Ultrasound image segmentation through deep learning based  improvised U-NetUltrasound image segmentation through deep learning based  improvised U-Net
Ultrasound image segmentation through deep learning based improvised U-Net
 
Neuroengineering.pdf
Neuroengineering.pdfNeuroengineering.pdf
Neuroengineering.pdf
 
Mri image registration based segmentation framework for whole heart
Mri image registration based segmentation framework for whole heartMri image registration based segmentation framework for whole heart
Mri image registration based segmentation framework for whole heart
 

More from Ziyuan Zhao

[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...
[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...
[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...
Ziyuan Zhao
 
[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...
[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...
[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...
Ziyuan Zhao
 
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning for Medical...
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning  for Medical...[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning  for Medical...
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning for Medical...
Ziyuan Zhao
 
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...
Ziyuan Zhao
 
[BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detec...
[BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detec...[BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detec...
[BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detec...
Ziyuan Zhao
 
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation
Ziyuan Zhao
 
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...
Ziyuan Zhao
 
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
Ziyuan Zhao
 
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...
Ziyuan Zhao
 
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...
Ziyuan Zhao
 
[MICCAI 2021 - Poster] MT-UDA: Towards unsupervised cross-modality medical im...
[MICCAI 2021 - Poster] MT-UDA: Towards unsupervised cross-modality medical im...[MICCAI 2021 - Poster] MT-UDA: Towards unsupervised cross-modality medical im...
[MICCAI 2021 - Poster] MT-UDA: Towards unsupervised cross-modality medical im...
Ziyuan Zhao
 

More from Ziyuan Zhao (11)

[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...
[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...
[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...
 
[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...
[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...
[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...
 
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning for Medical...
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning  for Medical...[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning  for Medical...
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning for Medical...
 
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...
 
[BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detec...
[BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detec...[BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detec...
[BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detec...
 
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation
 
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...
 
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
 
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...
 
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...
 
[MICCAI 2021 - Poster] MT-UDA: Towards unsupervised cross-modality medical im...
[MICCAI 2021 - Poster] MT-UDA: Towards unsupervised cross-modality medical im...[MICCAI 2021 - Poster] MT-UDA: Towards unsupervised cross-modality medical im...
[MICCAI 2021 - Poster] MT-UDA: Towards unsupervised cross-modality medical im...
 

Recently uploaded

Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 

Recently uploaded (20)

Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 

[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning for Semi-supervised Cardiac Image Segmentation

  • 1. MMGL: Multi-Scale Multi-View Global-Local Contrastive learning for Semi-supervised Cardiac Image Segmentation Ziyuan Zhao, Jinxuan Hu, Zeng Zeng, Xulei Yang, Peisheng Qian, Bharadwaj Veeravalli, Cuntai Guan I2R, A*STAR, Singapore Nanyang Technological University, Singapore National University of Singapore, Singapore ID: 1311
  • 2. Introduction – Cardiac Image Segmentation - Cardiovascular disease is one of the most serious threats to human health globally, and automatic cardiac substructure segmentation can help disease diagnosis and treatment planning. - Deep learning has been widely used for medical image segmentation Cardiac Substructure segmentation[1] [1] Hu, Xinrong , et al. "Semi-supervised Contrastive Learning for Label-efficient Medical Image Segmentation." MICCAI, 2021.
  • 3. Challenge – Annotation Scarcity - DCNNs are data-hungry and require large amounts of well-annotated data. - Annotation process is expensive, laborious, time-consuming, and requires expert knowledge, leading to label scarcity. - In this regard, many not-so-supervised methods like contrastive learning have been proposed to reduce annotation efforts.
  • 4. Solution - Contrastive Learning - Intuition: Push original and augmented images (positive pairs) closer and push original and negative images (negative pairs) away by minimizing contrastive loss. - Limitation: Mostly used for image-level global representations. Basic intuition behind contrastive learning paradigm[2] [2] Jaiswal A , Babu A R , Zadeh M Z , et al. A Survey on Contrastive Self-supervised Learning[J]. 2020.
  • 5. Related work - Semi-Supervised contrastive learning Pixel-level local representation: - use a decoder to extract local representations from the feature map - use labels to form more accurate contrastive pairs (positive pairs & negative pairs) [1] Hu, Xinrong , et al. "Semi-supervised Contrastive Learning for Label-efficient Medical Image Segmentation." MICCAI, 2021. Existing contrastive learning methods for cardiac image segmentation[1]
  • 6. Motivation - To realize the 2D segmentation of cardiac substructures via contrastive learning methodology. - Existing work only applied contrastive learning on the last layer of encoder or decoder, ignoring the rich multi-scale information for robust representation learning. - Volumetric information is still under-explored for contrastive learning due to computational complexity.
  • 7. Method - Overview - Model Architecture: 4 components - Multi-scale Multi-view Global-Local contrastive learning framework (MMGL)
  • 8. Method (1) - Multi-view Co-training - Different three views from the transaxial plane(x-axis), sagittal plane(y-axis) and coronal plane(z-axis) have related spatial information (Volumetric information). - Main view: transaxial view - Auxiliary views: sagittal view and coronal view. The auxiliary views can add more information.
  • 9. Method (2) - Multi-scale Global Unsupervised Contrastive Learning - To extract image-level representation: - Global unsupervised contrastive loss of the e-th layer: - Multi-scale global unsupervised contrastive loss:
  • 10. Method (3) - Multi-scale Local Supervised Contrastive Learning - To extract pixel-level representation: - Local supervised contrastive loss for a feature map 𝑓𝑙 : - Multi-scale local supervised contrastive loss:
  • 11. Method (4) - Multi-scale Deeply Supervised Learning - To utilize multi-scale information in fine-tune stage: - Deeply supervised loss:
  • 12. Dataset - MM-WHS(Multi-Modality Whole Heart Segmentation) - The expert labels for MM-WHS consist of 7 cardiac substructures: left ventricle (LV), left atrium (LA), right ventricle (RV), right atrium (RA), myocardium (MYO), ascending aorta (AA), and pulmonary artery (PA). - 20 CT 3D volumes from multiple sites. Training set, validation set, and testing set : (2 : 1 : 1)
  • 13. Experimental Results - Comparison with existing methods: - Random: randomly initialize the model without any pretraining method - Global: only apply the global unsupervised contrastive learning - Global+Local: apply unsupervised global and local contrastive learning - SemiContrast: apply global unsupervised learning and local supervised learning - MMGL has better performance than other methods and demonstrates the effectiveness of multi-scale features for contrastive learning with multi-view images.
  • 14. Ablation Studies - Ablation experiments demonstrate the effectiveness of the four components since every component has increased the performance.
  • 15. Conclusions - We propose a multi-scale multi-view global and local contrastive learning strategy to solve the problem of label scarcity in medical image segmentation tasks. - Add multi-view information in the pre-training stage as auxiliary information without additional annotations. - Employed a multi-scale method in the pre-training stage to explicitly leverage the information in the different hidden layers from both the encoder and decoder parts of the network. - Injected deep supervision in the fine-tuning stage.

Editor's Notes

  1. Welcome to my presentation. My name is Zhao Ziyuan. Here I am presenting Multi-Scale Multi-View Global-Local Contrastive learning for Semi-supervised Cardiac Image Segmentation. MMGL for short.
  2. According to World Health Organization, heart diseases are the leading cause of death. Cardiac image segmentation is a crucial step in many clinical applications related to heart diseases. So, automatic segmentation of cardiac substructures is very meaningful, can help release the workload of doctors and facilitate diagnosis and treatment planning. Recently, deep convolutional neural networks have achieved markable progress in medical image segmentation.
  3. However, deep learning methods are usually data-hungry. In practice, when we conduct medical image segmentation tasks, the main challenge we met is the scarcity of annotation. Therefore, we need to use some not-so-supervised label-efficient methods to reduce annotation efforts and tackle this challenge.
  4. Contrastive learning is one of those promising and popular methods. The intuition behind contrastive learning is minimizing defined contrastive loss to pull similar representations closer while pushing the dissimilar ones further away. So following this intuition, when we design our model, we mainly focus on the design of the contrastive loss and focus on how to define the similar pairs and the dissimilar pairs. It is worth noting that contrastive learning as a variant of self-supervised learning is just a pre-training strategy. That means though we do not need labels in the pre-training stage, we still need some labels in the downstream task in the fine-tuning stage to complete the segmentation. Contrastive learning also has one limitation, it is usually used for extracting image-level global representations for classification tasks. While some works have been working on extracting pixel-level local representations. The most recent one is semi-supervised contrastive learning.
  5. For global unsupervised contrastive learning, they only used the encoder to extract image-level features in the pretraining stage without any labels. However, for semi-supervised contrastive learning, they added the decoder to extract pixel-level local features and they used a small portion of labeled data to decide the positive and negative contrastive pairs and thus extracted more accurate local features.
  6. However, the above-mentioned related work has some limitations. Firstly, all of them only applied contrastive learning on the last layer of the encoder or decoder, ignoring the rich multi-scale information that can be extracted from different layers of the network. Secondly, the volumetric information between these 2D slices is ignored due to computational complexity and can be further explored.
  7. So in our work, we propose a Multi-scale Multi-view Global-Local semi-supervised contrastive learning framework. We fully take advantage of multi-view co-training and multi-scale learning in both the pre-train and fine-tune stages of the segmentation workflow. For the pretraining stage, we first design a multi-scale global unsupervised contrastive learning module to extract global features from different views and encoder layers. Next, we develop a multi-scale local supervised contrastive learning scheme by adding the decoder layers on top of the encoder to extract multi-scale local features from different views. In the fine-tuning stage, we train the segmentation model with multi-scale deeply supervised learning with a few labels.
  8. There are three imaging planes of the heart. Transaxial plane, sagittal plane, and coronal plane. They have related spatial information. So, in our work, we set the transaxial plane as our main view which is the target view for segmentation and we set the other two views as the auxiliary views for more information. When we implement this module, we just simply add the auxiliary views’ images into the training set in the pre-training stage to form the multi-view images.
  9. For Multi-scale Global Unsupervised Contrastive Learning, we aim to extract image-level global representations from different layers of the network. The encoder part of the UNet is often used to extract global representations. To form contrastive pairs, we forward a batch of multi-view inputs through two augmentation pipelines to get an augmentation set. Then, to extract multi-scale global representations, we add projection heads after each block of the encoder, enhancing the representation ability of extracted features. Then, we calculate the global unsupervised loss at different layers. To get multi-scale global unsupervised contrastive loss, we simply sum up global contrastive losses from different levels. The λ is the balancing weight.
  10. Multi-scale global unsupervised contrastive learning only learns the image-level features. But, for the segmentation task, we need pixel-level information. So, to learn the dense pixel-level features, we introduce the local supervised contrastive loss. By adding label information, we can form positive and negative pairs more precisely. The local supervised contrastive loss is calculated as in the equation which is a variation of the global loss. Ω contains the total points in the feature map. P contains the positive points of the feature map that share the same annotation and N contains the negative points that have different annotations. Similar to the multi-scale global loss, to get multi-scale local supervised contrastive loss, we simply sum up local contrastive losses from different levels. The λ is the balancing weight.
  11. In the last fine-tune stage, we adopt the multi-scale deeply supervision strategy with a small portion of labelled data to utilize multi-scale information in different levels and we only trained with the main view images. The deeply supervised loss is formulated here.
  12. In this paper, the dataset we use is the public dataset provided in Multi-Modality Whole Heart Segmentation Challenge, MMWHS for short. The expert labels for MM-WHS consist of 7 cardiac substructures as listed here. For the experiment, we utilize the given 20 CT 3D volumes to train and evaluate our model The dataset is split into a training set, validation set, and testing set with a ratio of (2: 1: 1).
  13. Compared to the random baseline, all different contrastive learning methods can improve the performance against the label scarcity problem. The semi-Contrast method obtains better performance than others since it leverages label information for contrastive learning. We also observe that the proposed method has better performance than other methods with the same portion of labeled data, demonstrating the effectiveness of multi-scale features for contrastive learning with multi-view images. Our MMGL model achieves up to 84.9% in Dice score when training with 40% labeled data, which is the highest performance. We can also observe that all combinations of weights outperform other contrastive learning methods.
  14. To further analyze the effectiveness of different key components of MMGL, we also perform a comprehensive ablation analysis, we repeat 4 times to get the mean and standard deviation. We start by the random baseline and achieve 72.3% in Dice with 20% labeled data. By adding deeply supervised learning, the metric is improved to 73.8%. By adding multi-scale global learning, the metric is improved to 77.3%. When applying multi-view images, the metric is enhanced by 3.8%. Finally, MMGL increases the performance to 82.8% by adding the multi-scale local learning. In total, the proposed method improves the performance by 10.5%. These experiments demonstrate the effectiveness of the four components.
  15. In conclusion, we propose a multi-scale multi-view global and local contrastive learning strategy to solve the problem of label scarcity in medical image segmentation tasks. First, we added multi-view information in the pre-training stage as auxiliary information without additional annotations. Second, we employed a multi-scale method in the pre-training stage to explicitly leverage the information in the different hidden layers from both the encoder and decoder parts of the network. Thirdly, we injected deep supervision in the fine-tuning stage. In the ablation study, we prove that all our components can improve the performance of contrastive self-supervised medical image segmentation.
  16. Thanks for listening to my presentation.