[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning for Semi-supervised Cardiac Image Segmentation

MMGL: Multi-Scale Multi-View Global-Local Contrastive learning for
Semi-supervised Cardiac Image Segmentation
Ziyuan Zhao, Jinxuan Hu, Zeng Zeng, Xulei Yang, Peisheng Qian,
Bharadwaj Veeravalli, Cuntai Guan
I2R, A*STAR, Singapore
Nanyang Technological University, Singapore
National University of Singapore, Singapore
ID: 1311

Introduction – Cardiac Image Segmentation
- Cardiovascular disease is one of the most serious threats to human health globally, and automatic
cardiac substructure segmentation can help disease diagnosis and treatment planning.
- Deep learning has been widely used for medical image segmentation
Cardiac Substructure segmentation[1]
[1] Hu, Xinrong , et al. "Semi-supervised Contrastive Learning for Label-efficient Medical Image Segmentation." MICCAI, 2021.

Challenge – Annotation Scarcity
- DCNNs are data-hungry and require large amounts of well-annotated data.
- Annotation process is expensive, laborious, time-consuming, and requires expert knowledge, leading to
label scarcity.
- In this regard, many not-so-supervised methods like contrastive learning have been proposed to reduce
annotation efforts.

Solution - Contrastive Learning
- Intuition: Push original and augmented images (positive pairs) closer and push original and negative
images (negative pairs) away by minimizing contrastive loss.
- Limitation: Mostly used for image-level global representations.
Basic intuition behind contrastive learning paradigm[2]
[2] Jaiswal A , Babu A R , Zadeh M Z , et al. A Survey on Contrastive Self-supervised Learning[J]. 2020.

Related work - Semi-Supervised contrastive learning
Pixel-level local representation:
- use a decoder to extract local representations from the feature map
- use labels to form more accurate contrastive pairs (positive pairs & negative pairs)
[1] Hu, Xinrong , et al. "Semi-supervised Contrastive Learning for Label-efficient Medical Image Segmentation." MICCAI, 2021.
Existing contrastive learning methods for cardiac image segmentation[1]

Motivation
- To realize the 2D segmentation of cardiac substructures via contrastive learning methodology.
- Existing work only applied contrastive learning on the last layer of encoder or decoder, ignoring the rich
multi-scale information for robust representation learning.
- Volumetric information is still under-explored for contrastive learning due to computational complexity.

Method - Overview
- Model Architecture: 4 components
- Multi-scale Multi-view Global-Local contrastive learning framework (MMGL)

Method (1) - Multi-view Co-training
- Different three views from the transaxial plane(x-axis), sagittal plane(y-axis) and coronal plane(z-axis)
have related spatial information (Volumetric information).
- Main view: transaxial view
- Auxiliary views: sagittal view and coronal view. The auxiliary views can add more information.

Method (2) - Multi-scale Global Unsupervised Contrastive Learning
- To extract image-level representation:
- Global unsupervised contrastive loss of the e-th layer:
- Multi-scale global unsupervised contrastive loss:

Method (3) - Multi-scale Local Supervised Contrastive Learning
- To extract pixel-level representation:
- Local supervised contrastive loss for a feature map 𝑓𝑙 :
- Multi-scale local supervised contrastive loss:

Method (4) - Multi-scale Deeply Supervised Learning
- To utilize multi-scale information in fine-tune stage:
- Deeply supervised loss:

Dataset - MM-WHS(Multi-Modality Whole Heart Segmentation)
- The expert labels for MM-WHS consist of 7 cardiac substructures: left ventricle (LV), left atrium (LA),
right ventricle (RV), right atrium (RA), myocardium (MYO), ascending aorta (AA), and pulmonary artery
(PA).
- 20 CT 3D volumes from multiple sites. Training set, validation set, and testing set : (2 : 1 : 1)

Experimental Results
- Comparison with existing methods:
- Random: randomly initialize the model without any pretraining method
- Global: only apply the global unsupervised contrastive learning
- Global+Local: apply unsupervised global and local contrastive learning
- SemiContrast: apply global unsupervised learning and local supervised learning
- MMGL has better performance than other methods and demonstrates the effectiveness of multi-scale
features for contrastive learning with multi-view images.

Ablation Studies
- Ablation experiments demonstrate the effectiveness of the four components since every component has
increased the performance.

Conclusions
- We propose a multi-scale multi-view global and local contrastive learning strategy to solve the problem
of label scarcity in medical image segmentation tasks.
- Add multi-view information in the pre-training stage as auxiliary information without additional
annotations.
- Employed a multi-scale method in the pre-training stage to explicitly leverage the information in the
different hidden layers from both the encoder and decoder parts of the network.
- Injected deep supervision in the fine-tuning stage.

Thanks
Zhao_Ziyuan@i2r.a-star.edu.sg
https://jacobzhaoziyuan.github.io/

[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning for Semi-supervised Cardiac Image Segmentation

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to [ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning for Semi-supervised Cardiac Image Segmentation

Similar to [ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning for Semi-supervised Cardiac Image Segmentation (20)

More from Ziyuan Zhao

More from Ziyuan Zhao (11)

Recently uploaded

Recently uploaded (20)

[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning for Semi-supervised Cardiac Image Segmentation

Editor's Notes