SlideShare a Scribd company logo
1 of 24
MEDICAL IMAGE DATA AUGMENTATION
USING GANS
Dr. Hazrat Ali
Umea University, Sweden
27 May 2020
Hazrat Ali, MT-FoU, Umea University 1
ACKNOWLEDGMENTS AND CREDITS
The slides are based on Jerry Wei et al.,. Figures are
copied from the paper for demonstration purpose.
Copyrights are held by authors/publishers.
[1] Jerry Wei, et al., Generative Image Translation for Data Augmentation in Colorectal Histopathology
Images, Machine Learning for Health Workshop at NeurIPS 2019.
Hazrat Ali, MT-FoU, Umea University 2
TO TRAIN DEEP LEARNING MODELS
 Expectation
Positive
(Disease)
Normal
(No disease)
 Reality
Positive
(Disease)
Normal
(No disease)
Hazrat Ali, MT-FoU, Umea University 3
TO TRAIN DEEP LEARNING MODELS
 Expectation
Positive
(Disease)
Normal
(No disease)
 Reality
Positive
(Disease)
Normal
(No disease)
P
Normal
(No disease)
Hazrat Ali, MT-FoU, Umea University 4
TO TRAIN DEEP LEARNING MODELS
We need large datasets
Hazrat Ali, MT-FoU, Umea University 5
For rare diseases, training a deep learning model
can be very challenging
Increase the data by augmentation techniques
DataAugmentation
Rotation
Flip
Color jittering
Hazrat Ali, MT-FoU, Umea University 6
USING GENERATIVE MODELS
 Generate completely new images
 Hypothesis: The generation may work better if we can use data from one
class to generate data of the other class.
Hazrat Ali, MT-FoU, Umea University 7
USING GENERATIVE MODELS
 Generate completely new images
 Hypothesis: The generation may work better if we can use data from one
class to generate data of the other class
 Consider it a transformation
 And not base the generation on random noise input.
X Y
Hazrat Ali, MT-FoU, Umea University 8
USING GENERATIVE MODELS: CASE STUDY
 Normal colonic mucosa images to adenomatous preneoplastic
polyps images
 Use cycleGAN
 X: Normal Colonic Mucosa Images (Many)
 Y: Tubular ademona or sessile serrated adenoma (Few)
X Y
Hazrat Ali, MT-FoU, Umea University 9
MODEL DESIGN
Figure 1: Process for generating synthetic histopathology images of rare colorectal polyp classes. Path-Rank-Filter (i-ii)
enhances the adenomatous features in generated images by filtering the training data for CycleGAN for only images
with strong adenomatous features. Image: Jerry Wei et al., 2019 Hazrat Ali, MT-FoU, Umea University 10
KEY POINTS
 Filtering through Resnet
 GANs for transformation
 Evaluation through 4 pathologists
 Evaluations through resnet classifier (Can the augmented data improve the
classifier training?)
Hazrat Ali, MT-FoU, Umea University 11
KEY POINTS
 For training of GAN, use images with more prominent features
 Text from Jerry Wei et al.,
Hazrat Ali, MT-FoU, Umea University 12
DATA DISTRIBUTION
Normal vs Adenomatous images
 HP: Hyperplastic polp (Benign)
 NO: Normal colonic mucosa (Benign)
 TVA: Tubulovillous/villous Adenoma
 TA: Tubular Adenoma
 SSA: Sessile Serrated Adenoma
Hazrat Ali, MT-FoU, Umea University 13
DATA DISTRIBUTION
Normal vs Adenomatous images
 HP: Hyperplastic polp (Benign)
 NO: Normal colonic mucosa (Benign)
 TVA: Tubulovillous/villous Adenoma
 TA: Tubular Adenoma (only 15%)
 SSA: Sessile Serrated Adenoma (only 3%)
Hazrat Ali, MT-FoU, Umea University 14
USING GENERATIVE MODELS: CASE STUDY
 Normal colonic mucosa images to adenomatous preneoplastic
polyps images
 Use cycleGAN
 X: Normal Colonic Mucosa Images (Many)
 Y: Tubular ademona or sessile serrated adenoma (Few)
X Y
Hazrat Ali, MT-FoU, Umea University 15
USING GENERATIVE MODELS: CASE STUDY
 Normal colonic mucosa images to adenomatous preneoplastic
polyps images
 Use cycleGAN
 X: Normal Colonic Mucosa Images (9054 images)
 Y: Tubular ademona or sessile serrated adenoma (9054 images)
X Y
Hazrat Ali, MT-FoU, Umea University 16
EVALUATIONS
Hazrat Ali, MT-FoU, Umea University 17
Figure 3: CycleGAN’s generated images for different values of alpha. For instance, alpha =1/4 means
that the top 25% of images with the highest output probabilities from a ResNet were used to train
CycleGAN. TA: tubular adenoma, TVA: tubulovillous/villous adenoma, SSA: sessile serrated
adenoma. For TA and TVA, adenomatous features were enhanced at smaller values. Jerry Wei et al.,
EVALUATIONS
Evaluation by 4 pathologists – Tubular Adenoma Images
 Present 100 real and 100 generated (fake) images, p < 0.05 is statically significant
Hazrat Ali, MT-FoU, Umea University 18
Imagefrom:JerryWeietal.,
No. of pathologists
correctly predicting it
EVALUATIONS
Evaluation by 4 pathologists – Sessile Serrated Adenoma Images
 Present 100 real and 100 generated (fake) images, p < 0.05 is statically significant
Hazrat Ali, MT-FoU, Umea University 19
Imagefrom:JerryWeietal.,
EVALUATION
Does it improve the machine learning model?
 Train a Resnet classifier with and without generated data
Hazrat Ali, MT-FoU, Umea University 20
ResNet
X
Contains
augmented
data
EVALUATION
Hazrat Ali, MT-FoU, Umea University 21
Figure 5: A: AUCs of ResNets trained on real images with synthetic images from different generative models
given as additional training data. B: AUCs of ResNets trained without real images and with synthetic images
from different generative models as the only available training data. In both experiments, the ResNet that was
trained with CycleGAN’s synthetic images had the highest AUC.
Imagefrom:JerryWeietal.,
GENERATIVE ADVERSARIAL NETWORKS
 GAN
 G1(xa) = xb
Hazrat Ali, MT-FoU, Umea University 22
Imagefrom:XinYietal.,
GENERATIVE ADVERSARIAL NETWORKS
 CycleGAN
 G1(xa) = xb
 G2(xb) = xa
Hazrat Ali, MT-FoU, Umea University 23
Imagefrom:XinYietal.,
USEFUL RESOURCES
 Xin Yi et al., Generative Adversarial Network in Medical Imaging: A Review,
https://arxiv.org/abs/1809.07294
 Jerry Wei, et al., Generative Image Translation for Data Augmentation in Colorectal Histopathology
Images, Machine Learning for Health Workshop at NeurIPS 2019
 J. Zhu et al., "Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks,"
2017 IEEE International Conference on Computer Vision (ICCV), Venice, 2017, pp. 2242-2251.
 Talha Iqbal, Hazrat Ali, “Generative Adversarial Network for Medical Images (MI-GAN)”, Journal of
Medical Systems, vol 42, November 2018
 The GAN zoo: https://github.com/hindupuravinash/the-gan-zoo
Hazrat Ali, MT-FoU, Umea University 24

More Related Content

Similar to Medical Image Data Augmentation using GANs

18F-FDG PET/MRI VS PET/CT IN DETECTION OF BONE METASTASIS IN BREAST CANCER PA...
18F-FDG PET/MRI VS PET/CT IN DETECTION OF BONE METASTASIS IN BREAST CANCER PA...18F-FDG PET/MRI VS PET/CT IN DETECTION OF BONE METASTASIS IN BREAST CANCER PA...
18F-FDG PET/MRI VS PET/CT IN DETECTION OF BONE METASTASIS IN BREAST CANCER PA...Elgha Parambi
 
Benefit vs[1][1][2][1]. risk in children's exposure to radiation for medical ...
Benefit vs[1][1][2][1]. risk in children's exposure to radiation for medical ...Benefit vs[1][1][2][1]. risk in children's exposure to radiation for medical ...
Benefit vs[1][1][2][1]. risk in children's exposure to radiation for medical ...Prunariu Ludmila
 
Effectiveness of Via and Vili as A Screening Tool for Cervical Intra Epitheli...
Effectiveness of Via and Vili as A Screening Tool for Cervical Intra Epitheli...Effectiveness of Via and Vili as A Screening Tool for Cervical Intra Epitheli...
Effectiveness of Via and Vili as A Screening Tool for Cervical Intra Epitheli...iosrjce
 
1.4 Holger Woehrle
1.4 Holger Woehrle1.4 Holger Woehrle
1.4 Holger Woehrlebrnmomentum
 
High fidelity simulation for healthcare education iii
High fidelity simulation for healthcare education iiiHigh fidelity simulation for healthcare education iii
High fidelity simulation for healthcare education iiiHelen Wood
 
High fidelity simulation for healthcare education iii
High fidelity simulation for healthcare education iiiHigh fidelity simulation for healthcare education iii
High fidelity simulation for healthcare education iiiHelen Wood
 
ICCV 2019 REVIEW [CDM]
ICCV 2019 REVIEW [CDM]ICCV 2019 REVIEW [CDM]
ICCV 2019 REVIEW [CDM]Dongmin Choi
 
COVID and Causal Inference -- NAS CATS 6/2020
COVID and Causal Inference -- NAS CATS 6/2020COVID and Causal Inference -- NAS CATS 6/2020
COVID and Causal Inference -- NAS CATS 6/2020Ellie Murray
 
Rapid Response Team Essay.docx
Rapid Response Team Essay.docxRapid Response Team Essay.docx
Rapid Response Team Essay.docxwrite22
 
BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...
BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...
BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...Big Data Week
 

Similar to Medical Image Data Augmentation using GANs (15)

18F-FDG PET/MRI VS PET/CT IN DETECTION OF BONE METASTASIS IN BREAST CANCER PA...
18F-FDG PET/MRI VS PET/CT IN DETECTION OF BONE METASTASIS IN BREAST CANCER PA...18F-FDG PET/MRI VS PET/CT IN DETECTION OF BONE METASTASIS IN BREAST CANCER PA...
18F-FDG PET/MRI VS PET/CT IN DETECTION OF BONE METASTASIS IN BREAST CANCER PA...
 
Benefit vs[1][1][2][1]. risk in children's exposure to radiation for medical ...
Benefit vs[1][1][2][1]. risk in children's exposure to radiation for medical ...Benefit vs[1][1][2][1]. risk in children's exposure to radiation for medical ...
Benefit vs[1][1][2][1]. risk in children's exposure to radiation for medical ...
 
Effectiveness of Via and Vili as A Screening Tool for Cervical Intra Epitheli...
Effectiveness of Via and Vili as A Screening Tool for Cervical Intra Epitheli...Effectiveness of Via and Vili as A Screening Tool for Cervical Intra Epitheli...
Effectiveness of Via and Vili as A Screening Tool for Cervical Intra Epitheli...
 
1.4 Holger Woehrle
1.4 Holger Woehrle1.4 Holger Woehrle
1.4 Holger Woehrle
 
The Lachman Test
The Lachman TestThe Lachman Test
The Lachman Test
 
Identifying and mitigating bias in machine learning, by Ruta Binkyte
Identifying and mitigating bias in machine learning, by Ruta BinkyteIdentifying and mitigating bias in machine learning, by Ruta Binkyte
Identifying and mitigating bias in machine learning, by Ruta Binkyte
 
Dubrowski final 1
Dubrowski final 1Dubrowski final 1
Dubrowski final 1
 
High fidelity simulation for healthcare education iii
High fidelity simulation for healthcare education iiiHigh fidelity simulation for healthcare education iii
High fidelity simulation for healthcare education iii
 
High fidelity simulation for healthcare education iii
High fidelity simulation for healthcare education iiiHigh fidelity simulation for healthcare education iii
High fidelity simulation for healthcare education iii
 
ICCV 2019 REVIEW [CDM]
ICCV 2019 REVIEW [CDM]ICCV 2019 REVIEW [CDM]
ICCV 2019 REVIEW [CDM]
 
COVID and Causal Inference -- NAS CATS 6/2020
COVID and Causal Inference -- NAS CATS 6/2020COVID and Causal Inference -- NAS CATS 6/2020
COVID and Causal Inference -- NAS CATS 6/2020
 
AI in eHealth
AI in eHealthAI in eHealth
AI in eHealth
 
11 am sim
11 am sim11 am sim
11 am sim
 
Rapid Response Team Essay.docx
Rapid Response Team Essay.docxRapid Response Team Essay.docx
Rapid Response Team Essay.docx
 
BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...
BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...
BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...
 

Recently uploaded

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 

Recently uploaded (20)

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 

Medical Image Data Augmentation using GANs

  • 1. MEDICAL IMAGE DATA AUGMENTATION USING GANS Dr. Hazrat Ali Umea University, Sweden 27 May 2020 Hazrat Ali, MT-FoU, Umea University 1
  • 2. ACKNOWLEDGMENTS AND CREDITS The slides are based on Jerry Wei et al.,. Figures are copied from the paper for demonstration purpose. Copyrights are held by authors/publishers. [1] Jerry Wei, et al., Generative Image Translation for Data Augmentation in Colorectal Histopathology Images, Machine Learning for Health Workshop at NeurIPS 2019. Hazrat Ali, MT-FoU, Umea University 2
  • 3. TO TRAIN DEEP LEARNING MODELS  Expectation Positive (Disease) Normal (No disease)  Reality Positive (Disease) Normal (No disease) Hazrat Ali, MT-FoU, Umea University 3
  • 4. TO TRAIN DEEP LEARNING MODELS  Expectation Positive (Disease) Normal (No disease)  Reality Positive (Disease) Normal (No disease) P Normal (No disease) Hazrat Ali, MT-FoU, Umea University 4
  • 5. TO TRAIN DEEP LEARNING MODELS We need large datasets Hazrat Ali, MT-FoU, Umea University 5
  • 6. For rare diseases, training a deep learning model can be very challenging Increase the data by augmentation techniques DataAugmentation Rotation Flip Color jittering Hazrat Ali, MT-FoU, Umea University 6
  • 7. USING GENERATIVE MODELS  Generate completely new images  Hypothesis: The generation may work better if we can use data from one class to generate data of the other class. Hazrat Ali, MT-FoU, Umea University 7
  • 8. USING GENERATIVE MODELS  Generate completely new images  Hypothesis: The generation may work better if we can use data from one class to generate data of the other class  Consider it a transformation  And not base the generation on random noise input. X Y Hazrat Ali, MT-FoU, Umea University 8
  • 9. USING GENERATIVE MODELS: CASE STUDY  Normal colonic mucosa images to adenomatous preneoplastic polyps images  Use cycleGAN  X: Normal Colonic Mucosa Images (Many)  Y: Tubular ademona or sessile serrated adenoma (Few) X Y Hazrat Ali, MT-FoU, Umea University 9
  • 10. MODEL DESIGN Figure 1: Process for generating synthetic histopathology images of rare colorectal polyp classes. Path-Rank-Filter (i-ii) enhances the adenomatous features in generated images by filtering the training data for CycleGAN for only images with strong adenomatous features. Image: Jerry Wei et al., 2019 Hazrat Ali, MT-FoU, Umea University 10
  • 11. KEY POINTS  Filtering through Resnet  GANs for transformation  Evaluation through 4 pathologists  Evaluations through resnet classifier (Can the augmented data improve the classifier training?) Hazrat Ali, MT-FoU, Umea University 11
  • 12. KEY POINTS  For training of GAN, use images with more prominent features  Text from Jerry Wei et al., Hazrat Ali, MT-FoU, Umea University 12
  • 13. DATA DISTRIBUTION Normal vs Adenomatous images  HP: Hyperplastic polp (Benign)  NO: Normal colonic mucosa (Benign)  TVA: Tubulovillous/villous Adenoma  TA: Tubular Adenoma  SSA: Sessile Serrated Adenoma Hazrat Ali, MT-FoU, Umea University 13
  • 14. DATA DISTRIBUTION Normal vs Adenomatous images  HP: Hyperplastic polp (Benign)  NO: Normal colonic mucosa (Benign)  TVA: Tubulovillous/villous Adenoma  TA: Tubular Adenoma (only 15%)  SSA: Sessile Serrated Adenoma (only 3%) Hazrat Ali, MT-FoU, Umea University 14
  • 15. USING GENERATIVE MODELS: CASE STUDY  Normal colonic mucosa images to adenomatous preneoplastic polyps images  Use cycleGAN  X: Normal Colonic Mucosa Images (Many)  Y: Tubular ademona or sessile serrated adenoma (Few) X Y Hazrat Ali, MT-FoU, Umea University 15
  • 16. USING GENERATIVE MODELS: CASE STUDY  Normal colonic mucosa images to adenomatous preneoplastic polyps images  Use cycleGAN  X: Normal Colonic Mucosa Images (9054 images)  Y: Tubular ademona or sessile serrated adenoma (9054 images) X Y Hazrat Ali, MT-FoU, Umea University 16
  • 17. EVALUATIONS Hazrat Ali, MT-FoU, Umea University 17 Figure 3: CycleGAN’s generated images for different values of alpha. For instance, alpha =1/4 means that the top 25% of images with the highest output probabilities from a ResNet were used to train CycleGAN. TA: tubular adenoma, TVA: tubulovillous/villous adenoma, SSA: sessile serrated adenoma. For TA and TVA, adenomatous features were enhanced at smaller values. Jerry Wei et al.,
  • 18. EVALUATIONS Evaluation by 4 pathologists – Tubular Adenoma Images  Present 100 real and 100 generated (fake) images, p < 0.05 is statically significant Hazrat Ali, MT-FoU, Umea University 18 Imagefrom:JerryWeietal., No. of pathologists correctly predicting it
  • 19. EVALUATIONS Evaluation by 4 pathologists – Sessile Serrated Adenoma Images  Present 100 real and 100 generated (fake) images, p < 0.05 is statically significant Hazrat Ali, MT-FoU, Umea University 19 Imagefrom:JerryWeietal.,
  • 20. EVALUATION Does it improve the machine learning model?  Train a Resnet classifier with and without generated data Hazrat Ali, MT-FoU, Umea University 20 ResNet X Contains augmented data
  • 21. EVALUATION Hazrat Ali, MT-FoU, Umea University 21 Figure 5: A: AUCs of ResNets trained on real images with synthetic images from different generative models given as additional training data. B: AUCs of ResNets trained without real images and with synthetic images from different generative models as the only available training data. In both experiments, the ResNet that was trained with CycleGAN’s synthetic images had the highest AUC. Imagefrom:JerryWeietal.,
  • 22. GENERATIVE ADVERSARIAL NETWORKS  GAN  G1(xa) = xb Hazrat Ali, MT-FoU, Umea University 22 Imagefrom:XinYietal.,
  • 23. GENERATIVE ADVERSARIAL NETWORKS  CycleGAN  G1(xa) = xb  G2(xb) = xa Hazrat Ali, MT-FoU, Umea University 23 Imagefrom:XinYietal.,
  • 24. USEFUL RESOURCES  Xin Yi et al., Generative Adversarial Network in Medical Imaging: A Review, https://arxiv.org/abs/1809.07294  Jerry Wei, et al., Generative Image Translation for Data Augmentation in Colorectal Histopathology Images, Machine Learning for Health Workshop at NeurIPS 2019  J. Zhu et al., "Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks," 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 2017, pp. 2242-2251.  Talha Iqbal, Hazrat Ali, “Generative Adversarial Network for Medical Images (MI-GAN)”, Journal of Medical Systems, vol 42, November 2018  The GAN zoo: https://github.com/hindupuravinash/the-gan-zoo Hazrat Ali, MT-FoU, Umea University 24