Practical aspects of medical image ai for hospital (IRB course)

Medical Image AI:
Progress and Challenges
Sean Yu, Data Scientist

• Basis concept of AI and common applications
• Experiences of developing AI applications in aetherAI
• Challenges & Impacts in medical AI
Outlines

What is Deep
Learning ?
A.I.
Ex. Expert System
Machine
Learning
Ex. Logistic regression
Representation
Learning
Ex. Shallow
autoencoder
Deep
Learning
Ex. MLPs
What is A.I.?
Source: Ian Goodfellow et al. Deep Learning
1956 Dartmouth AI Workshop

Deep Learning Applications in
Computer Vision

Safety & Security - umboCV
Security：Face Recognition
Video Editing

Entertainment – drone tracking
Source
Entertainment (AR) - Octi

Misc：Inverse Cooking Recipe
Retail：Amazon Go
Intelligence Agriculture

Generative Adversarial Network in Medical Imaging: A Review, Xin Yi et.al (2018)
Synthetic data through
generative model

Some computer vision researches/applications in medical
image analysis
Mammographic
Mass Classification
Diabetic
Retinopathy
Detection
Breast Cancer
Metastasis
Detection
Airway
Segmentation of
Chest CT Image
Lung Nodule
Detection
Bone Suppression
in X-Ray Image
Skin Disease
Classification
https://arxiv.org/abs/1702.05747

• Ideal World
• 𝑓 𝑑𝑖𝑠𝑒𝑎𝑠𝑒 = 𝑟𝑎𝑑𝑖𝑜𝑔𝑟𝑎𝑝ℎ𝑜𝑓𝑑𝑖𝑠𝑒𝑎𝑠𝑒 , 𝑓 ∶ 𝑡𝑟𝑢𝑡ℎ 𝑓𝑢𝑛𝑐𝑡𝑖𝑜𝑛
• 𝑔 𝑟𝑎𝑑𝑖𝑜𝑔𝑟𝑎𝑝ℎ𝑜𝑓𝑑𝑖𝑠𝑒𝑎𝑠𝑒 = 𝑑𝑖𝑠𝑒𝑎𝑠𝑒 , 𝑔 ∶ 𝑖𝑛𝑣𝑒𝑟𝑠𝑒 𝑡𝑟𝑢𝑡ℎ 𝑓𝑢𝑛𝑐𝑡𝑖𝑜𝑛
How does deep learning work?

• Ideal World
• Human Approximation
• ℎ 𝑟𝑎𝑑𝑖𝑜𝑔𝑟𝑎𝑝ℎ = 𝑑𝑖𝑠𝑒𝑎𝑠𝑒

• Ideal World
• Human Approximation
• ℎ 𝑟𝑎𝑑𝑖𝑜𝑔𝑟𝑎𝑝ℎ = 𝑑𝑖𝑠𝑒𝑎𝑠𝑒
• Machine learning
• 𝑚 𝑟𝑎𝑑𝑖𝑜𝑔𝑟𝑎𝑝ℎ = 𝑑𝑖𝑠𝑒𝑎𝑠𝑒
• m begins in random state
• m is update by error backpropagation

• Classification
• Detection
• Segmentation
General rules for evaluating different tasks under different
scenarios

• Classification
• Accuracy may be biased when data
imbalanced
• F1-score or AUC will be better.
• Detection
• Segmentation
Normal Event Guessing
Accuracy
5 1 83%
50 1 98%
5000 1 99.98%
scenarios

• Classification
• Detection
• mean Average Precision (mAP)
• Evaluate precision under different
thresholds across classes
• Segmentation
scenarios

• Classification
• Detection
• Segmentation
• Intersection-of-Union
scenarios

How Does Deep Learning Differ From Traditional Image Processing
Method
or ?

Method
• Traditional Image Processing
• Cell contour segmentation
through thresholding
• Nuclear segmentation through
thresholding
• Calculating N/C ratio
• Set a threshold for
classification
or ?

• Deep Learning
• Feed deep neural
network 5000 images of
lymphocyte and 5000
images of segmented
neutrophils
Method
or ?
thresholding
classification

Deep Learning Performance Benchmarks
image source: cs231n

Our Experiences of Medical Image AI

The Process of Making a Medical Image AI
Collecting Data &
Get Annotated Labels
Data Cleaning
Deploy
UI/UX
Identify Goals
Build and Train Model
Evaluate model
performance
Visualization & Error
analysis

• How did we come up with the project
• Challenges
Several use cases to discuss

How to Pickup Projects/Issues
What kinds of problems
can ML solve today

How to Pickup Projects/Issues
What kinds of problems
can ML solve today
Anything you are
writing rules for today!

• Deep Learning
• Feed deep neural
network 5000 images of
lymphocyte and 5000
images of segmented
neutrophils
Method
or ?
thresholding
classification
These are
R U L E S !!

Before we dive into case studies …
What is Digital Pathology?

Case1
Nasopharyngeal carcinoma
screening
Credit: cparrarojas
100,000
pixels
80,000 pixels
512 pixels
512 pixels

• Background
• Adoption of digital pathology is slow because benefit is not clear
• AI-powered diagnostic support may be the key
• Goal
• To train deep neural networks to recognize cancer cells in nasal
biopsy
Nasopharyngeal carcinoma screening

• Hospital : Chang Gung Memorial Hospital
• Physicians : ~6 physicians involved
• 720 whole slide images
• Digital slide scanner
• aetherAI cloud platform for image viewing and annotation
• Compute hardware for deep neural network training
• 8 * Geforce 1080 Ti
Resource required

Annotation for digital pathology : > 1 hour/slide

Two-Level AI Model for
Cancer Detection on
Whole Slide Image
Patch-level model (>10M Patches)
Background, Benign, Cancer
Performance
- Accuracy: 98%
- AUC: 0.99
Slide-level model
260 Training, 100 Testing
Performance
- Accuracy: 97%
- AUC: 0.98
Benign or NPC ?
Ground Truth : Cancer, Normal Tissue
Shadowed area : Cancer predicted by AI

• Lymphoid tissue recognized as cancer
Error Analysis
background
benign
npc

Iteration of the model
Real Image Prediction
by Model V1
Prediction
by Model V2

AI-Assisted Digital Pathology – Lesion Highlighting

How AI is Helping Pathologists with Challenging Cases

Integration with our platform：
AI-Powered Target Navigation
(Quick Examination)

Digital Pathology AI Modules
97% Accuracy
Integration with our platform：
AI-Powered Calculation of Tumor Purity
(Quantification)
Tumor Purity : 80%
Region of Interest (1)

• Annotation is very time-consuming
• Error analysis requires deep understanding of histopathology
• Integration with workflow requires careful design
• Image quality differences among institutes
Main Challenges

Image quality differences among institutes

Image quality differences among institutes
Before retraining
After retraining

Case2
Differential Counting of Bone
Marrow Smear
Source: LVIS

• Background:
• Differential counting of bone marrow smear is cumbersome
• Experts are in shortage, detailed counting often not performed
• Materials and Methods:
• Bone marrow smear slides from 1000 patients at NTUH
• Images taken at 1000X
• 500,000 cells to be annotated
• Mask-RCNN based cell detection and classification model
Differential Counting of Bone Marrow Smear

Methodology
• Mask R-CNN (state-of-the-art on most of instance segmentation
tasks)

Effect of Dataset Size on Precision
Class number of
data before
1/27
Precision
before 1/27
Plasma Cell 2,398 0.8
Segmented-neutrophil 2,635 0.47
Mature-limphocyte 4,087 0.37
Neutrophilic-band 1,226 0.25
Orthochromatic-
erythroblast
1,158 0.59
Eosinophils-and-
precursors
895 0.0
Polychromatophilic-
erythroblast
2,352 0.1

Effect of Dataset Size on Precision
More Data, More Benefit!
Class number of
data before
1/27
Precision
before 1/27
New labeled number of
data on 3/6
Precision on 3/6
Plasma Cell 2,398 0.8 3,566 (+48.7%) 0.93 (+16.3%)
Segmented-neutrophil 2,635 0.47 4,693 (+78.1%) 1.0 (+112.8%)
Mature-limphocyte 4,087 0.37 5,344 (+30.8%) 0.71 (+91.9%)
Neutrophilic-band 1,226 0.25 2,636 (+115%) 0.66 (+164%)
Orthochromatic-
erythroblast
1,158 0.59 1,600 (+38.2%) 0.66 (+11.9%)
Eosinophils-and-
precursors
895 0.0 1,229 (+37.3%) 0.5 (+50%)
Polychromatophilic-
erythroblast
2,352 0.1 4,502 (+91.4%) 0.3 (+200%)

• Annotation is very difficult ( and time-consuming )
• Computation load (20M pixels, large model)
• Unexpected imaging conditions when landing
• Integration of workflow
Main Challenges

• Model of microscopes
• Lightness
• Lens
• Exposure
• …
• Staining quality across
different institutes
• Different habits between
algorithms and human
• Counting in crowded
views
The landing challenge

The full scope
Slide
Microscope
Raw image
Machine
Learning
Algorithm
AI results

The full scope
Slide
Microscope
Raw image
Machine
Learning
Algorithm
AI results
- Which views are best
- Lens focusing
- Lens switching
…

Other Challenges of Medical Image AI

Variation is too much and
annotation is too time-consuming
>1 hr/slide

>1 hr/slide
Variation is too much and
annotation is too time-consuming

• Input size: 10000 x 10000 x 3 (RGB)
• Model : Residual Networks
• Training set : 780 images (357 NPC, 423 Benign)
• Validation set size: 68 images (32 NPC, 36 Benign)
• Hardware : QuantaGrid D52G nodes on Taiwania 2
Supercomputer, 8 Tesla V100(32gb) and 768 Gb system memory
per node
• With batch size = 1, 360 Gb system memory is used for training
through Unified Memory
• Each update takes 2.5 minutes.
Using images of entire specimen to train CNN
a.k.a. the no-fuss approach

Comparison of the two approaches
Patch-level model
No-fuss model
Classificatio
n probability
Grad-
CAM
output

Slide-Level Prediction Testset Performance
True vs False Positive
Precision-Recall
No-fuss
model
Two-stage
model
True
Positive
Precision
Recall False Positive
True
Positive
Precision

Result - Multi Nodes
● Up to 274X speedup is achieved by running 32 nodes, compared to non-
optimized 1-GPU training.
● Iteration of the model： from few weeks  few days

Impact of AI on Medical Imaging

Impact of AI On Medical Image Analysis
• Laborious manual tasks may be taken over by AI
• Results will be available faster, 24 hours / 7 days
• Improved and constant quality of medical image analysis
• Provide supportive medical services to rural areas -- universal
access to expert diagnosis
• Medical care will be improved with new insights brought by AI

數位病理 AI – 實現精準醫療
Thank you
有其他問題，歡迎提出
seanyu@aetherai.com

Practical aspects of medical image ai for hospital (IRB course)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Practical aspects of medical image ai for hospital (IRB course)

Similar to Practical aspects of medical image ai for hospital (IRB course) (20)

More from Sean Yu

More from Sean Yu (9)

Recently uploaded

Recently uploaded (20)

Practical aspects of medical image ai for hospital (IRB course)

Editor's Notes