SlideShare a Scribd company logo
1 of 51
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicerDIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Preparing Digital Slides for
Machine Learning Experiments
Preparing Digital Slides for
Machine Learning Experiments
Dima Lituiev, PhD
University of California, San Francisco
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Acknowledgments
 Bakar Computational Health
Sciences Institute, UCSF
 Dexter Hadley
 Sung Jik Cha
 UC Berkeley
 Ryan Chen
 UCSF Pathology
 Zoltan Laszik
 Dejan Dobi
 Aaron Chin
 Eliah Shamir
 Yunn-Yi Chen
 UCSF Radiation Oncology
 Catherine Park
 Vasant Kearney
 Stathis Gennatas
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
This talk is right for you if…
You'd like to learn how to apply deep learning to
your pathology data
You have digitized slides (or use public datasets)
You have some experience in coding
(or have colleagues who do it for you)
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Educational Goals
 [know and reason about] categories of machine learning and
computer vision tasks applied to digital pathology
 [be able to] choose which format to use
depending on your task
 [be able to] prepare digital slides for classification,
segmentation, and object detection using open-source tools in
Python
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Please see the Github repository
for summary
github.com/DSLituiev/slideslicer
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Motivation & Intro
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Why?
Improve pathology diagnostics
Recognize, outline, or count morphological
structures and pathological changes in digital
slides
Train machine learning algorithms
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Machine Learning for Digital Pathology
Learning from
pathology notes:
Natural Language Processing
Learning from slides:
Computer Vision
https://www.pinterest.ch/pin/504684701962620102/www.wikipedia.org
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
fibroadenoma
atypical lobular hyperplasia
calcifications
DCIS
LCIS
invasive breast cancer
Using NLP to mine pathology labels
Right breast, core needle
biopsy:
1. Focal atypical lobular
hyperplasia.
2. Hyalinized fibroadenoma
with associated
microcalcifications.
Dx: Set of labels:
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
fibroadenoma
atypical lobular hyperplasia
calcifications
DCIS
LCIS
invasive breast cancer
Using NLP to mine pathology labels
Right breast, core needle
biopsy:
1. Focal atypical lobular
hyperplasia.
2. Hyalinized fibroadenoma
with associated
microcalcifications.
Dx: Set of labels:
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
fibroadenoma
atypical lobular hyperplasia
calcifications
DCIS
LCIS
invasive breast cancer
Using NLP to mine pathology labels
Right breast, core needle
biopsy:
1. Focal atypical lobular
hyperplasia.
2. Hyalinized fibroadenoma
with associated
microcalcifications.
Dx: Set of labels:
Tools:
• tokenizers
• medical ontologies (e.g. UMSL)
• approaches: bag-of-words, n-grams, sequential
• classifiers: FastText, GBM, SVM, Logistic Regression
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Using NLP to mine pathology labels
Right breast, core needle
biopsy:
1. Focal atypical lobular
hyperplasia.
2. Hyalinized fibroadenoma
with associated
microcalcifications.
Dx:
fibroadenoma
atypical lobular hyperplasia
calcifications
DCIS
LCIS
invasive breast cancer
Set of labels:
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Computer Vision Tasks in Digital Pathology
Classification
assign
a categorical label to
each image
acute
rejection
normal
Regression
EGFR: 70
assign
a numeric value
to each image
One label per-slide
 Whole-slide images don't fit
into regular 2018 AD GPU
memory, thus image needs
to be fed in small patches
 Signal predictive of the
target is often concentrated
in small areas
 Requires weak supervision
techniques to guess from
which patch the signal is
coming
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Computer Vision Tasks in Digital Pathology
Segmentation,
Object detection
detect structural
elements
Potentially multiple
contours per slide
Classification
assign
a categorical label to
each image
acute
rejection
normal
Regression
EGFR: 70
assign
a numeric value
to each image
One label per-slide
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Semantic
Segmentation
Object
Detection and
Localization
Image / Patch
Classification
glomerulus
tubuli
assign
a categorical label to
each image
provide bounding
boxes and labels of
contained objects
provide pixel-level
labels
Computer Vision Tasks in Digital Pathology
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Manual slide annotation
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Manual Annotation (SVS format)
Screenshot: Aperio
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Manual Annotation
Screenshot: Aperio
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Manual Annotation
Screenshot: Aperio
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Annotations are stored as an XML file
Screenshot:
annotation XML file
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Image preprocessing
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Why do we need to preprocess slides?
 Whole-slide images don't fit into GPU memory (as of 2018AD)
 Slide images have to be chunked into smaller pieces
 Image annotations (contours) have to be sliced in same way
 Whole-slide imaging is very sparse
(tissue occupies only 5 – 10% of the slide for needle biopsy)
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Technical Tasks
loc: 7 800; 11 485
size: 1024 x 1024
loc: 0; 0
size:256 x 256
 Tissue vs background?
 How to sample it
efficiently?
 How to handle ROIs?
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Step 1: Know what your ML model needs
 Which file format? (png, jpeg, tiff etc)
 What dimensions? (399x399, 256x256, variable dimensions)
 What file/folder structure:
 image folder per each class (classification)
 paired images and masks (segmentation)
 MS-COCO format (object detection)
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Step 2: choose tools
XPath -- working with XML
shapely -- intersecting contours
opencv -- general purpose classical CV
PIL -- light-weight Python CV toolbox
 -- reading slides
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Data Preparation with slideslicer
 Reading annotated digital pathology slides
 Automated annotation of tissue vs background
 Splitting ~300Mb slides into smaller patches
suitable for training machine learning algorithms
 Extras:
slide de-identification
dataset splitting (train, test, val)
resizing/subsampling
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
 Read SVS with OpenSlide
 Read annotation
 Save annotations as a json file
Step 3: Reading slides and annotations
XPath
>>> fnsvs = "some_pathology_slide.svs"
>>> slide = openslide.OpenSlide(fnsvs)
>>> rreader = RoiReader(fnsvs)
>>> rreader.save('my_annotation.json')
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Step 3: Reading slides and annotations
 Inspect annotations as a pandas table:
>>> rreader.df
id name area length
1 infl 1729228.5 8163.4
2 open glom 406998.5 2475.8
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
 Inspect annotations as a pandas table:
Step 3: Reading slides and annotations
>>> rreader.df
id name area length
1 infl 1729228.5 8163.4
2 open glom 406998.5 2475.8
>>> rreader.plot(labels=False)
>>> plt.legend(loc='center left',
bbox_to_anchor=(1, 0.5))
 Visualize ROIs:
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Step 4: Segment tissue vs background
>>> rreader = RoiReader(fnsvs,
threshold_tissue=True,
save=True)
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Step 4: Segment tissue vs background
>>> rreader = RoiReader(fnsvs,
threshold_tissue=True,
save=True)
NB: tissue pieces
with no annotations
in them are discarded
by default
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Step 5: Sample patches
 Challenge: most of the slide is blank
(no tissue)
 Need to select only points that contain tissue
(& maybe very few blank patches)
 Naïve sampling produces many empty patches and is costly
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Optimized point sampling
for needle biopsy
Finding the tightest bounding box for efficient sampling
with opencv and shapely packages
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Optimized point sampling
>>> sample_points(contour,
n_points=1000,
# spacing=512,
mode='uniform_random')
# mode='grid')
Timing: O(n)
70 μs/point
700 ms/10,000 points
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Step 5: Read an arbitrary patch with ROIs
Read ROIs from XML
>>> rreader = RoiReader(xml_filename)
>>> points = sample_points(contour,1000)
>>> xc, yc = points[0]
>>> fig, ax, region, rois = 
rreader.plot_patch(xc, yc, 1024,
subsample=4)
Read and visualize a patch
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Step 5: Read an arbitrary patch with ROIs
>>> region = rreader.read_patch(
xc, yc, 1024, scale=4)
Read an image patch
Read matching ROIs for the patch
>>> patch_rois = 
rreader.get_patch_rois(
xc, yc, 1024, scale=4,
cocorle=True, translate=True)
source patch
size=10242 pix
down-sample the patch by
a factor of 4
(resulting size is 2562 pix)
translate coordinates
so that upper left
corner is (x=0, y=0)
produce an
MS-COCO
RLE encoding
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Step 5: Read an arbitrary patch with ROIs
 Read matching ROIs for the patch
>>> patch_rois = 
rreader.get_patch_rois(
xc, yc, 1024, scale=4,
cocorle=True, translate=True)
 Convert ROIs to MS-COCO formatted dictionary or JSON
>>> patch_rois.to_dict()
>>> patch_rois.to_json()
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Tiling and Subsampling
Slice a ~40x40K image into ingestible bites
Subsample image patches and ROIs
$ python3 sample_from_slide.py 
--target-side 1024 
--data-root "$OUTPUT_DIR" 
"$XML"
$ DATADIR="/data/data_1024/all"
$ FACTOR=2
$ python3 subsample.py $DATADIR $FACTOR
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Region of Interest (ROI) formats
Contour Vertices
One-hot mask
Integer mask
Run-length encoding (RLE) mask
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
ROI formats
Channels of a One-Hot Binary Mask
Original Vertices Integer Mask
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Run-length encoding (RLE)
+10
+60
+30
10, 60, 30+ …
RLE
Count # of pixels between ROI boundaries in a flattened image
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Run-length encoding (RLE)
+10
+60
+30
10, 60, 50, 60, 20, …
+60
RLE
+20
+20
ASCII byte encoded RLE:
'lbe5<b?5K3O2M2N4...'
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Region of Interest (ROI) formats
 Contour Vertices
compact, slow to convert to mask
 One-hot mask
easy to ingest, hard to visualize
 Integer mask
easy to ingest, easy to visualize
 RLE mask
compact, fast to convert
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Formatting slides into MS-COCO format
 Why MS-COCO format?
 A standard dataset for object
detection tasks with its
associated standard format
 A number of open-source tools
accept MS-COCO format as
input
{'annotations':[...],
'images' :[...],
'type' :[...],
'categories' :[...],
'info' :[...]}
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Formatting slides into MS-COCO format
$ XML="~/Documents/some_dcis_slide.xml"
$ COCODIR="~/Documents/coco_patches_dcis/"
$ python3 sample_patches_lowres_coco.py 
--rle  # include RLE mask
--out-root "$COCODIR"  # output root folder
--target-side 512  # output size (pixels)
--magnlevel 2  # magnification 4^n
$XML # slide annotation path
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Structure of MS-COCO dataset: JSON file
In [2]: coco['images'][0]
{'file_name': '0977c-x28-
y1020.png',
'height': 512,
'width': 512,
'id': 15,
'location-x': 28326,
'location-y': 10204,
'slide_name': '0977c.svs',
'set': 'train'}
In [3]: coco['annotations'][0]
{'area': 1661.5,
'bbox': [363.0, 75.0, 44.0, 50.0],
'category_id': 1,
'category_name': 'glom',
'counts': 'lbe5<b?5K3O2M2N4...',
'size': [512, 512],
'id’: 124,
'image_id': 15,
'iscrowd': 0,
'segmentation': [[383,75,383,75,...]],
'set': 'train',
'slide_name': '0977c.svs'}
In [1]: coco.keys()
['annotations',
'images',
'type',
'categories',
'info']
Images and annotations are linked by:
images.id <-> annotations.image_id
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Structure of MS-COCO dataset: JSON file
In [2]: coco['images'][0]
{'file_name': '0977c-x28-
y1020.png',
'height': 512,
'width': 512,
'id': 15,
'location-x': 28326,
'location-y': 10204,
'slide_name': '0977c.svs',
'set': 'train'}
In [3]: coco['annotations'][0]
{'area': 1661.5,
'bbox': [363.0, 75.0, 44.0, 50.0],
'category_id': 1,
'category_name': 'glom',
'counts': 'lbe5<b?5K3O2M2N4...',
'size': [512, 512],
'id’: 124,
'image_id': 15,
'iscrowd': 0,
'segmentation': [[383,75,383,75,...]],
'set': 'train',
'slide_name': '0977c.svs'}
In [1]: coco.keys()
['annotations',
'images',
'type',
'categories',
'info']
slideslicer's custom fields to track
location of a patch within the slide
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Converting between formats
 Contour -> Mask
 Mask -> Contour
 Mask <-> MS-COCO RLE
>>> mask = convert_contour2mask(contour)
>>> contour = convert_mask2contour(mask)
>>> from pycocotools.mask import encode, decode
>>> coco_rle = encode(mask)
>>> mask = encode(coco_rle)
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Conclusion and Further Considerations
 Check if you have labels already
 If yes – find an automated way to extract them
 if not -- create your own
 Know what format you need for your downstream application
 Work on communication between your clinical and computational collaborators.
 Know what matters and what is possible (sometimes you wouldn't dream of it!)
 Take breaks and drink water
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Download and install
https://github.com/DSLituiev/slideslicer
Check also tools for training keras models:
https://github.com/DSLituiev/kerastrainutils
Image augmentation toolset:
https://github.com/aleju/imgaug
Connect
DSLituiev
@DimaLituiev
DIGITALPATHOLOGYASSOCIATION.ORG #PATHVISIONgithub.com/DSLituiev/slideslicer
Thank you!

More Related Content

What's hot

Minarcik robbins 2013_ch14-wbc
Minarcik robbins 2013_ch14-wbcMinarcik robbins 2013_ch14-wbc
Minarcik robbins 2013_ch14-wbc
Elsa von Licy
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)
butest
 

What's hot (20)

Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)
 
Lymphoid malignancies
Lymphoid malignanciesLymphoid malignancies
Lymphoid malignancies
 
Artificial Intelligence in pathology
Artificial Intelligence in pathologyArtificial Intelligence in pathology
Artificial Intelligence in pathology
 
AI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersAI and whole slide imaging biomarkers
AI and whole slide imaging biomarkers
 
Minarcik robbins 2013_ch14-wbc
Minarcik robbins 2013_ch14-wbcMinarcik robbins 2013_ch14-wbc
Minarcik robbins 2013_ch14-wbc
 
WHO BRAIN TUMOR CLASSIFICATION 5th EDITION
WHO BRAIN TUMOR CLASSIFICATION 5th EDITIONWHO BRAIN TUMOR CLASSIFICATION 5th EDITION
WHO BRAIN TUMOR CLASSIFICATION 5th EDITION
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)
 
Medical image processing studies
Medical image processing studiesMedical image processing studies
Medical image processing studies
 
Tumors of lung seminar dr. swarupa
Tumors of lung seminar dr. swarupaTumors of lung seminar dr. swarupa
Tumors of lung seminar dr. swarupa
 
PR-231: A Simple Framework for Contrastive Learning of Visual Representations
PR-231: A Simple Framework for Contrastive Learning of Visual RepresentationsPR-231: A Simple Framework for Contrastive Learning of Visual Representations
PR-231: A Simple Framework for Contrastive Learning of Visual Representations
 
Deep learning health care
Deep learning health care  Deep learning health care
Deep learning health care
 
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
 
AI IN PATH final PPT.pptx
AI IN PATH final PPT.pptxAI IN PATH final PPT.pptx
AI IN PATH final PPT.pptx
 
DEEP LEARNING IN HEALTHCARE
DEEP LEARNING IN HEALTHCAREDEEP LEARNING IN HEALTHCARE
DEEP LEARNING IN HEALTHCARE
 
Postmortem CT (PMCT)
Postmortem CT (PMCT)Postmortem CT (PMCT)
Postmortem CT (PMCT)
 
Deep learning - Conceptual understanding and applications
Deep learning - Conceptual understanding and applicationsDeep learning - Conceptual understanding and applications
Deep learning - Conceptual understanding and applications
 
AI and Healthcare 2023.pdf
AI and Healthcare 2023.pdfAI and Healthcare 2023.pdf
AI and Healthcare 2023.pdf
 
Explainable AI
Explainable AIExplainable AI
Explainable AI
 
X-linked adrenoleukodystrophy: Radiological assessment
X-linked adrenoleukodystrophy: Radiological assessmentX-linked adrenoleukodystrophy: Radiological assessment
X-linked adrenoleukodystrophy: Radiological assessment
 
Yapay Zeka Güvenliği : Machine Learning & Deep Learning & Computer Vision Sec...
Yapay Zeka Güvenliği : Machine Learning & Deep Learning & Computer Vision Sec...Yapay Zeka Güvenliği : Machine Learning & Deep Learning & Computer Vision Sec...
Yapay Zeka Güvenliği : Machine Learning & Deep Learning & Computer Vision Sec...
 

Similar to Preparing Pathology WSI data for Machine Learning Experiments

Infrastructure for digital pathology in a tissue bank
Infrastructure for digital pathology in a tissue bankInfrastructure for digital pathology in a tissue bank
Infrastructure for digital pathology in a tissue bank
BIT002
 

Similar to Preparing Pathology WSI data for Machine Learning Experiments (20)

Data Management
Data ManagementData Management
Data Management
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
2015 genome-center
2015 genome-center2015 genome-center
2015 genome-center
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
Building a Biomedical Knowledge Garden
Building a Biomedical Knowledge Garden Building a Biomedical Knowledge Garden
Building a Biomedical Knowledge Garden
 
Infrastructure for digital pathology in a tissue bank
Infrastructure for digital pathology in a tissue bankInfrastructure for digital pathology in a tissue bank
Infrastructure for digital pathology in a tissue bank
 
2014-10-10-SBC361-Reproducible research
2014-10-10-SBC361-Reproducible research2014-10-10-SBC361-Reproducible research
2014-10-10-SBC361-Reproducible research
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Efficient Data Labelling for Ocular Imaging
Efficient Data Labelling for Ocular ImagingEfficient Data Labelling for Ocular Imaging
Efficient Data Labelling for Ocular Imaging
 
Pine education-platform
Pine education-platformPine education-platform
Pine education-platform
 
2015 aem-grs-keynote
2015 aem-grs-keynote2015 aem-grs-keynote
2015 aem-grs-keynote
 
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
 
Overview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data AnalysisOverview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data Analysis
 
Converged IT and Data Commons
Converged IT and Data CommonsConverged IT and Data Commons
Converged IT and Data Commons
 
Cheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case StudiesCheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case Studies
 
PRISM Project Update
PRISM Project UpdatePRISM Project Update
PRISM Project Update
 
Cloud Accelerated Genomics
Cloud Accelerated GenomicsCloud Accelerated Genomics
Cloud Accelerated Genomics
 
20170315 Cloud Accelerated Genomics - Tel Aviv / Phoenix
20170315 Cloud Accelerated Genomics - Tel Aviv / Phoenix20170315 Cloud Accelerated Genomics - Tel Aviv / Phoenix
20170315 Cloud Accelerated Genomics - Tel Aviv / Phoenix
 
MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)
MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)
MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 

Recently uploaded

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Preparing Pathology WSI data for Machine Learning Experiments

Editor's Notes

  1. Thresholding and morphologic smoothing Removal of small pieces Removal of tissue pieces with no ROI inside
  2. Thresholding and morphologic smoothing Removal of small pieces Removal of tissue pieces with no ROI inside