SlideShare a Scribd company logo
Journal Club Sep. 18, 2020 (Ryohei Suzuki)
J. R. Soc. Interface 16.158 (2019): 16:20190531
Medical image analysis 55 (2019): 1-14.
Topological data analysis (TDA)
Why TDA?
• TDA provides metric-invariant
summarization of complex and
high-dimensional data
cf. many normalization modes of RNA-seq
• TDA robustly handles the global
structure of data in intuitive way
Applications of TDA
• Material science (crystal structure)
• Network analysis
• Peak detection, etc.
Topology(位相幾何学)
= mathematical framework for
describing the “shape” of object
that is invariant with respect to
continuous deformation
Basic framework of topological analysis
Figure copied from https://www.wpi-aimr.tohoku.ac.jp/hiraoka_labo/introduction_j.pdf ← recommended reference!
# Connected
Components
# Rings
(1d-cycle)
# Hollows
(2d-cycle)
Persistent homology
Assuming the input to be a point set, observe the transition of topological
features of the complex given by connecting points with growing radius ε
Lifetime of individual
connected components
Lifetime of individual rings
Robust ring
structure
Called “barcode”
Birth of ring Death of ring
Persistent diagram
Scatter graph showing the
birth-(x-axis) and death-time (y-axis)
of topological components
→ representing the information of
global structure of the data
Further analysis
- Calculating summarized values
e.g. sum of the cycle length SLk
- Classification of diagrams as images
Figure from https://www.pnas.org/content/113/26/7035
Robust ring
Transient
rings
Goal: discover the transcriptomic characteristics of ASD patients’ brains
• ASD is known to be highly heritable, but no key genetic variant contributing to the disease
is found. Rather, >100 genes are considered to contribute to the risk.
• Several studies have showed transcriptomic differences e.g., the downregulation of
neuronal synaptic genes and the upregulation of immune genes in ASD patients
• More comprehensive study is required to understand the disease
Approach: directly apply persistent homology to expression data
• To see the inter-patient and inter-gene geometries of ASD/healthy groups
Patient-space
Densely-packed topology
= patients have similar expression profiles
Sparsely-packed topology
= patients have heterogeneous expression
Dataset and study overview
Datasets
• Dataset 1: microarray (9934 genes, 29 ASD / 29 control) [1], log2-transformed
• Dataset 2: RNA-seq (22399 genes, 82 ASD / 82 control) [2], RPKM & log2-transformed
Procedure
• Calculate the inter-sample and inter-gene
distance matrices for ASD/control expression
• Dissimilarity measure: 1-r (r=Pearson correlation)
• Compute the persistent diagrams
• Derive the summary values
• SDT0
= sum of death times of connected components.
• Euler characteristics = SL0 – SL1 + SL2
※SLk is sum of lifespan of connected components (k=0), rings (k=1), hollows (k=2).
[1] Voineagu et al., (2011) Nature 474, 380-384 [2] Parikshak et al., (2016) Nature 540, 423-427
Sample 1 Sample 2 Sample 3
Gene 1 0.01 0.52 …
Gene 2 0.25
Gene 3 …
Inter-sample
Inter-gene
Results (inter-patient)
Dataset1
(Microarray)
Dataset2
(RNA-seq)
ASD-PD Control-PD diff SDT0 diff Euler
Random
permutation
distribution
ASD vs.
control
p=0.00017 p=0.00024
p=0.011 p=0.012
Conclusion:
ASD group have more
heterogeneous expression
profiles than control group
Results (inter-gene)
p=0.316 p=0.403
p=0.998 p=0.997
ASD-PD Control-PD diff SDT0 diff Euler
Author’s conclusion:
ASD/healthy groups don’t have
significant difference in their
transcriptomic organization
Insignificant??
Dense topology
→ expression of
genes correlate well
among samples
Sparse topology
→ less correlation
Goal: fast tumor-region segmentation on WSI of colorectal cancer (CRC)
• CRC is the third/second most diagnosed cancer in males/females
• Fast automatic detection of possible tumor regions is vital for clinical use
• CNN-based methods are actively studied, but suffer from computational costs
Approach: use PH-inspired feature to classify patches
• PH of image pixels is calculated via thresholding
• Birth/death time distribution is used as feature
• Comparison of the feature with ~100 exemplars
provides very fast classification model
Connecting pixels by thresholding
• Common way to calculate persistent homology for 2D image data
• By lowering the threshold, connected components advent and vanish
(merge) one after another.
Left image from: https://www.nature.com/articles/s41598-018-36798-y
Persistent homology profiles (PHP)
• From the thresholding result, probability distributions of birth/death-
time called PHP are constructed (green lines)
• These distributions are
treated as feature vectors
• By comparing PHP of
input data with those of
exemplar T/N images,
fast classification can be
performed.
birth death
tumor
mean
normal
mean
PHP
Exemplar selection using CNN activation
• Training dataset contains ~100000 patches
→ we should compare the PHP of input with some representative values
• Improper selection of exemplars
causes overfitting to significant
texture patterns
• Authors proposes a CNN-based
selection strategy where patches
with various feature activation
are equally respected Select k exemplars from
each bin of activation
strength
(highest 1/Q ~ lowest 1/Q)
Quantitative classification results
• Proposed algorithm outperforms existing
methods in terms of F1-score in two
distinct dataset
• Generalization has room for improvement,
but best among the tested methods
• Why good? → PHP efficiently captures
connectivity between cells in rotation-
invariant way, which is difficult for convnets
Qualitative segmentation results
Comments
• Comparison to the recent deep encoder-decoder models was not conducted
• Batch effects (e.g., contrast) may significantly influence the calculation of PHP
Reflection
• (+) Persistent homology provides unique information about the global
structure of the dataset, which is difficult to calculate in raw-data space,
which would be useful for very high-dimensional data with large noise
• (-) Persistent homology only provides highly summarized statistics,
discarding the information about contributions of individual data points,
e.g., which gene set is contributing in ASD patients.
• Combination with CNNs, which perform very good at discovering local
features, seems to be a promising idea for image analysis.

More Related Content

What's hot

Whale optimization mirjalili
Whale optimization mirjaliliWhale optimization mirjalili
Whale optimization mirjalili
Prashant Kumar
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformaticsAbhishek Vatsa
 
Performance Evaluation of Different Data Mining Classification Algorithm and ...
Performance Evaluation of Different Data Mining Classification Algorithm and ...Performance Evaluation of Different Data Mining Classification Algorithm and ...
Performance Evaluation of Different Data Mining Classification Algorithm and ...
IOSR Journals
 
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
IJECEIAES
 
Protein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural AlignmentProtein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural Alignment
Saramita De Chakravarti
 
Edbt2014 talk
Edbt2014 talkEdbt2014 talk
Edbt2014 talk
Khalid Belhajjame
 
Automatic Feature Subset Selection using Genetic Algorithm for Clustering
Automatic Feature Subset Selection using Genetic Algorithm for ClusteringAutomatic Feature Subset Selection using Genetic Algorithm for Clustering
Automatic Feature Subset Selection using Genetic Algorithm for Clustering
idescitation
 
Standardization of the HIPC Data Templates: The Story So Far
Standardization of the HIPC Data Templates: The Story So FarStandardization of the HIPC Data Templates: The Story So Far
Standardization of the HIPC Data Templates: The Story So Far
Ahmad C. Bukhari
 
Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...
ijdms
 
Seminar Slides
Seminar SlidesSeminar Slides
Seminar Slidespannicle
 
An improved teaching learning
An improved teaching learningAn improved teaching learning
An improved teaching learning
csandit
 
Iaetsd an efficient and large data base using subset selection algorithm
Iaetsd an efficient and large data base using subset selection algorithmIaetsd an efficient and large data base using subset selection algorithm
Iaetsd an efficient and large data base using subset selection algorithm
Iaetsd Iaetsd
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
prashant tripathi
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data mining
Sangeeta Das
 
Discovery of Jumping Emerging Patterns Using Genetic Algorithm
Discovery of Jumping Emerging Patterns Using Genetic AlgorithmDiscovery of Jumping Emerging Patterns Using Genetic Algorithm
Discovery of Jumping Emerging Patterns Using Genetic Algorithm
IJCSIS Research Publications
 

What's hot (16)

Whale optimization mirjalili
Whale optimization mirjaliliWhale optimization mirjalili
Whale optimization mirjalili
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformatics
 
Performance Evaluation of Different Data Mining Classification Algorithm and ...
Performance Evaluation of Different Data Mining Classification Algorithm and ...Performance Evaluation of Different Data Mining Classification Algorithm and ...
Performance Evaluation of Different Data Mining Classification Algorithm and ...
 
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
 
Protein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural AlignmentProtein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural Alignment
 
Edbt2014 talk
Edbt2014 talkEdbt2014 talk
Edbt2014 talk
 
Automatic Feature Subset Selection using Genetic Algorithm for Clustering
Automatic Feature Subset Selection using Genetic Algorithm for ClusteringAutomatic Feature Subset Selection using Genetic Algorithm for Clustering
Automatic Feature Subset Selection using Genetic Algorithm for Clustering
 
Standardization of the HIPC Data Templates: The Story So Far
Standardization of the HIPC Data Templates: The Story So FarStandardization of the HIPC Data Templates: The Story So Far
Standardization of the HIPC Data Templates: The Story So Far
 
Deliverable_5.1.2
Deliverable_5.1.2Deliverable_5.1.2
Deliverable_5.1.2
 
Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...
 
Seminar Slides
Seminar SlidesSeminar Slides
Seminar Slides
 
An improved teaching learning
An improved teaching learningAn improved teaching learning
An improved teaching learning
 
Iaetsd an efficient and large data base using subset selection algorithm
Iaetsd an efficient and large data base using subset selection algorithmIaetsd an efficient and large data base using subset selection algorithm
Iaetsd an efficient and large data base using subset selection algorithm
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data mining
 
Discovery of Jumping Emerging Patterns Using Genetic Algorithm
Discovery of Jumping Emerging Patterns Using Genetic AlgorithmDiscovery of Jumping Emerging Patterns Using Genetic Algorithm
Discovery of Jumping Emerging Patterns Using Genetic Algorithm
 

Similar to Paper memo: persistent homology on biological problems

Yeasin
YeasinYeasin
Yeasin
Cody Behles
 
Unified Framework for Learning Representation from EEG Data
Unified Framework for Learning Representation from EEG DataUnified Framework for Learning Representation from EEG Data
Unified Framework for Learning Representation from EEG Data
FedEx Institute of Technology
 
Developmental Mega Sample: Exploring Inter-Individual Variation
Developmental Mega Sample: Exploring Inter-Individual VariationDevelopmental Mega Sample: Exploring Inter-Individual Variation
Developmental Mega Sample: Exploring Inter-Individual Variation
SaigeRutherford
 
Protein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on RosettaProtein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on Rosetta
Bioinformatics and Computational Biosciences Branch
 
NIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials DesignNIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials Design
KAMAL CHOUDHARY
 
Presentation 2007 Journal Club Azhar Ali Shah
Presentation 2007 Journal Club Azhar Ali ShahPresentation 2007 Journal Club Azhar Ali Shah
Presentation 2007 Journal Club Azhar Ali Shah
guest5de83e
 
An efficient fuzzy classifier with feature selection based
An efficient fuzzy classifier with feature selection basedAn efficient fuzzy classifier with feature selection based
An efficient fuzzy classifier with feature selection basedssairayousaf
 
Problems in Task Scheduling in Multiprocessor System
Problems in Task Scheduling in Multiprocessor SystemProblems in Task Scheduling in Multiprocessor System
Problems in Task Scheduling in Multiprocessor System
ijtsrd
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
Dmitry Grapov
 
Prote-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and VisualizationProte-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and Visualization
Dmitry Grapov
 
BPSO&1-NN algorithm-based variable selection for power system stability ident...
BPSO&1-NN algorithm-based variable selection for power system stability ident...BPSO&1-NN algorithm-based variable selection for power system stability ident...
BPSO&1-NN algorithm-based variable selection for power system stability ident...
IJAEMSJORNAL
 
Intelligent Controller Design for a Chemical Process
Intelligent Controller Design for a Chemical ProcessIntelligent Controller Design for a Chemical Process
Intelligent Controller Design for a Chemical Process
CSCJournals
 
16 siddareddy.bathini 13
16 siddareddy.bathini 1316 siddareddy.bathini 13
16 siddareddy.bathini 13
Alexander Decker
 
NS-CUK Joint Journal Club: Minwoo Choi, Review on "Spatio-Temporal Wind Speed...
NS-CUK Joint Journal Club: Minwoo Choi, Review on "Spatio-Temporal Wind Speed...NS-CUK Joint Journal Club: Minwoo Choi, Review on "Spatio-Temporal Wind Speed...
NS-CUK Joint Journal Club: Minwoo Choi, Review on "Spatio-Temporal Wind Speed...
ssuser4b1f48
 
Comparative Performance Analysis of Teen Sep Leach EAMMH and PEGASIS Routing ...
Comparative Performance Analysis of Teen Sep Leach EAMMH and PEGASIS Routing ...Comparative Performance Analysis of Teen Sep Leach EAMMH and PEGASIS Routing ...
Comparative Performance Analysis of Teen Sep Leach EAMMH and PEGASIS Routing ...
IRJET Journal
 
Implementing a neural network potential for exascale molecular dynamics
Implementing a neural network potential for exascale molecular dynamicsImplementing a neural network potential for exascale molecular dynamics
Implementing a neural network potential for exascale molecular dynamics
PFHub PFHub
 
Inverse Mixed-Solvent Molecular Dynamics for Visualization of Amino Acid Resi...
Inverse Mixed-Solvent Molecular Dynamics for Visualization of Amino Acid Resi...Inverse Mixed-Solvent Molecular Dynamics for Visualization of Amino Acid Resi...
Inverse Mixed-Solvent Molecular Dynamics for Visualization of Amino Acid Resi...
Keisuke Yanagisawa
 
High Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and VisualizationHigh Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and Visualization
Dmitry Grapov
 
Data analysis
Data analysisData analysis
Data analysis
amlbinder
 
Lecture 9 molecular descriptors
Lecture 9  molecular descriptorsLecture 9  molecular descriptors
Lecture 9 molecular descriptors
RAJAN ROLTA
 

Similar to Paper memo: persistent homology on biological problems (20)

Yeasin
YeasinYeasin
Yeasin
 
Unified Framework for Learning Representation from EEG Data
Unified Framework for Learning Representation from EEG DataUnified Framework for Learning Representation from EEG Data
Unified Framework for Learning Representation from EEG Data
 
Developmental Mega Sample: Exploring Inter-Individual Variation
Developmental Mega Sample: Exploring Inter-Individual VariationDevelopmental Mega Sample: Exploring Inter-Individual Variation
Developmental Mega Sample: Exploring Inter-Individual Variation
 
Protein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on RosettaProtein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on Rosetta
 
NIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials DesignNIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials Design
 
Presentation 2007 Journal Club Azhar Ali Shah
Presentation 2007 Journal Club Azhar Ali ShahPresentation 2007 Journal Club Azhar Ali Shah
Presentation 2007 Journal Club Azhar Ali Shah
 
An efficient fuzzy classifier with feature selection based
An efficient fuzzy classifier with feature selection basedAn efficient fuzzy classifier with feature selection based
An efficient fuzzy classifier with feature selection based
 
Problems in Task Scheduling in Multiprocessor System
Problems in Task Scheduling in Multiprocessor SystemProblems in Task Scheduling in Multiprocessor System
Problems in Task Scheduling in Multiprocessor System
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
 
Prote-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and VisualizationProte-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and Visualization
 
BPSO&1-NN algorithm-based variable selection for power system stability ident...
BPSO&1-NN algorithm-based variable selection for power system stability ident...BPSO&1-NN algorithm-based variable selection for power system stability ident...
BPSO&1-NN algorithm-based variable selection for power system stability ident...
 
Intelligent Controller Design for a Chemical Process
Intelligent Controller Design for a Chemical ProcessIntelligent Controller Design for a Chemical Process
Intelligent Controller Design for a Chemical Process
 
16 siddareddy.bathini 13
16 siddareddy.bathini 1316 siddareddy.bathini 13
16 siddareddy.bathini 13
 
NS-CUK Joint Journal Club: Minwoo Choi, Review on "Spatio-Temporal Wind Speed...
NS-CUK Joint Journal Club: Minwoo Choi, Review on "Spatio-Temporal Wind Speed...NS-CUK Joint Journal Club: Minwoo Choi, Review on "Spatio-Temporal Wind Speed...
NS-CUK Joint Journal Club: Minwoo Choi, Review on "Spatio-Temporal Wind Speed...
 
Comparative Performance Analysis of Teen Sep Leach EAMMH and PEGASIS Routing ...
Comparative Performance Analysis of Teen Sep Leach EAMMH and PEGASIS Routing ...Comparative Performance Analysis of Teen Sep Leach EAMMH and PEGASIS Routing ...
Comparative Performance Analysis of Teen Sep Leach EAMMH and PEGASIS Routing ...
 
Implementing a neural network potential for exascale molecular dynamics
Implementing a neural network potential for exascale molecular dynamicsImplementing a neural network potential for exascale molecular dynamics
Implementing a neural network potential for exascale molecular dynamics
 
Inverse Mixed-Solvent Molecular Dynamics for Visualization of Amino Acid Resi...
Inverse Mixed-Solvent Molecular Dynamics for Visualization of Amino Acid Resi...Inverse Mixed-Solvent Molecular Dynamics for Visualization of Amino Acid Resi...
Inverse Mixed-Solvent Molecular Dynamics for Visualization of Amino Acid Resi...
 
High Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and VisualizationHigh Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and Visualization
 
Data analysis
Data analysisData analysis
Data analysis
 
Lecture 9 molecular descriptors
Lecture 9  molecular descriptorsLecture 9  molecular descriptors
Lecture 9 molecular descriptors
 

More from Ryohei Suzuki

Transformer based approaches for visual representation learning
Transformer based approaches for visual representation learningTransformer based approaches for visual representation learning
Transformer based approaches for visual representation learning
Ryohei Suzuki
 
Paper memo: Optimal-Transport Analysis of Single-Cell Gene Expression Identif...
Paper memo: Optimal-Transport Analysis of Single-Cell Gene Expression Identif...Paper memo: Optimal-Transport Analysis of Single-Cell Gene Expression Identif...
Paper memo: Optimal-Transport Analysis of Single-Cell Gene Expression Identif...
Ryohei Suzuki
 
Basic Concepts of Entanglement Measures
Basic Concepts of Entanglement MeasuresBasic Concepts of Entanglement Measures
Basic Concepts of Entanglement Measures
Ryohei Suzuki
 
Disentangled Representation Learning of Deep Generative Models
Disentangled Representation Learning of Deep Generative ModelsDisentangled Representation Learning of Deep Generative Models
Disentangled Representation Learning of Deep Generative Models
Ryohei Suzuki
 
論文紹介: "MolGAN: An implicit generative model for small molecular graphs"
論文紹介: "MolGAN: An implicit generative model for small molecular graphs"論文紹介: "MolGAN: An implicit generative model for small molecular graphs"
論文紹介: "MolGAN: An implicit generative model for small molecular graphs"
Ryohei Suzuki
 
等号と不等号の物理学
等号と不等号の物理学等号と不等号の物理学
等号と不等号の物理学
Ryohei Suzuki
 
Wolf et al. "Graph abstraction reconciles clustering with trajectory inferen...
Wolf et al. "Graph abstraction reconciles clustering with trajectory inferen...Wolf et al. "Graph abstraction reconciles clustering with trajectory inferen...
Wolf et al. "Graph abstraction reconciles clustering with trajectory inferen...
Ryohei Suzuki
 
コンピュータは知恵熱を出すか?
コンピュータは知恵熱を出すか?コンピュータは知恵熱を出すか?
コンピュータは知恵熱を出すか?
Ryohei Suzuki
 
身体の中の小宇宙:免疫研究の最前線
身体の中の小宇宙:免疫研究の最前線身体の中の小宇宙:免疫研究の最前線
身体の中の小宇宙:免疫研究の最前線
Ryohei Suzuki
 
Single-cell pseudo-temporal ordering 近年の技術動向
Single-cell pseudo-temporal ordering 近年の技術動向Single-cell pseudo-temporal ordering 近年の技術動向
Single-cell pseudo-temporal ordering 近年の技術動向
Ryohei Suzuki
 
Collaborative 3D Modeling by the Crowd
Collaborative 3D Modeling by the CrowdCollaborative 3D Modeling by the Crowd
Collaborative 3D Modeling by the Crowd
Ryohei Suzuki
 
汝は計算機なりや?
汝は計算機なりや?汝は計算機なりや?
汝は計算機なりや?
Ryohei Suzuki
 
アナログとはなんだろう。―古くて新しい、もう一つの計算―
アナログとはなんだろう。―古くて新しい、もう一つの計算―アナログとはなんだろう。―古くて新しい、もう一つの計算―
アナログとはなんだろう。―古くて新しい、もう一つの計算―
Ryohei Suzuki
 
AnnoTone (CHI 2015)
AnnoTone (CHI 2015)AnnoTone (CHI 2015)
AnnoTone (CHI 2015)
Ryohei Suzuki
 
色字共感覚と書記素学習
色字共感覚と書記素学習色字共感覚と書記素学習
色字共感覚と書記素学習
Ryohei Suzuki
 
AnnoTone: 高周波音の映像収録時 埋め込みによる編集支援
AnnoTone: 高周波音の映像収録時埋め込みによる編集支援AnnoTone: 高周波音の映像収録時埋め込みによる編集支援
AnnoTone: 高周波音の映像収録時 埋め込みによる編集支援
Ryohei Suzuki
 
立体音響とインタラクション
立体音響とインタラクション立体音響とインタラクション
立体音響とインタラクション
Ryohei Suzuki
 
SIGGRAPH 2014 Preview -"Shape Collection" Session
SIGGRAPH 2014 Preview -"Shape Collection" SessionSIGGRAPH 2014 Preview -"Shape Collection" Session
SIGGRAPH 2014 Preview -"Shape Collection" Session
Ryohei Suzuki
 
Overview of User Interfaces
Overview of User InterfacesOverview of User Interfaces
Overview of User InterfacesRyohei Suzuki
 
Brief Introduction to Recent Spatial Interfaces
Brief Introduction to Recent Spatial InterfacesBrief Introduction to Recent Spatial Interfaces
Brief Introduction to Recent Spatial InterfacesRyohei Suzuki
 

More from Ryohei Suzuki (20)

Transformer based approaches for visual representation learning
Transformer based approaches for visual representation learningTransformer based approaches for visual representation learning
Transformer based approaches for visual representation learning
 
Paper memo: Optimal-Transport Analysis of Single-Cell Gene Expression Identif...
Paper memo: Optimal-Transport Analysis of Single-Cell Gene Expression Identif...Paper memo: Optimal-Transport Analysis of Single-Cell Gene Expression Identif...
Paper memo: Optimal-Transport Analysis of Single-Cell Gene Expression Identif...
 
Basic Concepts of Entanglement Measures
Basic Concepts of Entanglement MeasuresBasic Concepts of Entanglement Measures
Basic Concepts of Entanglement Measures
 
Disentangled Representation Learning of Deep Generative Models
Disentangled Representation Learning of Deep Generative ModelsDisentangled Representation Learning of Deep Generative Models
Disentangled Representation Learning of Deep Generative Models
 
論文紹介: "MolGAN: An implicit generative model for small molecular graphs"
論文紹介: "MolGAN: An implicit generative model for small molecular graphs"論文紹介: "MolGAN: An implicit generative model for small molecular graphs"
論文紹介: "MolGAN: An implicit generative model for small molecular graphs"
 
等号と不等号の物理学
等号と不等号の物理学等号と不等号の物理学
等号と不等号の物理学
 
Wolf et al. "Graph abstraction reconciles clustering with trajectory inferen...
Wolf et al. "Graph abstraction reconciles clustering with trajectory inferen...Wolf et al. "Graph abstraction reconciles clustering with trajectory inferen...
Wolf et al. "Graph abstraction reconciles clustering with trajectory inferen...
 
コンピュータは知恵熱を出すか?
コンピュータは知恵熱を出すか?コンピュータは知恵熱を出すか?
コンピュータは知恵熱を出すか?
 
身体の中の小宇宙:免疫研究の最前線
身体の中の小宇宙:免疫研究の最前線身体の中の小宇宙:免疫研究の最前線
身体の中の小宇宙:免疫研究の最前線
 
Single-cell pseudo-temporal ordering 近年の技術動向
Single-cell pseudo-temporal ordering 近年の技術動向Single-cell pseudo-temporal ordering 近年の技術動向
Single-cell pseudo-temporal ordering 近年の技術動向
 
Collaborative 3D Modeling by the Crowd
Collaborative 3D Modeling by the CrowdCollaborative 3D Modeling by the Crowd
Collaborative 3D Modeling by the Crowd
 
汝は計算機なりや?
汝は計算機なりや?汝は計算機なりや?
汝は計算機なりや?
 
アナログとはなんだろう。―古くて新しい、もう一つの計算―
アナログとはなんだろう。―古くて新しい、もう一つの計算―アナログとはなんだろう。―古くて新しい、もう一つの計算―
アナログとはなんだろう。―古くて新しい、もう一つの計算―
 
AnnoTone (CHI 2015)
AnnoTone (CHI 2015)AnnoTone (CHI 2015)
AnnoTone (CHI 2015)
 
色字共感覚と書記素学習
色字共感覚と書記素学習色字共感覚と書記素学習
色字共感覚と書記素学習
 
AnnoTone: 高周波音の映像収録時 埋め込みによる編集支援
AnnoTone: 高周波音の映像収録時埋め込みによる編集支援AnnoTone: 高周波音の映像収録時埋め込みによる編集支援
AnnoTone: 高周波音の映像収録時 埋め込みによる編集支援
 
立体音響とインタラクション
立体音響とインタラクション立体音響とインタラクション
立体音響とインタラクション
 
SIGGRAPH 2014 Preview -"Shape Collection" Session
SIGGRAPH 2014 Preview -"Shape Collection" SessionSIGGRAPH 2014 Preview -"Shape Collection" Session
SIGGRAPH 2014 Preview -"Shape Collection" Session
 
Overview of User Interfaces
Overview of User InterfacesOverview of User Interfaces
Overview of User Interfaces
 
Brief Introduction to Recent Spatial Interfaces
Brief Introduction to Recent Spatial InterfacesBrief Introduction to Recent Spatial Interfaces
Brief Introduction to Recent Spatial Interfaces
 

Recently uploaded

Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
zeex60
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Studia Poinsotiana
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
BLOOD AND BLOOD COMPONENT- introduction to blood physiology
BLOOD AND BLOOD COMPONENT- introduction to blood physiologyBLOOD AND BLOOD COMPONENT- introduction to blood physiology
BLOOD AND BLOOD COMPONENT- introduction to blood physiology
NoelManyise1
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
muralinath2
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
frank0071
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
Wasswaderrick3
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
S.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary levelS.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary level
ronaldlakony0
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 

Recently uploaded (20)

Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
BLOOD AND BLOOD COMPONENT- introduction to blood physiology
BLOOD AND BLOOD COMPONENT- introduction to blood physiologyBLOOD AND BLOOD COMPONENT- introduction to blood physiology
BLOOD AND BLOOD COMPONENT- introduction to blood physiology
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
S.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary levelS.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary level
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 

Paper memo: persistent homology on biological problems

  • 1. Journal Club Sep. 18, 2020 (Ryohei Suzuki) J. R. Soc. Interface 16.158 (2019): 16:20190531 Medical image analysis 55 (2019): 1-14.
  • 2. Topological data analysis (TDA) Why TDA? • TDA provides metric-invariant summarization of complex and high-dimensional data cf. many normalization modes of RNA-seq • TDA robustly handles the global structure of data in intuitive way Applications of TDA • Material science (crystal structure) • Network analysis • Peak detection, etc. Topology(位相幾何学) = mathematical framework for describing the “shape” of object that is invariant with respect to continuous deformation
  • 3. Basic framework of topological analysis Figure copied from https://www.wpi-aimr.tohoku.ac.jp/hiraoka_labo/introduction_j.pdf ← recommended reference! # Connected Components # Rings (1d-cycle) # Hollows (2d-cycle)
  • 4. Persistent homology Assuming the input to be a point set, observe the transition of topological features of the complex given by connecting points with growing radius ε Lifetime of individual connected components Lifetime of individual rings Robust ring structure Called “barcode” Birth of ring Death of ring
  • 5. Persistent diagram Scatter graph showing the birth-(x-axis) and death-time (y-axis) of topological components → representing the information of global structure of the data Further analysis - Calculating summarized values e.g. sum of the cycle length SLk - Classification of diagrams as images Figure from https://www.pnas.org/content/113/26/7035 Robust ring Transient rings
  • 6. Goal: discover the transcriptomic characteristics of ASD patients’ brains • ASD is known to be highly heritable, but no key genetic variant contributing to the disease is found. Rather, >100 genes are considered to contribute to the risk. • Several studies have showed transcriptomic differences e.g., the downregulation of neuronal synaptic genes and the upregulation of immune genes in ASD patients • More comprehensive study is required to understand the disease Approach: directly apply persistent homology to expression data • To see the inter-patient and inter-gene geometries of ASD/healthy groups Patient-space Densely-packed topology = patients have similar expression profiles Sparsely-packed topology = patients have heterogeneous expression
  • 7. Dataset and study overview Datasets • Dataset 1: microarray (9934 genes, 29 ASD / 29 control) [1], log2-transformed • Dataset 2: RNA-seq (22399 genes, 82 ASD / 82 control) [2], RPKM & log2-transformed Procedure • Calculate the inter-sample and inter-gene distance matrices for ASD/control expression • Dissimilarity measure: 1-r (r=Pearson correlation) • Compute the persistent diagrams • Derive the summary values • SDT0 = sum of death times of connected components. • Euler characteristics = SL0 – SL1 + SL2 ※SLk is sum of lifespan of connected components (k=0), rings (k=1), hollows (k=2). [1] Voineagu et al., (2011) Nature 474, 380-384 [2] Parikshak et al., (2016) Nature 540, 423-427 Sample 1 Sample 2 Sample 3 Gene 1 0.01 0.52 … Gene 2 0.25 Gene 3 … Inter-sample Inter-gene
  • 8. Results (inter-patient) Dataset1 (Microarray) Dataset2 (RNA-seq) ASD-PD Control-PD diff SDT0 diff Euler Random permutation distribution ASD vs. control p=0.00017 p=0.00024 p=0.011 p=0.012 Conclusion: ASD group have more heterogeneous expression profiles than control group
  • 9. Results (inter-gene) p=0.316 p=0.403 p=0.998 p=0.997 ASD-PD Control-PD diff SDT0 diff Euler Author’s conclusion: ASD/healthy groups don’t have significant difference in their transcriptomic organization Insignificant?? Dense topology → expression of genes correlate well among samples Sparse topology → less correlation
  • 10. Goal: fast tumor-region segmentation on WSI of colorectal cancer (CRC) • CRC is the third/second most diagnosed cancer in males/females • Fast automatic detection of possible tumor regions is vital for clinical use • CNN-based methods are actively studied, but suffer from computational costs Approach: use PH-inspired feature to classify patches • PH of image pixels is calculated via thresholding • Birth/death time distribution is used as feature • Comparison of the feature with ~100 exemplars provides very fast classification model
  • 11. Connecting pixels by thresholding • Common way to calculate persistent homology for 2D image data • By lowering the threshold, connected components advent and vanish (merge) one after another. Left image from: https://www.nature.com/articles/s41598-018-36798-y
  • 12. Persistent homology profiles (PHP) • From the thresholding result, probability distributions of birth/death- time called PHP are constructed (green lines) • These distributions are treated as feature vectors • By comparing PHP of input data with those of exemplar T/N images, fast classification can be performed. birth death tumor mean normal mean PHP
  • 13. Exemplar selection using CNN activation • Training dataset contains ~100000 patches → we should compare the PHP of input with some representative values • Improper selection of exemplars causes overfitting to significant texture patterns • Authors proposes a CNN-based selection strategy where patches with various feature activation are equally respected Select k exemplars from each bin of activation strength (highest 1/Q ~ lowest 1/Q)
  • 14. Quantitative classification results • Proposed algorithm outperforms existing methods in terms of F1-score in two distinct dataset • Generalization has room for improvement, but best among the tested methods • Why good? → PHP efficiently captures connectivity between cells in rotation- invariant way, which is difficult for convnets
  • 15. Qualitative segmentation results Comments • Comparison to the recent deep encoder-decoder models was not conducted • Batch effects (e.g., contrast) may significantly influence the calculation of PHP
  • 16. Reflection • (+) Persistent homology provides unique information about the global structure of the dataset, which is difficult to calculate in raw-data space, which would be useful for very high-dimensional data with large noise • (-) Persistent homology only provides highly summarized statistics, discarding the information about contributions of individual data points, e.g., which gene set is contributing in ASD patients. • Combination with CNNs, which perform very good at discovering local features, seems to be a promising idea for image analysis.