SlideShare a Scribd company logo
Graph Kernels for Chemical Informatics
Fall 2015 Data Lunch Seminar
Mukund Raj
10th Nov, 2015
1 / 48
Outline
1 Introduction
2 Expressiveness versus complexity
3 Walk kernels
4 Conclusion and future directions
5 Data depth for labeled graph ensembles
2 / 48
Liband-Based Virtual Screening1
Objective
Build models to predict biochemical properties of small molecules
from their structures.
Structures
C15H14CIN3O3
Properties
toxicity
pharmacokinetics (absorption)
binding to target
1
Slide from: Jean-Phillipe Vert, ParisTech
3 / 48
Example1
NCI AIDS screen results (from http://cactus.nci.nih.gov)
1
Slide from: Jean-Phillipe Vert, ParisTech
4 / 48
Chemical space2
Stars Small Molecules
Existing 1022 107
Virtual 0 1060
Access Difficult “Easy”
2
Slide from: Pierre Baldi, UC Irvine
5 / 48
Formalization
Problem statement
Given a set of training instances (x1, y1), . . . , (xn, yn), where :
xi ’s are graphs and yi ’s are continuous or discrete variables of
interest.
Estimate a function
y = f (x)
where x is any graph.
6 / 48
Classical Approaches1
Classical approaches
1 Map each molecule to a vector of fixed dimension.
2 Apply an algorithm for regression or classification over vectors.
Example: 2D structural keys in chemoinformatics
Then use NN, decision tree, least squares e.t.c
1
Slide from: Jean-Phillipe Vert, ParisTech
7 / 48
Difficulties
Expressiveness of features (which features are relevant?)
Large dimension of vector representation.
8 / 48
The kernel trick
Kernel
Let φ(x) be a vector representation of the graph x.
The kernel between two graphs is defined by:
K(x, x ) = φ(x)T
φ(x ).
Many linear algorithms can be expressed only in terms of inner
products between vectors.
Often computing kernel is more efficient than computing φ(x).
9 / 48
Kernel trick example: computing distances in the feature
space1
1
Slide from: Jean-Phillipe Vert, ParisTech
10 / 48
Positive definite (p.d.) Kernels
Definition
A positive definite (p.d.) kernel on a set χ is a function
K : χ × χ → R that is symmetric and satisfies, for all
N ∈ N, (x1, x2, . . . , xn) ∈ χN and (a1, a2, . . . , an) ∈ RN:
N
i=1
N
j=1
ai aj K(xi , xj ) ≥ 0
11 / 48
Positive definite kernels are inner products1
Mercer’s property
K is a p.d. kernel on the set χ if and only if there exists a Hilbert
space H and a mapping
φ : χ → H,
such that, for any x, x’ in χ:
K(x, x ) = φ(x), φ(x ) H
1
Slide from: Jean-Phillipe Vert, ParisTech
12 / 48
Graph kernels 1
Definition
A graph kernel K(x, x ) is a p.d. kernel over the set of
(labeled) graphs.
It is equivalent to an embedding φ : χ → H of the set of
graphs to a Hilbert space through the relation:
K(x, x ) = φ(x)T
φ(x ).
1
Slide from: Jean-Phillipe Vert, ParisTech
13 / 48
Clarification
Descriptors and kernels in chemoinformatics
1D- SMILES strings
2D- Graph of chemical bonds
2.5D- Surfaces
3D- Atomic coordinates
4D- Temporal evolution
14 / 48
Outline
1 Introduction
2 Expressiveness versus complexity
3 Walk kernels
4 Conclusion and future directions
5 Data depth for labeled graph ensembles
15 / 48
Expressibility versus complexity
Definition: Complete graph kernels
A graph kernel is complete if it separates nonisomorphic graphs.
16 / 48
Expressibility versus complexity
Definition: Complete graph kernels
A graph kernel is complete if it separates nonisomorphic graphs.
Graph isomorphism
Figure from: Wikipedia
17 / 48
Expressibility versus complexity
Definition: Complete graph kernels
A graph kernel is complete if it separates nonisomorphic graphs.
Implication
If graph kernel not complete, then it cannot differentiate all
nonisomorphic graphs.
18 / 48
Expressibility versus complexity
Definition: Complete graph kernels
A graph kernel is complete if it separates nonisomorphic graphs.
Implication
If graph kernel not complete, then it cannot differentiate all
nonisomorphic graphs.
Tractability
Computing any complete graph kernel is at least as hard as the
graph isomorphism problem (Gartner et al, 2003)
19 / 48
Subgraph kernel
Definition: Subgraph
A subgraph of a graph (V , E) is a graph (V , E ) with V ⊂ V and
E ⊂ E.
20 / 48
Subgraph kernel
Definition: Subgraph
A subgraph of a graph (V , E) is a graph (V , E ) with V ⊂ V and
E ⊂ E.
Definition: Subgraph kernel
Ksubgraph(G1, G2) =
H∈χ
λHφH(G1)φH(G2).
where H ⊂ χ, λH is weight associated with H and φH(Gx ) returns
the number of occurrences of H in Gx .
21 / 48
Subgraph kernel
Definition: Subgraph
A subgraph of a graph (V , E) is a graph (V , E ) with V ⊂ V and
E ⊂ E.
Definition: Subgraph kernel
Ksubgraph(G1, G2) =
H∈χ
λHφH(G1)φH(G2).
where H ⊂ χ, λH is weight associated with H and φH(Gx ) returns
the number of occurrences of H in Gx .
Subgraph kernel complexity
Computing the subgraph kernel is NP hard (Gartner et.al. 2003)
22 / 48
Path kernels
Definition: Path
A path of a graph (V,E) is a sequence of distinct vertices such that
consecutive vertices share an edge.
23 / 48
Path kernels
Definition: Path
A path of a graph (V,E) is a sequence of distinct vertices such that
consecutive vertices share an edge.
Definition: Path kernel
Kpath(G1, G2) =
H∈P
λHφH(G1)φH(G2).
where P ⊂ χ is the set of path graphs.
24 / 48
Path kernels
Definition: Path
A path of a graph (V,E) is a sequence of distinct vertices such that
consecutive vertices share an edge.
Definition: Path kernel
Kpath(G1, G2) =
H∈P
λHφH(G1)φH(G2).
where P ⊂ χ is the set of path graphs.
Path kernel complexity
Computing the path kernel is NP hard (Gartner et.al. 2003)
25 / 48
Outline
1 Introduction
2 Expressiveness versus complexity
3 Walk kernels
4 Conclusion and future directions
5 Data depth for labeled graph ensembles
26 / 48
Walks
Definition
A walk of a graph (V,E) is a sequence of distinct vertices such that
consecutive vertices share an edge. Edge cannot appear in path
only once.
27 / 48
Walks
Definition
A walk of a graph (V,E) is a sequence of distinct vertices such that
consecutive vertices share an edge. Edge cannot appear in path
only once.
Definition: Walk kernel
Kwalk(G1, G2) =
w∈S
λw φw (G1)φw (G2).
where S is the set of all walks and φw (Gx ) returns the count of
walk w in Gx .
28 / 48
Walk kernel examples
nth order walk kernel
λG (w) = 1 if the length of w is n, 0 other wise.
29 / 48
Walk kernel examples
nth order walk kernel
λG (w) = 1 if the length of w is n, 0 other wise.
Geometric walk kernel
λG (w) = βlength(w), for β > 0
30 / 48
Walk kernel examples
nth order walk kernel
λG (w) = 1 if the length of w is n, 0 other wise.
Geometric walk kernel
λG (w) = βlength(w), for β > 0
Random walk kernel
λG (w) = PG (w)
31 / 48
Walk kernel examples
nth order walk kernel
λG (w) = 1 if the length of w is n, 0 other wise.
Geometric walk kernel
λG (w) = βlength(w), for β > 0
Random walk kernel
λG (w) = PG (w)
Fingerprint based kernels
Dot product kernel
Tanimoto kernel
MinMax kernel
32 / 48
Computation of walk kernels
Yay!
All the above walk kernels can be computed efficiently in
polynomial time.
33 / 48
Computation of n-th order walk kernel (1/2)1
Product graphs
Let G1 = (V1, E1) and G2 = (V1, E2). Then product graph
G = G1 × G2 is the graph G = (V , E) with:
1 V = {(v1, v2) ∈ V1 × V2: v1 and v2 have same label},
2 E = { (v1, v2), (v1, v2) ∈ V × V : (v1, v1) ∈
E1 and (v2, v2) ∈ E2}
1
Slide from: Jean-Phillipe Vert, ParisTech
34 / 48
Computation of n-th order walk kernel (2/2)1
For nth order walk kernel we have λG1×G2 = 1 if length of w
is n, 0 otherwise.
Therefore:
Knth−order (G1, G2) =
w∈Sn(G1×G2)
1 =
i,j
[An
]i,j = 1T
An
1.
Computation in O(n|G1||G2|d1d2), where di is the maximum
degree of Gi .
1
Slide from: Jean-Phillipe Vert, ParisTech
35 / 48
Traditional molecular fingerprints
Bit vectors of size ( usually = 512 or 1024)
Steps (as summarized in Ralaivola et al. 2005)
1 DFS exploration from each atom to get set of walks
2 Each path initializes a random number generator to form b
integers.
3 b integers reduced molulo then used to set corresponding
bits in fingerprint vector.
Complexity O(nm) or O(nαd ), where n := # atoms and
m := #edges, α := branching factor and d := depth of walk
36 / 48
Generalized molecular fingerprints
Avoids clashes/information loss with reserved bit positions.
Let P(d) be set of all atom-bond labeled path containing max
d bonds.
Binary feature map given depth d:
φd (u) = φpath(u) path∈P(d)
Binary feature map given depth d and fixed vector size :
φd (u) = φγ (path)(u) path∈P(d)
where γ :→ {1, . . . , }b
37 / 48
Fingerprint based kernel (Ralaivola et.al. 2005)2
Complexity O(d(n1m1 + n2m2)) using suffix tree data
structure.
2
Slide from: Pierre Baldi, UC Irvine
38 / 48
Extensions for walk kernels
Label enrichment
Non tottering walk
3D kernels
Mutual information in fingerprint construction
39 / 48
Results(Mahe et al., 2005, Ralaivola et al, 2005)
MUTAG Dataset
Collection of 188 compounds.
Classification of mutagenic activity :high(125) or none(63), as
assayed in Salmonella typhimurium.
Method Accuracy
Progol1(1D) 81.4 %
Random walk kernel (2D) 91.2 %
MinMax kernel (2D) 91.5%
40 / 48
Outline
1 Introduction
2 Expressiveness versus complexity
3 Walk kernels
4 Conclusion and future directions
5 Data depth for labeled graph ensembles
41 / 48
Conclusion
Summary
Extension of ML algorithms to graph data using definition of
positive definite kernels.
Two classes of 2D kernels for chemical molecule structures.
What next?
Can we use graph kernel machinery for computing depth.
42 / 48
Outline
1 Introduction
2 Expressiveness versus complexity
3 Walk kernels
4 Conclusion and future directions
5 Data depth for labeled graph ensembles
43 / 48
Data depth
What is depth function?
A depth function is designed to provide a P-based center outward
ordering( and thus a ranking) for ensemble of data objects drawn
from any arbitrary distribution P.
Taxonomy of data depth definitions (Mosler 2012)
Distance based depth functions
Simplex/Halfspace based depth
Weighted mean based depth
44 / 48
Band depth
Band depth: A type of simplex based data depth method.
Many definitions for various kinds of data.
Functions Multivariate functions
Paths on graph
45 / 48
Band formed by graphs
When alignment is known..
Graphs → Adjacency matrices → Functions
Alignment: A mapping from θ : VGx → VGy , where Gx and Gy
are any two graphs.
When alignment is unknown..
????
46 / 48
Product graphs!
V = {(v1, v2) ∈ V1 × V2: v1 and v2 have same label},
E = { (v1, v2), (v1, v2) ∈ V × V : (v1, v1) ∈ E1 and (v2, v2) ∈ E2}
Weak Direct Product (aka Tensor product or Kronecker product..)
E× = { (v1, v2), (v1, v2) ∈ V × V : (v1, v1) ∈ E1 and (v2, v2) ∈
E2}
Strong Product
E = E× ∪ E where
E = { (v1, v2), (v1, v2) ∈ V × V : v1 = v1 and (v2, v2) ∈
E2} or v2 = v2 and (v1, v1) ∈ E1}
47 / 48
References
Ralaivola, Liva, Sanjay J. Swamidass, Hiroto Saigo, and Pierre
Baldi. ”Graph kernels for chemical informatics.” Neural
Networks 18, no. 8 (2005): 1093-1110.
Mahe, Pierre, et al. ”Graph kernels for molecular
structure-activity relationship analysis with support vector
machines.” Journal of chemical information and modeling
45.4 (2005): 939-951.
Gartner, Thomas, Peter Flach, and Stefan Wrobel. ”On graph
kernels: Hardness results and efficient alternatives.” Learning
Theory and Kernel Machines. Springer Berlin Heidelberg,
2003. 129-143.
http://videolectures.net/site/normal_dl/tag=9127/
gbr07_vert_ckac_01.pdf
http:
//www.ics.uci.edu/~dock/upload/UCI_CHEM_05.ppt
48 / 48

More Related Content

What's hot

Information Content of Complex Networks
Information Content of Complex NetworksInformation Content of Complex Networks
Information Content of Complex Networks
Hector Zenil
 
Fractal dimension versus Computational Complexity
Fractal dimension versus Computational ComplexityFractal dimension versus Computational Complexity
Fractal dimension versus Computational Complexity
Hector Zenil
 
Fractal Dimension of Space-time Diagrams and the Runtime Complexity of Small ...
Fractal Dimension of Space-time Diagrams and the Runtime Complexity of Small ...Fractal Dimension of Space-time Diagrams and the Runtime Complexity of Small ...
Fractal Dimension of Space-time Diagrams and the Runtime Complexity of Small ...
Hector Zenil
 
Graphical Model Selection for Big Data
Graphical Model Selection for Big DataGraphical Model Selection for Big Data
Graphical Model Selection for Big Data
Alexander Jung
 
Differential analyses of structures in HiC data
Differential analyses of structures in HiC dataDifferential analyses of structures in HiC data
Differential analyses of structures in HiC data
tuxette
 
Kernelization algorithms for graph and other structure modification problems
Kernelization algorithms for graph and other structure modification problemsKernelization algorithms for graph and other structure modification problems
Kernelization algorithms for graph and other structure modification problems
Anthony Perez
 
Graph kernels
Graph kernelsGraph kernels
Graph kernels
Luc Brun
 
ADAPTIVE FUZZY KERNEL CLUSTERING ALGORITHM
ADAPTIVE FUZZY KERNEL CLUSTERING ALGORITHMADAPTIVE FUZZY KERNEL CLUSTERING ALGORITHM
ADAPTIVE FUZZY KERNEL CLUSTERING ALGORITHM
ijfls
 
A Numerical Method for the Evaluation of Kolmogorov Complexity, An alternativ...
A Numerical Method for the Evaluation of Kolmogorov Complexity, An alternativ...A Numerical Method for the Evaluation of Kolmogorov Complexity, An alternativ...
A Numerical Method for the Evaluation of Kolmogorov Complexity, An alternativ...
Hector Zenil
 
Graph Edit Distance: Basics & Trends
Graph Edit Distance: Basics & TrendsGraph Edit Distance: Basics & Trends
Graph Edit Distance: Basics & Trends
Luc Brun
 
High Speed Memory Efficient Multiplier-less 1-D 9/7 Wavelet Filters Based NED...
High Speed Memory Efficient Multiplier-less 1-D 9/7 Wavelet Filters Based NED...High Speed Memory Efficient Multiplier-less 1-D 9/7 Wavelet Filters Based NED...
High Speed Memory Efficient Multiplier-less 1-D 9/7 Wavelet Filters Based NED...
IJERA Editor
 
Linear Classifiers
Linear ClassifiersLinear Classifiers
Linear Classifiers
Alexander Jung
 
D E S I G N A N D A N A L Y S I S O F A L G O R I T H M S J N T U M O D E L...
D E S I G N A N D A N A L Y S I S O F A L G O R I T H M S  J N T U  M O D E L...D E S I G N A N D A N A L Y S I S O F A L G O R I T H M S  J N T U  M O D E L...
D E S I G N A N D A N A L Y S I S O F A L G O R I T H M S J N T U M O D E L...guest3f9c6b
 
A lattice-based consensus clustering
A lattice-based consensus clusteringA lattice-based consensus clustering
A lattice-based consensus clustering
Dmitrii Ignatov
 
Ee693 questionshomework
Ee693 questionshomeworkEe693 questionshomework
Ee693 questionshomework
Gopi Saiteja
 
THE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSES
THE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSESTHE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSES
THE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSES
graphhoc
 
Predicting organic reaction outcomes with weisfeiler lehman network
Predicting organic reaction outcomes with weisfeiler lehman networkPredicting organic reaction outcomes with weisfeiler lehman network
Predicting organic reaction outcomes with weisfeiler lehman network
Kazuki Fujikawa
 
R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...
R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...
R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...
Matt Moores
 
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Frank Nielsen
 

What's hot (20)

Information Content of Complex Networks
Information Content of Complex NetworksInformation Content of Complex Networks
Information Content of Complex Networks
 
Fractal dimension versus Computational Complexity
Fractal dimension versus Computational ComplexityFractal dimension versus Computational Complexity
Fractal dimension versus Computational Complexity
 
Fractal Dimension of Space-time Diagrams and the Runtime Complexity of Small ...
Fractal Dimension of Space-time Diagrams and the Runtime Complexity of Small ...Fractal Dimension of Space-time Diagrams and the Runtime Complexity of Small ...
Fractal Dimension of Space-time Diagrams and the Runtime Complexity of Small ...
 
Graphical Model Selection for Big Data
Graphical Model Selection for Big DataGraphical Model Selection for Big Data
Graphical Model Selection for Big Data
 
Differential analyses of structures in HiC data
Differential analyses of structures in HiC dataDifferential analyses of structures in HiC data
Differential analyses of structures in HiC data
 
Kernelization algorithms for graph and other structure modification problems
Kernelization algorithms for graph and other structure modification problemsKernelization algorithms for graph and other structure modification problems
Kernelization algorithms for graph and other structure modification problems
 
Graph kernels
Graph kernelsGraph kernels
Graph kernels
 
ADAPTIVE FUZZY KERNEL CLUSTERING ALGORITHM
ADAPTIVE FUZZY KERNEL CLUSTERING ALGORITHMADAPTIVE FUZZY KERNEL CLUSTERING ALGORITHM
ADAPTIVE FUZZY KERNEL CLUSTERING ALGORITHM
 
A Numerical Method for the Evaluation of Kolmogorov Complexity, An alternativ...
A Numerical Method for the Evaluation of Kolmogorov Complexity, An alternativ...A Numerical Method for the Evaluation of Kolmogorov Complexity, An alternativ...
A Numerical Method for the Evaluation of Kolmogorov Complexity, An alternativ...
 
Graph Edit Distance: Basics & Trends
Graph Edit Distance: Basics & TrendsGraph Edit Distance: Basics & Trends
Graph Edit Distance: Basics & Trends
 
High Speed Memory Efficient Multiplier-less 1-D 9/7 Wavelet Filters Based NED...
High Speed Memory Efficient Multiplier-less 1-D 9/7 Wavelet Filters Based NED...High Speed Memory Efficient Multiplier-less 1-D 9/7 Wavelet Filters Based NED...
High Speed Memory Efficient Multiplier-less 1-D 9/7 Wavelet Filters Based NED...
 
Linear Classifiers
Linear ClassifiersLinear Classifiers
Linear Classifiers
 
D E S I G N A N D A N A L Y S I S O F A L G O R I T H M S J N T U M O D E L...
D E S I G N A N D A N A L Y S I S O F A L G O R I T H M S  J N T U  M O D E L...D E S I G N A N D A N A L Y S I S O F A L G O R I T H M S  J N T U  M O D E L...
D E S I G N A N D A N A L Y S I S O F A L G O R I T H M S J N T U M O D E L...
 
A lattice-based consensus clustering
A lattice-based consensus clusteringA lattice-based consensus clustering
A lattice-based consensus clustering
 
Ee693 questionshomework
Ee693 questionshomeworkEe693 questionshomework
Ee693 questionshomework
 
THE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSES
THE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSESTHE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSES
THE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSES
 
Predicting organic reaction outcomes with weisfeiler lehman network
Predicting organic reaction outcomes with weisfeiler lehman networkPredicting organic reaction outcomes with weisfeiler lehman network
Predicting organic reaction outcomes with weisfeiler lehman network
 
R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...
R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...
R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...
 
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
 
Pixelrelationships
PixelrelationshipsPixelrelationships
Pixelrelationships
 

Viewers also liked

Planning du 14 au 18 mars 2016
Planning du 14 au 18 mars 2016Planning du 14 au 18 mars 2016
Planning du 14 au 18 mars 2016
esidocvigan
 
Regra de três composta
Regra de três compostaRegra de três composta
Regra de três composta
Carolina Souza
 
What to expect when you're expecting a nanny via Care.com
What to expect when you're expecting a nanny via Care.comWhat to expect when you're expecting a nanny via Care.com
What to expect when you're expecting a nanny via Care.com
FlutterbyBarb
 
CUESTIONARIO MATEMÁTICAS - TAREA VACACIONAL 9NO
CUESTIONARIO MATEMÁTICAS - TAREA VACACIONAL 9NOCUESTIONARIO MATEMÁTICAS - TAREA VACACIONAL 9NO
CUESTIONARIO MATEMÁTICAS - TAREA VACACIONAL 9NO
enrique0975
 
Housekeeping department with Other Department
Housekeeping department with Other DepartmentHousekeeping department with Other Department
Housekeeping department with Other Department
Tyrara Xieleen
 
Ejercicios resueltos de c++
Ejercicios resueltos de c++Ejercicios resueltos de c++
Ejercicios resueltos de c++
Jhon TRUJILLO
 
FENICIA "País de las palmeras"
FENICIA "País de las palmeras"FENICIA "País de las palmeras"
FENICIA "País de las palmeras"
Rafael Moreno Yupanqui
 

Viewers also liked (9)

MHC - Mazzola (1)
MHC - Mazzola (1)MHC - Mazzola (1)
MHC - Mazzola (1)
 
Planning du 14 au 18 mars 2016
Planning du 14 au 18 mars 2016Planning du 14 au 18 mars 2016
Planning du 14 au 18 mars 2016
 
msc U-PHONE
msc U-PHONEmsc U-PHONE
msc U-PHONE
 
Regra de três composta
Regra de três compostaRegra de três composta
Regra de três composta
 
What to expect when you're expecting a nanny via Care.com
What to expect when you're expecting a nanny via Care.comWhat to expect when you're expecting a nanny via Care.com
What to expect when you're expecting a nanny via Care.com
 
CUESTIONARIO MATEMÁTICAS - TAREA VACACIONAL 9NO
CUESTIONARIO MATEMÁTICAS - TAREA VACACIONAL 9NOCUESTIONARIO MATEMÁTICAS - TAREA VACACIONAL 9NO
CUESTIONARIO MATEMÁTICAS - TAREA VACACIONAL 9NO
 
Housekeeping department with Other Department
Housekeeping department with Other DepartmentHousekeeping department with Other Department
Housekeeping department with Other Department
 
Ejercicios resueltos de c++
Ejercicios resueltos de c++Ejercicios resueltos de c++
Ejercicios resueltos de c++
 
FENICIA "País de las palmeras"
FENICIA "País de las palmeras"FENICIA "País de las palmeras"
FENICIA "País de las palmeras"
 

Similar to Graph Kernels for Chemical Informatics

Patch Matching with Polynomial Exponential Families and Projective Divergences
Patch Matching with Polynomial Exponential Families and Projective DivergencesPatch Matching with Polynomial Exponential Families and Projective Divergences
Patch Matching with Polynomial Exponential Families and Projective Divergences
Frank Nielsen
 
Tensor Train data format for uncertainty quantification
Tensor Train data format for uncertainty quantificationTensor Train data format for uncertainty quantification
Tensor Train data format for uncertainty quantification
Alexander Litvinenko
 
Divergence center-based clustering and their applications
Divergence center-based clustering and their applicationsDivergence center-based clustering and their applications
Divergence center-based clustering and their applications
Frank Nielsen
 
Vector Distance Transform Maps for Autonomous Mobile Robot Navigation
Vector Distance Transform Maps for Autonomous Mobile Robot NavigationVector Distance Transform Maps for Autonomous Mobile Robot Navigation
Vector Distance Transform Maps for Autonomous Mobile Robot Navigation
Janindu Arukgoda
 
Scalable Global Alignment Graph Kernel Using Random Features: From Node Embed...
Scalable Global Alignment Graph Kernel Using Random Features: From Node Embed...Scalable Global Alignment Graph Kernel Using Random Features: From Node Embed...
Scalable Global Alignment Graph Kernel Using Random Features: From Node Embed...
seijihagawa
 
Problem Solving with Algorithms and Data Structure - Graphs
Problem Solving with Algorithms and Data Structure - GraphsProblem Solving with Algorithms and Data Structure - Graphs
Problem Solving with Algorithms and Data Structure - Graphs
Yi-Lung Tsai
 
Image trnsformations
Image trnsformationsImage trnsformations
Image trnsformationsJohn Williams
 
Low rank tensor approximation of probability density and characteristic funct...
Low rank tensor approximation of probability density and characteristic funct...Low rank tensor approximation of probability density and characteristic funct...
Low rank tensor approximation of probability density and characteristic funct...
Alexander Litvinenko
 
Q-Metrics in Theory And Practice
Q-Metrics in Theory And PracticeQ-Metrics in Theory And Practice
Q-Metrics in Theory And Practice
guest3550292
 
Q-Metrics in Theory and Practice
Q-Metrics in Theory and PracticeQ-Metrics in Theory and Practice
Q-Metrics in Theory and Practice
Magdi Mohamed
 
Lecture4 kenrels functions_rkhs
Lecture4 kenrels functions_rkhsLecture4 kenrels functions_rkhs
Lecture4 kenrels functions_rkhsStéphane Canu
 
Inria Tech Talk - La classification de données complexes avec MASSICCC
Inria Tech Talk - La classification de données complexes avec MASSICCCInria Tech Talk - La classification de données complexes avec MASSICCC
Inria Tech Talk - La classification de données complexes avec MASSICCC
Stéphanie Roger
 
Clustering in Hilbert geometry for machine learning
Clustering in Hilbert geometry for machine learningClustering in Hilbert geometry for machine learning
Clustering in Hilbert geometry for machine learning
Frank Nielsen
 
Distributed solution of stochastic optimal control problem on GPUs
Distributed solution of stochastic optimal control problem on GPUsDistributed solution of stochastic optimal control problem on GPUs
Distributed solution of stochastic optimal control problem on GPUs
Pantelis Sopasakis
 
Gate Computer Science Solved Paper 2007
Gate Computer Science Solved Paper 2007 Gate Computer Science Solved Paper 2007
Gate Computer Science Solved Paper 2007
Rohit Garg
 
QMC Error SAMSI Tutorial Aug 2017
QMC Error SAMSI Tutorial Aug 2017QMC Error SAMSI Tutorial Aug 2017
QMC Error SAMSI Tutorial Aug 2017
Fred J. Hickernell
 
Automatic bayesian cubature
Automatic bayesian cubatureAutomatic bayesian cubature
Automatic bayesian cubature
Jagadeeswaran Rathinavel
 
Fourier_Pricing_ICCF_2022.pdf
Fourier_Pricing_ICCF_2022.pdfFourier_Pricing_ICCF_2022.pdf
Fourier_Pricing_ICCF_2022.pdf
Chiheb Ben Hammouda
 
Hierarchical matrices for approximating large covariance matries and computin...
Hierarchical matrices for approximating large covariance matries and computin...Hierarchical matrices for approximating large covariance matries and computin...
Hierarchical matrices for approximating large covariance matries and computin...
Alexander Litvinenko
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
The Statistical and Applied Mathematical Sciences Institute
 

Similar to Graph Kernels for Chemical Informatics (20)

Patch Matching with Polynomial Exponential Families and Projective Divergences
Patch Matching with Polynomial Exponential Families and Projective DivergencesPatch Matching with Polynomial Exponential Families and Projective Divergences
Patch Matching with Polynomial Exponential Families and Projective Divergences
 
Tensor Train data format for uncertainty quantification
Tensor Train data format for uncertainty quantificationTensor Train data format for uncertainty quantification
Tensor Train data format for uncertainty quantification
 
Divergence center-based clustering and their applications
Divergence center-based clustering and their applicationsDivergence center-based clustering and their applications
Divergence center-based clustering and their applications
 
Vector Distance Transform Maps for Autonomous Mobile Robot Navigation
Vector Distance Transform Maps for Autonomous Mobile Robot NavigationVector Distance Transform Maps for Autonomous Mobile Robot Navigation
Vector Distance Transform Maps for Autonomous Mobile Robot Navigation
 
Scalable Global Alignment Graph Kernel Using Random Features: From Node Embed...
Scalable Global Alignment Graph Kernel Using Random Features: From Node Embed...Scalable Global Alignment Graph Kernel Using Random Features: From Node Embed...
Scalable Global Alignment Graph Kernel Using Random Features: From Node Embed...
 
Problem Solving with Algorithms and Data Structure - Graphs
Problem Solving with Algorithms and Data Structure - GraphsProblem Solving with Algorithms and Data Structure - Graphs
Problem Solving with Algorithms and Data Structure - Graphs
 
Image trnsformations
Image trnsformationsImage trnsformations
Image trnsformations
 
Low rank tensor approximation of probability density and characteristic funct...
Low rank tensor approximation of probability density and characteristic funct...Low rank tensor approximation of probability density and characteristic funct...
Low rank tensor approximation of probability density and characteristic funct...
 
Q-Metrics in Theory And Practice
Q-Metrics in Theory And PracticeQ-Metrics in Theory And Practice
Q-Metrics in Theory And Practice
 
Q-Metrics in Theory and Practice
Q-Metrics in Theory and PracticeQ-Metrics in Theory and Practice
Q-Metrics in Theory and Practice
 
Lecture4 kenrels functions_rkhs
Lecture4 kenrels functions_rkhsLecture4 kenrels functions_rkhs
Lecture4 kenrels functions_rkhs
 
Inria Tech Talk - La classification de données complexes avec MASSICCC
Inria Tech Talk - La classification de données complexes avec MASSICCCInria Tech Talk - La classification de données complexes avec MASSICCC
Inria Tech Talk - La classification de données complexes avec MASSICCC
 
Clustering in Hilbert geometry for machine learning
Clustering in Hilbert geometry for machine learningClustering in Hilbert geometry for machine learning
Clustering in Hilbert geometry for machine learning
 
Distributed solution of stochastic optimal control problem on GPUs
Distributed solution of stochastic optimal control problem on GPUsDistributed solution of stochastic optimal control problem on GPUs
Distributed solution of stochastic optimal control problem on GPUs
 
Gate Computer Science Solved Paper 2007
Gate Computer Science Solved Paper 2007 Gate Computer Science Solved Paper 2007
Gate Computer Science Solved Paper 2007
 
QMC Error SAMSI Tutorial Aug 2017
QMC Error SAMSI Tutorial Aug 2017QMC Error SAMSI Tutorial Aug 2017
QMC Error SAMSI Tutorial Aug 2017
 
Automatic bayesian cubature
Automatic bayesian cubatureAutomatic bayesian cubature
Automatic bayesian cubature
 
Fourier_Pricing_ICCF_2022.pdf
Fourier_Pricing_ICCF_2022.pdfFourier_Pricing_ICCF_2022.pdf
Fourier_Pricing_ICCF_2022.pdf
 
Hierarchical matrices for approximating large covariance matries and computin...
Hierarchical matrices for approximating large covariance matries and computin...Hierarchical matrices for approximating large covariance matries and computin...
Hierarchical matrices for approximating large covariance matries and computin...
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 

Recently uploaded

Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Enterprise Wired
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Subhajit Sahu
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 

Recently uploaded (20)

Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 

Graph Kernels for Chemical Informatics

  • 1. Graph Kernels for Chemical Informatics Fall 2015 Data Lunch Seminar Mukund Raj 10th Nov, 2015 1 / 48
  • 2. Outline 1 Introduction 2 Expressiveness versus complexity 3 Walk kernels 4 Conclusion and future directions 5 Data depth for labeled graph ensembles 2 / 48
  • 3. Liband-Based Virtual Screening1 Objective Build models to predict biochemical properties of small molecules from their structures. Structures C15H14CIN3O3 Properties toxicity pharmacokinetics (absorption) binding to target 1 Slide from: Jean-Phillipe Vert, ParisTech 3 / 48
  • 4. Example1 NCI AIDS screen results (from http://cactus.nci.nih.gov) 1 Slide from: Jean-Phillipe Vert, ParisTech 4 / 48
  • 5. Chemical space2 Stars Small Molecules Existing 1022 107 Virtual 0 1060 Access Difficult “Easy” 2 Slide from: Pierre Baldi, UC Irvine 5 / 48
  • 6. Formalization Problem statement Given a set of training instances (x1, y1), . . . , (xn, yn), where : xi ’s are graphs and yi ’s are continuous or discrete variables of interest. Estimate a function y = f (x) where x is any graph. 6 / 48
  • 7. Classical Approaches1 Classical approaches 1 Map each molecule to a vector of fixed dimension. 2 Apply an algorithm for regression or classification over vectors. Example: 2D structural keys in chemoinformatics Then use NN, decision tree, least squares e.t.c 1 Slide from: Jean-Phillipe Vert, ParisTech 7 / 48
  • 8. Difficulties Expressiveness of features (which features are relevant?) Large dimension of vector representation. 8 / 48
  • 9. The kernel trick Kernel Let φ(x) be a vector representation of the graph x. The kernel between two graphs is defined by: K(x, x ) = φ(x)T φ(x ). Many linear algorithms can be expressed only in terms of inner products between vectors. Often computing kernel is more efficient than computing φ(x). 9 / 48
  • 10. Kernel trick example: computing distances in the feature space1 1 Slide from: Jean-Phillipe Vert, ParisTech 10 / 48
  • 11. Positive definite (p.d.) Kernels Definition A positive definite (p.d.) kernel on a set χ is a function K : χ × χ → R that is symmetric and satisfies, for all N ∈ N, (x1, x2, . . . , xn) ∈ χN and (a1, a2, . . . , an) ∈ RN: N i=1 N j=1 ai aj K(xi , xj ) ≥ 0 11 / 48
  • 12. Positive definite kernels are inner products1 Mercer’s property K is a p.d. kernel on the set χ if and only if there exists a Hilbert space H and a mapping φ : χ → H, such that, for any x, x’ in χ: K(x, x ) = φ(x), φ(x ) H 1 Slide from: Jean-Phillipe Vert, ParisTech 12 / 48
  • 13. Graph kernels 1 Definition A graph kernel K(x, x ) is a p.d. kernel over the set of (labeled) graphs. It is equivalent to an embedding φ : χ → H of the set of graphs to a Hilbert space through the relation: K(x, x ) = φ(x)T φ(x ). 1 Slide from: Jean-Phillipe Vert, ParisTech 13 / 48
  • 14. Clarification Descriptors and kernels in chemoinformatics 1D- SMILES strings 2D- Graph of chemical bonds 2.5D- Surfaces 3D- Atomic coordinates 4D- Temporal evolution 14 / 48
  • 15. Outline 1 Introduction 2 Expressiveness versus complexity 3 Walk kernels 4 Conclusion and future directions 5 Data depth for labeled graph ensembles 15 / 48
  • 16. Expressibility versus complexity Definition: Complete graph kernels A graph kernel is complete if it separates nonisomorphic graphs. 16 / 48
  • 17. Expressibility versus complexity Definition: Complete graph kernels A graph kernel is complete if it separates nonisomorphic graphs. Graph isomorphism Figure from: Wikipedia 17 / 48
  • 18. Expressibility versus complexity Definition: Complete graph kernels A graph kernel is complete if it separates nonisomorphic graphs. Implication If graph kernel not complete, then it cannot differentiate all nonisomorphic graphs. 18 / 48
  • 19. Expressibility versus complexity Definition: Complete graph kernels A graph kernel is complete if it separates nonisomorphic graphs. Implication If graph kernel not complete, then it cannot differentiate all nonisomorphic graphs. Tractability Computing any complete graph kernel is at least as hard as the graph isomorphism problem (Gartner et al, 2003) 19 / 48
  • 20. Subgraph kernel Definition: Subgraph A subgraph of a graph (V , E) is a graph (V , E ) with V ⊂ V and E ⊂ E. 20 / 48
  • 21. Subgraph kernel Definition: Subgraph A subgraph of a graph (V , E) is a graph (V , E ) with V ⊂ V and E ⊂ E. Definition: Subgraph kernel Ksubgraph(G1, G2) = H∈χ λHφH(G1)φH(G2). where H ⊂ χ, λH is weight associated with H and φH(Gx ) returns the number of occurrences of H in Gx . 21 / 48
  • 22. Subgraph kernel Definition: Subgraph A subgraph of a graph (V , E) is a graph (V , E ) with V ⊂ V and E ⊂ E. Definition: Subgraph kernel Ksubgraph(G1, G2) = H∈χ λHφH(G1)φH(G2). where H ⊂ χ, λH is weight associated with H and φH(Gx ) returns the number of occurrences of H in Gx . Subgraph kernel complexity Computing the subgraph kernel is NP hard (Gartner et.al. 2003) 22 / 48
  • 23. Path kernels Definition: Path A path of a graph (V,E) is a sequence of distinct vertices such that consecutive vertices share an edge. 23 / 48
  • 24. Path kernels Definition: Path A path of a graph (V,E) is a sequence of distinct vertices such that consecutive vertices share an edge. Definition: Path kernel Kpath(G1, G2) = H∈P λHφH(G1)φH(G2). where P ⊂ χ is the set of path graphs. 24 / 48
  • 25. Path kernels Definition: Path A path of a graph (V,E) is a sequence of distinct vertices such that consecutive vertices share an edge. Definition: Path kernel Kpath(G1, G2) = H∈P λHφH(G1)φH(G2). where P ⊂ χ is the set of path graphs. Path kernel complexity Computing the path kernel is NP hard (Gartner et.al. 2003) 25 / 48
  • 26. Outline 1 Introduction 2 Expressiveness versus complexity 3 Walk kernels 4 Conclusion and future directions 5 Data depth for labeled graph ensembles 26 / 48
  • 27. Walks Definition A walk of a graph (V,E) is a sequence of distinct vertices such that consecutive vertices share an edge. Edge cannot appear in path only once. 27 / 48
  • 28. Walks Definition A walk of a graph (V,E) is a sequence of distinct vertices such that consecutive vertices share an edge. Edge cannot appear in path only once. Definition: Walk kernel Kwalk(G1, G2) = w∈S λw φw (G1)φw (G2). where S is the set of all walks and φw (Gx ) returns the count of walk w in Gx . 28 / 48
  • 29. Walk kernel examples nth order walk kernel λG (w) = 1 if the length of w is n, 0 other wise. 29 / 48
  • 30. Walk kernel examples nth order walk kernel λG (w) = 1 if the length of w is n, 0 other wise. Geometric walk kernel λG (w) = βlength(w), for β > 0 30 / 48
  • 31. Walk kernel examples nth order walk kernel λG (w) = 1 if the length of w is n, 0 other wise. Geometric walk kernel λG (w) = βlength(w), for β > 0 Random walk kernel λG (w) = PG (w) 31 / 48
  • 32. Walk kernel examples nth order walk kernel λG (w) = 1 if the length of w is n, 0 other wise. Geometric walk kernel λG (w) = βlength(w), for β > 0 Random walk kernel λG (w) = PG (w) Fingerprint based kernels Dot product kernel Tanimoto kernel MinMax kernel 32 / 48
  • 33. Computation of walk kernels Yay! All the above walk kernels can be computed efficiently in polynomial time. 33 / 48
  • 34. Computation of n-th order walk kernel (1/2)1 Product graphs Let G1 = (V1, E1) and G2 = (V1, E2). Then product graph G = G1 × G2 is the graph G = (V , E) with: 1 V = {(v1, v2) ∈ V1 × V2: v1 and v2 have same label}, 2 E = { (v1, v2), (v1, v2) ∈ V × V : (v1, v1) ∈ E1 and (v2, v2) ∈ E2} 1 Slide from: Jean-Phillipe Vert, ParisTech 34 / 48
  • 35. Computation of n-th order walk kernel (2/2)1 For nth order walk kernel we have λG1×G2 = 1 if length of w is n, 0 otherwise. Therefore: Knth−order (G1, G2) = w∈Sn(G1×G2) 1 = i,j [An ]i,j = 1T An 1. Computation in O(n|G1||G2|d1d2), where di is the maximum degree of Gi . 1 Slide from: Jean-Phillipe Vert, ParisTech 35 / 48
  • 36. Traditional molecular fingerprints Bit vectors of size ( usually = 512 or 1024) Steps (as summarized in Ralaivola et al. 2005) 1 DFS exploration from each atom to get set of walks 2 Each path initializes a random number generator to form b integers. 3 b integers reduced molulo then used to set corresponding bits in fingerprint vector. Complexity O(nm) or O(nαd ), where n := # atoms and m := #edges, α := branching factor and d := depth of walk 36 / 48
  • 37. Generalized molecular fingerprints Avoids clashes/information loss with reserved bit positions. Let P(d) be set of all atom-bond labeled path containing max d bonds. Binary feature map given depth d: φd (u) = φpath(u) path∈P(d) Binary feature map given depth d and fixed vector size : φd (u) = φγ (path)(u) path∈P(d) where γ :→ {1, . . . , }b 37 / 48
  • 38. Fingerprint based kernel (Ralaivola et.al. 2005)2 Complexity O(d(n1m1 + n2m2)) using suffix tree data structure. 2 Slide from: Pierre Baldi, UC Irvine 38 / 48
  • 39. Extensions for walk kernels Label enrichment Non tottering walk 3D kernels Mutual information in fingerprint construction 39 / 48
  • 40. Results(Mahe et al., 2005, Ralaivola et al, 2005) MUTAG Dataset Collection of 188 compounds. Classification of mutagenic activity :high(125) or none(63), as assayed in Salmonella typhimurium. Method Accuracy Progol1(1D) 81.4 % Random walk kernel (2D) 91.2 % MinMax kernel (2D) 91.5% 40 / 48
  • 41. Outline 1 Introduction 2 Expressiveness versus complexity 3 Walk kernels 4 Conclusion and future directions 5 Data depth for labeled graph ensembles 41 / 48
  • 42. Conclusion Summary Extension of ML algorithms to graph data using definition of positive definite kernels. Two classes of 2D kernels for chemical molecule structures. What next? Can we use graph kernel machinery for computing depth. 42 / 48
  • 43. Outline 1 Introduction 2 Expressiveness versus complexity 3 Walk kernels 4 Conclusion and future directions 5 Data depth for labeled graph ensembles 43 / 48
  • 44. Data depth What is depth function? A depth function is designed to provide a P-based center outward ordering( and thus a ranking) for ensemble of data objects drawn from any arbitrary distribution P. Taxonomy of data depth definitions (Mosler 2012) Distance based depth functions Simplex/Halfspace based depth Weighted mean based depth 44 / 48
  • 45. Band depth Band depth: A type of simplex based data depth method. Many definitions for various kinds of data. Functions Multivariate functions Paths on graph 45 / 48
  • 46. Band formed by graphs When alignment is known.. Graphs → Adjacency matrices → Functions Alignment: A mapping from θ : VGx → VGy , where Gx and Gy are any two graphs. When alignment is unknown.. ???? 46 / 48
  • 47. Product graphs! V = {(v1, v2) ∈ V1 × V2: v1 and v2 have same label}, E = { (v1, v2), (v1, v2) ∈ V × V : (v1, v1) ∈ E1 and (v2, v2) ∈ E2} Weak Direct Product (aka Tensor product or Kronecker product..) E× = { (v1, v2), (v1, v2) ∈ V × V : (v1, v1) ∈ E1 and (v2, v2) ∈ E2} Strong Product E = E× ∪ E where E = { (v1, v2), (v1, v2) ∈ V × V : v1 = v1 and (v2, v2) ∈ E2} or v2 = v2 and (v1, v1) ∈ E1} 47 / 48
  • 48. References Ralaivola, Liva, Sanjay J. Swamidass, Hiroto Saigo, and Pierre Baldi. ”Graph kernels for chemical informatics.” Neural Networks 18, no. 8 (2005): 1093-1110. Mahe, Pierre, et al. ”Graph kernels for molecular structure-activity relationship analysis with support vector machines.” Journal of chemical information and modeling 45.4 (2005): 939-951. Gartner, Thomas, Peter Flach, and Stefan Wrobel. ”On graph kernels: Hardness results and efficient alternatives.” Learning Theory and Kernel Machines. Springer Berlin Heidelberg, 2003. 129-143. http://videolectures.net/site/normal_dl/tag=9127/ gbr07_vert_ckac_01.pdf http: //www.ics.uci.edu/~dock/upload/UCI_CHEM_05.ppt 48 / 48