SlideShare a Scribd company logo
1 of 27
Download to read offline
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
A two-step method to incorporate task features
for large output spaces
Michiel Stock1, Tapio Pahikkala2, Antti Airola2, Bernard
De Baets1 & Willem Waegeman1
1KERMIT
Department of Mathematical Modelling, Statistics and Bioinformatics
Ghent University
2Department of Computer Science
University of Turku
NIPS: extreme classification workshop
December 12, 2015
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
What will we read next?
5 4
1 4
4
2 1
4 3
Alice Bob Cedric Daphne
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
What will we read next?
5 4
1 4
4
2 1
4 3
Alice Bob Cedric Daphne
Social
graph
Genre
1 1 0 1
0 0 1 0
1 1 0 0
0 0 1 0
0 1 0 1
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
What will we read next?
5 2.3 4 3.1
1 4.5 1.3 4
3.9 4 3.8 0.8
2 5.2 1 4.5
4 2.5 3 3.6
Alice Bob Cedric Daphne
Social
graph
Genre
1 1 0 1
0 0 1 0
1 1 0 0
0 0 1 0
0 1 0 1
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
What will we read next?
5 2.3 4 3.1
1 4.5 1.3 4
3.9 4 3.8 0.8
2 5.2 1 4.5
4 2.5 3 3.6
Alice Bob Cedric Daphne
Social
graph
Genre
1 1 0 1
0 0 1 0
1 1 0 0
0 0 1 0
0 1 0 1
4.8 1.1 3.7 2.31 1 0 1
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
What will we read next?
5 2.3 4 3.1
1 4.5 1.3 4
3.9 4 3.8 0.8
2 5.2 1 4.5
4 2.5 3 3.6
Alice Bob Cedric Daphne
Social
graph
Genre
1 1 0 1
0 0 1 0
1 1 0 0
0 0 1 0
0 1 0 1
4.8 1.1 3.7 2.31 1 0 1
2.3
4.0
1.7
4.8
2.9
Eric
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
What will we read next?
5 2.3 4 3.1
1 4.5 1.3 4
3.9 4 3.8 0.8
2 5.2 1 4.5
4 2.5 3 3.6
Alice Bob Cedric Daphne
Social
graph
Genre
1 1 0 1
0 0 1 0
1 1 0 0
0 0 1 0
0 1 0 1
4.8 1.1 3.7 2.31 1 0 1
2.3
4.0
1.7
4.8
2.9
Eric
2.4
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Learning relations
Setting A
Setting B
Setting C
Setting D
Training
In-sample
tasks
Out-of-sample
tasks
Out-of-
sample
instances
In-sample
instances
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Other cool applications: drug design
Predicting interaction between proteins and small compounds
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Other cool applications: social network analysis
Predicting links between people
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Other cool applications: food pairing
Finding ingredients that pair well
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Learning with pairwise feature representations
Features
books
Features
readers
d : instance (e.g. book)
φ(d) : instance features
(e.g. genre)
t : task (e.g. reader)
ψ(t) : task features (e.g.
social network)
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Learning with pairwise feature representations
Features
books
Features
readers
⌦
⌦ =
d : instance (e.g. book)
φ(d) : instance features
(e.g. genre)
t : task (e.g. reader)
ψ(t) : task features (e.g.
social network)
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Learning with pairwise feature representations
Features
books
Features
readers
⌦
⌦ =
d : instance (e.g. book)
φ(d) : instance features
(e.g. genre)
t : task (e.g. reader)
ψ(t) : task features (e.g.
social network)
Pairwise prediction function: f (d, t) = w (φ(d) ⊗ ψ(t))
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Learning relations in two steps
In-sample
tasks
Out-of-sample
tasks
Task KRR
Instance
KRR
Virtual instances
In-sample
instances
Out-of-
sample
instances
1 Build a ridge
regression model to
generalize to new
instances
2 Build a ridge
regression model to
generalize to new
tasks
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
The two-step ridge regression
Prediction function:
f (d, t) = φ(d) Wψ(t)
Parameters can be found by solving:
Φ YΨ = (Φ Φ + λd I)W(Ψ Ψ + λtI)
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
The two-step ridge regression
Prediction function:
f (d, t) = φ(d) Wψ(t)
Parameters can be found by solving:
Φ YΨ = (Φ Φ + λd I)W(Ψ Ψ + λtI)
Two hyperparameters: λd and λt!
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Four ways of cross validation
Setting A Setting B
Setting DSetting C
Train
Test
Discarded
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Four ways of cross validation
Setting A Setting B
Setting DSetting C
Train
Test
Discarded
Analytic shortcuts
can be derived to
perform LOOCV for
each setting!
Tuning λd and λt
essentially free!
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Effect of regularization for the four settings
Data: protein-ligand interactions.
Evaluation by AUC (lighter = better performance)
lambda drugs
lambdatargets
0.550
0.600
0.6500.700
0.750
0.800
0.800
0.800
0.850nr data
CV for Setting A
lambda drugs
lambdatargets
0.560
0.6000.640
0.680
0.720
0.760
nr data
CV for Setting B
lambda drugs
lambdatargets
0.790
0.800
0.800
0.810
0.810
0.820
0.820
0.830
0.830
0.840
0.850
nr data
CV for Setting C
lambda drugs
lambdatargets
0.600
0.625
0.625
0.650
0.6750.700
0.725
nr data
CV for Setting D
0.50
0.55
0.60
0.65
0.70
0.75
0.80
0.85
0.90
0.95
1.00
Clear difference between four settings and λd and λt!
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Learning with mini-batches
Initial
training data
New training
instances
Even more training
instances
Tasks
Instances
Newtrainingtasks
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Learning with mini-batches
Initial
training data
New training
instances
Even more training
instances
Tasks
Instances
Newtrainingtasks
Exact updating of the
parameters when new
training instances and/or
taks become available
scalable for “Big
Data” applications
updating model in
dynamic
environment
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Exact online learning for hierarchical text
classification
Hierarchical text classification (> 12, 000 labels): from 5,000
to 350,000 instances in steps of 1,000 instances.
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Why two-step ridge regression?
Zero-shot learning, transfer learning, multi-task learning...
in one line of code
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Why two-step ridge regression?
Zero-shot learning, transfer learning, multi-task learning...
in one line of code
Theoretically well founded
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
Why two-step ridge regression?
Zero-shot learning, transfer learning, multi-task learning...
in one line of code
Theoretically well founded
Allows for nifty computational tricks
‘free’ tuning for the hyperparameters
‘free’ LOOCV for all four settings!
closed-form solution for updating with mini-batches
A two-step
method for
large output
spaces
Michiel Stock
twitter:
@michielstock
Motivation
Introductory
example
Relational
learning
Other
applications
Pairwise
learning
methods
Kronecker kernel
ridge regression
Two-step kernel
ridge regression
Computational
aspects
Cross-validation
Exact online
learning
Take home
messages
KERMIT
A two-step method to incorporate task features
for large output spaces
Michiel Stock1, Tapio Pahikkala2, Antti Airola2, Bernard
De Baets1 & Willem Waegeman1
1KERMIT
Department of Mathematical Modelling, Statistics and Bioinformatics
Ghent University
2Department of Computer Science
University of Turku
NIPS: extreme classification workshop
December 12, 2015

More Related Content

Similar to A two-step method for large output spaces: pairwise learning and cross-validation shortcuts

Symbolic Background Knowledge for Machine Learning
Symbolic Background Knowledge for Machine LearningSymbolic Background Knowledge for Machine Learning
Symbolic Background Knowledge for Machine LearningSteffen Staab
 
Deep Learning And Business Models (VNITC 2015-09-13)
Deep Learning And Business Models (VNITC 2015-09-13)Deep Learning And Business Models (VNITC 2015-09-13)
Deep Learning And Business Models (VNITC 2015-09-13)Ha Phuong
 
VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...
VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...
VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...Keiichiro Ono
 
The deep learning tour - Q1 2017
The deep learning tour - Q1 2017 The deep learning tour - Q1 2017
The deep learning tour - Q1 2017 Eran Shlomo
 
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017SERC at Carleton College
 
A Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree EnsemblesA Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree EnsemblesIchigaku Takigawa
 
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.pptProto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.pptAnirbanBhar3
 
Current clustering techniques
Current clustering techniquesCurrent clustering techniques
Current clustering techniquesPoonam Kshirsagar
 
An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...
An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...
An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...Databricks
 
A Practical Approach to Design, Implementation, and Management A Practical Ap...
A Practical Approach to Design, Implementation, and Management A Practical Ap...A Practical Approach to Design, Implementation, and Management A Practical Ap...
A Practical Approach to Design, Implementation, and Management A Practical Ap...Cynthia Velynne
 
ChenXin_Daniel_Han
ChenXin_Daniel_HanChenXin_Daniel_Han
ChenXin_Daniel_HanDaniel Han
 
What Academia Can Learn from Open Source
What Academia Can Learn from Open SourceWhat Academia Can Learn from Open Source
What Academia Can Learn from Open SourceAll Things Open
 
A computational scientist's wish list for tomorrow's computing systems
A computational scientist's wish list for tomorrow's computing systemsA computational scientist's wish list for tomorrow's computing systems
A computational scientist's wish list for tomorrow's computing systemskhinsen
 
Sentiment Analysis In Retail Domain
Sentiment Analysis In Retail DomainSentiment Analysis In Retail Domain
Sentiment Analysis In Retail DomainEdureka!
 
Download presentation source
Download presentation sourceDownload presentation source
Download presentation sourcebutest
 
Cambridgestructuraldesignseminar
CambridgestructuraldesignseminarCambridgestructuraldesignseminar
Cambridgestructuraldesignseminarpdemian
 
Lancaster UCREL Summer School 2017 - Big Data and NLP
Lancaster UCREL Summer School 2017 - Big Data and NLPLancaster UCREL Summer School 2017 - Big Data and NLP
Lancaster UCREL Summer School 2017 - Big Data and NLPDaniel Kershaw
 

Similar to A two-step method for large output spaces: pairwise learning and cross-validation shortcuts (20)

Symbolic Background Knowledge for Machine Learning
Symbolic Background Knowledge for Machine LearningSymbolic Background Knowledge for Machine Learning
Symbolic Background Knowledge for Machine Learning
 
Deep Learning And Business Models (VNITC 2015-09-13)
Deep Learning And Business Models (VNITC 2015-09-13)Deep Learning And Business Models (VNITC 2015-09-13)
Deep Learning And Business Models (VNITC 2015-09-13)
 
VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...
VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...
VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...
 
My lectures
My lecturesMy lectures
My lectures
 
The deep learning tour - Q1 2017
The deep learning tour - Q1 2017 The deep learning tour - Q1 2017
The deep learning tour - Q1 2017
 
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017
 
A Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree EnsemblesA Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree Ensembles
 
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.pptProto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
 
Current clustering techniques
Current clustering techniquesCurrent clustering techniques
Current clustering techniques
 
An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...
An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...
An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...
 
A Practical Approach to Design, Implementation, and Management A Practical Ap...
A Practical Approach to Design, Implementation, and Management A Practical Ap...A Practical Approach to Design, Implementation, and Management A Practical Ap...
A Practical Approach to Design, Implementation, and Management A Practical Ap...
 
ChenXin_Daniel_Han
ChenXin_Daniel_HanChenXin_Daniel_Han
ChenXin_Daniel_Han
 
What Academia Can Learn from Open Source
What Academia Can Learn from Open SourceWhat Academia Can Learn from Open Source
What Academia Can Learn from Open Source
 
A computational scientist's wish list for tomorrow's computing systems
A computational scientist's wish list for tomorrow's computing systemsA computational scientist's wish list for tomorrow's computing systems
A computational scientist's wish list for tomorrow's computing systems
 
Sentiment Analysis In Retail Domain
Sentiment Analysis In Retail DomainSentiment Analysis In Retail Domain
Sentiment Analysis In Retail Domain
 
CSE509 Lecture 5
CSE509 Lecture 5CSE509 Lecture 5
CSE509 Lecture 5
 
Download presentation source
Download presentation sourceDownload presentation source
Download presentation source
 
Cambridgestructuraldesignseminar
CambridgestructuraldesignseminarCambridgestructuraldesignseminar
Cambridgestructuraldesignseminar
 
A Day In The Life V4.1
A Day In The Life V4.1A Day In The Life V4.1
A Day In The Life V4.1
 
Lancaster UCREL Summer School 2017 - Big Data and NLP
Lancaster UCREL Summer School 2017 - Big Data and NLPLancaster UCREL Summer School 2017 - Big Data and NLP
Lancaster UCREL Summer School 2017 - Big Data and NLP
 

More from Michiel Stock

Wiskunde voor Waterbeheer
Wiskunde voor WaterbeheerWiskunde voor Waterbeheer
Wiskunde voor WaterbeheerMichiel Stock
 
How the mathematics behind Netflix will save the world
How the mathematics behind Netflix will save the worldHow the mathematics behind Netflix will save the world
How the mathematics behind Netflix will save the worldMichiel Stock
 
Disentangling ecological networks using graph embedding methods
Disentangling ecological networks using graph embedding methodsDisentangling ecological networks using graph embedding methods
Disentangling ecological networks using graph embedding methodsMichiel Stock
 
2018 presentation montréal_handouts
2018 presentation montréal_handouts2018 presentation montréal_handouts
2018 presentation montréal_handoutsMichiel Stock
 
A tour in optimal transport
A tour in optimal transportA tour in optimal transport
A tour in optimal transportMichiel Stock
 
Pairwise Learning for Synthetic Biology
Pairwise Learning for Synthetic BiologyPairwise Learning for Synthetic Biology
Pairwise Learning for Synthetic BiologyMichiel Stock
 
PhD defence pairwise learning
PhD defence pairwise learningPhD defence pairwise learning
PhD defence pairwise learningMichiel Stock
 
Bioscience engineering together: participating at iGEM
Bioscience engineering together: participating at iGEMBioscience engineering together: participating at iGEM
Bioscience engineering together: participating at iGEMMichiel Stock
 
Exact and efficient top-K inference for multi-target prediction by querying s...
Exact and efficient top-K inference for multi-target prediction by querying s...Exact and efficient top-K inference for multi-target prediction by querying s...
Exact and efficient top-K inference for multi-target prediction by querying s...Michiel Stock
 
Poster genome engineering & Synthetic Biology 2016
Poster genome engineering & Synthetic Biology 2016Poster genome engineering & Synthetic Biology 2016
Poster genome engineering & Synthetic Biology 2016Michiel Stock
 
Kernel Methods and Relational Learning in Computational Biology
Kernel Methods and Relational Learning in Computational BiologyKernel Methods and Relational Learning in Computational Biology
Kernel Methods and Relational Learning in Computational BiologyMichiel Stock
 
Enzyme Annotation using Conditional Ranking Algorithms
Enzyme Annotation using Conditional Ranking AlgorithmsEnzyme Annotation using Conditional Ranking Algorithms
Enzyme Annotation using Conditional Ranking AlgorithmsMichiel Stock
 

More from Michiel Stock (12)

Wiskunde voor Waterbeheer
Wiskunde voor WaterbeheerWiskunde voor Waterbeheer
Wiskunde voor Waterbeheer
 
How the mathematics behind Netflix will save the world
How the mathematics behind Netflix will save the worldHow the mathematics behind Netflix will save the world
How the mathematics behind Netflix will save the world
 
Disentangling ecological networks using graph embedding methods
Disentangling ecological networks using graph embedding methodsDisentangling ecological networks using graph embedding methods
Disentangling ecological networks using graph embedding methods
 
2018 presentation montréal_handouts
2018 presentation montréal_handouts2018 presentation montréal_handouts
2018 presentation montréal_handouts
 
A tour in optimal transport
A tour in optimal transportA tour in optimal transport
A tour in optimal transport
 
Pairwise Learning for Synthetic Biology
Pairwise Learning for Synthetic BiologyPairwise Learning for Synthetic Biology
Pairwise Learning for Synthetic Biology
 
PhD defence pairwise learning
PhD defence pairwise learningPhD defence pairwise learning
PhD defence pairwise learning
 
Bioscience engineering together: participating at iGEM
Bioscience engineering together: participating at iGEMBioscience engineering together: participating at iGEM
Bioscience engineering together: participating at iGEM
 
Exact and efficient top-K inference for multi-target prediction by querying s...
Exact and efficient top-K inference for multi-target prediction by querying s...Exact and efficient top-K inference for multi-target prediction by querying s...
Exact and efficient top-K inference for multi-target prediction by querying s...
 
Poster genome engineering & Synthetic Biology 2016
Poster genome engineering & Synthetic Biology 2016Poster genome engineering & Synthetic Biology 2016
Poster genome engineering & Synthetic Biology 2016
 
Kernel Methods and Relational Learning in Computational Biology
Kernel Methods and Relational Learning in Computational BiologyKernel Methods and Relational Learning in Computational Biology
Kernel Methods and Relational Learning in Computational Biology
 
Enzyme Annotation using Conditional Ranking Algorithms
Enzyme Annotation using Conditional Ranking AlgorithmsEnzyme Annotation using Conditional Ranking Algorithms
Enzyme Annotation using Conditional Ranking Algorithms
 

Recently uploaded

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 

Recently uploaded (20)

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 

A two-step method for large output spaces: pairwise learning and cross-validation shortcuts

  • 1. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT A two-step method to incorporate task features for large output spaces Michiel Stock1, Tapio Pahikkala2, Antti Airola2, Bernard De Baets1 & Willem Waegeman1 1KERMIT Department of Mathematical Modelling, Statistics and Bioinformatics Ghent University 2Department of Computer Science University of Turku NIPS: extreme classification workshop December 12, 2015
  • 2. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT What will we read next? 5 4 1 4 4 2 1 4 3 Alice Bob Cedric Daphne
  • 3. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT What will we read next? 5 4 1 4 4 2 1 4 3 Alice Bob Cedric Daphne Social graph Genre 1 1 0 1 0 0 1 0 1 1 0 0 0 0 1 0 0 1 0 1
  • 4. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT What will we read next? 5 2.3 4 3.1 1 4.5 1.3 4 3.9 4 3.8 0.8 2 5.2 1 4.5 4 2.5 3 3.6 Alice Bob Cedric Daphne Social graph Genre 1 1 0 1 0 0 1 0 1 1 0 0 0 0 1 0 0 1 0 1
  • 5. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT What will we read next? 5 2.3 4 3.1 1 4.5 1.3 4 3.9 4 3.8 0.8 2 5.2 1 4.5 4 2.5 3 3.6 Alice Bob Cedric Daphne Social graph Genre 1 1 0 1 0 0 1 0 1 1 0 0 0 0 1 0 0 1 0 1 4.8 1.1 3.7 2.31 1 0 1
  • 6. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT What will we read next? 5 2.3 4 3.1 1 4.5 1.3 4 3.9 4 3.8 0.8 2 5.2 1 4.5 4 2.5 3 3.6 Alice Bob Cedric Daphne Social graph Genre 1 1 0 1 0 0 1 0 1 1 0 0 0 0 1 0 0 1 0 1 4.8 1.1 3.7 2.31 1 0 1 2.3 4.0 1.7 4.8 2.9 Eric
  • 7. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT What will we read next? 5 2.3 4 3.1 1 4.5 1.3 4 3.9 4 3.8 0.8 2 5.2 1 4.5 4 2.5 3 3.6 Alice Bob Cedric Daphne Social graph Genre 1 1 0 1 0 0 1 0 1 1 0 0 0 0 1 0 0 1 0 1 4.8 1.1 3.7 2.31 1 0 1 2.3 4.0 1.7 4.8 2.9 Eric 2.4
  • 8. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Learning relations Setting A Setting B Setting C Setting D Training In-sample tasks Out-of-sample tasks Out-of- sample instances In-sample instances
  • 9. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Other cool applications: drug design Predicting interaction between proteins and small compounds
  • 10. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Other cool applications: social network analysis Predicting links between people
  • 11. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Other cool applications: food pairing Finding ingredients that pair well
  • 12. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Learning with pairwise feature representations Features books Features readers d : instance (e.g. book) φ(d) : instance features (e.g. genre) t : task (e.g. reader) ψ(t) : task features (e.g. social network)
  • 13. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Learning with pairwise feature representations Features books Features readers ⌦ ⌦ = d : instance (e.g. book) φ(d) : instance features (e.g. genre) t : task (e.g. reader) ψ(t) : task features (e.g. social network)
  • 14. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Learning with pairwise feature representations Features books Features readers ⌦ ⌦ = d : instance (e.g. book) φ(d) : instance features (e.g. genre) t : task (e.g. reader) ψ(t) : task features (e.g. social network) Pairwise prediction function: f (d, t) = w (φ(d) ⊗ ψ(t))
  • 15. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Learning relations in two steps In-sample tasks Out-of-sample tasks Task KRR Instance KRR Virtual instances In-sample instances Out-of- sample instances 1 Build a ridge regression model to generalize to new instances 2 Build a ridge regression model to generalize to new tasks
  • 16. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT The two-step ridge regression Prediction function: f (d, t) = φ(d) Wψ(t) Parameters can be found by solving: Φ YΨ = (Φ Φ + λd I)W(Ψ Ψ + λtI)
  • 17. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT The two-step ridge regression Prediction function: f (d, t) = φ(d) Wψ(t) Parameters can be found by solving: Φ YΨ = (Φ Φ + λd I)W(Ψ Ψ + λtI) Two hyperparameters: λd and λt!
  • 18. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Four ways of cross validation Setting A Setting B Setting DSetting C Train Test Discarded
  • 19. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Four ways of cross validation Setting A Setting B Setting DSetting C Train Test Discarded Analytic shortcuts can be derived to perform LOOCV for each setting! Tuning λd and λt essentially free!
  • 20. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Effect of regularization for the four settings Data: protein-ligand interactions. Evaluation by AUC (lighter = better performance) lambda drugs lambdatargets 0.550 0.600 0.6500.700 0.750 0.800 0.800 0.800 0.850nr data CV for Setting A lambda drugs lambdatargets 0.560 0.6000.640 0.680 0.720 0.760 nr data CV for Setting B lambda drugs lambdatargets 0.790 0.800 0.800 0.810 0.810 0.820 0.820 0.830 0.830 0.840 0.850 nr data CV for Setting C lambda drugs lambdatargets 0.600 0.625 0.625 0.650 0.6750.700 0.725 nr data CV for Setting D 0.50 0.55 0.60 0.65 0.70 0.75 0.80 0.85 0.90 0.95 1.00 Clear difference between four settings and λd and λt!
  • 21. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Learning with mini-batches Initial training data New training instances Even more training instances Tasks Instances Newtrainingtasks
  • 22. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Learning with mini-batches Initial training data New training instances Even more training instances Tasks Instances Newtrainingtasks Exact updating of the parameters when new training instances and/or taks become available scalable for “Big Data” applications updating model in dynamic environment
  • 23. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Exact online learning for hierarchical text classification Hierarchical text classification (> 12, 000 labels): from 5,000 to 350,000 instances in steps of 1,000 instances.
  • 24. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Why two-step ridge regression? Zero-shot learning, transfer learning, multi-task learning... in one line of code
  • 25. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Why two-step ridge regression? Zero-shot learning, transfer learning, multi-task learning... in one line of code Theoretically well founded
  • 26. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT Why two-step ridge regression? Zero-shot learning, transfer learning, multi-task learning... in one line of code Theoretically well founded Allows for nifty computational tricks ‘free’ tuning for the hyperparameters ‘free’ LOOCV for all four settings! closed-form solution for updating with mini-batches
  • 27. A two-step method for large output spaces Michiel Stock twitter: @michielstock Motivation Introductory example Relational learning Other applications Pairwise learning methods Kronecker kernel ridge regression Two-step kernel ridge regression Computational aspects Cross-validation Exact online learning Take home messages KERMIT A two-step method to incorporate task features for large output spaces Michiel Stock1, Tapio Pahikkala2, Antti Airola2, Bernard De Baets1 & Willem Waegeman1 1KERMIT Department of Mathematical Modelling, Statistics and Bioinformatics Ghent University 2Department of Computer Science University of Turku NIPS: extreme classification workshop December 12, 2015