SlideShare a Scribd company logo
1 of 28
1
http://chalearn.org/
Causality and
Graph Reconstruction
MLconf 2015
Isabelle Guyon, ChaLearn
2
Motivation
BIG data makes lots of BIG promises, but…
… will the promises be held?
DIFFICULTY
VALUE
Classical statistics Machine learning
What
happened?
How
happened?
Explicative
power
Forecasting
power
http://chalearn.org/
Decisional
power
What will
happen?
What is
Causal graph reconstruction? http://chalearn.org/
4
Problem setting http://chalearn.org/
A
F
I
H
E
B
D G
C
J
INPUT
OUTPUT
5
Causal questions http://chalearn.org/
actions
…your health?
…climate
changes?… the economy?
What affects…
6
Scientific method http://chalearn.org/
7
Thanks to Jonas Peters for this example
Observe correlations http://chalearn.org/
8
Hypothesize causal relationships http://chalearn.org/
Thanks to Jonas Peters for this example
9
Hypothesize causal relationships http://chalearn.org/
Thanks to Jonas Peters for this example
10
Hypothesize causal relationships http://chalearn.org/
Chocolate Nobel
Chocolate Nobel
Chocolate Nobel
Chocolate Nobel
?
11
“Please test your
researchers for ten
years: Randomly pick
half of them and give
them chocolate for
desert and give apples
to the other half. Then
compare the number
of Nobel prizes in the
two populations.”
Perform randomized
controlled experiments http://chalearn.org/
12







How far can we get
to improve causal hypotheses …
… to minimize the need for experiments?
13
• Pioneer work: Glymour, Scheines, Spirtes, Pearl (Turing
Award, 2011), Rubin, in the US, since the 80’s.
• New wave: Hyvärinen, Schölkopf, Bühlmann in the EU.
• Nobel prizes in econometrics: Haavelmo (1989),
Granger (2003), Sargent and Sims (2011).
• DARPA programs: Big mechanisms (2014), upcoming
program (Schwartz, program manager).
Landmark work http://chalearn.org/
14
Game changing work:
Causality challenges http://chalearn.org/
Cause-Effect Pairs (2013)
Neural Connectomics (2014)
Causation and Prediction (2007)
Pot-luck challenge (2008)
15
To make a long story short… http://chalearn.org/
1. Discovering dependencies: easiest = classical
feature selection. Hard to beat!
2. Removing spurious dependencies: harder and
“dangerous” because removing good features
is more harmful than keeping bad ones.
3. Orienting dependencies: hardest.
16
Cause-effect pair challenge
(2013) http://chalearn.org/
Initial impulse: Joris Mooij, Dominik Janzing, and Bernhard Schölkopf.
Examples of algorithms and data: Povilas Daniušis, Arthur Gretton,
Patrik O. Hoyer, Dominik Janzing, Antti Kerminen, Joris Mooij, Jonas
Peters, Bernhard Schölkopf, Shohei Shimizu, Oliver Stegle, and Kun
Zhang, Jakob Zscheischler.
Datasets and result analysis: Isabelle Guyon + Mehreen Saeed +
{Mikael Henaff, Sisi Ma, and Alexander Statnikov}, from NYU.
Website and sample code: Isabelle Guyon +
Phase 1: Ben Hamner (Kaggle) https://www.kaggle.com/c/cause-
effect-pairs
Phase 2: Ivan Judson, Christophe Poulain, Evelyne Viegas,
Michael Zyskowski https://www.codalab.org/competitions/1381
Review, testing: Marc Boullé,
Hugo Jair Escalante, Frederick Eberhardt,
Seth Flaxman, Patrik Hoyer,
Dominik Janzing, Richard Kennaway,
Vincent Lemaire, Joris Mooij,
Jonas Peters, Florin Popescu, Peter Spirtes,
Ioannis Tsamardinos, Jianxin Yin, Kun Zhang.
Mehreen
Evelyne
Joris Dominik
Bernhard
Kun
Ben
Alexander
Marc
17
Problem setting http://chalearn.org/
A
F
I
H
E
B
D G
C
J
INPUT
OUTPUT
18
Problem setting http://chalearn.org/
A
F
I
H
E
B
D G
C
J
INPUT
OUTPUT
A -> B ?
0 / 1
19
B =Temperature
A = log(Altitude)
A  B ? http://chalearn.org/
20
A  B A  B
Best fit: A  B http://chalearn.org/
21
The data:
A
B
Z
A  B
B
A
Z
A <- B
A B
Z
ZBZA
A  Z  B
A B A | B
Demographics:
Sex  Height
Age  Wages
Country  Education
Latitude  Infant mortality
Ecology:
City elevation  Temperature
Water level  Algal frequency
Elevation  Vegetation
Dist. to hydrology  Fire
Econometrics:
Mileage  Car resell price
Num.rooms  House price
Trade price last day  Trade price
Medicine:
Cancer vol.  Recurrence
Metastasis  Prognosis
Age  Blood pressure
Genomics (mRNA level):
transcription factor  protein
induced
Engineering:
Car model year  Horsepower
Number of cylinders  MPG
Cache memory  Compute power
Roof area  Heating load
Cement used  Compressive strength
20% 80%
http://chalearn.org/
22
0.5
0.55
0.6
0.65
0.7
0.75
0.8
0.85
267
test
The results:
http://chalearn.org/
23
Amazing: an operational
causation coefficient!
http://chalearn.org/
24
Neural connectomics
Challenge (2014) http://chalearn.org/
Coordinator:
Isabelle Guyon
Data Providers:
Demian Battaglia
Javier Orlandi
Jordi Soriano Fradera
Olav Stetter
Advisors:
Gavin Cawley
Gideon Dror
Hugo-Jair Escalante
Alice Guyon
Vincent Lemaire
Sisi Ma
Eric Peskin
Florin Popescu
Bisakha Ray,
Mehreen Saeed
Alexander Statnikov
Demian
Olav
Jordi
Javier
Bisakha
Mehreen
25
Problem setting http://chalearn.org/
A
F
I
H
E
B
D G
C
J
INPUT
OUTPUT
26
Network deconvolution http://chalearn.org/
27
Conclusion
• Causal models:
– Better explain data.
– Make decisions.
• Challenges:
– Fair evaluations.
– Innovation.
• Machine Learning:
– Novel approaches to causal discovery.
– Operational “causation coefficient”:
• First detect oriented pairs, then prune indirect effects and confounders.
• First build undirected graph, then orient edges.
28
http://chalearn.org/
Fully automatic machine learning
without ANY human intervention
automl.chalearn.org
December 2014 – January 2016
$30,000 in prizes
Thank you!
AutoML Challenge

More Related Content

More from MLconf

Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceMLconf
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...MLconf
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMLconf
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionMLconf
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLMLconf
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldMLconf
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...MLconf
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...MLconf
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...MLconf
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeMLconf
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareMLconf
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesMLconf
 
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...MLconf
 
Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...
Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...
Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...MLconf
 
LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...
LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...
LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...MLconf
 
Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...
Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...
Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...MLconf
 

More from MLconf (20)

Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
 
Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...
Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...
Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...
 
LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...
LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...
LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...
 
Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...
Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...
Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...
 

Recently uploaded

Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringWSO2
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceIES VE
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaWSO2
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAnitaRaj43
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governanceWSO2
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuidePixlogix Infotech
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)Samir Dash
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxMarkSteadman7
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...caitlingebhard1
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 

Recently uploaded (20)

Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational Performance
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

Isabelle Guyon, President, ChaLearn at MLconf SF - 11/13/15

  • 2. 2 Motivation BIG data makes lots of BIG promises, but… … will the promises be held? DIFFICULTY VALUE Classical statistics Machine learning What happened? How happened? Explicative power Forecasting power http://chalearn.org/ Decisional power What will happen?
  • 3. What is Causal graph reconstruction? http://chalearn.org/
  • 5. 5 Causal questions http://chalearn.org/ actions …your health? …climate changes?… the economy? What affects…
  • 7. 7 Thanks to Jonas Peters for this example Observe correlations http://chalearn.org/
  • 8. 8 Hypothesize causal relationships http://chalearn.org/ Thanks to Jonas Peters for this example
  • 9. 9 Hypothesize causal relationships http://chalearn.org/ Thanks to Jonas Peters for this example
  • 10. 10 Hypothesize causal relationships http://chalearn.org/ Chocolate Nobel Chocolate Nobel Chocolate Nobel Chocolate Nobel ?
  • 11. 11 “Please test your researchers for ten years: Randomly pick half of them and give them chocolate for desert and give apples to the other half. Then compare the number of Nobel prizes in the two populations.” Perform randomized controlled experiments http://chalearn.org/
  • 12. 12        How far can we get to improve causal hypotheses … … to minimize the need for experiments?
  • 13. 13 • Pioneer work: Glymour, Scheines, Spirtes, Pearl (Turing Award, 2011), Rubin, in the US, since the 80’s. • New wave: Hyvärinen, Schölkopf, Bühlmann in the EU. • Nobel prizes in econometrics: Haavelmo (1989), Granger (2003), Sargent and Sims (2011). • DARPA programs: Big mechanisms (2014), upcoming program (Schwartz, program manager). Landmark work http://chalearn.org/
  • 14. 14 Game changing work: Causality challenges http://chalearn.org/ Cause-Effect Pairs (2013) Neural Connectomics (2014) Causation and Prediction (2007) Pot-luck challenge (2008)
  • 15. 15 To make a long story short… http://chalearn.org/ 1. Discovering dependencies: easiest = classical feature selection. Hard to beat! 2. Removing spurious dependencies: harder and “dangerous” because removing good features is more harmful than keeping bad ones. 3. Orienting dependencies: hardest.
  • 16. 16 Cause-effect pair challenge (2013) http://chalearn.org/ Initial impulse: Joris Mooij, Dominik Janzing, and Bernhard Schölkopf. Examples of algorithms and data: Povilas Daniušis, Arthur Gretton, Patrik O. Hoyer, Dominik Janzing, Antti Kerminen, Joris Mooij, Jonas Peters, Bernhard Schölkopf, Shohei Shimizu, Oliver Stegle, and Kun Zhang, Jakob Zscheischler. Datasets and result analysis: Isabelle Guyon + Mehreen Saeed + {Mikael Henaff, Sisi Ma, and Alexander Statnikov}, from NYU. Website and sample code: Isabelle Guyon + Phase 1: Ben Hamner (Kaggle) https://www.kaggle.com/c/cause- effect-pairs Phase 2: Ivan Judson, Christophe Poulain, Evelyne Viegas, Michael Zyskowski https://www.codalab.org/competitions/1381 Review, testing: Marc Boullé, Hugo Jair Escalante, Frederick Eberhardt, Seth Flaxman, Patrik Hoyer, Dominik Janzing, Richard Kennaway, Vincent Lemaire, Joris Mooij, Jonas Peters, Florin Popescu, Peter Spirtes, Ioannis Tsamardinos, Jianxin Yin, Kun Zhang. Mehreen Evelyne Joris Dominik Bernhard Kun Ben Alexander Marc
  • 18. 18 Problem setting http://chalearn.org/ A F I H E B D G C J INPUT OUTPUT A -> B ? 0 / 1
  • 19. 19 B =Temperature A = log(Altitude) A  B ? http://chalearn.org/
  • 20. 20 A  B A  B Best fit: A  B http://chalearn.org/
  • 21. 21 The data: A B Z A  B B A Z A <- B A B Z ZBZA A  Z  B A B A | B Demographics: Sex  Height Age  Wages Country  Education Latitude  Infant mortality Ecology: City elevation  Temperature Water level  Algal frequency Elevation  Vegetation Dist. to hydrology  Fire Econometrics: Mileage  Car resell price Num.rooms  House price Trade price last day  Trade price Medicine: Cancer vol.  Recurrence Metastasis  Prognosis Age  Blood pressure Genomics (mRNA level): transcription factor  protein induced Engineering: Car model year  Horsepower Number of cylinders  MPG Cache memory  Compute power Roof area  Heating load Cement used  Compressive strength 20% 80% http://chalearn.org/
  • 23. 23 Amazing: an operational causation coefficient! http://chalearn.org/
  • 24. 24 Neural connectomics Challenge (2014) http://chalearn.org/ Coordinator: Isabelle Guyon Data Providers: Demian Battaglia Javier Orlandi Jordi Soriano Fradera Olav Stetter Advisors: Gavin Cawley Gideon Dror Hugo-Jair Escalante Alice Guyon Vincent Lemaire Sisi Ma Eric Peskin Florin Popescu Bisakha Ray, Mehreen Saeed Alexander Statnikov Demian Olav Jordi Javier Bisakha Mehreen
  • 27. 27 Conclusion • Causal models: – Better explain data. – Make decisions. • Challenges: – Fair evaluations. – Innovation. • Machine Learning: – Novel approaches to causal discovery. – Operational “causation coefficient”: • First detect oriented pairs, then prune indirect effects and confounders. • First build undirected graph, then orient edges.
  • 28. 28 http://chalearn.org/ Fully automatic machine learning without ANY human intervention automl.chalearn.org December 2014 – January 2016 $30,000 in prizes Thank you! AutoML Challenge