SlideShare a Scribd company logo
1 of 18
Why big data
didn’t end causal
inference
Totte Harinen
Structure of talk
1. The case against causal
inference in the big data era
2. Reasons why big data did not
(and will not) end causal
inference
3. A reconciliation between big
data and causal inference?
4. Conclusions
Causal inference: Any modelling
approach where some parts of the
model are assumed to correspond
to some aspects of the causal
structure of the world.
Big data: Lots of observations and/or
variables per observation.
The case against causal inference in the big data era
“Scientists are trained to recognize that correlation is
not causation, that no conclusions should be drawn
simply on the basis of correlation between X and Y (it
could just be a coincidence). Instead, you must
understand the underlying mechanisms that connect
the two. […]
There is now a better way. Petabytes allow us to say:
‘Correlation is enough.’ We can stop looking for
models. We can analyze the data without hypotheses
about what it might show. We can throw the numbers
into the biggest computing clusters the world has ever
seen and let statistical algorithms find patterns where
science cannot.”
Chris Anderson, Wired 2008
1. Humans are bad at coming up with
causal hypotheses
Coming up with hypotheses about the world
and then testing them against experimental
evidence seems old-fashioned, and there’s a
sense that humans are somehow bad at this.
The experimental method was certainly
developed before powerful computing, so it’s
not a crazy idea that it’s due for a revolution
just like so many other things have been.
2. Correlational models form a more
accurate picture of reality
Anderson refers to the fact that models are
abstractions of the underlying reality. He
suggests that the correlational approach
results in a more accurate picture of how the
world because such an approach is more
flexible in its assumptions and can
incorporate more complexity.
3. Data analysis just seems to be
headed towards the correlational
approach
It’s clear that running correlational analyses
on big datasets has resulted in progress both
in science and business, and this big data
driven progress has presumably become
greater over time. So we could extrapolate
that progress is increasingly going to be
based on the correlational approach.
Reasons why big data might have ended causal inference
And still… causal inference seems
to be doing just fine
Science
Randomised controlled trials, mediation
analysis and quasi-experiments: 41400
Google Scholar hits since the beginning
of the year
Business and policy
A/B testing in business, incrementality
measurement in advertising,
experimental methods in public policy
And last but not least…
At Uber, we’re applying causal inference
methods to answer questions relevant to
our business
Reasons why big data did not (and will not) end causal
inference
Humans are good at causal
hypotheses
This is because during our
evolutionary history it’s been useful
to be able to answer the question:
“If I changed X, what would happen
to Y?”
Abstractness is what makes
models useful
When we abstract away from the
particulars of a situation, we can
generalize into other similar
contexts.
Correlational approaches don’t
give us the counterfactual
Estimating a causal effect requires
estimating what would have
happened in the absence of the
cause.
Three quick considerations
Bigger data = better causal
inference
Bigger sample sizes enable us to
identify smaller causal effects and/
or have a greater number of
treatment arms in a standard RCT.
Participant matching approaches
benefit from a larger number of
covariates. Time series based causal
inference methods require multiple
observations… All of which were
difficult to achieve before the big
data era.
Technology
Email open rate before
and after a
personalised title
Interrupted time series
analysis is a classic method
for inferring the causal
impact of an intervention,
based on qualitative
assumptions of the
underlying causal
structure. However, until
recently, high quality time
series data was hard to get.
Example:
interrupted time
series analysis
Causal inference is in its
infancy
The formal language to describe
causal relationships has been
developed fairly recently. This has
enabled both the development of
better computational methods for
causal inference as well as the
clarification of key assumptions.
New methodologies
How much of the
impact of an email is
mediated via click
through?
Causal mediation modeling
is designed to reveal the
mechanisms through which
the impact of an
intervention is mediated.
Until recently, there
weren’t easy to implement
methods to run mediation
models with nonparametric
data.
Example: causal
mediation modeling
A reconciliation between big data and causal inference?
Big data enables us to do more and better causal inference
Better participant matching, subpopulation analyses, multi-arm trials and the ability to
identify smaller effects are some of the benefits to causal inference that arise from the
existence of big data.
Big data findings can inspire causal hypotheses
Experiments, quasi-experiments and causal modelling can be used to test hypotheses
about patterns that arise from correlational analysis.
Machine learning methods can help us to estimate causal quantities
Exciting developments in machine learning ay help us to estimate counterfactuals like what
the outcome would have looked like for the treated in the absence of the intervention.
Three immediate ways in which big data and causal inference compliment each other
Conclusions
The rumours of the death of causal
inference were strongly exaggerated
The arguments probably weren’t very good to
begin with, but they did have the merit of
drawing our attention to the intersection of
these two important fields.
Causal inference is here to stay
If anything, the field has become more active
in recent years, thanks to technological and
methodological developments. Correlational
methods alone don’t answer the question of
what would have happened in the absence of
an intervention.
The future belongs to both big data
and causal inference
When it comes to the relationship between
big data and causal inference, perhaps the
most exciting recent developments are in the
area of combining causal inference methods
with big data approaches.
Conclusions
Totte Harinen
Data Scientist II

totte@uber.com
Thank you

More Related Content

What's hot

Talk on reproducibility in EEG research
Talk on reproducibility in EEG researchTalk on reproducibility in EEG research
Talk on reproducibility in EEG researchDorothy Bishop
 
Gelman psych crisis_2
Gelman psych crisis_2Gelman psych crisis_2
Gelman psych crisis_2jemille6
 
Advancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsAdvancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsTao Xie
 
Balance between insight and noise indicia v2
Balance between insight and noise indicia v2Balance between insight and noise indicia v2
Balance between insight and noise indicia v2Nick Barthram
 
Statistics for Business Decision-making
Statistics for Business Decision-makingStatistics for Business Decision-making
Statistics for Business Decision-makingJason Martuscello
 
Deep learning for fun and profit (a simple introduction to Artificial Intelli...
Deep learning for fun and profit (a simple introduction to Artificial Intelli...Deep learning for fun and profit (a simple introduction to Artificial Intelli...
Deep learning for fun and profit (a simple introduction to Artificial Intelli...Thomas da Silva Paula
 
Analysis of the article "A Predictive Analytics Primer" by Thomas H. Davenport
Analysis of the article "A Predictive Analytics Primer" by Thomas H. DavenportAnalysis of the article "A Predictive Analytics Primer" by Thomas H. Davenport
Analysis of the article "A Predictive Analytics Primer" by Thomas H. DavenportVaibhav Srivastav
 
The Role of Scientific Method in Software Development
The Role of Scientific Method in Software Development The Role of Scientific Method in Software Development
The Role of Scientific Method in Software Development Natalia Juristo
 
Senn repligate
Senn repligateSenn repligate
Senn repligatejemille6
 
Predictive Analytics - Get Skilled or Die Trying
Predictive Analytics - Get Skilled or Die TryingPredictive Analytics - Get Skilled or Die Trying
Predictive Analytics - Get Skilled or Die TryingKienco
 
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”jemille6
 
WPIPosterPresentation24x36
WPIPosterPresentation24x36WPIPosterPresentation24x36
WPIPosterPresentation24x36Allan La
 
Final mayo's aps_talk
Final mayo's aps_talkFinal mayo's aps_talk
Final mayo's aps_talkjemille6
 
Program theory evaluation
Program theory evaluationProgram theory evaluation
Program theory evaluationMatti Heino
 
Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead Maarten van Smeden
 

What's hot (19)

Talk on reproducibility in EEG research
Talk on reproducibility in EEG researchTalk on reproducibility in EEG research
Talk on reproducibility in EEG research
 
Discussion
Discussion                               Discussion
Discussion
 
Sti2018 jws
Sti2018 jwsSti2018 jws
Sti2018 jws
 
Gelman psych crisis_2
Gelman psych crisis_2Gelman psych crisis_2
Gelman psych crisis_2
 
Advancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsAdvancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software Analytics
 
Balance between insight and noise indicia v2
Balance between insight and noise indicia v2Balance between insight and noise indicia v2
Balance between insight and noise indicia v2
 
Statistics for Business Decision-making
Statistics for Business Decision-makingStatistics for Business Decision-making
Statistics for Business Decision-making
 
Deep learning for fun and profit (a simple introduction to Artificial Intelli...
Deep learning for fun and profit (a simple introduction to Artificial Intelli...Deep learning for fun and profit (a simple introduction to Artificial Intelli...
Deep learning for fun and profit (a simple introduction to Artificial Intelli...
 
Analysis of the article "A Predictive Analytics Primer" by Thomas H. Davenport
Analysis of the article "A Predictive Analytics Primer" by Thomas H. DavenportAnalysis of the article "A Predictive Analytics Primer" by Thomas H. Davenport
Analysis of the article "A Predictive Analytics Primer" by Thomas H. Davenport
 
The Role of Scientific Method in Software Development
The Role of Scientific Method in Software Development The Role of Scientific Method in Software Development
The Role of Scientific Method in Software Development
 
Senn repligate
Senn repligateSenn repligate
Senn repligate
 
Predictive Analytics - Get Skilled or Die Trying
Predictive Analytics - Get Skilled or Die TryingPredictive Analytics - Get Skilled or Die Trying
Predictive Analytics - Get Skilled or Die Trying
 
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
 
WPIPosterPresentation24x36
WPIPosterPresentation24x36WPIPosterPresentation24x36
WPIPosterPresentation24x36
 
Final mayo's aps_talk
Final mayo's aps_talkFinal mayo's aps_talk
Final mayo's aps_talk
 
Program theory evaluation
Program theory evaluationProgram theory evaluation
Program theory evaluation
 
00952
0095200952
00952
 
Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead
 
07.10.08 POLI 399
07.10.08 POLI 39907.10.08 POLI 399
07.10.08 POLI 399
 

Similar to BDW17 London - Totte Harinen, Uber - Why Big Data Didn’t End Causal Inference

Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017Big Data Spain
 
'Drinking from the fire hose? The pitfalls and potential of Big Data'.
'Drinking from the fire hose? The pitfalls and potential of Big Data'.'Drinking from the fire hose? The pitfalls and potential of Big Data'.
'Drinking from the fire hose? The pitfalls and potential of Big Data'.Josh Cowls
 
A Qualititative Approach To HCI Research
A Qualititative Approach To HCI ResearchA Qualititative Approach To HCI Research
A Qualititative Approach To HCI ResearchNathan Mathis
 
Not sure how to do this case analysis please help me do it!1.Are t.pdf
Not sure how to do this case analysis please help me do it!1.Are t.pdfNot sure how to do this case analysis please help me do it!1.Are t.pdf
Not sure how to do this case analysis please help me do it!1.Are t.pdfamitbagga0808
 
Supervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For CancerSupervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For Cancerpaperpublications3
 
Crowdsourcing and Cognitive Data Analytics for Conflict Transformation - Istv...
Crowdsourcing and Cognitive Data Analytics for Conflict Transformation - Istv...Crowdsourcing and Cognitive Data Analytics for Conflict Transformation - Istv...
Crowdsourcing and Cognitive Data Analytics for Conflict Transformation - Istv...Istvan Csakany
 
304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx
304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx
304 Part II • Predictive AnalyticsMachine LearningIntrodu.docxpriestmanmable
 
Building theory from case study
Building theory from case studyBuilding theory from case study
Building theory from case studyKrite Infotech
 
On Machine Learning and Data Mining
On Machine Learning and Data MiningOn Machine Learning and Data Mining
On Machine Learning and Data Miningbutest
 
MIXED METHOD APPROACH BY LEOPTRA MUTAZU GREAT ZIMBABWE UNIVERSITY(2017)
MIXED METHOD APPROACH BY LEOPTRA MUTAZU GREAT ZIMBABWE UNIVERSITY(2017)MIXED METHOD APPROACH BY LEOPTRA MUTAZU GREAT ZIMBABWE UNIVERSITY(2017)
MIXED METHOD APPROACH BY LEOPTRA MUTAZU GREAT ZIMBABWE UNIVERSITY(2017)leomutazu
 
Chapter3-Methods_of_Research-Module.pdf
Chapter3-Methods_of_Research-Module.pdfChapter3-Methods_of_Research-Module.pdf
Chapter3-Methods_of_Research-Module.pdfJUNGERONA
 
Mayo O&M slides (4-28-13)
Mayo O&M slides (4-28-13)Mayo O&M slides (4-28-13)
Mayo O&M slides (4-28-13)jemille6
 
Causality for Policy Assessment and 
Impact Analysis
Causality for Policy Assessment and 
Impact AnalysisCausality for Policy Assessment and 
Impact Analysis
Causality for Policy Assessment and 
Impact AnalysisBayesia USA
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...The Higher Education Academy
 
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...Editor IJCATR
 

Similar to BDW17 London - Totte Harinen, Uber - Why Big Data Didn’t End Causal Inference (20)

Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
 
Sample Methodology Essay
Sample Methodology EssaySample Methodology Essay
Sample Methodology Essay
 
'Drinking from the fire hose? The pitfalls and potential of Big Data'.
'Drinking from the fire hose? The pitfalls and potential of Big Data'.'Drinking from the fire hose? The pitfalls and potential of Big Data'.
'Drinking from the fire hose? The pitfalls and potential of Big Data'.
 
A Qualititative Approach To HCI Research
A Qualititative Approach To HCI ResearchA Qualititative Approach To HCI Research
A Qualititative Approach To HCI Research
 
Not sure how to do this case analysis please help me do it!1.Are t.pdf
Not sure how to do this case analysis please help me do it!1.Are t.pdfNot sure how to do this case analysis please help me do it!1.Are t.pdf
Not sure how to do this case analysis please help me do it!1.Are t.pdf
 
Supervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For CancerSupervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For Cancer
 
Analytic Essay Examples
Analytic Essay ExamplesAnalytic Essay Examples
Analytic Essay Examples
 
Crowdsourcing and Cognitive Data Analytics for Conflict Transformation - Istv...
Crowdsourcing and Cognitive Data Analytics for Conflict Transformation - Istv...Crowdsourcing and Cognitive Data Analytics for Conflict Transformation - Istv...
Crowdsourcing and Cognitive Data Analytics for Conflict Transformation - Istv...
 
304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx
304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx
304 Part II • Predictive AnalyticsMachine LearningIntrodu.docx
 
Building theory from case study
Building theory from case studyBuilding theory from case study
Building theory from case study
 
On Machine Learning and Data Mining
On Machine Learning and Data MiningOn Machine Learning and Data Mining
On Machine Learning and Data Mining
 
Methodology Essay
Methodology EssayMethodology Essay
Methodology Essay
 
Data collection, Data Integration, Data Understanding e Data Cleaning & Prepa...
Data collection, Data Integration, Data Understanding e Data Cleaning & Prepa...Data collection, Data Integration, Data Understanding e Data Cleaning & Prepa...
Data collection, Data Integration, Data Understanding e Data Cleaning & Prepa...
 
MIXED METHOD APPROACH BY LEOPTRA MUTAZU GREAT ZIMBABWE UNIVERSITY(2017)
MIXED METHOD APPROACH BY LEOPTRA MUTAZU GREAT ZIMBABWE UNIVERSITY(2017)MIXED METHOD APPROACH BY LEOPTRA MUTAZU GREAT ZIMBABWE UNIVERSITY(2017)
MIXED METHOD APPROACH BY LEOPTRA MUTAZU GREAT ZIMBABWE UNIVERSITY(2017)
 
Chapter3-Methods_of_Research-Module.pdf
Chapter3-Methods_of_Research-Module.pdfChapter3-Methods_of_Research-Module.pdf
Chapter3-Methods_of_Research-Module.pdf
 
Mayo O&M slides (4-28-13)
Mayo O&M slides (4-28-13)Mayo O&M slides (4-28-13)
Mayo O&M slides (4-28-13)
 
Causality for Policy Assessment and 
Impact Analysis
Causality for Policy Assessment and 
Impact AnalysisCausality for Policy Assessment and 
Impact Analysis
Causality for Policy Assessment and 
Impact Analysis
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...
 
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
 
PR2_Wk1-2.pptx
PR2_Wk1-2.pptxPR2_Wk1-2.pptx
PR2_Wk1-2.pptx
 

More from Big Data Week

BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A...
 BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A... BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A...
BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A...Big Data Week
 
BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...
BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...
BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...Big Data Week
 
BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...
BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...
BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...Big Data Week
 
BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...
BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...
BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...Big Data Week
 
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...Big Data Week
 
BDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of Data
BDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of DataBDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of Data
BDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of DataBig Data Week
 
BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...
BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...
BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...Big Data Week
 
BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...
BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...
BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...Big Data Week
 
BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...
BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...
BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...Big Data Week
 
BDW16 London - Alex Bordei, Bigstep - Building Data Labs in the Cloud
BDW16 London - Alex Bordei, Bigstep - Building Data Labs in the CloudBDW16 London - Alex Bordei, Bigstep - Building Data Labs in the Cloud
BDW16 London - Alex Bordei, Bigstep - Building Data Labs in the CloudBig Data Week
 
BDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
BDW16 London - William Vambenepe, Google - 3rd Generation Data PlatformBDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
BDW16 London - William Vambenepe, Google - 3rd Generation Data PlatformBig Data Week
 
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...Big Data Week
 
BDW16 London - Nondas Sourlas, Bupa - Big Data in Healthcare
BDW16 London  - Nondas Sourlas, Bupa - Big Data in HealthcareBDW16 London  - Nondas Sourlas, Bupa - Big Data in Healthcare
BDW16 London - Nondas Sourlas, Bupa - Big Data in HealthcareBig Data Week
 
BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...
BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...
BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...Big Data Week
 
BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...
BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...
BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...Big Data Week
 
BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...
BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...
BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...Big Data Week
 
BDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word Bingo
BDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word BingoBDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word Bingo
BDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word BingoBig Data Week
 
BDW16 London - Marius Boeru, Bigstep - How to Automate Big Data with Ansible
BDW16 London -  Marius Boeru, Bigstep - How to Automate Big Data with AnsibleBDW16 London -  Marius Boeru, Bigstep - How to Automate Big Data with Ansible
BDW16 London - Marius Boeru, Bigstep - How to Automate Big Data with AnsibleBig Data Week
 
BDW16 London - Josh Partridge, Shazam - How Labels, Radio Stations and Brand...
BDW16 London - Josh Partridge, Shazam -  How Labels, Radio Stations and Brand...BDW16 London - Josh Partridge, Shazam -  How Labels, Radio Stations and Brand...
BDW16 London - Josh Partridge, Shazam - How Labels, Radio Stations and Brand...Big Data Week
 
BDW16 London - Wael Elrifai, Pentaho - Big Data-Driven Innovatiom
BDW16 London - Wael Elrifai, Pentaho - Big Data-Driven InnovatiomBDW16 London - Wael Elrifai, Pentaho - Big Data-Driven Innovatiom
BDW16 London - Wael Elrifai, Pentaho - Big Data-Driven InnovatiomBig Data Week
 

More from Big Data Week (20)

BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A...
 BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A... BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A...
BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A...
 
BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...
BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...
BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...
 
BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...
BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...
BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...
 
BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...
BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...
BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...
 
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
 
BDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of Data
BDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of DataBDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of Data
BDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of Data
 
BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...
BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...
BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...
 
BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...
BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...
BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...
 
BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...
BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...
BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...
 
BDW16 London - Alex Bordei, Bigstep - Building Data Labs in the Cloud
BDW16 London - Alex Bordei, Bigstep - Building Data Labs in the CloudBDW16 London - Alex Bordei, Bigstep - Building Data Labs in the Cloud
BDW16 London - Alex Bordei, Bigstep - Building Data Labs in the Cloud
 
BDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
BDW16 London - William Vambenepe, Google - 3rd Generation Data PlatformBDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
BDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
 
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...
 
BDW16 London - Nondas Sourlas, Bupa - Big Data in Healthcare
BDW16 London  - Nondas Sourlas, Bupa - Big Data in HealthcareBDW16 London  - Nondas Sourlas, Bupa - Big Data in Healthcare
BDW16 London - Nondas Sourlas, Bupa - Big Data in Healthcare
 
BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...
BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...
BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...
 
BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...
BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...
BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...
 
BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...
BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...
BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...
 
BDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word Bingo
BDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word BingoBDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word Bingo
BDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word Bingo
 
BDW16 London - Marius Boeru, Bigstep - How to Automate Big Data with Ansible
BDW16 London -  Marius Boeru, Bigstep - How to Automate Big Data with AnsibleBDW16 London -  Marius Boeru, Bigstep - How to Automate Big Data with Ansible
BDW16 London - Marius Boeru, Bigstep - How to Automate Big Data with Ansible
 
BDW16 London - Josh Partridge, Shazam - How Labels, Radio Stations and Brand...
BDW16 London - Josh Partridge, Shazam -  How Labels, Radio Stations and Brand...BDW16 London - Josh Partridge, Shazam -  How Labels, Radio Stations and Brand...
BDW16 London - Josh Partridge, Shazam - How Labels, Radio Stations and Brand...
 
BDW16 London - Wael Elrifai, Pentaho - Big Data-Driven Innovatiom
BDW16 London - Wael Elrifai, Pentaho - Big Data-Driven InnovatiomBDW16 London - Wael Elrifai, Pentaho - Big Data-Driven Innovatiom
BDW16 London - Wael Elrifai, Pentaho - Big Data-Driven Innovatiom
 

Recently uploaded

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 

Recently uploaded (20)

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 

BDW17 London - Totte Harinen, Uber - Why Big Data Didn’t End Causal Inference

  • 1. Why big data didn’t end causal inference Totte Harinen
  • 2. Structure of talk 1. The case against causal inference in the big data era 2. Reasons why big data did not (and will not) end causal inference 3. A reconciliation between big data and causal inference? 4. Conclusions
  • 3. Causal inference: Any modelling approach where some parts of the model are assumed to correspond to some aspects of the causal structure of the world. Big data: Lots of observations and/or variables per observation.
  • 4. The case against causal inference in the big data era
  • 5. “Scientists are trained to recognize that correlation is not causation, that no conclusions should be drawn simply on the basis of correlation between X and Y (it could just be a coincidence). Instead, you must understand the underlying mechanisms that connect the two. […] There is now a better way. Petabytes allow us to say: ‘Correlation is enough.’ We can stop looking for models. We can analyze the data without hypotheses about what it might show. We can throw the numbers into the biggest computing clusters the world has ever seen and let statistical algorithms find patterns where science cannot.” Chris Anderson, Wired 2008
  • 6. 1. Humans are bad at coming up with causal hypotheses Coming up with hypotheses about the world and then testing them against experimental evidence seems old-fashioned, and there’s a sense that humans are somehow bad at this. The experimental method was certainly developed before powerful computing, so it’s not a crazy idea that it’s due for a revolution just like so many other things have been. 2. Correlational models form a more accurate picture of reality Anderson refers to the fact that models are abstractions of the underlying reality. He suggests that the correlational approach results in a more accurate picture of how the world because such an approach is more flexible in its assumptions and can incorporate more complexity. 3. Data analysis just seems to be headed towards the correlational approach It’s clear that running correlational analyses on big datasets has resulted in progress both in science and business, and this big data driven progress has presumably become greater over time. So we could extrapolate that progress is increasingly going to be based on the correlational approach. Reasons why big data might have ended causal inference
  • 7. And still… causal inference seems to be doing just fine Science Randomised controlled trials, mediation analysis and quasi-experiments: 41400 Google Scholar hits since the beginning of the year Business and policy A/B testing in business, incrementality measurement in advertising, experimental methods in public policy And last but not least… At Uber, we’re applying causal inference methods to answer questions relevant to our business
  • 8. Reasons why big data did not (and will not) end causal inference
  • 9. Humans are good at causal hypotheses This is because during our evolutionary history it’s been useful to be able to answer the question: “If I changed X, what would happen to Y?” Abstractness is what makes models useful When we abstract away from the particulars of a situation, we can generalize into other similar contexts. Correlational approaches don’t give us the counterfactual Estimating a causal effect requires estimating what would have happened in the absence of the cause. Three quick considerations
  • 10. Bigger data = better causal inference Bigger sample sizes enable us to identify smaller causal effects and/ or have a greater number of treatment arms in a standard RCT. Participant matching approaches benefit from a larger number of covariates. Time series based causal inference methods require multiple observations… All of which were difficult to achieve before the big data era. Technology
  • 11. Email open rate before and after a personalised title Interrupted time series analysis is a classic method for inferring the causal impact of an intervention, based on qualitative assumptions of the underlying causal structure. However, until recently, high quality time series data was hard to get. Example: interrupted time series analysis
  • 12. Causal inference is in its infancy The formal language to describe causal relationships has been developed fairly recently. This has enabled both the development of better computational methods for causal inference as well as the clarification of key assumptions. New methodologies
  • 13. How much of the impact of an email is mediated via click through? Causal mediation modeling is designed to reveal the mechanisms through which the impact of an intervention is mediated. Until recently, there weren’t easy to implement methods to run mediation models with nonparametric data. Example: causal mediation modeling
  • 14. A reconciliation between big data and causal inference?
  • 15. Big data enables us to do more and better causal inference Better participant matching, subpopulation analyses, multi-arm trials and the ability to identify smaller effects are some of the benefits to causal inference that arise from the existence of big data. Big data findings can inspire causal hypotheses Experiments, quasi-experiments and causal modelling can be used to test hypotheses about patterns that arise from correlational analysis. Machine learning methods can help us to estimate causal quantities Exciting developments in machine learning ay help us to estimate counterfactuals like what the outcome would have looked like for the treated in the absence of the intervention. Three immediate ways in which big data and causal inference compliment each other
  • 17. The rumours of the death of causal inference were strongly exaggerated The arguments probably weren’t very good to begin with, but they did have the merit of drawing our attention to the intersection of these two important fields. Causal inference is here to stay If anything, the field has become more active in recent years, thanks to technological and methodological developments. Correlational methods alone don’t answer the question of what would have happened in the absence of an intervention. The future belongs to both big data and causal inference When it comes to the relationship between big data and causal inference, perhaps the most exciting recent developments are in the area of combining causal inference methods with big data approaches. Conclusions
  • 18. Totte Harinen Data Scientist II
 totte@uber.com Thank you