SlideShare a Scribd company logo
calculation | consulting
data science leadership
(TM)
c|c
(TM)
charles@calculationconsulting.com
calculation|consulting
Data Science Leadership
(TM)
charles@caclulationconsulting.com
calculation | consulting data science leadership
Who Are We?
c|c
(TM)
Dr. Charles H. Martin, PhD
University of Chicago, Chemical Physics
NSF Fellow in Theoretical Chemistry
Over 10 years experience in applied Machine Learning
Developed ML algos for Demand Media; the first $1B IPO since Google
Lean Start Ups: Aardvark (acquired by Google), eHow,
Wall Street: GLG, BGI, BlackRock
Fortune 500: Roche, France Telecom
Tech / Retail: GoDaddy, eBay, Walmart
Investment: Griffin Advisors, Page Family Offices

www.calculationconsulting.com
charles@calculationconsulting.com
(TM)
3
Recent AI News: Epic Systems
When machine learning FAILS
c|c
(TM)
calculation | consulting data science leadership
(TM)
4
Recent AI News: Epic Systems
c|c
(TM)
calculation | consulting data science leadership
(TM)
5
“[The] definition of sepsis based on
billing codes alone is imprecise and not
the one that is clinically meaningful to a
health system or to patients.”
When machine learning FAILS
Recent AI News: Zillow
When machine learning (or AI) FAILS
c|c
(TM)
calculation | consulting data science leadership
(TM)
6
Recent AI News: Zillow
When machine learning (or AI) FAILS
c|c
(TM)
calculation | consulting data science leadership
(TM)
7
Data Science is Different
c|c
(TM)
calculation | consulting data science leadership
(TM)
8
Data Science Leadership : Becoming Data-Led
c|c
(TM)
(TM)
calculation | consulting data science leadership
9
1. Data Informed: OperationalVisibility
2. Data Driven: Tooling and Insights for Growth
3. Data Led Automation and Innovation
creating the data-led organization
Data Science Leadership : 4 Steps to Leading
c|c
(TM)
(TM)
calculation | consulting data science leadership
10
• Strategy: How can you leverage data ?
• Stage: How mature is your data ?
• Team: What team do we need ?
• Tools: What tools do they need ?
creating the data-led organization
Strategy:Algo Gas Station Analogy
Problem: where to open a gas station ?
Need: good traffic, weak competition
c|c
(TM)
less competitors
no traffic
sweet spot
great traffic
too many competitors
calculation | consulting data science leadership
ML algorithms can predict supply and demand
(TM)
11
Strategy: Data Science Process
• Acquire Domain Knowledge
• Formulate Hypothesis
• Generate Model(s) from the Data
• Predict Revenue Gains
• Backtest Predictions on your Data
• A/B Test in Production
• Attribute Gains to Model(s)
c|c
(TM)
(TM)
acting
solving
framing
calculation | consulting data science leadership
12
c|c
(TM)
• Systems Thinking: leveraging the inter-relationships
between data, marketing, and the customer
• Knowledge Transfer: mentoring — not training — to
develop both personal mastery and team learning
• Mental Models: create a base of small-scale models for
thinking about how to use your data
• Knowledge Sharing: foster collaboration between
research, engineering, and product to drive revenue
Strategy: Learning from Data
calculation | consulting data science leadership
(TM)
13
c|c
(TM)
• Cross-functional engineering, product, marketing, finance
• Autonomous: separate from the traditional engineering
product lifecycle. self-organizing and self-managing
• Experimental: form hypothesis, analyze data, make
predictions, run backtests, A/B testing
• Self-sustaining: not a cost center; generates revenue
(TM)
calculation | consulting data science leadership
14
Strategy: Data Science is not IT
c|c
(TM)
(TM)
Problem: Externalities
calculation | consulting data science leadership
15
external factors can change
(TM)
c|c
(TM)
Data is only is as accurate as it’s original intent demanded
calculation | consulting data science leadership
16
Stage: Your Data Maturity
• Where is your data ? Transaction Database? Web Logs ?
3rd party system ? Data Lake ?
• What product does it service ? Billing ? CRM ?
• Can you access it ? Security ? Regulations ?
• Who owns it ? Responsible for quality ?
Problem: Data Quality Mismatch
(TM)
c|c
(TM)
Data is only is as accurate as it’s original intent demanded
calculation | consulting data science leadership
17
?
Problem: Data Quality Mismatch
(TM)
c|c
(TM)
Data is only is as accurate as it’s original intent demanded
calculation | consulting data science leadership
18
Recommender System
Problem: Data Quality Mismatch
(TM)
c|c
(TM)
Data is only is as accurate as it’s original intent demanded
calculation | consulting data science leadership
19
Recommender System
Quality of product metadata
May not materially impact billing
x
? wrong
missing
(TM)
c|c
(TM)
“Only the paranoid survive” Andy Grove (Intel)
calculation | consulting data science leadership
20
Recommender System
Solution: Be Paranoid and Test Everything
(TM)
c|c
(TM)
“Only the paranoid survive” Andy Grove (Intel)
calculation | consulting data science leadership
21
Recommender System
Solution: Test Everything
Software engineers can be paranoid about programming.
In fact, Paranoid Programming is a thing.
You have to be paranoid about your data.
Thing is, bad code can usually be fixed.
But bad data has usually has to be thrown away
(TM)
c|c
(TM)
calculation | consulting data science leadership
22
Recommender System
Problem: Data Contraband
(TM)
c|c
(TM)
data 'from a friend’ that may violate compliance
calculation | consulting data science leadership
23
Recommender System
Data pulled into
spreadsheet / csv
Data actually stored in DB Data passed around
by email, etc
(TM)
c|c
(TM)
calculation | consulting data science leadership
24
Recommender System
Google Sheets, SAP, etc
(where you can track everything)
Move functions to the data
(stored procedures, Spark, etc)
Jira, GitHub, Confluence, .
Document tracking systems
Solutions: Data Contraband
Team: Data Scientists are Different
c|c
(TM)
calculation | consulting data science leadership
(TM)
25
not all techies are the same
Team: Data Scientists are Different
c|c
(TM)
calculation | consulting data science leadership
theoretical physics
machine learning / AI specialist
(TM)
26
applied physics
data scientist
engineer
software, browser tech, dev ops, …
not all techies are the same
Team: Data Scientists are Different
c|c
(TM)
calculation | consulting data science leadership
Data science group. Can be very isolated.
Very research-y & difficult to productionalize
(TM)
27
Embedded data scientist, solves problems
builds solutions, and deploys them
Software and IT services
Great at managing code and systems
Not great with data, math — or ambiguity
not all techies are the same
FANNG Managers: Fallen Gods
c|c
(TM)
(TM)
calculation | consulting data science leadership
28
the Earth is flat and they fallen off
FANNG Managers: Fallen Gods
c|c
(TM)
(TM)
calculation | consulting data science leadership
29
FAANG infrastructure is 10-20 years ahead
FANNG Managers: Fallen Gods
c|c
(TM)
(TM)
calculation | consulting data science leadership
30
you need infrastructure to deliver data products
Data Strategy : Think like a Beginner
c|c
(TM)
(TM)
calculation | consulting data science leadership
31
cultivate a beginner’s mind
- Test your assumptions. Literally
- Look for problems early on. And never stop looking
- Distinguish between statistical structural outliers.
- Repair your data, if possible.
- Start with simple, robust methods.
- Sophisticated models are more sensitive to
errors
- and are more easily overtrained.
- Evaluate your predictions on real data, and figure
out how to attribute results to your models.
- Re-calibrate your models if necessary.
Tools: What the Team Needs
(TM)
c|c
(TM)
• Infrastructure: Data storage, cloud services, etc
• Analytics: Measuring whats going on
• Operations: Keeping things running
• Machine Learning and AI: Growth and Innovation
Algorithms, not data lakes, generate revenue
calculation | consulting data science leadership
32
Tools: What the Team Needs to Know
(TM)
c|c
(TM)
• Metrics: What KPIs you have, and what to hit
• Access: How to get what they need (i.e self-service)
• Impact: How tooling (used and built) support the business
• Truth: What data is reliable, what is not
Algorithms, not data lakes, generate revenue
calculation | consulting data science leadership
33
c|c
(TM)
(TM)
Final Thoughts: Algorithmic Accountability
calculation | consulting data science leadership
An asset is an economic resource.
Anything tangible or intangible that is capable of
being owned or controlled to produce value and
that is held to have positive economic value is
considered an asset.
algorithms can be valuable assets
34
c|c
(TM)
(TM)
Algorithmic Accountability
calculation | consulting data science leadership
35
does revenue depends on hidden algos ?
• WebMD Google SEO
• Amazon Product Listing Algo
• Pinterest Relevance Algo
• Twitter Spam filter
• Apple App Store Rankings
c|c
(TM)
(TM)
Algorithmic Accountability
calculation | consulting data science leadership
36
do decisions depend on hidden factors ?
A 'Crisis' in Online Ads: One-Third of Traffic Is Bogus
http://www.wsj.com/articles/SB10001424052702304026304579453253860786362
Now Algorithms Are DecidingWhomTo Hire…
http://www.npr.org/blogs/alltechconsidered/2015/03/23/394827451/now-algorithms-are-deciding-whom-to-hire-based-on-voice
What you don’t know about Internet algorithms is hurting you…
http://www.washingtonpost.com/news/the-intersect/wp/2015/03/23/what-you-dont-know-about-internet-algorithms-is-hurting-you-and-you-probably-dont-know-very-much/
c|c
(TM)
(TM)
Solution: Algorithmic Transparency
calculation | consulting data science leadership
37
can you be transparent and not be gamed ?
http://fortune.com/2015/03/18/how-do-you-govern-a-hidden-fluid-and-amoral-algorithm/
83% of the participants in the study changed their behavior
once they knew about the algorithm
How do you govern a (hidden, fluid and amoral) algorithm?
participants mistakenly believed that their friends intentionally
chose not to show them stories
c|c
(TM)
(TM)
Algorithmic Accountability
calculation | consulting data science leadership
Do you depend on some else’s marketplace?
How does your revenue depend on algos?
Do you need an internal algo ?
Who will manage it? build it? maintain it?
algorithms have unforeseen liabilities
38
(TM)
c|c
(TM)
c | c
charles@calculationconsulting.com

More Related Content

What's hot

Why Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural NetworksWhy Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural Networks
Charles Martin
 
Why Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural NetworksWhy Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural Networks
Charles Martin
 
CC mmds talk 2106
CC mmds talk 2106CC mmds talk 2106
CC mmds talk 2106
Charles Martin
 
Cc hass b school talk 2105
Cc hass b school talk  2105Cc hass b school talk  2105
Cc hass b school talk 2105
Charles Martin
 
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Mokhtar SELLAMI
 
A BA-based algorithm for parameter optimization of support vector machine
A BA-based algorithm for parameter optimization of support vector machineA BA-based algorithm for parameter optimization of support vector machine
A BA-based algorithm for parameter optimization of support vector machine
Aboul Ella Hassanien
 
CC Talk at Berekely
CC Talk at BerekelyCC Talk at Berekely
CC Talk at Berekely
Charles Martin
 
Cari presentation maurice-tchoupe-joskelngoufo
Cari presentation maurice-tchoupe-joskelngoufoCari presentation maurice-tchoupe-joskelngoufo
Cari presentation maurice-tchoupe-joskelngoufo
Mokhtar SELLAMI
 
Data-Driven Recommender Systems
Data-Driven Recommender SystemsData-Driven Recommender Systems
Data-Driven Recommender Systems
recsysfr
 
Dictionary Learning for Massive Matrix Factorization
Dictionary Learning for Massive Matrix FactorizationDictionary Learning for Massive Matrix Factorization
Dictionary Learning for Massive Matrix Factorization
recsysfr
 
Chap 8. Optimization for training deep models
Chap 8. Optimization for training deep modelsChap 8. Optimization for training deep models
Chap 8. Optimization for training deep models
Young-Geun Choi
 
Introduction to behavior based recommendation system
Introduction to behavior based recommendation systemIntroduction to behavior based recommendation system
Introduction to behavior based recommendation system
Kimikazu Kato
 
iccv2009 tutorial: boosting and random forest - part II
iccv2009 tutorial: boosting and random forest - part IIiccv2009 tutorial: boosting and random forest - part II
iccv2009 tutorial: boosting and random forest - part II
zukun
 
Self-Adapting Large Neighborhood Search: Application to single-mode schedulin...
Self-Adapting Large Neighborhood Search: Application to single-mode schedulin...Self-Adapting Large Neighborhood Search: Application to single-mode schedulin...
Self-Adapting Large Neighborhood Search: Application to single-mode schedulin...
Philippe Laborie
 
Industrial project and machine scheduling with Constraint Programming
Industrial project and machine scheduling with Constraint ProgrammingIndustrial project and machine scheduling with Constraint Programming
Industrial project and machine scheduling with Constraint Programming
Philippe Laborie
 
Predicting churn in telco industry: machine learning approach - Marko Mitić
 Predicting churn in telco industry: machine learning approach - Marko Mitić Predicting churn in telco industry: machine learning approach - Marko Mitić
Predicting churn in telco industry: machine learning approach - Marko Mitić
Institute of Contemporary Sciences
 
Winner of EY NextWave Data Science Challenge 2019
Winner of EY NextWave Data Science Challenge 2019Winner of EY NextWave Data Science Challenge 2019
Winner of EY NextWave Data Science Challenge 2019
ByungEunJeon
 
WeightWatcher Update: January 2021
WeightWatcher Update:  January 2021WeightWatcher Update:  January 2021
WeightWatcher Update: January 2021
Charles Martin
 
Optimization for Deep Learning
Optimization for Deep LearningOptimization for Deep Learning
Optimization for Deep Learning
Sebastian Ruder
 

What's hot (19)

Why Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural NetworksWhy Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural Networks
 
Why Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural NetworksWhy Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural Networks
 
CC mmds talk 2106
CC mmds talk 2106CC mmds talk 2106
CC mmds talk 2106
 
Cc hass b school talk 2105
Cc hass b school talk  2105Cc hass b school talk  2105
Cc hass b school talk 2105
 
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
 
A BA-based algorithm for parameter optimization of support vector machine
A BA-based algorithm for parameter optimization of support vector machineA BA-based algorithm for parameter optimization of support vector machine
A BA-based algorithm for parameter optimization of support vector machine
 
CC Talk at Berekely
CC Talk at BerekelyCC Talk at Berekely
CC Talk at Berekely
 
Cari presentation maurice-tchoupe-joskelngoufo
Cari presentation maurice-tchoupe-joskelngoufoCari presentation maurice-tchoupe-joskelngoufo
Cari presentation maurice-tchoupe-joskelngoufo
 
Data-Driven Recommender Systems
Data-Driven Recommender SystemsData-Driven Recommender Systems
Data-Driven Recommender Systems
 
Dictionary Learning for Massive Matrix Factorization
Dictionary Learning for Massive Matrix FactorizationDictionary Learning for Massive Matrix Factorization
Dictionary Learning for Massive Matrix Factorization
 
Chap 8. Optimization for training deep models
Chap 8. Optimization for training deep modelsChap 8. Optimization for training deep models
Chap 8. Optimization for training deep models
 
Introduction to behavior based recommendation system
Introduction to behavior based recommendation systemIntroduction to behavior based recommendation system
Introduction to behavior based recommendation system
 
iccv2009 tutorial: boosting and random forest - part II
iccv2009 tutorial: boosting and random forest - part IIiccv2009 tutorial: boosting and random forest - part II
iccv2009 tutorial: boosting and random forest - part II
 
Self-Adapting Large Neighborhood Search: Application to single-mode schedulin...
Self-Adapting Large Neighborhood Search: Application to single-mode schedulin...Self-Adapting Large Neighborhood Search: Application to single-mode schedulin...
Self-Adapting Large Neighborhood Search: Application to single-mode schedulin...
 
Industrial project and machine scheduling with Constraint Programming
Industrial project and machine scheduling with Constraint ProgrammingIndustrial project and machine scheduling with Constraint Programming
Industrial project and machine scheduling with Constraint Programming
 
Predicting churn in telco industry: machine learning approach - Marko Mitić
 Predicting churn in telco industry: machine learning approach - Marko Mitić Predicting churn in telco industry: machine learning approach - Marko Mitić
Predicting churn in telco industry: machine learning approach - Marko Mitić
 
Winner of EY NextWave Data Science Challenge 2019
Winner of EY NextWave Data Science Challenge 2019Winner of EY NextWave Data Science Challenge 2019
Winner of EY NextWave Data Science Challenge 2019
 
WeightWatcher Update: January 2021
WeightWatcher Update:  January 2021WeightWatcher Update:  January 2021
WeightWatcher Update: January 2021
 
Optimization for Deep Learning
Optimization for Deep LearningOptimization for Deep Learning
Optimization for Deep Learning
 

Similar to Georgetown B-school Talk 2021

Building AI Products: Delivery Vs Discovery
Building AI Products: Delivery Vs Discovery Building AI Products: Delivery Vs Discovery
Building AI Products: Delivery Vs Discovery
Charles Martin
 
Building the Cognitive Era : Big Data Strategies
Building the Cognitive Era : Big Data StrategiesBuilding the Cognitive Era : Big Data Strategies
Building the Cognitive Era : Big Data Strategies
Kevin Sigliano
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao Paulo
OCTO Technology
 
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Mathieu DESPRIEE
 
Palo alto university rotary club talk Sep 29, 2107
Palo alto university rotary club talk Sep 29, 2107Palo alto university rotary club talk Sep 29, 2107
Palo alto university rotary club talk Sep 29, 2107
Charles Martin
 
DSDT Meetup April 2021
DSDT Meetup April 2021DSDT Meetup April 2021
DSDT Meetup April 2021
DSDT_MTL
 
Mohammed AL Madhani
Mohammed AL MadhaniMohammed AL Madhani
Mohammed AL Madhani
Mohammad Al Madhani
 
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
Santiago Cabrera-Naranjo
 
Data Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdfData Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdf
Hendri Karisma
 
Pan Dhoni - Modernizing Data And Analytics using AI.pdf
Pan Dhoni - Modernizing Data And Analytics using AI.pdfPan Dhoni - Modernizing Data And Analytics using AI.pdf
Pan Dhoni - Modernizing Data And Analytics using AI.pdf
SOLTUIONSpeople, THINKubators, THINKathons
 
The Future of HR: From Metrics to Analytics [Webcast]
The Future of HR: From Metrics to Analytics [Webcast]The Future of HR: From Metrics to Analytics [Webcast]
The Future of HR: From Metrics to Analytics [Webcast]
LinkedIn Talent Solutions
 
Intelligence Data Day 2020
Intelligence Data Day 2020Intelligence Data Day 2020
Intelligence Data Day 2020
Patrick Deglon
 
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Sri Ambati
 
1 machine learning demystified
1 machine learning demystified1 machine learning demystified
1 machine learning demystified
Dr Nisha Arora
 
The Sky’s the Limit – The Rise of Machine Learnin
The Sky’s the Limit – The Rise of Machine LearninThe Sky’s the Limit – The Rise of Machine Learnin
The Sky’s the Limit – The Rise of Machine Learnin
Inside Analysis
 
Data Competitive
Data CompetitiveData Competitive
Data Competitive
June Andrews
 
Data empathy - A Design Thinking approach to AI application development
Data empathy  -  A Design Thinking approach to AI application development Data empathy  -  A Design Thinking approach to AI application development
Data empathy - A Design Thinking approach to AI application development
Franki Chamaki
 
Build Intelligence System with AI. Antimo Musone, Ernst & Young
Build Intelligence System with AI. Antimo Musone, Ernst & YoungBuild Intelligence System with AI. Antimo Musone, Ernst & Young
Build Intelligence System with AI. Antimo Musone, Ernst & Young
Data Driven Innovation
 
Getting started-jan-9-2018
Getting started-jan-9-2018Getting started-jan-9-2018
Getting started-jan-9-2018
Thinkful
 
Optimize supply chains using machine learning superpowers webinar deck
Optimize supply chains using machine learning superpowers webinar deckOptimize supply chains using machine learning superpowers webinar deck
Optimize supply chains using machine learning superpowers webinar deck
TamrMarketing
 

Similar to Georgetown B-school Talk 2021 (20)

Building AI Products: Delivery Vs Discovery
Building AI Products: Delivery Vs Discovery Building AI Products: Delivery Vs Discovery
Building AI Products: Delivery Vs Discovery
 
Building the Cognitive Era : Big Data Strategies
Building the Cognitive Era : Big Data StrategiesBuilding the Cognitive Era : Big Data Strategies
Building the Cognitive Era : Big Data Strategies
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao Paulo
 
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
 
Palo alto university rotary club talk Sep 29, 2107
Palo alto university rotary club talk Sep 29, 2107Palo alto university rotary club talk Sep 29, 2107
Palo alto university rotary club talk Sep 29, 2107
 
DSDT Meetup April 2021
DSDT Meetup April 2021DSDT Meetup April 2021
DSDT Meetup April 2021
 
Mohammed AL Madhani
Mohammed AL MadhaniMohammed AL Madhani
Mohammed AL Madhani
 
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
 
Data Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdfData Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdf
 
Pan Dhoni - Modernizing Data And Analytics using AI.pdf
Pan Dhoni - Modernizing Data And Analytics using AI.pdfPan Dhoni - Modernizing Data And Analytics using AI.pdf
Pan Dhoni - Modernizing Data And Analytics using AI.pdf
 
The Future of HR: From Metrics to Analytics [Webcast]
The Future of HR: From Metrics to Analytics [Webcast]The Future of HR: From Metrics to Analytics [Webcast]
The Future of HR: From Metrics to Analytics [Webcast]
 
Intelligence Data Day 2020
Intelligence Data Day 2020Intelligence Data Day 2020
Intelligence Data Day 2020
 
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
 
1 machine learning demystified
1 machine learning demystified1 machine learning demystified
1 machine learning demystified
 
The Sky’s the Limit – The Rise of Machine Learnin
The Sky’s the Limit – The Rise of Machine LearninThe Sky’s the Limit – The Rise of Machine Learnin
The Sky’s the Limit – The Rise of Machine Learnin
 
Data Competitive
Data CompetitiveData Competitive
Data Competitive
 
Data empathy - A Design Thinking approach to AI application development
Data empathy  -  A Design Thinking approach to AI application development Data empathy  -  A Design Thinking approach to AI application development
Data empathy - A Design Thinking approach to AI application development
 
Build Intelligence System with AI. Antimo Musone, Ernst & Young
Build Intelligence System with AI. Antimo Musone, Ernst & YoungBuild Intelligence System with AI. Antimo Musone, Ernst & Young
Build Intelligence System with AI. Antimo Musone, Ernst & Young
 
Getting started-jan-9-2018
Getting started-jan-9-2018Getting started-jan-9-2018
Getting started-jan-9-2018
 
Optimize supply chains using machine learning superpowers webinar deck
Optimize supply chains using machine learning superpowers webinar deckOptimize supply chains using machine learning superpowers webinar deck
Optimize supply chains using machine learning superpowers webinar deck
 

More from Charles Martin

Heavy Tails Workshop NeurIPS2023.pdf
Heavy Tails Workshop NeurIPS2023.pdfHeavy Tails Workshop NeurIPS2023.pdf
Heavy Tails Workshop NeurIPS2023.pdf
Charles Martin
 
LLM avalanche June 2023.pdf
LLM avalanche June 2023.pdfLLM avalanche June 2023.pdf
LLM avalanche June 2023.pdf
Charles Martin
 
WeightWatcher LLM Update
WeightWatcher LLM UpdateWeightWatcher LLM Update
WeightWatcher LLM Update
Charles Martin
 
ICCF24.pdf
ICCF24.pdfICCF24.pdf
ICCF24.pdf
Charles Martin
 
WeightWatcher Introduction
WeightWatcher IntroductionWeightWatcher Introduction
WeightWatcher Introduction
Charles Martin
 
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Charles Martin
 
Capsule Networks
Capsule NetworksCapsule Networks
Capsule Networks
Charles Martin
 
Cc stat phys draft
Cc stat phys draftCc stat phys draft
Cc stat phys draft
Charles Martin
 
Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3
Charles Martin
 

More from Charles Martin (9)

Heavy Tails Workshop NeurIPS2023.pdf
Heavy Tails Workshop NeurIPS2023.pdfHeavy Tails Workshop NeurIPS2023.pdf
Heavy Tails Workshop NeurIPS2023.pdf
 
LLM avalanche June 2023.pdf
LLM avalanche June 2023.pdfLLM avalanche June 2023.pdf
LLM avalanche June 2023.pdf
 
WeightWatcher LLM Update
WeightWatcher LLM UpdateWeightWatcher LLM Update
WeightWatcher LLM Update
 
ICCF24.pdf
ICCF24.pdfICCF24.pdf
ICCF24.pdf
 
WeightWatcher Introduction
WeightWatcher IntroductionWeightWatcher Introduction
WeightWatcher Introduction
 
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
 
Capsule Networks
Capsule NetworksCapsule Networks
Capsule Networks
 
Cc stat phys draft
Cc stat phys draftCc stat phys draft
Cc stat phys draft
 
Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3
 

Recently uploaded

Creative Web Design Company in Singapore
Creative Web Design Company in SingaporeCreative Web Design Company in Singapore
Creative Web Design Company in Singapore
techboxsqauremedia
 
amptalk_RecruitingDeck_english_2024.06.05
amptalk_RecruitingDeck_english_2024.06.05amptalk_RecruitingDeck_english_2024.06.05
amptalk_RecruitingDeck_english_2024.06.05
marketing317746
 
Business storytelling: key ingredients to a story
Business storytelling: key ingredients to a storyBusiness storytelling: key ingredients to a story
Business storytelling: key ingredients to a story
Alexandra Fulford
 
Creative Web Design Company in Singapore
Creative Web Design Company in SingaporeCreative Web Design Company in Singapore
Creative Web Design Company in Singapore
techboxsqauremedia
 
Company Valuation webinar series - Tuesday, 4 June 2024
Company Valuation webinar series - Tuesday, 4 June 2024Company Valuation webinar series - Tuesday, 4 June 2024
Company Valuation webinar series - Tuesday, 4 June 2024
FelixPerez547899
 
Mastering B2B Payments Webinar from BlueSnap
Mastering B2B Payments Webinar from BlueSnapMastering B2B Payments Webinar from BlueSnap
Mastering B2B Payments Webinar from BlueSnap
Norma Mushkat Gaffin
 
Understanding User Needs and Satisfying Them
Understanding User Needs and Satisfying ThemUnderstanding User Needs and Satisfying Them
Understanding User Needs and Satisfying Them
Aggregage
 
How to Implement a Strategy: Transform Your Strategy with BSC Designer's Comp...
How to Implement a Strategy: Transform Your Strategy with BSC Designer's Comp...How to Implement a Strategy: Transform Your Strategy with BSC Designer's Comp...
How to Implement a Strategy: Transform Your Strategy with BSC Designer's Comp...
Aleksey Savkin
 
Structural Design Process: Step-by-Step Guide for Buildings
Structural Design Process: Step-by-Step Guide for BuildingsStructural Design Process: Step-by-Step Guide for Buildings
Structural Design Process: Step-by-Step Guide for Buildings
Chandresh Chudasama
 
Organizational Change Leadership Agile Tour Geneve 2024
Organizational Change Leadership Agile Tour Geneve 2024Organizational Change Leadership Agile Tour Geneve 2024
Organizational Change Leadership Agile Tour Geneve 2024
Kirill Klimov
 
Part 2 Deep Dive: Navigating the 2024 Slowdown
Part 2 Deep Dive: Navigating the 2024 SlowdownPart 2 Deep Dive: Navigating the 2024 Slowdown
Part 2 Deep Dive: Navigating the 2024 Slowdown
jeffkluth1
 
BeMetals Investor Presentation_June 1, 2024.pdf
BeMetals Investor Presentation_June 1, 2024.pdfBeMetals Investor Presentation_June 1, 2024.pdf
BeMetals Investor Presentation_June 1, 2024.pdf
DerekIwanaka1
 
Digital Marketing with a Focus on Sustainability
Digital Marketing with a Focus on SustainabilityDigital Marketing with a Focus on Sustainability
Digital Marketing with a Focus on Sustainability
sssourabhsharma
 
Lundin Gold Corporate Presentation - June 2024
Lundin Gold Corporate Presentation - June 2024Lundin Gold Corporate Presentation - June 2024
Lundin Gold Corporate Presentation - June 2024
Adnet Communications
 
Brian Fitzsimmons on the Business Strategy and Content Flywheel of Barstool S...
Brian Fitzsimmons on the Business Strategy and Content Flywheel of Barstool S...Brian Fitzsimmons on the Business Strategy and Content Flywheel of Barstool S...
Brian Fitzsimmons on the Business Strategy and Content Flywheel of Barstool S...
Neil Horowitz
 
How MJ Global Leads the Packaging Industry.pdf
How MJ Global Leads the Packaging Industry.pdfHow MJ Global Leads the Packaging Industry.pdf
How MJ Global Leads the Packaging Industry.pdf
MJ Global
 
2022 Vintage Roman Numerals Men Rings
2022 Vintage Roman  Numerals  Men  Rings2022 Vintage Roman  Numerals  Men  Rings
2022 Vintage Roman Numerals Men Rings
aragme
 
Unveiling the Dynamic Personalities, Key Dates, and Horoscope Insights: Gemin...
Unveiling the Dynamic Personalities, Key Dates, and Horoscope Insights: Gemin...Unveiling the Dynamic Personalities, Key Dates, and Horoscope Insights: Gemin...
Unveiling the Dynamic Personalities, Key Dates, and Horoscope Insights: Gemin...
my Pandit
 
Easily Verify Compliance and Security with Binance KYC
Easily Verify Compliance and Security with Binance KYCEasily Verify Compliance and Security with Binance KYC
Easily Verify Compliance and Security with Binance KYC
Any kyc Account
 
Chapter 7 Final business management sciences .ppt
Chapter 7 Final business management sciences .pptChapter 7 Final business management sciences .ppt
Chapter 7 Final business management sciences .ppt
ssuser567e2d
 

Recently uploaded (20)

Creative Web Design Company in Singapore
Creative Web Design Company in SingaporeCreative Web Design Company in Singapore
Creative Web Design Company in Singapore
 
amptalk_RecruitingDeck_english_2024.06.05
amptalk_RecruitingDeck_english_2024.06.05amptalk_RecruitingDeck_english_2024.06.05
amptalk_RecruitingDeck_english_2024.06.05
 
Business storytelling: key ingredients to a story
Business storytelling: key ingredients to a storyBusiness storytelling: key ingredients to a story
Business storytelling: key ingredients to a story
 
Creative Web Design Company in Singapore
Creative Web Design Company in SingaporeCreative Web Design Company in Singapore
Creative Web Design Company in Singapore
 
Company Valuation webinar series - Tuesday, 4 June 2024
Company Valuation webinar series - Tuesday, 4 June 2024Company Valuation webinar series - Tuesday, 4 June 2024
Company Valuation webinar series - Tuesday, 4 June 2024
 
Mastering B2B Payments Webinar from BlueSnap
Mastering B2B Payments Webinar from BlueSnapMastering B2B Payments Webinar from BlueSnap
Mastering B2B Payments Webinar from BlueSnap
 
Understanding User Needs and Satisfying Them
Understanding User Needs and Satisfying ThemUnderstanding User Needs and Satisfying Them
Understanding User Needs and Satisfying Them
 
How to Implement a Strategy: Transform Your Strategy with BSC Designer's Comp...
How to Implement a Strategy: Transform Your Strategy with BSC Designer's Comp...How to Implement a Strategy: Transform Your Strategy with BSC Designer's Comp...
How to Implement a Strategy: Transform Your Strategy with BSC Designer's Comp...
 
Structural Design Process: Step-by-Step Guide for Buildings
Structural Design Process: Step-by-Step Guide for BuildingsStructural Design Process: Step-by-Step Guide for Buildings
Structural Design Process: Step-by-Step Guide for Buildings
 
Organizational Change Leadership Agile Tour Geneve 2024
Organizational Change Leadership Agile Tour Geneve 2024Organizational Change Leadership Agile Tour Geneve 2024
Organizational Change Leadership Agile Tour Geneve 2024
 
Part 2 Deep Dive: Navigating the 2024 Slowdown
Part 2 Deep Dive: Navigating the 2024 SlowdownPart 2 Deep Dive: Navigating the 2024 Slowdown
Part 2 Deep Dive: Navigating the 2024 Slowdown
 
BeMetals Investor Presentation_June 1, 2024.pdf
BeMetals Investor Presentation_June 1, 2024.pdfBeMetals Investor Presentation_June 1, 2024.pdf
BeMetals Investor Presentation_June 1, 2024.pdf
 
Digital Marketing with a Focus on Sustainability
Digital Marketing with a Focus on SustainabilityDigital Marketing with a Focus on Sustainability
Digital Marketing with a Focus on Sustainability
 
Lundin Gold Corporate Presentation - June 2024
Lundin Gold Corporate Presentation - June 2024Lundin Gold Corporate Presentation - June 2024
Lundin Gold Corporate Presentation - June 2024
 
Brian Fitzsimmons on the Business Strategy and Content Flywheel of Barstool S...
Brian Fitzsimmons on the Business Strategy and Content Flywheel of Barstool S...Brian Fitzsimmons on the Business Strategy and Content Flywheel of Barstool S...
Brian Fitzsimmons on the Business Strategy and Content Flywheel of Barstool S...
 
How MJ Global Leads the Packaging Industry.pdf
How MJ Global Leads the Packaging Industry.pdfHow MJ Global Leads the Packaging Industry.pdf
How MJ Global Leads the Packaging Industry.pdf
 
2022 Vintage Roman Numerals Men Rings
2022 Vintage Roman  Numerals  Men  Rings2022 Vintage Roman  Numerals  Men  Rings
2022 Vintage Roman Numerals Men Rings
 
Unveiling the Dynamic Personalities, Key Dates, and Horoscope Insights: Gemin...
Unveiling the Dynamic Personalities, Key Dates, and Horoscope Insights: Gemin...Unveiling the Dynamic Personalities, Key Dates, and Horoscope Insights: Gemin...
Unveiling the Dynamic Personalities, Key Dates, and Horoscope Insights: Gemin...
 
Easily Verify Compliance and Security with Binance KYC
Easily Verify Compliance and Security with Binance KYCEasily Verify Compliance and Security with Binance KYC
Easily Verify Compliance and Security with Binance KYC
 
Chapter 7 Final business management sciences .ppt
Chapter 7 Final business management sciences .pptChapter 7 Final business management sciences .ppt
Chapter 7 Final business management sciences .ppt
 

Georgetown B-school Talk 2021

  • 1. calculation | consulting data science leadership (TM) c|c (TM) charles@calculationconsulting.com
  • 3. calculation | consulting data science leadership Who Are We? c|c (TM) Dr. Charles H. Martin, PhD University of Chicago, Chemical Physics NSF Fellow in Theoretical Chemistry Over 10 years experience in applied Machine Learning Developed ML algos for Demand Media; the first $1B IPO since Google Lean Start Ups: Aardvark (acquired by Google), eHow, Wall Street: GLG, BGI, BlackRock Fortune 500: Roche, France Telecom Tech / Retail: GoDaddy, eBay, Walmart Investment: Griffin Advisors, Page Family Offices
 www.calculationconsulting.com charles@calculationconsulting.com (TM) 3
  • 4. Recent AI News: Epic Systems When machine learning FAILS c|c (TM) calculation | consulting data science leadership (TM) 4
  • 5. Recent AI News: Epic Systems c|c (TM) calculation | consulting data science leadership (TM) 5 “[The] definition of sepsis based on billing codes alone is imprecise and not the one that is clinically meaningful to a health system or to patients.” When machine learning FAILS
  • 6. Recent AI News: Zillow When machine learning (or AI) FAILS c|c (TM) calculation | consulting data science leadership (TM) 6
  • 7. Recent AI News: Zillow When machine learning (or AI) FAILS c|c (TM) calculation | consulting data science leadership (TM) 7
  • 8. Data Science is Different c|c (TM) calculation | consulting data science leadership (TM) 8
  • 9. Data Science Leadership : Becoming Data-Led c|c (TM) (TM) calculation | consulting data science leadership 9 1. Data Informed: OperationalVisibility 2. Data Driven: Tooling and Insights for Growth 3. Data Led Automation and Innovation creating the data-led organization
  • 10. Data Science Leadership : 4 Steps to Leading c|c (TM) (TM) calculation | consulting data science leadership 10 • Strategy: How can you leverage data ? • Stage: How mature is your data ? • Team: What team do we need ? • Tools: What tools do they need ? creating the data-led organization
  • 11. Strategy:Algo Gas Station Analogy Problem: where to open a gas station ? Need: good traffic, weak competition c|c (TM) less competitors no traffic sweet spot great traffic too many competitors calculation | consulting data science leadership ML algorithms can predict supply and demand (TM) 11
  • 12. Strategy: Data Science Process • Acquire Domain Knowledge • Formulate Hypothesis • Generate Model(s) from the Data • Predict Revenue Gains • Backtest Predictions on your Data • A/B Test in Production • Attribute Gains to Model(s) c|c (TM) (TM) acting solving framing calculation | consulting data science leadership 12
  • 13. c|c (TM) • Systems Thinking: leveraging the inter-relationships between data, marketing, and the customer • Knowledge Transfer: mentoring — not training — to develop both personal mastery and team learning • Mental Models: create a base of small-scale models for thinking about how to use your data • Knowledge Sharing: foster collaboration between research, engineering, and product to drive revenue Strategy: Learning from Data calculation | consulting data science leadership (TM) 13
  • 14. c|c (TM) • Cross-functional engineering, product, marketing, finance • Autonomous: separate from the traditional engineering product lifecycle. self-organizing and self-managing • Experimental: form hypothesis, analyze data, make predictions, run backtests, A/B testing • Self-sustaining: not a cost center; generates revenue (TM) calculation | consulting data science leadership 14 Strategy: Data Science is not IT
  • 15. c|c (TM) (TM) Problem: Externalities calculation | consulting data science leadership 15 external factors can change
  • 16. (TM) c|c (TM) Data is only is as accurate as it’s original intent demanded calculation | consulting data science leadership 16 Stage: Your Data Maturity • Where is your data ? Transaction Database? Web Logs ? 3rd party system ? Data Lake ? • What product does it service ? Billing ? CRM ? • Can you access it ? Security ? Regulations ? • Who owns it ? Responsible for quality ?
  • 17. Problem: Data Quality Mismatch (TM) c|c (TM) Data is only is as accurate as it’s original intent demanded calculation | consulting data science leadership 17 ?
  • 18. Problem: Data Quality Mismatch (TM) c|c (TM) Data is only is as accurate as it’s original intent demanded calculation | consulting data science leadership 18 Recommender System
  • 19. Problem: Data Quality Mismatch (TM) c|c (TM) Data is only is as accurate as it’s original intent demanded calculation | consulting data science leadership 19 Recommender System Quality of product metadata May not materially impact billing x ? wrong missing
  • 20. (TM) c|c (TM) “Only the paranoid survive” Andy Grove (Intel) calculation | consulting data science leadership 20 Recommender System Solution: Be Paranoid and Test Everything
  • 21. (TM) c|c (TM) “Only the paranoid survive” Andy Grove (Intel) calculation | consulting data science leadership 21 Recommender System Solution: Test Everything Software engineers can be paranoid about programming. In fact, Paranoid Programming is a thing. You have to be paranoid about your data. Thing is, bad code can usually be fixed. But bad data has usually has to be thrown away
  • 22. (TM) c|c (TM) calculation | consulting data science leadership 22 Recommender System
  • 23. Problem: Data Contraband (TM) c|c (TM) data 'from a friend’ that may violate compliance calculation | consulting data science leadership 23 Recommender System Data pulled into spreadsheet / csv Data actually stored in DB Data passed around by email, etc
  • 24. (TM) c|c (TM) calculation | consulting data science leadership 24 Recommender System Google Sheets, SAP, etc (where you can track everything) Move functions to the data (stored procedures, Spark, etc) Jira, GitHub, Confluence, . Document tracking systems Solutions: Data Contraband
  • 25. Team: Data Scientists are Different c|c (TM) calculation | consulting data science leadership (TM) 25 not all techies are the same
  • 26. Team: Data Scientists are Different c|c (TM) calculation | consulting data science leadership theoretical physics machine learning / AI specialist (TM) 26 applied physics data scientist engineer software, browser tech, dev ops, … not all techies are the same
  • 27. Team: Data Scientists are Different c|c (TM) calculation | consulting data science leadership Data science group. Can be very isolated. Very research-y & difficult to productionalize (TM) 27 Embedded data scientist, solves problems builds solutions, and deploys them Software and IT services Great at managing code and systems Not great with data, math — or ambiguity not all techies are the same
  • 28. FANNG Managers: Fallen Gods c|c (TM) (TM) calculation | consulting data science leadership 28 the Earth is flat and they fallen off
  • 29. FANNG Managers: Fallen Gods c|c (TM) (TM) calculation | consulting data science leadership 29 FAANG infrastructure is 10-20 years ahead
  • 30. FANNG Managers: Fallen Gods c|c (TM) (TM) calculation | consulting data science leadership 30 you need infrastructure to deliver data products
  • 31. Data Strategy : Think like a Beginner c|c (TM) (TM) calculation | consulting data science leadership 31 cultivate a beginner’s mind - Test your assumptions. Literally - Look for problems early on. And never stop looking - Distinguish between statistical structural outliers. - Repair your data, if possible. - Start with simple, robust methods. - Sophisticated models are more sensitive to errors - and are more easily overtrained. - Evaluate your predictions on real data, and figure out how to attribute results to your models. - Re-calibrate your models if necessary.
  • 32. Tools: What the Team Needs (TM) c|c (TM) • Infrastructure: Data storage, cloud services, etc • Analytics: Measuring whats going on • Operations: Keeping things running • Machine Learning and AI: Growth and Innovation Algorithms, not data lakes, generate revenue calculation | consulting data science leadership 32
  • 33. Tools: What the Team Needs to Know (TM) c|c (TM) • Metrics: What KPIs you have, and what to hit • Access: How to get what they need (i.e self-service) • Impact: How tooling (used and built) support the business • Truth: What data is reliable, what is not Algorithms, not data lakes, generate revenue calculation | consulting data science leadership 33
  • 34. c|c (TM) (TM) Final Thoughts: Algorithmic Accountability calculation | consulting data science leadership An asset is an economic resource. Anything tangible or intangible that is capable of being owned or controlled to produce value and that is held to have positive economic value is considered an asset. algorithms can be valuable assets 34
  • 35. c|c (TM) (TM) Algorithmic Accountability calculation | consulting data science leadership 35 does revenue depends on hidden algos ? • WebMD Google SEO • Amazon Product Listing Algo • Pinterest Relevance Algo • Twitter Spam filter • Apple App Store Rankings
  • 36. c|c (TM) (TM) Algorithmic Accountability calculation | consulting data science leadership 36 do decisions depend on hidden factors ? A 'Crisis' in Online Ads: One-Third of Traffic Is Bogus http://www.wsj.com/articles/SB10001424052702304026304579453253860786362 Now Algorithms Are DecidingWhomTo Hire… http://www.npr.org/blogs/alltechconsidered/2015/03/23/394827451/now-algorithms-are-deciding-whom-to-hire-based-on-voice What you don’t know about Internet algorithms is hurting you… http://www.washingtonpost.com/news/the-intersect/wp/2015/03/23/what-you-dont-know-about-internet-algorithms-is-hurting-you-and-you-probably-dont-know-very-much/
  • 37. c|c (TM) (TM) Solution: Algorithmic Transparency calculation | consulting data science leadership 37 can you be transparent and not be gamed ? http://fortune.com/2015/03/18/how-do-you-govern-a-hidden-fluid-and-amoral-algorithm/ 83% of the participants in the study changed their behavior once they knew about the algorithm How do you govern a (hidden, fluid and amoral) algorithm? participants mistakenly believed that their friends intentionally chose not to show them stories
  • 38. c|c (TM) (TM) Algorithmic Accountability calculation | consulting data science leadership Do you depend on some else’s marketplace? How does your revenue depend on algos? Do you need an internal algo ? Who will manage it? build it? maintain it? algorithms have unforeseen liabilities 38