SlideShare a Scribd company logo
calculation | consulting
data science leadership
(TM)
c|c
(TM)
charles@calculationconsulting.com
calculation|consulting
Data Science Leadership
(TM)
charles@caclulationconsulting.com
calculation | consulting data science leadership
Who Are We?
c|c
(TM)
Dr. Charles H. Martin, PhD
University of Chicago, Chemical Physics
NSF Fellow in Theoretical Chemistry
Over 10 years experience in applied Machine Learning
Developed ML algos for Demand Media; the first $1B IPO since Google
Lean Start Ups: Aardvark (acquired by Google), eHow, Mode
Wall Street: BlackRock, GLG
Fortune 500: Big Pharma, Telecom, eBay
www.calculationconsulting.com
charles@calculationconsulting.com
(TM)
3
BackStory: in 2011, Search Changed. Forever.
• first $1B IPO since Google
• Machine Learning based SEO algorithms
• Measure the demand for search, and fulfill it
data science algorithms created a billion $ company
c|c
(TM)
(TM)
Demand Media
calculation | consulting data science leadership(TM)
4
eHow.com
BackStory: in 2011, Search Changed. Forever.
• Google adapted (Panda)
• Lack of diversification
• Lack of adaptation
• Stock price never recovered
algorithmic accountability: DMD or Google?
c|c
(TM)
IPO
Panda
stock price 2011-2012
(TM)
calculation | consulting data science leadership
DMD
(TM)
5
• first $1B collapse due to Panda ?
• CPC revenues down
• premium online publishers died
collapse
?
stock price 2011-2012
c|c
(TM)
$1B in ad revenue was repriced and reallocated
Problem: Cornering the market on
search induced a market crash
calculation | consulting data science leadership(TM)
6
c|c
(TM)
Panda-Induced ‘Market Crash’
Google CPC dropped just after Panda
calculation | consulting data science leadership(TM)
7
Data Science is Different
c|c
(TM)
Davenport
calculation | consulting data science leadership
Generating sustainable revenue requires
Data Science Leadership and Execution
(TM)
8
“Companies need a Spock in the boardroom”
Data Science is Different
c|c
(TM)
Davenport
calculation | consulting data science leadership
Generating sustainable revenue requires
Data Science Leadership and Execution
(TM)
9
http://www.theonion.com/articles/national-science-foundation-science-hard,1405/
Problem: Data Scientists are Different
c|c
(TM)
Davenport
calculation | consulting data science leadership(TM)
10
not all techies are the same
Problem: Data Scientists are Different
c|c
(TM)
Davenport
calculation | consulting data science leadership
theoretical physics
machine learning specialist
(TM)
11
experimental physics
data scientist
engineer
software, browser tech, dev ops, …
not all techies are the same
Problem: Data Scientists are Different
c|c
(TM)
Davenport
calculation | consulting data science leadership(TM)
12
not all techies are the same
Managing: Data Science Process
• Acquire Domain Knowledge
• Formulate Hypothesis
• Generate Model(s) from the Data
• Predict Revenue Gains
• Backtest Predictions on your Data
• A/B Test in Production
• Attribute Gains to Model(s)
c|c
(TM)
(TM)
acting
solving
framing
calculation | consulting data science leadership
13
Managing: Data Science Process
c|c
(TM)
(TM)
calculation | consulting data science leadership
14
c|c
(TM)
• Systems Thinking: leveraging the inter-relationships
between data, marketing, and the customer
• Knowledge Transfer: mentoring — not training — to
develop both personal mastery and team learning
• Mental Models: create a base of small-scale models for
thinking about how to use your data
• Knowledge Sharing: foster collaboration between
research, engineering, and product to drive revenue
Managing: Learning from Data
calculation | consulting data science leadership(TM)
15
c|c
(TM)
• Cross-functional engineering, product, marketing, finance
• Autonomous: separate from the traditional engineering
product lifecycle. self-organizing and self-managing
• Experimental: form hypothesis, analyze data, make
predictions, run backtests, A/B testing
• Self-sustaining: not a cost center; generates revenue
(TM)
Data Science is Different
calculation | consulting data science leadership
16
Solution: Collecting and Organizing Data
(TM)
c|c
(TM)
• Most companies are struggling organizing their data
• Data needs to be examined
• Don’t assume data is correct or useful
• More is More: simple algos work
• More is Less: noise is noise
Data not examined is not collected
calculation | consulting data science leadership
17
Solutions: Hadoop and Big Data
(TM)
c|c
(TM)
• Hadoop is an internal data ecosystem
• Hadoop appears to have won the adoption wars ?
• Hadoop : 90% deployments internal
• Hadoop is a cost center
• ROI needs cut across business divisions
Algorithms, not data, generate revenue
calculation | consulting data science leadership
18
Solutions: Cloud
(TM)
c|c
(TM)
• Startups don’t need infrastructure
• long term Data Storage is virtually free
• Amazon Redshift
• Google Big Query
• Cloud is the future ?
Algorithms, not data, generate revenue
calculation | consulting data science leadership
19
Solutions: Spark
(TM)
c|c
(TM)
• Next Gen Platform for Machine Learning
• Sits on Hadoop or the Cloud
• Still very high touch
• Limited algos
Algorithms, not data, generate revenue
calculation | consulting data science leadership
20
Problem: Measurements
(TM)
c|c
(TM)
good experiments are amazing
calculation | consulting data science leadership
21
“If you can’t measure it, you can’t fix it.”
DJ Patil,White House Chief Data Scientist
Data Science’s Measurement Problem
(TM)
c|c
(TM)
good experiments are hard to design
calculation | consulting data science leadership
22
http://www.forbes.com/sites/lizryan/2014/02/10/if-you-cant-measure-it-you-cant-manage-it-is-bs/
Data Science’s Measurement Problem
(TM)
c|c
(TM)
good experiments are hard to design
calculation | consulting data science leadership
23
“Data science has a measurement problem.
Simple metrics may not address complex situations.
But complex metrics present myriad problems.”
“As we strive for better algorithms,
we often fail to think critically about what it means
for predictions to be ‘good’”
http://www.kdnuggets.com/2015/03/data-science-measurement-problem-accuracy-auroc-f1.html
Data Science’s Measurement Problem
(TM)
c|c
(TM)
good experiments are hard to design
calculation | consulting data science leadership
24
“Buffett found it 'extraordinary' that academics studied such things.
They studied what was measurable, rather than what was meaningful.‘
… to a man with a hammer,
everything looks like a nail.”
― Roger Lowenstein, Buffett:
The Making of an American Capitalist
c|c
(TM)
(TM)
Problem: The Cult of the Algorithm
calculation | consulting data science leadership
25
what can algos actually do ?
“We have a new machine learning algo that anticipate
your needs over time and behave accordingly”
c|c
(TM)
(TM)
Problem: What can Machine Learning Do?
calculation | consulting data science leadership
26
what can algos actually do ?
Demand Algos: Gas Station Analogy
Problem: where to open a gas station ?
Need: good traffic, weak competition
c|c
(TM)
less competitors
no traffic
sweet spot
great traffic
too many competitors
calculation | consulting data science leadership
all businesses balance supply and demand
(TM)
27
SAAS Machine Learning Algos
c|c
(TM)
calculation | consulting data science leadership
(TM)
28
$100,000 • 167 teams
Diabetic Retinopathy Detection
$15,000 • 341 teams
March Machine Learning Mania 2015
machine learning contests
SAAS Machine Learning Algos
c|c
(TM)
calculation | consulting data science leadership
(TM)
29
machine learning apis
c|c
(TM)
(TM)
Problem: What can Deep Learning Do?
calculation | consulting data science leadership
30
what can algos actually do ?
c|c
(TM)
(TM)
Problem: Externalities
calculation | consulting data science leadership
31
external factors can change
c|c
(TM)
(TM)
Problem: Externalities
calculation | consulting data science leadership
32
“Zynga is our best company ever!” (2010)
John Doerr, Google Investor, LegendaryVC
http://venturebeat.com/2010/11/16/google-investor-john-doerr-zynga-is-our-best-company-ever/
one marketplace | big risks
c|c
(TM)
(TM)
Solution: Algorithmic Accountability
calculation | consulting data science leadership
An asset is an economic resource.
Anything tangible or intangible that is capable of
being owned or controlled to produce value and
that is held to have positive economic value is
considered an asset.
algorithms can be valuable assets
33
c|c
(TM)
(TM)
Algorithmic Accountability
calculation | consulting data science leadership
34
does revenue depends on hidden algos ?
• WebMD Google SEO
• Amazon Product Listing Algo
• Pinterest Relevance Algo
• Twitter Spam filter
• Apple App Store Rankings
c|c
(TM)
(TM)
Algorithmic Accountability
calculation | consulting data science leadership
35
do decisions depend on hidden factors ?
A 'Crisis' in Online Ads: One-Third of Traffic Is Bogus
http://www.wsj.com/articles/SB10001424052702304026304579453253860786362
Now Algorithms Are DecidingWhomTo Hire…
http://www.npr.org/blogs/alltechconsidered/2015/03/23/394827451/now-algorithms-are-deciding-whom-to-hire-based-on-voice
What you don’t know about Internet algorithms is hurting you…
http://www.washingtonpost.com/news/the-intersect/wp/2015/03/23/what-you-dont-know-about-internet-algorithms-is-hurting-you-and-you-probably-dont-know-very-much/
c|c
(TM)
(TM)
Solution: Algorithmic Transparency
calculation | consulting data science leadership
36
can you be transparent and not be gamed ?
http://fortune.com/2015/03/18/how-do-you-govern-a-hidden-fluid-and-amoral-algorithm/
83% of the participants in the study changed their behavior
once they knew about the algorithm
How do you govern a (hidden, fluid and amoral) algorithm?
participants mistakenly believed that their friends intentionally
chose not to show them stories
c|c
(TM)
(TM)
Algorithmic Accountability
calculation | consulting data science leadership
Do you depend on some else’s marketplace?
How does your revenue depend on algos?
Do you need an internal algo ?
Who will manage it? build it? maintain it?
algorithms have unforeseen liabilities
37
(TM)
c|c
(TM)
c | c
charles@calculationconsulting.com

More Related Content

Similar to Cc hass b school talk 2105

Building AI Products: Delivery Vs Discovery
Building AI Products: Delivery Vs Discovery Building AI Products: Delivery Vs Discovery
Building AI Products: Delivery Vs Discovery
Charles Martin
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao Paulo
OCTO Technology
 
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Mathieu DESPRIEE
 
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Sri Ambati
 
The Sky’s the Limit – The Rise of Machine Learnin
The Sky’s the Limit – The Rise of Machine LearninThe Sky’s the Limit – The Rise of Machine Learnin
The Sky’s the Limit – The Rise of Machine Learnin
Inside Analysis
 
Building the Cognitive Era : Big Data Strategies
Building the Cognitive Era : Big Data StrategiesBuilding the Cognitive Era : Big Data Strategies
Building the Cognitive Era : Big Data Strategies
Kevin Sigliano
 
Machine Learning - why the hype and how it does its magic
Machine Learning - why the hype and how it does its magicMachine Learning - why the hype and how it does its magic
Machine Learning - why the hype and how it does its magic
Amirali Charania
 
DSDT Meetup April 2021
DSDT Meetup April 2021DSDT Meetup April 2021
DSDT Meetup April 2021
DSDT_MTL
 
20210907 jim spohrer berkeley ai_do_dont v1
20210907 jim spohrer berkeley ai_do_dont  v120210907 jim spohrer berkeley ai_do_dont  v1
20210907 jim spohrer berkeley ai_do_dont v1
ISSIP
 
DutchMLSchool. ML Business Perspective
DutchMLSchool. ML Business PerspectiveDutchMLSchool. ML Business Perspective
DutchMLSchool. ML Business Perspective
BigML, Inc
 
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
Santiago Cabrera-Naranjo
 
Intelligence Data Day 2020
Intelligence Data Day 2020Intelligence Data Day 2020
Intelligence Data Day 2020
Patrick Deglon
 
Palo alto university rotary club talk Sep 29, 2107
Palo alto university rotary club talk Sep 29, 2107Palo alto university rotary club talk Sep 29, 2107
Palo alto university rotary club talk Sep 29, 2107
Charles Martin
 
Metric-X - My Entrepreneurial Journey
Metric-X - My Entrepreneurial JourneyMetric-X - My Entrepreneurial Journey
Metric-X - My Entrepreneurial Journey
Metric-X
 
Data Competitive
Data CompetitiveData Competitive
Data Competitive
June Andrews
 
Lunch and Learn: You have the data, now what?
Lunch and Learn: You have the data, now what?Lunch and Learn: You have the data, now what?
Lunch and Learn: You have the data, now what?
DiUS
 
AI Orange Belt - Session 4
AI Orange Belt - Session 4AI Orange Belt - Session 4
AI Orange Belt - Session 4
AI Black Belt
 
Model Factory at ING Bank
Model Factory at ING BankModel Factory at ING Bank
Model Factory at ING Bank
DataWorks Summit
 
Minne analytics presentation 2018 12 03 final compressed
Minne analytics presentation 2018 12 03 final   compressedMinne analytics presentation 2018 12 03 final   compressed
Minne analytics presentation 2018 12 03 final compressed
Bonnie Holub
 
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
DATAVERSITY
 

Similar to Cc hass b school talk 2105 (20)

Building AI Products: Delivery Vs Discovery
Building AI Products: Delivery Vs Discovery Building AI Products: Delivery Vs Discovery
Building AI Products: Delivery Vs Discovery
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao Paulo
 
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
 
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
 
The Sky’s the Limit – The Rise of Machine Learnin
The Sky’s the Limit – The Rise of Machine LearninThe Sky’s the Limit – The Rise of Machine Learnin
The Sky’s the Limit – The Rise of Machine Learnin
 
Building the Cognitive Era : Big Data Strategies
Building the Cognitive Era : Big Data StrategiesBuilding the Cognitive Era : Big Data Strategies
Building the Cognitive Era : Big Data Strategies
 
Machine Learning - why the hype and how it does its magic
Machine Learning - why the hype and how it does its magicMachine Learning - why the hype and how it does its magic
Machine Learning - why the hype and how it does its magic
 
DSDT Meetup April 2021
DSDT Meetup April 2021DSDT Meetup April 2021
DSDT Meetup April 2021
 
20210907 jim spohrer berkeley ai_do_dont v1
20210907 jim spohrer berkeley ai_do_dont  v120210907 jim spohrer berkeley ai_do_dont  v1
20210907 jim spohrer berkeley ai_do_dont v1
 
DutchMLSchool. ML Business Perspective
DutchMLSchool. ML Business PerspectiveDutchMLSchool. ML Business Perspective
DutchMLSchool. ML Business Perspective
 
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
 
Intelligence Data Day 2020
Intelligence Data Day 2020Intelligence Data Day 2020
Intelligence Data Day 2020
 
Palo alto university rotary club talk Sep 29, 2107
Palo alto university rotary club talk Sep 29, 2107Palo alto university rotary club talk Sep 29, 2107
Palo alto university rotary club talk Sep 29, 2107
 
Metric-X - My Entrepreneurial Journey
Metric-X - My Entrepreneurial JourneyMetric-X - My Entrepreneurial Journey
Metric-X - My Entrepreneurial Journey
 
Data Competitive
Data CompetitiveData Competitive
Data Competitive
 
Lunch and Learn: You have the data, now what?
Lunch and Learn: You have the data, now what?Lunch and Learn: You have the data, now what?
Lunch and Learn: You have the data, now what?
 
AI Orange Belt - Session 4
AI Orange Belt - Session 4AI Orange Belt - Session 4
AI Orange Belt - Session 4
 
Model Factory at ING Bank
Model Factory at ING BankModel Factory at ING Bank
Model Factory at ING Bank
 
Minne analytics presentation 2018 12 03 final compressed
Minne analytics presentation 2018 12 03 final   compressedMinne analytics presentation 2018 12 03 final   compressed
Minne analytics presentation 2018 12 03 final compressed
 
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
 

More from Charles Martin

Heavy Tails Workshop NeurIPS2023.pdf
Heavy Tails Workshop NeurIPS2023.pdfHeavy Tails Workshop NeurIPS2023.pdf
Heavy Tails Workshop NeurIPS2023.pdf
Charles Martin
 
LLM avalanche June 2023.pdf
LLM avalanche June 2023.pdfLLM avalanche June 2023.pdf
LLM avalanche June 2023.pdf
Charles Martin
 
WeightWatcher LLM Update
WeightWatcher LLM UpdateWeightWatcher LLM Update
WeightWatcher LLM Update
Charles Martin
 
ICCF24.pdf
ICCF24.pdfICCF24.pdf
ICCF24.pdf
Charles Martin
 
ENS Macrh 2022.pdf
ENS Macrh 2022.pdfENS Macrh 2022.pdf
ENS Macrh 2022.pdf
Charles Martin
 
Weight watcher Bay Area ACM Feb 28, 2022
Weight watcher Bay Area ACM Feb 28, 2022 Weight watcher Bay Area ACM Feb 28, 2022
Weight watcher Bay Area ACM Feb 28, 2022
Charles Martin
 
WeightWatcher Introduction
WeightWatcher IntroductionWeightWatcher Introduction
WeightWatcher Introduction
Charles Martin
 
WeightWatcher Update: January 2021
WeightWatcher Update:  January 2021WeightWatcher Update:  January 2021
WeightWatcher Update: January 2021
Charles Martin
 
Stanford ICME Lecture on Why Deep Learning Works
Stanford ICME Lecture on Why Deep Learning WorksStanford ICME Lecture on Why Deep Learning Works
Stanford ICME Lecture on Why Deep Learning Works
Charles Martin
 
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Charles Martin
 
This Week in Machine Learning and AI Feb 2019
This Week in Machine Learning and AI Feb 2019This Week in Machine Learning and AI Feb 2019
This Week in Machine Learning and AI Feb 2019
Charles Martin
 
Why Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural Networks Why Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural Networks
Charles Martin
 
Why Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural NetworksWhy Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural Networks
Charles Martin
 
Why Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural NetworksWhy Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural Networks
Charles Martin
 
Capsule Networks
Capsule NetworksCapsule Networks
Capsule Networks
Charles Martin
 
Cc stat phys draft
Cc stat phys draftCc stat phys draft
Cc stat phys draft
Charles Martin
 
CC mmds talk 2106
CC mmds talk 2106CC mmds talk 2106
CC mmds talk 2106
Charles Martin
 
Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3
Charles Martin
 

More from Charles Martin (18)

Heavy Tails Workshop NeurIPS2023.pdf
Heavy Tails Workshop NeurIPS2023.pdfHeavy Tails Workshop NeurIPS2023.pdf
Heavy Tails Workshop NeurIPS2023.pdf
 
LLM avalanche June 2023.pdf
LLM avalanche June 2023.pdfLLM avalanche June 2023.pdf
LLM avalanche June 2023.pdf
 
WeightWatcher LLM Update
WeightWatcher LLM UpdateWeightWatcher LLM Update
WeightWatcher LLM Update
 
ICCF24.pdf
ICCF24.pdfICCF24.pdf
ICCF24.pdf
 
ENS Macrh 2022.pdf
ENS Macrh 2022.pdfENS Macrh 2022.pdf
ENS Macrh 2022.pdf
 
Weight watcher Bay Area ACM Feb 28, 2022
Weight watcher Bay Area ACM Feb 28, 2022 Weight watcher Bay Area ACM Feb 28, 2022
Weight watcher Bay Area ACM Feb 28, 2022
 
WeightWatcher Introduction
WeightWatcher IntroductionWeightWatcher Introduction
WeightWatcher Introduction
 
WeightWatcher Update: January 2021
WeightWatcher Update:  January 2021WeightWatcher Update:  January 2021
WeightWatcher Update: January 2021
 
Stanford ICME Lecture on Why Deep Learning Works
Stanford ICME Lecture on Why Deep Learning WorksStanford ICME Lecture on Why Deep Learning Works
Stanford ICME Lecture on Why Deep Learning Works
 
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
Statistical Mechanics Methods for Discovering Knowledge from Production-Scale...
 
This Week in Machine Learning and AI Feb 2019
This Week in Machine Learning and AI Feb 2019This Week in Machine Learning and AI Feb 2019
This Week in Machine Learning and AI Feb 2019
 
Why Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural Networks Why Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural Networks
 
Why Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural NetworksWhy Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural Networks
 
Why Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural NetworksWhy Deep Learning Works: Self Regularization in Deep Neural Networks
Why Deep Learning Works: Self Regularization in Deep Neural Networks
 
Capsule Networks
Capsule NetworksCapsule Networks
Capsule Networks
 
Cc stat phys draft
Cc stat phys draftCc stat phys draft
Cc stat phys draft
 
CC mmds talk 2106
CC mmds talk 2106CC mmds talk 2106
CC mmds talk 2106
 
Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3
 

Recently uploaded

Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.
Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.
Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.
AnnySerafinaLove
 
Brand Analysis for an artist named Struan
Brand Analysis for an artist named StruanBrand Analysis for an artist named Struan
Brand Analysis for an artist named Struan
sarahvanessa51503
 
Recruiting in the Digital Age: A Social Media Masterclass
Recruiting in the Digital Age: A Social Media MasterclassRecruiting in the Digital Age: A Social Media Masterclass
Recruiting in the Digital Age: A Social Media Masterclass
LuanWise
 
Authentically Social Presented by Corey Perlman
Authentically Social Presented by Corey PerlmanAuthentically Social Presented by Corey Perlman
Authentically Social Presented by Corey Perlman
Corey Perlman, Social Media Speaker and Consultant
 
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
taqyed
 
Observation Lab PowerPoint Assignment for TEM 431
Observation Lab PowerPoint Assignment for TEM 431Observation Lab PowerPoint Assignment for TEM 431
Observation Lab PowerPoint Assignment for TEM 431
ecamare2
 
LA HUG - Video Testimonials with Chynna Morgan - June 2024
LA HUG - Video Testimonials with Chynna Morgan - June 2024LA HUG - Video Testimonials with Chynna Morgan - June 2024
LA HUG - Video Testimonials with Chynna Morgan - June 2024
Lital Barkan
 
FIA officials brutally tortured innocent and snatched 200 Bitcoins of worth 4...
FIA officials brutally tortured innocent and snatched 200 Bitcoins of worth 4...FIA officials brutally tortured innocent and snatched 200 Bitcoins of worth 4...
FIA officials brutally tortured innocent and snatched 200 Bitcoins of worth 4...
jamalseoexpert1978
 
Maksym Vyshnivetskyi: PMO Quality Management (UA)
Maksym Vyshnivetskyi: PMO Quality Management (UA)Maksym Vyshnivetskyi: PMO Quality Management (UA)
Maksym Vyshnivetskyi: PMO Quality Management (UA)
Lviv Startup Club
 
Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...
Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...
Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...
Lviv Startup Club
 
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdfMeas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
dylandmeas
 
Digital Transformation and IT Strategy Toolkit and Templates
Digital Transformation and IT Strategy Toolkit and TemplatesDigital Transformation and IT Strategy Toolkit and Templates
Digital Transformation and IT Strategy Toolkit and Templates
Aurelien Domont, MBA
 
Hamster Kombat' Telegram Game Surpasses 100 Million Players—Token Release Sch...
Hamster Kombat' Telegram Game Surpasses 100 Million Players—Token Release Sch...Hamster Kombat' Telegram Game Surpasses 100 Million Players—Token Release Sch...
Hamster Kombat' Telegram Game Surpasses 100 Million Players—Token Release Sch...
SOFTTECHHUB
 
Discover the innovative and creative projects that highlight my journey throu...
Discover the innovative and creative projects that highlight my journey throu...Discover the innovative and creative projects that highlight my journey throu...
Discover the innovative and creative projects that highlight my journey throu...
dylandmeas
 
Creative Web Design Company in Singapore
Creative Web Design Company in SingaporeCreative Web Design Company in Singapore
Creative Web Design Company in Singapore
techboxsqauremedia
 
Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...
Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...
Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...
bosssp10
 
The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...
Adam Smith
 
Project File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdfProject File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdf
RajPriye
 
Building Your Employer Brand with Social Media
Building Your Employer Brand with Social MediaBuilding Your Employer Brand with Social Media
Building Your Employer Brand with Social Media
LuanWise
 
Best Forex Brokers Comparison in INDIA 2024
Best Forex Brokers Comparison in INDIA 2024Best Forex Brokers Comparison in INDIA 2024
Best Forex Brokers Comparison in INDIA 2024
Top Forex Brokers Review
 

Recently uploaded (20)

Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.
Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.
Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.
 
Brand Analysis for an artist named Struan
Brand Analysis for an artist named StruanBrand Analysis for an artist named Struan
Brand Analysis for an artist named Struan
 
Recruiting in the Digital Age: A Social Media Masterclass
Recruiting in the Digital Age: A Social Media MasterclassRecruiting in the Digital Age: A Social Media Masterclass
Recruiting in the Digital Age: A Social Media Masterclass
 
Authentically Social Presented by Corey Perlman
Authentically Social Presented by Corey PerlmanAuthentically Social Presented by Corey Perlman
Authentically Social Presented by Corey Perlman
 
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
 
Observation Lab PowerPoint Assignment for TEM 431
Observation Lab PowerPoint Assignment for TEM 431Observation Lab PowerPoint Assignment for TEM 431
Observation Lab PowerPoint Assignment for TEM 431
 
LA HUG - Video Testimonials with Chynna Morgan - June 2024
LA HUG - Video Testimonials with Chynna Morgan - June 2024LA HUG - Video Testimonials with Chynna Morgan - June 2024
LA HUG - Video Testimonials with Chynna Morgan - June 2024
 
FIA officials brutally tortured innocent and snatched 200 Bitcoins of worth 4...
FIA officials brutally tortured innocent and snatched 200 Bitcoins of worth 4...FIA officials brutally tortured innocent and snatched 200 Bitcoins of worth 4...
FIA officials brutally tortured innocent and snatched 200 Bitcoins of worth 4...
 
Maksym Vyshnivetskyi: PMO Quality Management (UA)
Maksym Vyshnivetskyi: PMO Quality Management (UA)Maksym Vyshnivetskyi: PMO Quality Management (UA)
Maksym Vyshnivetskyi: PMO Quality Management (UA)
 
Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...
Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...
Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...
 
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdfMeas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
 
Digital Transformation and IT Strategy Toolkit and Templates
Digital Transformation and IT Strategy Toolkit and TemplatesDigital Transformation and IT Strategy Toolkit and Templates
Digital Transformation and IT Strategy Toolkit and Templates
 
Hamster Kombat' Telegram Game Surpasses 100 Million Players—Token Release Sch...
Hamster Kombat' Telegram Game Surpasses 100 Million Players—Token Release Sch...Hamster Kombat' Telegram Game Surpasses 100 Million Players—Token Release Sch...
Hamster Kombat' Telegram Game Surpasses 100 Million Players—Token Release Sch...
 
Discover the innovative and creative projects that highlight my journey throu...
Discover the innovative and creative projects that highlight my journey throu...Discover the innovative and creative projects that highlight my journey throu...
Discover the innovative and creative projects that highlight my journey throu...
 
Creative Web Design Company in Singapore
Creative Web Design Company in SingaporeCreative Web Design Company in Singapore
Creative Web Design Company in Singapore
 
Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...
Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...
Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...
 
The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...
 
Project File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdfProject File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdf
 
Building Your Employer Brand with Social Media
Building Your Employer Brand with Social MediaBuilding Your Employer Brand with Social Media
Building Your Employer Brand with Social Media
 
Best Forex Brokers Comparison in INDIA 2024
Best Forex Brokers Comparison in INDIA 2024Best Forex Brokers Comparison in INDIA 2024
Best Forex Brokers Comparison in INDIA 2024
 

Cc hass b school talk 2105

  • 1. calculation | consulting data science leadership (TM) c|c (TM) charles@calculationconsulting.com
  • 3. calculation | consulting data science leadership Who Are We? c|c (TM) Dr. Charles H. Martin, PhD University of Chicago, Chemical Physics NSF Fellow in Theoretical Chemistry Over 10 years experience in applied Machine Learning Developed ML algos for Demand Media; the first $1B IPO since Google Lean Start Ups: Aardvark (acquired by Google), eHow, Mode Wall Street: BlackRock, GLG Fortune 500: Big Pharma, Telecom, eBay www.calculationconsulting.com charles@calculationconsulting.com (TM) 3
  • 4. BackStory: in 2011, Search Changed. Forever. • first $1B IPO since Google • Machine Learning based SEO algorithms • Measure the demand for search, and fulfill it data science algorithms created a billion $ company c|c (TM) (TM) Demand Media calculation | consulting data science leadership(TM) 4 eHow.com
  • 5. BackStory: in 2011, Search Changed. Forever. • Google adapted (Panda) • Lack of diversification • Lack of adaptation • Stock price never recovered algorithmic accountability: DMD or Google? c|c (TM) IPO Panda stock price 2011-2012 (TM) calculation | consulting data science leadership DMD (TM) 5
  • 6. • first $1B collapse due to Panda ? • CPC revenues down • premium online publishers died collapse ? stock price 2011-2012 c|c (TM) $1B in ad revenue was repriced and reallocated Problem: Cornering the market on search induced a market crash calculation | consulting data science leadership(TM) 6
  • 7. c|c (TM) Panda-Induced ‘Market Crash’ Google CPC dropped just after Panda calculation | consulting data science leadership(TM) 7
  • 8. Data Science is Different c|c (TM) Davenport calculation | consulting data science leadership Generating sustainable revenue requires Data Science Leadership and Execution (TM) 8 “Companies need a Spock in the boardroom”
  • 9. Data Science is Different c|c (TM) Davenport calculation | consulting data science leadership Generating sustainable revenue requires Data Science Leadership and Execution (TM) 9 http://www.theonion.com/articles/national-science-foundation-science-hard,1405/
  • 10. Problem: Data Scientists are Different c|c (TM) Davenport calculation | consulting data science leadership(TM) 10 not all techies are the same
  • 11. Problem: Data Scientists are Different c|c (TM) Davenport calculation | consulting data science leadership theoretical physics machine learning specialist (TM) 11 experimental physics data scientist engineer software, browser tech, dev ops, … not all techies are the same
  • 12. Problem: Data Scientists are Different c|c (TM) Davenport calculation | consulting data science leadership(TM) 12 not all techies are the same
  • 13. Managing: Data Science Process • Acquire Domain Knowledge • Formulate Hypothesis • Generate Model(s) from the Data • Predict Revenue Gains • Backtest Predictions on your Data • A/B Test in Production • Attribute Gains to Model(s) c|c (TM) (TM) acting solving framing calculation | consulting data science leadership 13
  • 14. Managing: Data Science Process c|c (TM) (TM) calculation | consulting data science leadership 14
  • 15. c|c (TM) • Systems Thinking: leveraging the inter-relationships between data, marketing, and the customer • Knowledge Transfer: mentoring — not training — to develop both personal mastery and team learning • Mental Models: create a base of small-scale models for thinking about how to use your data • Knowledge Sharing: foster collaboration between research, engineering, and product to drive revenue Managing: Learning from Data calculation | consulting data science leadership(TM) 15
  • 16. c|c (TM) • Cross-functional engineering, product, marketing, finance • Autonomous: separate from the traditional engineering product lifecycle. self-organizing and self-managing • Experimental: form hypothesis, analyze data, make predictions, run backtests, A/B testing • Self-sustaining: not a cost center; generates revenue (TM) Data Science is Different calculation | consulting data science leadership 16
  • 17. Solution: Collecting and Organizing Data (TM) c|c (TM) • Most companies are struggling organizing their data • Data needs to be examined • Don’t assume data is correct or useful • More is More: simple algos work • More is Less: noise is noise Data not examined is not collected calculation | consulting data science leadership 17
  • 18. Solutions: Hadoop and Big Data (TM) c|c (TM) • Hadoop is an internal data ecosystem • Hadoop appears to have won the adoption wars ? • Hadoop : 90% deployments internal • Hadoop is a cost center • ROI needs cut across business divisions Algorithms, not data, generate revenue calculation | consulting data science leadership 18
  • 19. Solutions: Cloud (TM) c|c (TM) • Startups don’t need infrastructure • long term Data Storage is virtually free • Amazon Redshift • Google Big Query • Cloud is the future ? Algorithms, not data, generate revenue calculation | consulting data science leadership 19
  • 20. Solutions: Spark (TM) c|c (TM) • Next Gen Platform for Machine Learning • Sits on Hadoop or the Cloud • Still very high touch • Limited algos Algorithms, not data, generate revenue calculation | consulting data science leadership 20
  • 21. Problem: Measurements (TM) c|c (TM) good experiments are amazing calculation | consulting data science leadership 21 “If you can’t measure it, you can’t fix it.” DJ Patil,White House Chief Data Scientist
  • 22. Data Science’s Measurement Problem (TM) c|c (TM) good experiments are hard to design calculation | consulting data science leadership 22 http://www.forbes.com/sites/lizryan/2014/02/10/if-you-cant-measure-it-you-cant-manage-it-is-bs/
  • 23. Data Science’s Measurement Problem (TM) c|c (TM) good experiments are hard to design calculation | consulting data science leadership 23 “Data science has a measurement problem. Simple metrics may not address complex situations. But complex metrics present myriad problems.” “As we strive for better algorithms, we often fail to think critically about what it means for predictions to be ‘good’” http://www.kdnuggets.com/2015/03/data-science-measurement-problem-accuracy-auroc-f1.html
  • 24. Data Science’s Measurement Problem (TM) c|c (TM) good experiments are hard to design calculation | consulting data science leadership 24 “Buffett found it 'extraordinary' that academics studied such things. They studied what was measurable, rather than what was meaningful.‘ … to a man with a hammer, everything looks like a nail.” ― Roger Lowenstein, Buffett: The Making of an American Capitalist
  • 25. c|c (TM) (TM) Problem: The Cult of the Algorithm calculation | consulting data science leadership 25 what can algos actually do ? “We have a new machine learning algo that anticipate your needs over time and behave accordingly”
  • 26. c|c (TM) (TM) Problem: What can Machine Learning Do? calculation | consulting data science leadership 26 what can algos actually do ?
  • 27. Demand Algos: Gas Station Analogy Problem: where to open a gas station ? Need: good traffic, weak competition c|c (TM) less competitors no traffic sweet spot great traffic too many competitors calculation | consulting data science leadership all businesses balance supply and demand (TM) 27
  • 28. SAAS Machine Learning Algos c|c (TM) calculation | consulting data science leadership (TM) 28 $100,000 • 167 teams Diabetic Retinopathy Detection $15,000 • 341 teams March Machine Learning Mania 2015 machine learning contests
  • 29. SAAS Machine Learning Algos c|c (TM) calculation | consulting data science leadership (TM) 29 machine learning apis
  • 30. c|c (TM) (TM) Problem: What can Deep Learning Do? calculation | consulting data science leadership 30 what can algos actually do ?
  • 31. c|c (TM) (TM) Problem: Externalities calculation | consulting data science leadership 31 external factors can change
  • 32. c|c (TM) (TM) Problem: Externalities calculation | consulting data science leadership 32 “Zynga is our best company ever!” (2010) John Doerr, Google Investor, LegendaryVC http://venturebeat.com/2010/11/16/google-investor-john-doerr-zynga-is-our-best-company-ever/ one marketplace | big risks
  • 33. c|c (TM) (TM) Solution: Algorithmic Accountability calculation | consulting data science leadership An asset is an economic resource. Anything tangible or intangible that is capable of being owned or controlled to produce value and that is held to have positive economic value is considered an asset. algorithms can be valuable assets 33
  • 34. c|c (TM) (TM) Algorithmic Accountability calculation | consulting data science leadership 34 does revenue depends on hidden algos ? • WebMD Google SEO • Amazon Product Listing Algo • Pinterest Relevance Algo • Twitter Spam filter • Apple App Store Rankings
  • 35. c|c (TM) (TM) Algorithmic Accountability calculation | consulting data science leadership 35 do decisions depend on hidden factors ? A 'Crisis' in Online Ads: One-Third of Traffic Is Bogus http://www.wsj.com/articles/SB10001424052702304026304579453253860786362 Now Algorithms Are DecidingWhomTo Hire… http://www.npr.org/blogs/alltechconsidered/2015/03/23/394827451/now-algorithms-are-deciding-whom-to-hire-based-on-voice What you don’t know about Internet algorithms is hurting you… http://www.washingtonpost.com/news/the-intersect/wp/2015/03/23/what-you-dont-know-about-internet-algorithms-is-hurting-you-and-you-probably-dont-know-very-much/
  • 36. c|c (TM) (TM) Solution: Algorithmic Transparency calculation | consulting data science leadership 36 can you be transparent and not be gamed ? http://fortune.com/2015/03/18/how-do-you-govern-a-hidden-fluid-and-amoral-algorithm/ 83% of the participants in the study changed their behavior once they knew about the algorithm How do you govern a (hidden, fluid and amoral) algorithm? participants mistakenly believed that their friends intentionally chose not to show them stories
  • 37. c|c (TM) (TM) Algorithmic Accountability calculation | consulting data science leadership Do you depend on some else’s marketplace? How does your revenue depend on algos? Do you need an internal algo ? Who will manage it? build it? maintain it? algorithms have unforeseen liabilities 37