SlideShare a Scribd company logo
1 of 22
Download to read offline
Building Data Products: 
The Right Order of Things 
Gloria Lau 
VP of Data, Timeful 
Keynote @ Big Data Tech Con
http://www.linkedin.com/in/gloriatlau/ 
@gloriatlau
What do they have in common?
Right order of things 
def __init__(self): 
data infrastructure 
for x in range(3): 
offline modeling 
online data product 
user feedback
Model Product
Model Product
The challenge 
Exception: tracking code missing/ 
overloaded! 
Debug: Power user computation 
takes forever! 
def __init__(self): 
data infrastructure 
for x in range(3): 
offline modeling 
online data product 
user feedback
The challenge 
Data viz --> ID'ed new data potential 
--> Yet another data product 
Sparse data --> Crappy model --> 
Need to nudge users for *more* data 
Non-standardized data --> Crappy 
model --> Need to standardize 
def __init__(self): 
data infrastructure 
for x in range(3): 
offline modeling 
online data product 
user feedback
• Four diseases have broken out in the world and it 
is up to a team of specialists in various fields to 
find cures for these diseases before mankind is 
wiped out ... the diseases are out breaking fast 
and time is running out: the team must try to stem 
the tide of infection in diseased areas while also 
towards cures. A truly cooperative game where 
you all win or you all lose. 
• How do you win? 
• Optimally deploy minimal resources in the right 
order
• What is optimal 
• Do you fix that tracking issue first? 
• Do you optimize your power user computation? 
• Do you double down on standardization? 
• Relevant classifications 
• P0 vs P1 
• big company vs small company
2 Questions to ask 
1 Quote answers them all
“Premature optimization is the root of all evil.” 
–Donald Knuth
What is the one metric that 
your data product will move? 
• Retention. Growth. Engagement. Money. Etc. 
• Find it, and focus
If your users use your product a min/ 
day/user, how would you spend that? 
• Data scientists love data. More 
the merrier. 
• More data solves your data 
scientist's problem. It does not 
solve your user's problem.
Do you fix that tracking issue first? 
• Q1: Is it in the critical path of measuring that 
metric? 
• Q2: Are you throwing away user's time?
Do you optimize your power user 
computation? 
• Q1: Are power users your key user metric to lift? 
• Q2: What fraction of total user's time is affected 
by this?
Do you double down on 
standardization? 
• Q1: Peel the onion. How will x 
% increase in standardization 
rate affect your current and 
projected metric? 
• Q2: Does it add friction to the 
funnel?
“Premature optimization is the root of all evil.” 
–Donald Knuth
• Right order: 
• talent first 
• assimilation 
• the 3%; fail fast
“Programmers waste enormous amounts of time thinking about, or 
worrying about, the speed of noncritical parts of their programs, and 
these attempts at efficiency actually have a strong negative impact when 
debugging and maintenance are considered. We should forget about 
small efficiencies, say about 97% of the time: premature optimization is 
the root of all evil. Yet we should not pass up our opportunities in that 
critical 3%. A good programmer will not be lulled into complacency by 
such reasoning, he will be wise to look carefully at the critical code; but 
only after that code has been identified. It is often a mistake to make a 
priori judgments about what parts of a program are really critical, since 
the universal experience of programmers who have been using 
measurement tools has been that their intuitive guesses fail.” 
–Donald Knuth
It's an art.

More Related Content

What's hot

Dataiku r users group v2
Dataiku   r users group v2Dataiku   r users group v2
Dataiku r users group v2
Cdiscount
 
Enable Advanced Analytics with Hadoop and an Enterprise Data Hub
Enable Advanced Analytics with Hadoop and an Enterprise Data HubEnable Advanced Analytics with Hadoop and an Enterprise Data Hub
Enable Advanced Analytics with Hadoop and an Enterprise Data Hub
Cloudera, Inc.
 
Online Games Analytics - Data Science for Fun
Online Games Analytics - Data Science for FunOnline Games Analytics - Data Science for Fun
Online Games Analytics - Data Science for Fun
Dataiku
 

What's hot (19)

Simplify your analytics strategy by Jayesh Dosi
Simplify your analytics strategy by Jayesh DosiSimplify your analytics strategy by Jayesh Dosi
Simplify your analytics strategy by Jayesh Dosi
 
Worst Practices in Artificial Intelligence
Worst Practices in Artificial IntelligenceWorst Practices in Artificial Intelligence
Worst Practices in Artificial Intelligence
 
Levelling up your data infrastructure
Levelling up your data infrastructureLevelling up your data infrastructure
Levelling up your data infrastructure
 
Intro to Data and Analytics for Startups
Intro to Data and Analytics for StartupsIntro to Data and Analytics for Startups
Intro to Data and Analytics for Startups
 
Acceptance, Accessible, Actionable and Auditable
Acceptance, Accessible, Actionable and AuditableAcceptance, Accessible, Actionable and Auditable
Acceptance, Accessible, Actionable and Auditable
 
MassIntelligence 2018: How to Rapidly Prototype an AI Solution
MassIntelligence 2018: How to Rapidly Prototype an AI SolutionMassIntelligence 2018: How to Rapidly Prototype an AI Solution
MassIntelligence 2018: How to Rapidly Prototype an AI Solution
 
Data Mashups -Data Science Summit
Data Mashups -Data Science SummitData Mashups -Data Science Summit
Data Mashups -Data Science Summit
 
Adoption is the only option hadoop is changing our world and changing yours f...
Adoption is the only option hadoop is changing our world and changing yours f...Adoption is the only option hadoop is changing our world and changing yours f...
Adoption is the only option hadoop is changing our world and changing yours f...
 
Correlation does not mean causation
Correlation does not mean causationCorrelation does not mean causation
Correlation does not mean causation
 
LJC 2014 "Professional Software Development: Thinking Fast and Slow"
LJC 2014 "Professional Software Development: Thinking Fast and Slow"LJC 2014 "Professional Software Development: Thinking Fast and Slow"
LJC 2014 "Professional Software Development: Thinking Fast and Slow"
 
Books! Google isn't the only source of information
Books! Google isn't the only source of informationBooks! Google isn't the only source of information
Books! Google isn't the only source of information
 
Artificial Intelligence and the Data Center
Artificial Intelligence and the Data CenterArtificial Intelligence and the Data Center
Artificial Intelligence and the Data Center
 
Boosting Customer Engagement
Boosting Customer EngagementBoosting Customer Engagement
Boosting Customer Engagement
 
Dataiku r users group v2
Dataiku   r users group v2Dataiku   r users group v2
Dataiku r users group v2
 
Enable Advanced Analytics with Hadoop and an Enterprise Data Hub
Enable Advanced Analytics with Hadoop and an Enterprise Data HubEnable Advanced Analytics with Hadoop and an Enterprise Data Hub
Enable Advanced Analytics with Hadoop and an Enterprise Data Hub
 
Online Games Analytics - Data Science for Fun
Online Games Analytics - Data Science for FunOnline Games Analytics - Data Science for Fun
Online Games Analytics - Data Science for Fun
 
Big Data Ppt Powerpoint Presentation Slides
Big Data Ppt Powerpoint Presentation SlidesBig Data Ppt Powerpoint Presentation Slides
Big Data Ppt Powerpoint Presentation Slides
 
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’tAdi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
 
Dataiku - data driven nyc - april 2016 - the solitude of the data team m...
Dataiku  -  data driven nyc  - april  2016 - the  solitude of the data team m...Dataiku  -  data driven nyc  - april  2016 - the  solitude of the data team m...
Dataiku - data driven nyc - april 2016 - the solitude of the data team m...
 

Similar to Keynote at Big Data Tech Con SF 2014

Similar to Keynote at Big Data Tech Con SF 2014 (20)

Doing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics EnvironmentDoing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics Environment
 
predictive analysis and usage in procurement ppt 2017
predictive analysis and usage in procurement  ppt 2017predictive analysis and usage in procurement  ppt 2017
predictive analysis and usage in procurement ppt 2017
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015
 
Machine learning in production
Machine learning in productionMachine learning in production
Machine learning in production
 
Doing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating AnalyticsDoing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating Analytics
 
Testing metrics webinar
Testing metrics webinarTesting metrics webinar
Testing metrics webinar
 
Building and Scaling High Performing Technology Organizations by Jez Humble a...
Building and Scaling High Performing Technology Organizations by Jez Humble a...Building and Scaling High Performing Technology Organizations by Jez Humble a...
Building and Scaling High Performing Technology Organizations by Jez Humble a...
 
The UX Analyst
The UX AnalystThe UX Analyst
The UX Analyst
 
More content in less time
More content in less timeMore content in less time
More content in less time
 
Lean analytics
Lean analyticsLean analytics
Lean analytics
 
Splunk at MetLife
Splunk at MetLifeSplunk at MetLife
Splunk at MetLife
 
Unit 2 SEPM_ Requirement Engineering
Unit 2 SEPM_ Requirement EngineeringUnit 2 SEPM_ Requirement Engineering
Unit 2 SEPM_ Requirement Engineering
 
Engaging with Users on Public Social Media
Engaging with Users on Public Social MediaEngaging with Users on Public Social Media
Engaging with Users on Public Social Media
 
Artificial Intelligence Primer
Artificial Intelligence PrimerArtificial Intelligence Primer
Artificial Intelligence Primer
 
DevOps Enterprise Summit 2019 - How Swarming Enables Enterprise Support to wo...
DevOps Enterprise Summit 2019 - How Swarming Enables EnterpriseSupport to wo...DevOps Enterprise Summit 2019 - How Swarming Enables EnterpriseSupport to wo...
DevOps Enterprise Summit 2019 - How Swarming Enables Enterprise Support to wo...
 
Digital Transformation, Testing and Automation
Digital Transformation, Testing and AutomationDigital Transformation, Testing and Automation
Digital Transformation, Testing and Automation
 
If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...
If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...
If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...
 
Chapter3
Chapter3 Chapter3
Chapter3
 
iCTRE: The Informal community Transformer into Recommendation Engine
iCTRE: The Informal community Transformer into Recommendation EngineiCTRE: The Informal community Transformer into Recommendation Engine
iCTRE: The Informal community Transformer into Recommendation Engine
 
Learning Data Analytics
Learning Data AnalyticsLearning Data Analytics
Learning Data Analytics
 

Recently uploaded

Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
HyderabadDolls
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
gajnagarg
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
ranjankumarbehera14
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 

Recently uploaded (20)

Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 

Keynote at Big Data Tech Con SF 2014

  • 1. Building Data Products: The Right Order of Things Gloria Lau VP of Data, Timeful Keynote @ Big Data Tech Con
  • 3. What do they have in common?
  • 4. Right order of things def __init__(self): data infrastructure for x in range(3): offline modeling online data product user feedback
  • 7. The challenge Exception: tracking code missing/ overloaded! Debug: Power user computation takes forever! def __init__(self): data infrastructure for x in range(3): offline modeling online data product user feedback
  • 8. The challenge Data viz --> ID'ed new data potential --> Yet another data product Sparse data --> Crappy model --> Need to nudge users for *more* data Non-standardized data --> Crappy model --> Need to standardize def __init__(self): data infrastructure for x in range(3): offline modeling online data product user feedback
  • 9.
  • 10. • Four diseases have broken out in the world and it is up to a team of specialists in various fields to find cures for these diseases before mankind is wiped out ... the diseases are out breaking fast and time is running out: the team must try to stem the tide of infection in diseased areas while also towards cures. A truly cooperative game where you all win or you all lose. • How do you win? • Optimally deploy minimal resources in the right order
  • 11. • What is optimal • Do you fix that tracking issue first? • Do you optimize your power user computation? • Do you double down on standardization? • Relevant classifications • P0 vs P1 • big company vs small company
  • 12. 2 Questions to ask 1 Quote answers them all
  • 13. “Premature optimization is the root of all evil.” –Donald Knuth
  • 14. What is the one metric that your data product will move? • Retention. Growth. Engagement. Money. Etc. • Find it, and focus
  • 15. If your users use your product a min/ day/user, how would you spend that? • Data scientists love data. More the merrier. • More data solves your data scientist's problem. It does not solve your user's problem.
  • 16. Do you fix that tracking issue first? • Q1: Is it in the critical path of measuring that metric? • Q2: Are you throwing away user's time?
  • 17. Do you optimize your power user computation? • Q1: Are power users your key user metric to lift? • Q2: What fraction of total user's time is affected by this?
  • 18. Do you double down on standardization? • Q1: Peel the onion. How will x % increase in standardization rate affect your current and projected metric? • Q2: Does it add friction to the funnel?
  • 19. “Premature optimization is the root of all evil.” –Donald Knuth
  • 20. • Right order: • talent first • assimilation • the 3%; fail fast
  • 21. “Programmers waste enormous amounts of time thinking about, or worrying about, the speed of noncritical parts of their programs, and these attempts at efficiency actually have a strong negative impact when debugging and maintenance are considered. We should forget about small efficiencies, say about 97% of the time: premature optimization is the root of all evil. Yet we should not pass up our opportunities in that critical 3%. A good programmer will not be lulled into complacency by such reasoning, he will be wise to look carefully at the critical code; but only after that code has been identified. It is often a mistake to make a priori judgments about what parts of a program are really critical, since the universal experience of programmers who have been using measurement tools has been that their intuitive guesses fail.” –Donald Knuth