SlideShare a Scribd company logo
Data 101:
How to use your data science team:
Becoming a data-driven organization
March 29, 2016
Yael Garten
Director of Data Science, LinkedIn
http://www.linkedin.com/in/yaelgarten, @yaelgarten
Data is the present: every industry, organization,
product, decision
data driven, or at least data informed.
Good vs Great? Data is Expected, Discussed, & Accessible
Expected
For decision making
e.g. Determining strategy, goal setting, impact estimates of initiatives
Discussed
Talked about
e.g. product strategy reviews, design discussions, in the hallway
Accessible
When questions come up, easy to get hands on data?
e.g. democratized data, self-serve tools
Culture
Process
Tools
How do we get started?
(whether from scratch or to transform existing organization)
Invest in the data
Hire the right “data scientists”
Culture Process Tools
Data Scientist – which kind?
human machine
Find & communicate data insights
Producing for:
Deliverables:
Skills:
Machine learning models
This talk will be a success if we:
1. Review the steps of data driven product innovation
2. Understand what is needed to best enable fostering of (or transformation into)
a data-driven organization: culture, process, and tools.
1. Culture: Create a culture which expects decisions are informed by
data & which treats data as a first class citizen
2. Process: Consciously map how you use data
(in each phase of the product lifecycle, in making exec decisions)
3. Tools: invest in your data ecosystem
(data quality, pipeline, and access tools)
To create a data driven organization
Data driven product innovation
Product Lifecycle
Ideation Design and Specs Development Test & Iterate Release
Data driven product innovation framework:
Use data to measure, understand, and improve the product:
Build Measure Track Ship Tweak
If data scientist is not involved until
this stage, it may be too late.
Well-connected.
Get relevance right.
Few connections.
Give them inventory.
1. Opportunity sizing: how big or important
is the problem?
2. Use data to predict successful product
initiatives:
• Show news articles
• Suggest new connections
• Suggest following active content creators
Build Measure Track Ship Tweak
Hypothesis: Following active sources leads to improved user
experience with LinkedIn Feed
Success Metric – progression of definition:
• Total clicks on Follow
• # clicks / #impressions of Follow suggestions
• % Feed Inventory created by new followees
• Downstream sustained engagement with items created by these
followees
• What is engaging? # of clicks? Time spent? # Shares?
Combo?
Actionable insights lead
to product feature
Success Metric
Definition
Tracking
Instrumentation
spec
Experimentation
Setup and Analysis
Post-Launch
Analysis
Build Measure Track Ship Tweak
is creating content
we think you’ll find
interesting
Invest in developing the right success metric.
~Metric = Downstream engagement with items created by these followees
Must enable attributing future clicks on feed items to that campaign as a
source for the Follow.
FeedActivityClick
{
memberID = 77777
actor = 55555
}
FollowSources
{
followCampaign666
memberID = 77777
followeeID = 55555
}
Need accurate reliable standardized data logging to enable metric computation.
Actionable insights lead
to product feature
Success Metric
Definition
Tracking
Instrumentation
spec
Experimentation
Setup and Analysis
Post-Launch
Analysis
Build Measure Track Ship Tweak
Actionable insights lead
to product feature
Success Metric
Definition
Tracking
Instrumentation
spec
Experimentation
Setup and Analysis
Post-Launch
Analysis
Design: How long to run experiment, on whom?
Implement: properly set up & randomize to ensure no bias
Analyze: Go or no-go? monitor success metric, ideally automated
on company-wide platform for holistic view of impacts
Build Measure Track Ship Tweak
is creating content
we think you’ll find
interesting
Rigorously set up, then identify whether the feature increased the success metric.
Reporting, monitoring, ad hoc analysis
Long term measures of engagement/success
Analysis to inform revision of design
Actionable insights lead
to product feature
Success Metric
Definition
Tracking
Instrumentation
spec
Experimentation
Setup and Analysis
Post-Launch
Analysis
Build Measure Track Ship Tweak
is creating content
we think you’ll find
interesting
Iterate. How can we revise? How can we tweak to optimize?
Invest in foundations. Culture Process Tools
1. Culture: Create a culture which expects decisions are informed by
data & which treats data as a first class citizen
2. Process: Consciously map how you use data
(in each phase of the product lifecycle, in making exec decisions)
3. Tools: invest in your data ecosystem
(data quality, pipeline, and access tools)
To create a data driven organization
Data scientist as a partner, not a service – give context and ownership
Strong bias to actionable impactful insights, speed of iteration & feedback
• Data Foundations: governed datasets, consistent shared datasets and metrics
• Data Democratization: self serve data exploration platform
• Enable innovation: environment supports speedy ad-hoc analysis
Culture
Process
Tools
Actionable insights lead
to product feature
Success Metric
Definition
Tracking
Instrumentation
spec
Experimentation
Setup and Analysis
Post-Launch
Analysis
Build Measure Track Ship Tweak
Actionable insights lead
to product feature
Success Metric
Definition
Tracking
Instrumentation
spec
Experimentation
Setup and Analysis
Post-Launch
Analysis
Build Measure Track Ship Tweak
Democratize data – self served data exploration platform
Enable people in your organization (execs, product managers, designers) to have data at
their fingertips – to ask and answer questions
s
• Invest in creating the right metrics
• user centric mindset; optimize for user value not team success
all stakeholders agree upon success metrics prior to launching the
feature test
• Platform of shared tiered metrics visible to entire company
• a metrics pipeline that enables easy implementation of metrics
(and not manual one-offs)
Actionable insights lead
to product feature
Success Metric
Definition
Tracking
Instrumentation
spec
Experimentation
Setup and Analysis
Post-Launch
Analysis
Build Measure Track Ship Tweak
Culture
Process
Tools
sActionable insights lead
to product feature
Success Metric
Definition
Tracking
Instrumentation
spec
Experimentation
Setup and Analysis
Post-Launch
Analysis
Build Measure Track Ship Tweak
• Data as a first class citizen
• Data tracking bugs as a launch blocker
• Proactive joint definition of data requirements and contract (schemas)
by producers and consumers
• Data Model Review Committee
• Data tracking spec tool for source of truth
Automated testing of data tracking
• Data Quality monitoring tool to ensure data contract
is met
Actionable insights lead
to product feature
Success Metric
Definition
Tracking
Instrumentation
spec
Experimentation
Setup and Analysis
Post-Launch
Analysis
Build Measure Track Ship Tweak
Culture
Process
Tools
s
• Belief in proactive controlled experiment as decision making tool
• Efficient and principled lifecycle, from inception to decision
• Before: Review experiment design & implementation
• After: stakeholders discuss impacts and implications of tradeoffs
• Company-wide experimentation platform, with tiered key metrics
• Automated metric reporting & analysis capability
Actionable insights lead
to product feature
Success Metric
Definition
Tracking
Instrumentation
spec
Experimentation
Setup and Analysis
Post-Launch
Analysis
Build Measure Track Ship Tweak
Culture
Process
Tools
sActionable insights lead
to product feature
Success Metric
Definition
Tracking
Instrumentation
spec
Experimentation
Setup and Analysis
Post-Launch
Analysis
Build Measure Track Ship Tweak
s
• Iteration and innovation
• Metrics meeting: weekly to understand
performance and product value
• Combine qualitative user feedback with
quantitative results
• Long term hold-out groups for deep
understanding of impacts
Actionable insights lead
to product feature
Success Metric
Definition
Tracking
Instrumentation
spec
Experimentation
Setup and Analysis
Post-Launch
Analysis
Build Measure Track Ship Tweak
Culture
Process
Tools
This talk will be a success if we:
1. Review the steps of data driven product innovation
2. Understand what is needed to best enable fostering of (or transformation
into) a data-driven organization: culture, process, and tools.
1. Culture: Create a culture which expects decisions are informed by
data & which treats data as a first class citizen
2. Process: Consciously map how you use data
(in each phase of the product lifecycle, in making exec decisions)
3. Tools: invest in your data ecosystem
(data quality, pipeline, and access tools)
To create a data driven organization
Thank you.
http://www.linkedin.com/in/yaelgarten
@yaelgarten

More Related Content

What's hot

Data Governance by stealth v0.0.2
Data Governance by stealth v0.0.2Data Governance by stealth v0.0.2
Data Governance by stealth v0.0.2
Christopher Bradley
 

What's hot (20)

seven steps to dataops @ dataops.rocks conference Oct 2019
seven steps to dataops @ dataops.rocks conference Oct 2019seven steps to dataops @ dataops.rocks conference Oct 2019
seven steps to dataops @ dataops.rocks conference Oct 2019
 
DAS Slides: Enterprise Architecture vs. Data Architecture
DAS Slides: Enterprise Architecture vs. Data ArchitectureDAS Slides: Enterprise Architecture vs. Data Architecture
DAS Slides: Enterprise Architecture vs. Data Architecture
 
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
 
Driven by data - Why we need a Modern Enterprise Data Analytics Platform
Driven by data - Why we need a Modern Enterprise Data Analytics PlatformDriven by data - Why we need a Modern Enterprise Data Analytics Platform
Driven by data - Why we need a Modern Enterprise Data Analytics Platform
 
Creating a data driven culture
Creating a data driven cultureCreating a data driven culture
Creating a data driven culture
 
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
 
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
 
Why an AI-Powered Data Catalog Tool is Critical to Business Success
Why an AI-Powered Data Catalog Tool is Critical to Business SuccessWhy an AI-Powered Data Catalog Tool is Critical to Business Success
Why an AI-Powered Data Catalog Tool is Critical to Business Success
 
Creating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCreating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetup
 
Creating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use casesCreating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use cases
 
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
 
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
 
Data Modeling on Azure for Analytics
Data Modeling on Azure for AnalyticsData Modeling on Azure for Analytics
Data Modeling on Azure for Analytics
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Implementing a Data Lake
Implementing a Data LakeImplementing a Data Lake
Implementing a Data Lake
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
 
Session découverte de la Data Virtualization
Session découverte de la Data VirtualizationSession découverte de la Data Virtualization
Session découverte de la Data Virtualization
 
Data Governance by stealth v0.0.2
Data Governance by stealth v0.0.2Data Governance by stealth v0.0.2
Data Governance by stealth v0.0.2
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
 

Viewers also liked

Architecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystemArchitecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystem
Yael Garten
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Yael Garten
 
Admission Control in Impala
Admission Control in ImpalaAdmission Control in Impala
Admission Control in Impala
Cloudera, Inc.
 

Viewers also liked (20)

Architecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystemArchitecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystem
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
 
White paper hadoop performancetuning
White paper hadoop performancetuningWhite paper hadoop performancetuning
White paper hadoop performancetuning
 
A Perspective from the intersection Data Science, Mobility, and Mobile Devices
A Perspective from the intersection Data Science, Mobility, and Mobile DevicesA Perspective from the intersection Data Science, Mobility, and Mobile Devices
A Perspective from the intersection Data Science, Mobility, and Mobile Devices
 
Data Infused Product Design and Insights at LinkedIn
Data Infused Product Design and Insights at LinkedInData Infused Product Design and Insights at LinkedIn
Data Infused Product Design and Insights at LinkedIn
 
Remix: On-demand Live Randomization (Fine-grained live ASLR during runtime)
Remix: On-demand Live Randomization (Fine-grained live ASLR during runtime)Remix: On-demand Live Randomization (Fine-grained live ASLR during runtime)
Remix: On-demand Live Randomization (Fine-grained live ASLR during runtime)
 
Impala SQL Support
Impala SQL SupportImpala SQL Support
Impala SQL Support
 
Admission Control in Impala
Admission Control in ImpalaAdmission Control in Impala
Admission Control in Impala
 
Cloudera Impala Source Code Explanation and Analysis
Cloudera Impala Source Code Explanation and AnalysisCloudera Impala Source Code Explanation and Analysis
Cloudera Impala Source Code Explanation and Analysis
 
Hadoop application architectures - Fraud detection tutorial
Hadoop application architectures - Fraud detection tutorialHadoop application architectures - Fraud detection tutorial
Hadoop application architectures - Fraud detection tutorial
 
Apache Impala (incubating) 2.5 Performance Update
Apache Impala (incubating) 2.5 Performance UpdateApache Impala (incubating) 2.5 Performance Update
Apache Impala (incubating) 2.5 Performance Update
 
Présentation Micropole 3eme Forum MDM - 19 novembre 2014
Présentation Micropole 3eme Forum MDM - 19 novembre 2014Présentation Micropole 3eme Forum MDM - 19 novembre 2014
Présentation Micropole 3eme Forum MDM - 19 novembre 2014
 
SecPod: A Framework for Virtualization-based Security Systems
SecPod: A Framework for Virtualization-based Security SystemsSecPod: A Framework for Virtualization-based Security Systems
SecPod: A Framework for Virtualization-based Security Systems
 
DATA FORUM MICROPOLE - 2015
DATA FORUM MICROPOLE - 2015DATA FORUM MICROPOLE - 2015
DATA FORUM MICROPOLE - 2015
 
Data Modeling for Data Science: Simplify Your Workload with Complex Types in ...
Data Modeling for Data Science: Simplify Your Workload with Complex Types in ...Data Modeling for Data Science: Simplify Your Workload with Complex Types in ...
Data Modeling for Data Science: Simplify Your Workload with Complex Types in ...
 
Nested Types in Impala
Nested Types in ImpalaNested Types in Impala
Nested Types in Impala
 
Building a Data Driven Organization
Building a Data Driven OrganizationBuilding a Data Driven Organization
Building a Data Driven Organization
 
Architecting next generation big data platform
Architecting next generation big data platformArchitecting next generation big data platform
Architecting next generation big data platform
 
Faster Batch Processing with Cloudera 5.7: Hive-on-Spark is ready for production
Faster Batch Processing with Cloudera 5.7: Hive-on-Spark is ready for productionFaster Batch Processing with Cloudera 5.7: Hive-on-Spark is ready for production
Faster Batch Processing with Cloudera 5.7: Hive-on-Spark is ready for production
 
What no one tells you about writing a streaming app
What no one tells you about writing a streaming appWhat no one tells you about writing a streaming app
What no one tells you about writing a streaming app
 

Similar to How to use your data science team: Becoming a data-driven organization

Supercharge Your Corporate Dashboards With UX Analytics
Supercharge Your Corporate Dashboards With UX AnalyticsSupercharge Your Corporate Dashboards With UX Analytics
Supercharge Your Corporate Dashboards With UX Analytics
UserZoom
 
Telling the Full Story: Adding Qualitative Data To Executive Dashboards
Telling the Full Story: Adding Qualitative Data To Executive DashboardsTelling the Full Story: Adding Qualitative Data To Executive Dashboards
Telling the Full Story: Adding Qualitative Data To Executive Dashboards
UserZoom
 

Similar to How to use your data science team: Becoming a data-driven organization (20)

Getting Started with Product Analytics - A 101 Implementation Guide for Begin...
Getting Started with Product Analytics - A 101 Implementation Guide for Begin...Getting Started with Product Analytics - A 101 Implementation Guide for Begin...
Getting Started with Product Analytics - A 101 Implementation Guide for Begin...
 
Supercharge Your Corporate Dashboards With UX Analytics
Supercharge Your Corporate Dashboards With UX AnalyticsSupercharge Your Corporate Dashboards With UX Analytics
Supercharge Your Corporate Dashboards With UX Analytics
 
The New Self-Service Analytics - Going Beyond the Tools
The New Self-Service Analytics - Going Beyond the ToolsThe New Self-Service Analytics - Going Beyond the Tools
The New Self-Service Analytics - Going Beyond the Tools
 
Data driven decision making
Data driven decision makingData driven decision making
Data driven decision making
 
Telling the Full Story: Adding Qualitative Data To Executive Dashboards
Telling the Full Story: Adding Qualitative Data To Executive DashboardsTelling the Full Story: Adding Qualitative Data To Executive Dashboards
Telling the Full Story: Adding Qualitative Data To Executive Dashboards
 
Tableau Conference 2014 Presentation
Tableau Conference 2014 PresentationTableau Conference 2014 Presentation
Tableau Conference 2014 Presentation
 
Liberating data power of APIs
Liberating data power of APIsLiberating data power of APIs
Liberating data power of APIs
 
Agile data science
Agile data scienceAgile data science
Agile data science
 
Importance of Data Analytics
 Importance of Data Analytics Importance of Data Analytics
Importance of Data Analytics
 
Data mining
Data miningData mining
Data mining
 
Build Your Community Professional Services
Build Your Community Professional ServicesBuild Your Community Professional Services
Build Your Community Professional Services
 
Big Data, Analytics and Data Science
Big Data, Analytics and Data ScienceBig Data, Analytics and Data Science
Big Data, Analytics and Data Science
 
Big data and Marketing by Edward Chenard
Big data and Marketing by Edward ChenardBig data and Marketing by Edward Chenard
Big data and Marketing by Edward Chenard
 
#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...
#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...
#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...
 
Data Science in Python.pptx
Data Science in Python.pptxData Science in Python.pptx
Data Science in Python.pptx
 
Intro of Key Features of Soft CAAT Ent Software
Intro of Key Features of Soft CAAT Ent SoftwareIntro of Key Features of Soft CAAT Ent Software
Intro of Key Features of Soft CAAT Ent Software
 
semana1.pptx
semana1.pptxsemana1.pptx
semana1.pptx
 
Data Analytics: Better Decision, Better Business
Data Analytics: Better Decision, Better BusinessData Analytics: Better Decision, Better Business
Data Analytics: Better Decision, Better Business
 
Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan
 
Digital strategy overview
Digital strategy overviewDigital strategy overview
Digital strategy overview
 

Recently uploaded

一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 

Recently uploaded (20)

Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
Uber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis ReportUber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis Report
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 

How to use your data science team: Becoming a data-driven organization

  • 1. Data 101: How to use your data science team: Becoming a data-driven organization March 29, 2016 Yael Garten Director of Data Science, LinkedIn http://www.linkedin.com/in/yaelgarten, @yaelgarten
  • 2. Data is the present: every industry, organization, product, decision data driven, or at least data informed.
  • 3. Good vs Great? Data is Expected, Discussed, & Accessible Expected For decision making e.g. Determining strategy, goal setting, impact estimates of initiatives Discussed Talked about e.g. product strategy reviews, design discussions, in the hallway Accessible When questions come up, easy to get hands on data? e.g. democratized data, self-serve tools Culture Process Tools
  • 4. How do we get started? (whether from scratch or to transform existing organization) Invest in the data Hire the right “data scientists” Culture Process Tools
  • 5. Data Scientist – which kind? human machine Find & communicate data insights Producing for: Deliverables: Skills: Machine learning models
  • 6. This talk will be a success if we: 1. Review the steps of data driven product innovation 2. Understand what is needed to best enable fostering of (or transformation into) a data-driven organization: culture, process, and tools.
  • 7. 1. Culture: Create a culture which expects decisions are informed by data & which treats data as a first class citizen 2. Process: Consciously map how you use data (in each phase of the product lifecycle, in making exec decisions) 3. Tools: invest in your data ecosystem (data quality, pipeline, and access tools) To create a data driven organization
  • 8. Data driven product innovation Product Lifecycle Ideation Design and Specs Development Test & Iterate Release Data driven product innovation framework: Use data to measure, understand, and improve the product: Build Measure Track Ship Tweak If data scientist is not involved until this stage, it may be too late.
  • 9. Well-connected. Get relevance right. Few connections. Give them inventory. 1. Opportunity sizing: how big or important is the problem? 2. Use data to predict successful product initiatives: • Show news articles • Suggest new connections • Suggest following active content creators Build Measure Track Ship Tweak
  • 10. Hypothesis: Following active sources leads to improved user experience with LinkedIn Feed Success Metric – progression of definition: • Total clicks on Follow • # clicks / #impressions of Follow suggestions • % Feed Inventory created by new followees • Downstream sustained engagement with items created by these followees • What is engaging? # of clicks? Time spent? # Shares? Combo? Actionable insights lead to product feature Success Metric Definition Tracking Instrumentation spec Experimentation Setup and Analysis Post-Launch Analysis Build Measure Track Ship Tweak is creating content we think you’ll find interesting Invest in developing the right success metric.
  • 11. ~Metric = Downstream engagement with items created by these followees Must enable attributing future clicks on feed items to that campaign as a source for the Follow. FeedActivityClick { memberID = 77777 actor = 55555 } FollowSources { followCampaign666 memberID = 77777 followeeID = 55555 } Need accurate reliable standardized data logging to enable metric computation. Actionable insights lead to product feature Success Metric Definition Tracking Instrumentation spec Experimentation Setup and Analysis Post-Launch Analysis Build Measure Track Ship Tweak
  • 12. Actionable insights lead to product feature Success Metric Definition Tracking Instrumentation spec Experimentation Setup and Analysis Post-Launch Analysis Design: How long to run experiment, on whom? Implement: properly set up & randomize to ensure no bias Analyze: Go or no-go? monitor success metric, ideally automated on company-wide platform for holistic view of impacts Build Measure Track Ship Tweak is creating content we think you’ll find interesting Rigorously set up, then identify whether the feature increased the success metric.
  • 13. Reporting, monitoring, ad hoc analysis Long term measures of engagement/success Analysis to inform revision of design Actionable insights lead to product feature Success Metric Definition Tracking Instrumentation spec Experimentation Setup and Analysis Post-Launch Analysis Build Measure Track Ship Tweak is creating content we think you’ll find interesting Iterate. How can we revise? How can we tweak to optimize?
  • 14. Invest in foundations. Culture Process Tools
  • 15. 1. Culture: Create a culture which expects decisions are informed by data & which treats data as a first class citizen 2. Process: Consciously map how you use data (in each phase of the product lifecycle, in making exec decisions) 3. Tools: invest in your data ecosystem (data quality, pipeline, and access tools) To create a data driven organization
  • 16. Data scientist as a partner, not a service – give context and ownership Strong bias to actionable impactful insights, speed of iteration & feedback • Data Foundations: governed datasets, consistent shared datasets and metrics • Data Democratization: self serve data exploration platform • Enable innovation: environment supports speedy ad-hoc analysis Culture Process Tools Actionable insights lead to product feature Success Metric Definition Tracking Instrumentation spec Experimentation Setup and Analysis Post-Launch Analysis Build Measure Track Ship Tweak
  • 17. Actionable insights lead to product feature Success Metric Definition Tracking Instrumentation spec Experimentation Setup and Analysis Post-Launch Analysis Build Measure Track Ship Tweak Democratize data – self served data exploration platform Enable people in your organization (execs, product managers, designers) to have data at their fingertips – to ask and answer questions
  • 18. s • Invest in creating the right metrics • user centric mindset; optimize for user value not team success all stakeholders agree upon success metrics prior to launching the feature test • Platform of shared tiered metrics visible to entire company • a metrics pipeline that enables easy implementation of metrics (and not manual one-offs) Actionable insights lead to product feature Success Metric Definition Tracking Instrumentation spec Experimentation Setup and Analysis Post-Launch Analysis Build Measure Track Ship Tweak Culture Process Tools
  • 19. sActionable insights lead to product feature Success Metric Definition Tracking Instrumentation spec Experimentation Setup and Analysis Post-Launch Analysis Build Measure Track Ship Tweak
  • 20. • Data as a first class citizen • Data tracking bugs as a launch blocker • Proactive joint definition of data requirements and contract (schemas) by producers and consumers • Data Model Review Committee • Data tracking spec tool for source of truth Automated testing of data tracking • Data Quality monitoring tool to ensure data contract is met Actionable insights lead to product feature Success Metric Definition Tracking Instrumentation spec Experimentation Setup and Analysis Post-Launch Analysis Build Measure Track Ship Tweak Culture Process Tools
  • 21. s • Belief in proactive controlled experiment as decision making tool • Efficient and principled lifecycle, from inception to decision • Before: Review experiment design & implementation • After: stakeholders discuss impacts and implications of tradeoffs • Company-wide experimentation platform, with tiered key metrics • Automated metric reporting & analysis capability Actionable insights lead to product feature Success Metric Definition Tracking Instrumentation spec Experimentation Setup and Analysis Post-Launch Analysis Build Measure Track Ship Tweak Culture Process Tools
  • 22. sActionable insights lead to product feature Success Metric Definition Tracking Instrumentation spec Experimentation Setup and Analysis Post-Launch Analysis Build Measure Track Ship Tweak
  • 23. s • Iteration and innovation • Metrics meeting: weekly to understand performance and product value • Combine qualitative user feedback with quantitative results • Long term hold-out groups for deep understanding of impacts Actionable insights lead to product feature Success Metric Definition Tracking Instrumentation spec Experimentation Setup and Analysis Post-Launch Analysis Build Measure Track Ship Tweak Culture Process Tools
  • 24. This talk will be a success if we: 1. Review the steps of data driven product innovation 2. Understand what is needed to best enable fostering of (or transformation into) a data-driven organization: culture, process, and tools.
  • 25. 1. Culture: Create a culture which expects decisions are informed by data & which treats data as a first class citizen 2. Process: Consciously map how you use data (in each phase of the product lifecycle, in making exec decisions) 3. Tools: invest in your data ecosystem (data quality, pipeline, and access tools) To create a data driven organization

Editor's Notes

  1. To have a mental framework of a definition: Splits roughly into two types: Is your data scientist producing analytics for human or computer? Who is the consumer or decision maker? We’ll focus on analytics. So, what is the culture process and tools needed? such as: Product recommendations Opportunity identification Analyses Metrics such as: Recommendation systems Classifiers Predictive models
  2. Lets say we go with some measure like the last. So we know what we want to measure. Are we logging the data in a way that we can compute this metric?
  3. Invest in created more sophisticated measure of value created for the user or customer. If Profile team increases Profile edits (better matching), but it decreases connections made (which might impact growth, new user acquisition) what does that mean? Example - Total signups vs quality signups At LinkedIn we have true norths. And we have tiers. Total Follows vs follow that drives to more liquidity as seen by long term engagement, click through of content generated by those followees