SlideShare a Scribd company logo
1 of 21
Implementing improved and
consistent arbitrary event
tracking company-wide using
Snowplow
Nora Paymer
Sr. Business & Consumer Insights Analyst,
StumbleUpon
10/6/2015
SF Snowplow MeetUp
About me
• Hi, I’m Nora
• BS & MA in Cognitive Neuroscience
– Ask me about sign/speech bilingualism or
optical illusions in the brain!
• Previous Roles:
– UC Berkeley: Institutional Analytics
– CBS Interactive: Inventory Analytics
– SquareTrade: Marketing/Consumer Insights
Analytics
• StumbleUpon: Business & Product Analytics
About StumbleUpon
• What is StumbleUpon?
– Recommendation Engine for the Internet
– Ad Platform for native advertisement
– Social engagement platform
• Still #4 in Referral Traffic* (behind
Facebook, Twitter, and Pinterest; ahead of
Reddit)
• Still alive and kicking!
*Shareaholic, Q4 2014 (mot recent data available)
My Role
• Data Science Team & Finance/Sales
Analytics Team, but no dedicated Product or
Business Analytics
• When I was hired, I was asked to:
– Help Product team be a data-driven culture
– Make data more available company-wide
• Better & easier to change dashboards
• Ability for non-data people to access data
– Help clean up Data Pipelines
• With support from amazing Data Engineering Team
Problems
1. Data siloed all over the place
2. Data inaccessible to most people
• Other data all over the place
• No way to integrate with
user/stumble/activity data
• Only accessible by a couple people each
• Only place to access most real
site data
• Dashboards all made with
R/Shiny
• Queries done at terminal, only
by Data Science/Analytics
Team
• Hive/MapReduce is slow for
real-time data querying!
Data sources
Protobuf
messages
MySQL
HBase/
Hive
MixPanel
FireBase
Adjust
App
AnnieDesk.com
Sales
Force
StrongView
Solutions
1. Copy product data to quicker/more
universal data solution
2. Implement BI tool (Looker)
Data sources
Protobuf
messages
MySQL
HBase/
Hive
MixPanel
FireBase
Adjust
App
AnnieDesk.com
Sales
Force
• Send data to RedShift
for faster querying
• Connect RedShift to
Looker:
• Dashboards
• GUI Query Builder
RedShift
Looker
StrongView
Problems
1. Data siloed all over the place
2. Data inaccessible to most people
3. Difficult for teams to add new events
– Only “official” solution was protobuf messages,
which was slow and needed to go through
Engineering/Data Science/Me just to record a
button click
– Teams started using MixPanel, which is
expensive and limited
Solutions
1. Copy product data to quicker/more
universal data solution
2. Implement BI tool (Looker)
3. Replace MixPanel with Snowplow for
arbitrary Event Reporting
– Sends data to RedShift for easy integration
with other data
– Easy for teams to add new events
Data Sources
Protobuf
messages
MySQL
HBase/
Hive
MixPanel
FireBase
Adjust
App
AnnieDesk.com
Sales
Force
RedShift
Looker
Snowplow
StrongView
Problems
1. Data siloed all over the place
2. Data inaccessible to most people
3. Difficult for teams to add new events
4. So many teams! So much integration!
– Mobile (iOS & Android), Site (back end & front
end), Ads, Marketing (including install referral
info & email marketing & other), Firefox &
Chrome toolbars, etc. etc.
How we did it
Intended Plan:
1. Site implements default page tracker
2. Site implements 2-3 events to make sure
flow is working properly
– Structured Events
3. Assess if everything is working
4. Mobile implements 2-3 events per platform
5. Then roll out everywhere
How we did it
What Actually Happened:
1. Site implemented default page tracker
2. Site implemented ~100 events
– Structured Events
3. Mobile replaced all MixPanel events with
Snowplow
– Structured Events
– Some trouble with implementation/integration with
Android
– Used wiki page created by a site engineer, had
confusing language, did some things weirdly
4. Testing??
Uh-Oh
• Structured Events not really the right thing:
• Didn’t have userid implemented properly
originally
• More fields were going to be needed
Snowplow Term Our Use
Category Event Name (e.g. thumbup)
Action Event Type (e.g. click vs view)
Label Platform (site, iOS…)
Property Version #
Value When event had a value associated with it
So? Switch to Unstructured Events! Easy, right?
• OK great, come up with a new framework for
Unstructured Events!
– Some required fields across all events
– Some optional fields that we know will be widely
used from day 1
– Nature of unstructured events is that more fields
could be added later
Field Req’d? Description
event_name y Event name
platform y site, iOS, Android, etc.
device_version y Version number (standard field)
event_category n e.g. click; view: useful for filtering
event_group n For defining a group of events, for filtering
value n For events with a value
referrer n Referral source (when applicable)
Sounds good so far!
• Teams that had already implemented
Unstructured did not want to implement
Structured
– They had already spent Eng time on this, why
spend more?
• Everyone is always on a tight timeline
– Had trouble seeing the value in the format of
their events matching the format of teams they
didn’t work with.
• Result? Arguments and top-down mandates
What should we have done differently?
1. Program management across all teams
– Didn’t have anyone officially in charge
2. Implement in phases: do test events & a
test project before going full live
3. Excellent Documentation
4. Get buy-in from everyone from day one
5. Think through dream/far-fetched use
cases: what will you need for that?
6. Use Snowplow team for advice!
So now what?
• Still working on it
• Connecting all existing data pipelines to
RedShift, sometimes via Snowplow
• Better utilizing Snowplow when back end
tracking is too cumbersome
– Referral Tracking: both reg and landing page
– Better understanding of engagement and Time
on Site (for non-stumble pages especially)
– Understanding user flow through the site
– Etc. etc. etc, hopefully!
Protobuf
messages
MySQL
HBase/
Hive
MixPanel
FireBase
Adjust
App
AnnieDesk.com
Sales
Force
RedShift
Looker
Snowplow
StrongView
New
Data!
Thank You!
Questions, etc?

More Related Content

What's hot

Snowplow: open source game analytics powered by AWS
Snowplow: open source game analytics powered by AWSSnowplow: open source game analytics powered by AWS
Snowplow: open source game analytics powered by AWSGiuseppe Gaviani
 
Simply Business - Near Real Time Event Processing
Simply Business - Near Real Time Event ProcessingSimply Business - Near Real Time Event Processing
Simply Business - Near Real Time Event Processingidan_by
 
Snowplow at DA Hub emerging technology showcase
Snowplow at DA Hub emerging technology showcaseSnowplow at DA Hub emerging technology showcase
Snowplow at DA Hub emerging technology showcaseyalisassoon
 
Modelling event data in look ml
Modelling event data in look mlModelling event data in look ml
Modelling event data in look mlyalisassoon
 
Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2yalisassoon
 
Snowplow Analytics: from NoSQL to SQL and back again
Snowplow Analytics: from NoSQL to SQL and back againSnowplow Analytics: from NoSQL to SQL and back again
Snowplow Analytics: from NoSQL to SQL and back againAlexander Dean
 
Introducing Sauna - Decisioning and response platform from Snowplow
Introducing Sauna - Decisioning and response platform from SnowplowIntroducing Sauna - Decisioning and response platform from Snowplow
Introducing Sauna - Decisioning and response platform from SnowplowGiuseppe Gaviani
 
Understanding event data
Understanding event dataUnderstanding event data
Understanding event datayalisassoon
 
How we use Hive at SnowPlow, and how the role of HIve is changing
How we use Hive at SnowPlow, and how the role of HIve is changingHow we use Hive at SnowPlow, and how the role of HIve is changing
How we use Hive at SnowPlow, and how the role of HIve is changingyalisassoon
 
Snowplow is at the core of everything we do
Snowplow is at the core of everything we doSnowplow is at the core of everything we do
Snowplow is at the core of everything we doyalisassoon
 
Snowplow, Metail and Cascalog
Snowplow, Metail and CascalogSnowplow, Metail and Cascalog
Snowplow, Metail and CascalogRobert Boland
 
Snowplow presentation for Amsterdam Meetup #3
Snowplow presentation for Amsterdam Meetup #3Snowplow presentation for Amsterdam Meetup #3
Snowplow presentation for Amsterdam Meetup #3Snowplow Analytics
 
Snowplow the evolving data pipeline
Snowplow   the evolving data pipelineSnowplow   the evolving data pipeline
Snowplow the evolving data pipelineyalisassoon
 
Snowplow: where we came from and where we are going - March 2016
Snowplow: where we came from and where we are going - March 2016Snowplow: where we came from and where we are going - March 2016
Snowplow: where we came from and where we are going - March 2016yalisassoon
 
Big data meetup budapest adding data schemas to snowplow
Big data meetup budapest   adding data schemas to snowplowBig data meetup budapest   adding data schemas to snowplow
Big data meetup budapest adding data schemas to snowplowyalisassoon
 
Viewbix tracking journey
Viewbix tracking journeyViewbix tracking journey
Viewbix tracking journeyidan_by
 
How to evolve your analytics stack with your business using Snowplow
How to evolve your analytics stack with your business using SnowplowHow to evolve your analytics stack with your business using Snowplow
How to evolve your analytics stack with your business using SnowplowGiuseppe Gaviani
 
The culture trip snowplow implementation
The culture trip snowplow implementationThe culture trip snowplow implementation
The culture trip snowplow implementationidan_by
 
Big Data Beers - Introducing Snowplow
Big Data Beers - Introducing SnowplowBig Data Beers - Introducing Snowplow
Big Data Beers - Introducing SnowplowAlexander Dean
 
How Incuda builds user journey models with Snowplow
How Incuda builds user journey models with SnowplowHow Incuda builds user journey models with Snowplow
How Incuda builds user journey models with SnowplowGiuseppe Gaviani
 

What's hot (20)

Snowplow: open source game analytics powered by AWS
Snowplow: open source game analytics powered by AWSSnowplow: open source game analytics powered by AWS
Snowplow: open source game analytics powered by AWS
 
Simply Business - Near Real Time Event Processing
Simply Business - Near Real Time Event ProcessingSimply Business - Near Real Time Event Processing
Simply Business - Near Real Time Event Processing
 
Snowplow at DA Hub emerging technology showcase
Snowplow at DA Hub emerging technology showcaseSnowplow at DA Hub emerging technology showcase
Snowplow at DA Hub emerging technology showcase
 
Modelling event data in look ml
Modelling event data in look mlModelling event data in look ml
Modelling event data in look ml
 
Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2
 
Snowplow Analytics: from NoSQL to SQL and back again
Snowplow Analytics: from NoSQL to SQL and back againSnowplow Analytics: from NoSQL to SQL and back again
Snowplow Analytics: from NoSQL to SQL and back again
 
Introducing Sauna - Decisioning and response platform from Snowplow
Introducing Sauna - Decisioning and response platform from SnowplowIntroducing Sauna - Decisioning and response platform from Snowplow
Introducing Sauna - Decisioning and response platform from Snowplow
 
Understanding event data
Understanding event dataUnderstanding event data
Understanding event data
 
How we use Hive at SnowPlow, and how the role of HIve is changing
How we use Hive at SnowPlow, and how the role of HIve is changingHow we use Hive at SnowPlow, and how the role of HIve is changing
How we use Hive at SnowPlow, and how the role of HIve is changing
 
Snowplow is at the core of everything we do
Snowplow is at the core of everything we doSnowplow is at the core of everything we do
Snowplow is at the core of everything we do
 
Snowplow, Metail and Cascalog
Snowplow, Metail and CascalogSnowplow, Metail and Cascalog
Snowplow, Metail and Cascalog
 
Snowplow presentation for Amsterdam Meetup #3
Snowplow presentation for Amsterdam Meetup #3Snowplow presentation for Amsterdam Meetup #3
Snowplow presentation for Amsterdam Meetup #3
 
Snowplow the evolving data pipeline
Snowplow   the evolving data pipelineSnowplow   the evolving data pipeline
Snowplow the evolving data pipeline
 
Snowplow: where we came from and where we are going - March 2016
Snowplow: where we came from and where we are going - March 2016Snowplow: where we came from and where we are going - March 2016
Snowplow: where we came from and where we are going - March 2016
 
Big data meetup budapest adding data schemas to snowplow
Big data meetup budapest   adding data schemas to snowplowBig data meetup budapest   adding data schemas to snowplow
Big data meetup budapest adding data schemas to snowplow
 
Viewbix tracking journey
Viewbix tracking journeyViewbix tracking journey
Viewbix tracking journey
 
How to evolve your analytics stack with your business using Snowplow
How to evolve your analytics stack with your business using SnowplowHow to evolve your analytics stack with your business using Snowplow
How to evolve your analytics stack with your business using Snowplow
 
The culture trip snowplow implementation
The culture trip snowplow implementationThe culture trip snowplow implementation
The culture trip snowplow implementation
 
Big Data Beers - Introducing Snowplow
Big Data Beers - Introducing SnowplowBig Data Beers - Introducing Snowplow
Big Data Beers - Introducing Snowplow
 
How Incuda builds user journey models with Snowplow
How Incuda builds user journey models with SnowplowHow Incuda builds user journey models with Snowplow
How Incuda builds user journey models with Snowplow
 

Viewers also liked

Modeling event data
Modeling event dataModeling event data
Modeling event datayalisassoon
 
A KPI framework for startups
A KPI framework for startupsA KPI framework for startups
A KPI framework for startupsyalisassoon
 
Snowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessSnowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessyalisassoon
 
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015yalisassoon
 
Using Snowplow for A/B testing and user journey analysis at CustomMade
Using Snowplow for A/B testing and user journey analysis at CustomMadeUsing Snowplow for A/B testing and user journey analysis at CustomMade
Using Snowplow for A/B testing and user journey analysis at CustomMadeyalisassoon
 
Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...yalisassoon
 
Snowplow Analytics and Looker at Oyster.com
Snowplow Analytics and Looker at Oyster.comSnowplow Analytics and Looker at Oyster.com
Snowplow Analytics and Looker at Oyster.comyalisassoon
 
Snowplow at Sigfig
Snowplow at SigfigSnowplow at Sigfig
Snowplow at Sigfigyalisassoon
 

Viewers also liked (8)

Modeling event data
Modeling event dataModeling event data
Modeling event data
 
A KPI framework for startups
A KPI framework for startupsA KPI framework for startups
A KPI framework for startups
 
Snowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessSnowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your business
 
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015
 
Using Snowplow for A/B testing and user journey analysis at CustomMade
Using Snowplow for A/B testing and user journey analysis at CustomMadeUsing Snowplow for A/B testing and user journey analysis at CustomMade
Using Snowplow for A/B testing and user journey analysis at CustomMade
 
Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...
 
Snowplow Analytics and Looker at Oyster.com
Snowplow Analytics and Looker at Oyster.comSnowplow Analytics and Looker at Oyster.com
Snowplow Analytics and Looker at Oyster.com
 
Snowplow at Sigfig
Snowplow at SigfigSnowplow at Sigfig
Snowplow at Sigfig
 

Similar to Implementing improved and consistent arbitrary event tracking company-wide using Snowplow

NACDEP 2015 - Are we entrepreneurs?
NACDEP 2015  - Are we entrepreneurs?NACDEP 2015  - Are we entrepreneurs?
NACDEP 2015 - Are we entrepreneurs?Glenn Muske
 
How and Why: Embedded Analytics Interfaces For Your SaaS Product
How and Why: Embedded Analytics Interfaces For Your SaaS ProductHow and Why: Embedded Analytics Interfaces For Your SaaS Product
How and Why: Embedded Analytics Interfaces For Your SaaS ProductAggregage
 
Modern Product Data Workflows: How and Why: Embedded Analytics Interfaces For...
Modern Product Data Workflows: How and Why: Embedded Analytics Interfaces For...Modern Product Data Workflows: How and Why: Embedded Analytics Interfaces For...
Modern Product Data Workflows: How and Why: Embedded Analytics Interfaces For...Hannah Flynn
 
Mozilla Foundation Metrics - presentation to engineers
Mozilla Foundation Metrics - presentation to engineersMozilla Foundation Metrics - presentation to engineers
Mozilla Foundation Metrics - presentation to engineersJohn Schneider
 
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTSBig Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTSMatt Stubbs
 
SurveyPocket Webinar March 2012
SurveyPocket Webinar March 2012SurveyPocket Webinar March 2012
SurveyPocket Webinar March 2012QuestionPro
 
How to Start a Data Science Initiative and Grow Your Team
How to Start a Data Science Initiative and Grow Your TeamHow to Start a Data Science Initiative and Grow Your Team
How to Start a Data Science Initiative and Grow Your TeamAnnie Flippo
 
Art of Product Management
Art of Product ManagementArt of Product Management
Art of Product ManagementDinesh Vernekar
 
Going Responsive with Google Analytics - EdUi
Going Responsive with Google Analytics - EdUiGoing Responsive with Google Analytics - EdUi
Going Responsive with Google Analytics - EdUiNewCity
 
Advanced Use Cases for Analytics Breakout Session
Advanced Use Cases for Analytics Breakout SessionAdvanced Use Cases for Analytics Breakout Session
Advanced Use Cases for Analytics Breakout SessionSplunk
 
Big Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil GamesBig Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil GamesRob Winters
 
Abhishek Deshpande Resume- October 2023.pdf
Abhishek Deshpande Resume- October 2023.pdfAbhishek Deshpande Resume- October 2023.pdf
Abhishek Deshpande Resume- October 2023.pdfAbhishek Deshpande
 
Supercharge Your Corporate Dashboards With UX Analytics
Supercharge Your Corporate Dashboards With UX AnalyticsSupercharge Your Corporate Dashboards With UX Analytics
Supercharge Your Corporate Dashboards With UX AnalyticsUserZoom
 
H2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.ioH2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.ioSri Ambati
 
Agile data science
Agile data scienceAgile data science
Agile data scienceJoel Horwitz
 
Lean startups en el mundo real ejemplos y metricas
Lean startups en el mundo real  ejemplos y metricasLean startups en el mundo real  ejemplos y metricas
Lean startups en el mundo real ejemplos y metricasSoftware Guru
 
Hypothesis-Driven Development & How to Fail-Fast Hacking Growth
Hypothesis-Driven Development & How to Fail-Fast Hacking GrowthHypothesis-Driven Development & How to Fail-Fast Hacking Growth
Hypothesis-Driven Development & How to Fail-Fast Hacking GrowthPrabhat Gupta
 
How to use data the right way
How to use data the right way How to use data the right way
How to use data the right way Ivan Oung
 
Telling the Full Story: Adding Qualitative Data To Executive Dashboards
Telling the Full Story: Adding Qualitative Data To Executive DashboardsTelling the Full Story: Adding Qualitative Data To Executive Dashboards
Telling the Full Story: Adding Qualitative Data To Executive DashboardsUserZoom
 
Using data to guide product development
Using data to guide product developmentUsing data to guide product development
Using data to guide product developmentMat Clayton
 

Similar to Implementing improved and consistent arbitrary event tracking company-wide using Snowplow (20)

NACDEP 2015 - Are we entrepreneurs?
NACDEP 2015  - Are we entrepreneurs?NACDEP 2015  - Are we entrepreneurs?
NACDEP 2015 - Are we entrepreneurs?
 
How and Why: Embedded Analytics Interfaces For Your SaaS Product
How and Why: Embedded Analytics Interfaces For Your SaaS ProductHow and Why: Embedded Analytics Interfaces For Your SaaS Product
How and Why: Embedded Analytics Interfaces For Your SaaS Product
 
Modern Product Data Workflows: How and Why: Embedded Analytics Interfaces For...
Modern Product Data Workflows: How and Why: Embedded Analytics Interfaces For...Modern Product Data Workflows: How and Why: Embedded Analytics Interfaces For...
Modern Product Data Workflows: How and Why: Embedded Analytics Interfaces For...
 
Mozilla Foundation Metrics - presentation to engineers
Mozilla Foundation Metrics - presentation to engineersMozilla Foundation Metrics - presentation to engineers
Mozilla Foundation Metrics - presentation to engineers
 
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTSBig Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
 
SurveyPocket Webinar March 2012
SurveyPocket Webinar March 2012SurveyPocket Webinar March 2012
SurveyPocket Webinar March 2012
 
How to Start a Data Science Initiative and Grow Your Team
How to Start a Data Science Initiative and Grow Your TeamHow to Start a Data Science Initiative and Grow Your Team
How to Start a Data Science Initiative and Grow Your Team
 
Art of Product Management
Art of Product ManagementArt of Product Management
Art of Product Management
 
Going Responsive with Google Analytics - EdUi
Going Responsive with Google Analytics - EdUiGoing Responsive with Google Analytics - EdUi
Going Responsive with Google Analytics - EdUi
 
Advanced Use Cases for Analytics Breakout Session
Advanced Use Cases for Analytics Breakout SessionAdvanced Use Cases for Analytics Breakout Session
Advanced Use Cases for Analytics Breakout Session
 
Big Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil GamesBig Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil Games
 
Abhishek Deshpande Resume- October 2023.pdf
Abhishek Deshpande Resume- October 2023.pdfAbhishek Deshpande Resume- October 2023.pdf
Abhishek Deshpande Resume- October 2023.pdf
 
Supercharge Your Corporate Dashboards With UX Analytics
Supercharge Your Corporate Dashboards With UX AnalyticsSupercharge Your Corporate Dashboards With UX Analytics
Supercharge Your Corporate Dashboards With UX Analytics
 
H2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.ioH2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.io
 
Agile data science
Agile data scienceAgile data science
Agile data science
 
Lean startups en el mundo real ejemplos y metricas
Lean startups en el mundo real  ejemplos y metricasLean startups en el mundo real  ejemplos y metricas
Lean startups en el mundo real ejemplos y metricas
 
Hypothesis-Driven Development & How to Fail-Fast Hacking Growth
Hypothesis-Driven Development & How to Fail-Fast Hacking GrowthHypothesis-Driven Development & How to Fail-Fast Hacking Growth
Hypothesis-Driven Development & How to Fail-Fast Hacking Growth
 
How to use data the right way
How to use data the right way How to use data the right way
How to use data the right way
 
Telling the Full Story: Adding Qualitative Data To Executive Dashboards
Telling the Full Story: Adding Qualitative Data To Executive DashboardsTelling the Full Story: Adding Qualitative Data To Executive Dashboards
Telling the Full Story: Adding Qualitative Data To Executive Dashboards
 
Using data to guide product development
Using data to guide product developmentUsing data to guide product development
Using data to guide product development
 

Recently uploaded

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 

Recently uploaded (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 

Implementing improved and consistent arbitrary event tracking company-wide using Snowplow

  • 1. Implementing improved and consistent arbitrary event tracking company-wide using Snowplow Nora Paymer Sr. Business & Consumer Insights Analyst, StumbleUpon 10/6/2015 SF Snowplow MeetUp
  • 2. About me • Hi, I’m Nora • BS & MA in Cognitive Neuroscience – Ask me about sign/speech bilingualism or optical illusions in the brain! • Previous Roles: – UC Berkeley: Institutional Analytics – CBS Interactive: Inventory Analytics – SquareTrade: Marketing/Consumer Insights Analytics • StumbleUpon: Business & Product Analytics
  • 3. About StumbleUpon • What is StumbleUpon? – Recommendation Engine for the Internet – Ad Platform for native advertisement – Social engagement platform • Still #4 in Referral Traffic* (behind Facebook, Twitter, and Pinterest; ahead of Reddit) • Still alive and kicking! *Shareaholic, Q4 2014 (mot recent data available)
  • 4. My Role • Data Science Team & Finance/Sales Analytics Team, but no dedicated Product or Business Analytics • When I was hired, I was asked to: – Help Product team be a data-driven culture – Make data more available company-wide • Better & easier to change dashboards • Ability for non-data people to access data – Help clean up Data Pipelines • With support from amazing Data Engineering Team
  • 5. Problems 1. Data siloed all over the place 2. Data inaccessible to most people
  • 6. • Other data all over the place • No way to integrate with user/stumble/activity data • Only accessible by a couple people each • Only place to access most real site data • Dashboards all made with R/Shiny • Queries done at terminal, only by Data Science/Analytics Team • Hive/MapReduce is slow for real-time data querying! Data sources Protobuf messages MySQL HBase/ Hive MixPanel FireBase Adjust App AnnieDesk.com Sales Force StrongView
  • 7. Solutions 1. Copy product data to quicker/more universal data solution 2. Implement BI tool (Looker)
  • 8. Data sources Protobuf messages MySQL HBase/ Hive MixPanel FireBase Adjust App AnnieDesk.com Sales Force • Send data to RedShift for faster querying • Connect RedShift to Looker: • Dashboards • GUI Query Builder RedShift Looker StrongView
  • 9. Problems 1. Data siloed all over the place 2. Data inaccessible to most people 3. Difficult for teams to add new events – Only “official” solution was protobuf messages, which was slow and needed to go through Engineering/Data Science/Me just to record a button click – Teams started using MixPanel, which is expensive and limited
  • 10. Solutions 1. Copy product data to quicker/more universal data solution 2. Implement BI tool (Looker) 3. Replace MixPanel with Snowplow for arbitrary Event Reporting – Sends data to RedShift for easy integration with other data – Easy for teams to add new events
  • 12. Problems 1. Data siloed all over the place 2. Data inaccessible to most people 3. Difficult for teams to add new events 4. So many teams! So much integration! – Mobile (iOS & Android), Site (back end & front end), Ads, Marketing (including install referral info & email marketing & other), Firefox & Chrome toolbars, etc. etc.
  • 13. How we did it Intended Plan: 1. Site implements default page tracker 2. Site implements 2-3 events to make sure flow is working properly – Structured Events 3. Assess if everything is working 4. Mobile implements 2-3 events per platform 5. Then roll out everywhere
  • 14. How we did it What Actually Happened: 1. Site implemented default page tracker 2. Site implemented ~100 events – Structured Events 3. Mobile replaced all MixPanel events with Snowplow – Structured Events – Some trouble with implementation/integration with Android – Used wiki page created by a site engineer, had confusing language, did some things weirdly 4. Testing??
  • 15. Uh-Oh • Structured Events not really the right thing: • Didn’t have userid implemented properly originally • More fields were going to be needed Snowplow Term Our Use Category Event Name (e.g. thumbup) Action Event Type (e.g. click vs view) Label Platform (site, iOS…) Property Version # Value When event had a value associated with it
  • 16. So? Switch to Unstructured Events! Easy, right? • OK great, come up with a new framework for Unstructured Events! – Some required fields across all events – Some optional fields that we know will be widely used from day 1 – Nature of unstructured events is that more fields could be added later Field Req’d? Description event_name y Event name platform y site, iOS, Android, etc. device_version y Version number (standard field) event_category n e.g. click; view: useful for filtering event_group n For defining a group of events, for filtering value n For events with a value referrer n Referral source (when applicable)
  • 17. Sounds good so far! • Teams that had already implemented Unstructured did not want to implement Structured – They had already spent Eng time on this, why spend more? • Everyone is always on a tight timeline – Had trouble seeing the value in the format of their events matching the format of teams they didn’t work with. • Result? Arguments and top-down mandates
  • 18. What should we have done differently? 1. Program management across all teams – Didn’t have anyone officially in charge 2. Implement in phases: do test events & a test project before going full live 3. Excellent Documentation 4. Get buy-in from everyone from day one 5. Think through dream/far-fetched use cases: what will you need for that? 6. Use Snowplow team for advice!
  • 19. So now what? • Still working on it • Connecting all existing data pipelines to RedShift, sometimes via Snowplow • Better utilizing Snowplow when back end tracking is too cumbersome – Referral Tracking: both reg and landing page – Better understanding of engagement and Time on Site (for non-stumble pages especially) – Understanding user flow through the site – Etc. etc. etc, hopefully!