SlideShare a Scribd company logo
Multi-touch Attribution
Sri Sri Perangur
Key challenge around designing a solution
ABOUT SRI²
Sr. Data Scientist at Spotify
Sr. Data Scientist at Skyscanner
Data Scientist at State
Developer/Tech analyst at RBC Capital Markets
Research Associate at BAE Systems Applied Intelligence Unit
SKYSCANNER
Multi-touch
Attribution
The challenge is to identify the
distribution of credit amongst
multiple players.
The problem is seen in marketing
and product space amongst
others.
Training Data
The players’ credit is not easy to
observe directly.
Makes it challenging to define
the label/training data.
Need labeled data
to enable comparison of the
many solution options available
these days.
Simulated
Training data
The expert knowledge &
known data trails are captured to
define the Simulated Marketing
Environment .
This enables scoring and stress
testing with a range of training
data sets.
SUMMARY
Multi-Touch Attribution
Problem space
1.
ATTRIBUTION EXPLAINED
Thanks to SegmentIO
WHERE DO WE SEE THIS OCCUR IN PRODUCTS ?
Social Networks
Search Engine
Marketing
Google Display
Network
Organic
Direct
Book
through
Time
£
£
WHERE DO WE SEE THIS OCCUR IN MARKETING ?
Day 0 Day 1 Day 7 Day 15 Day 16
www.skyscanner.net
Social Networks
Search Engine
Marketing
Google Display
Network
Organic
Direct
Book
through
Time
£
£
WHERE DO WE SEE THIS OCCUR IN MARKETING ?
Day 0 Day 1 Day 7 Day 15 Day 16
www.skyscanner.net
30%
20%
10%
45%
0%
Hypothetically
Social Networks
Search Engine
Marketing
Google Display
Network
Organic
Direct
Book
through
Time
WHERE DO WE SEE THIS OCCUR IN MARKETING ?
Day 0 Day 1 Day 7 Day 15 Day 16
30%
20%
10%
45%
0%
Hypothetically
Designing a solution
...a Data Driven solution
2.
MOVING AWAY FROM THE PARTIAL VIEW
Even Time decay Custom rules
70,000,000 users
and counting
Data Driven Attribution
DATA DRIVEN SOLUTION
WHICH IS THE BEST METHOD ?
Methods are ready to train ...Where is the training data ?
Training data3.
1010101010100101010101
0101001010100101010101
0101010100101010100101
1010101010100101010101
0101001010100101010101
0101010100101010100101
1010101011010101010100
1010101010101001010100
1010101010101010100101
0101001011010101010100
1010101010101001010100
1010101010101010100101
0101001011010101010100
1010101010101001010100
1010101010101010100101
0101001010100101010101
0101001010100101010101
0101010100101010100101
Which is the best method for MTA
Identifying the best3.
TYPICAL MACHINE LEARNING PROBLEM
● Elevation
● Price per
sqft
● Price
● Square
feet
● Year built
● Bathrooms
MULTI-TOUCH ATTRIBUTION PROBLEM
● No of
touchpoints
user sees
● Marketing channel
distribution traits
● Sequence of
occurrence
● And so on..
Channel Value
Social Network ?
Search Engine Marketing ?
Google Display Network ?
Organic ?
Direct ?
X
aka Latent variables
Hidden in the real world
Concepts we can’t
observe directly …
LATENT VARIABLES
but we can infer
with byproducts...
Social Networks
Search Engine
Marketing
Google Display
Network
Organic
Direct
Book
through
Time
EXPERT OPINION MATTERS
Day 0 Day 1 Day 7 Day 15 Day 16
30%
20%
10%
45%
0%
Hypothetically
MULTI-TOUCH ATTRIBUTION PROBLEM
● No of touchpoints
user sees
● Marketing channel
distribution traits
● Sequence of
occurrence
● And so on..
?X
MULTI-TOUCH ATTRIBUTION PROBLEM
X
● No of
touchpoints
user sees
● Marketing channel
distribution traits
● Sequence of
occurrence
● And so on..
BUT CAN AN EXPERT GAUGE MARGINS EASILY ?
Simulated
training data
to enable classifier training
SIMULATED MARKETING ENVIRONMENT
CAPTURING EXPERT OPINION
Aim is to capture the educated expectations of experts
in the field, regarding the key perspectives .
To name a few:
● Marketing channel value
● Marketing channel after effect
In a way that we can fluctuate the key variables
CAPTURING EXPERT OPINION
Aim is to capture the educated expectations of experts
in the field, regarding the key perspectives .
To name a few:
● Marketing channel value
● Marketing channel after effect
In a way that we can fluctuate the key variables
Channel Value
(hypo
thetic
ally)
Social Network 30%
Search Engine Marketing 20%
Google Display Network 10%
Organic 45%
Direct 0%
CAPTURING EXPERT OPINION
Aim is to capture the educated expectations of experts
in the field, regarding the key perspectives .
To name a few:
● Marketing channel value
● Marketing channel after effect
In a way that we can fluctuate the key variables
Impact over weeks for
channel X
CAPTURING KNOWN DATA TRAITS
Capturing known traits of channel behaviour seen in company data
● Distribution of users over marketing channels
● Distribution of various marketing touch points over all touchpoint
● Distribution of the marketing touch points over our users’ journey
● Distribution of sessions/conversions/conversion rate per channel
CAPTURING KNOWN DATA TRAITS
Capturing known traits of channel behaviour seen in company data
● Distribution of users over marketing channels
● Distribution of various marketing touch points over all touchpoint
● Distribution of the marketing touch points over our users’ journey
● Distribution of sessions/conversions/conversion rate per channel
CAPTURING KNOWN DATA TRAITS
Capturing known traits of channel behaviour seen in company data
● Distribution of users over marketing channels
● Distribution of various marketing touch points over all touchpoint
● Distribution of the marketing touch points over our users’ journey
● Distribution of sessions/conversions/conversion rate per channel
CAPTURING KNOWN DATA TRAITS
Capturing known traits of channel behaviour seen in company data
● Distribution of users over marketing channels
● Distribution of various marketing touch points over all touchpoint
● Distribution of the marketing touch points over our users’ journey
● Distribution of sessions/conversions/conversion rate per channel
Distribution of seeing channel X
our user touchpoint position
CAPTURING KNOWN DATA TRAITS
Capturing known traits of channel behaviour seen in company data
● Distribution of users over marketing channels
● Distribution of various marketing touch points over all touchpoint
● Distribution of the marketing touch points over our users’ journey
● Distribution of sessions/conversions/conversion rate per channel
SIMULATED MARKETING ENVIRONMENT
Results
Best method for Multi-touch attribution
For Skyscanner
MULTI-TOUCH ATTRIBUTION PROBLEM
X
90%
SCORING AGAINST SIMULATED GOAL
2.69%
Current
Industries
popular
method
Baseline
Stress testing
models
How durable are these solution in other
scenarios ?
WHAT IF...
The Channel estimates are wrong ?
Estimated channel value as £ X but should be …
• £ X*100
• £ X * 0.01
• ….
STRESS TESTING
1. Fluctuate probability of channel exposure
2. Fluctuate probability of purchase
3. Fluctuate relative touchpoint weights ( ex:
marketing channel after effect)
90%
STRESS TESTING RESULTS
The Method Z (i.e. the 90% accurate
one) is the one in Green.
Method Z is the best even at
adapting to change when tested in
20 scenarios.
Underestimate Overestimate
APPLYING IN PRODUCTION
SIMULATION STACK
Channel Value
(hypotheticall
y)
Social Network 30%
Search Engine Marketing 20%
Google Display Network 10%
Organic 45%
Direct 0%
PRODUCTION APPLICATION STACK
Channel Value
Social Network a%
Search Engine Marketing b%
Google Display Network c%
Organic d%
Direct e%
CONCLUSION
● Understand accuracy of a approach
● Comparison of different approaches
● Tuning of different solutions
● Stress testing solutions
Thank you
Cheers to you all
& my team and community that have made this possible
Segment IO
Growth images
Content: Pretty images:
MC Attribution Blog
Shapley value Attribution blog
Photos from Unsplash
ML Gif
Pretty Icons from Flaticons
Food for thought -
Prof. Tamara Broderick
from MIT has been working
o
n enabling estimates
of latent variables through
Bayesian sampling
methods.
Skyscanner MTA
research poster
Research icons
Movie image
Oscar pic
Thank you
Cheers to you all
& my team and community that have made this possible
Any questions?
You can find me at:
» @perangur
» www.perangur.com
» Slides: http://bit.ly/MTASummerConf

More Related Content

Similar to Data Summer Conf 2018, “Multi-touch Attribution: Key challenge around designing a solution (ENG)” — Sri Sri, Sr. Data Scientist at Spotify

How Choosing the Right Formats and Placements Can Supercharge Your Game Reven...
How Choosing the Right Formats and Placements Can Supercharge Your Game Reven...How Choosing the Right Formats and Placements Can Supercharge Your Game Reven...
How Choosing the Right Formats and Placements Can Supercharge Your Game Reven...
Jessica Tams
 
Attribution modeling 101, Mariia Bocheva
Attribution modeling 101, Mariia BochevaAttribution modeling 101, Mariia Bocheva
Attribution modeling 101, Mariia Bocheva
Mariia Bocheva
 
Attribution modeling 101
Attribution modeling 101 Attribution modeling 101
Attribution modeling 101
OWOX BI
 
The Top 5 FAQ's When Considering Programmatic
The Top 5 FAQ's When Considering ProgrammaticThe Top 5 FAQ's When Considering Programmatic
The Top 5 FAQ's When Considering Programmatic
Hanapin Marketing
 
Google Analytics Konferenz 2019_Attribution: building a model_Martin Frotzler...
Google Analytics Konferenz 2019_Attribution: building a model_Martin Frotzler...Google Analytics Konferenz 2019_Attribution: building a model_Martin Frotzler...
Google Analytics Konferenz 2019_Attribution: building a model_Martin Frotzler...
e-dialog GmbH
 
Dan Golden's Internet Summit 2015 Presentation - Practical Attribution
Dan Golden's Internet Summit 2015 Presentation - Practical AttributionDan Golden's Internet Summit 2015 Presentation - Practical Attribution
Dan Golden's Internet Summit 2015 Presentation - Practical Attribution
BFO
 
Digital marketing Nanodegree Portfolio
Digital marketing Nanodegree PortfolioDigital marketing Nanodegree Portfolio
Digital marketing Nanodegree Portfolio
Clayton Condict
 
New Adtaxi General Presentation - 9.28.16-2
New Adtaxi General Presentation - 9.28.16-2New Adtaxi General Presentation - 9.28.16-2
New Adtaxi General Presentation - 9.28.16-2Victoria Collins
 
Data driven road to success, Analytics conference, Nora Thoth
Data driven road to success, Analytics conference, Nora ThothData driven road to success, Analytics conference, Nora Thoth
Data driven road to success, Analytics conference, Nora Thoth
Red Orbit digital marketing
 
Next Gen Lead Gen for Schools - Digital Ad Strategies for 2023
Next Gen Lead Gen for Schools - Digital Ad Strategies for 2023Next Gen Lead Gen for Schools - Digital Ad Strategies for 2023
Next Gen Lead Gen for Schools - Digital Ad Strategies for 2023
Higher Education Marketing
 
Nadav Ashkenazy - Casual Connect Europe 2018
Nadav Ashkenazy - Casual Connect Europe 2018Nadav Ashkenazy - Casual Connect Europe 2018
Nadav Ashkenazy - Casual Connect Europe 2018
ironSource
 
Steve Smith - Opening Remarks
Steve Smith - Opening RemarksSteve Smith - Opening Remarks
Steve Smith - Opening Remarks
MediaPost
 
Qmd 1903 wast_googad
Qmd 1903 wast_googadQmd 1903 wast_googad
Qmd 1903 wast_googad
EvoLife.bg
 
How to Supercharge Your Marketing Automation Using Audience Research and User...
How to Supercharge Your Marketing Automation Using Audience Research and User...How to Supercharge Your Marketing Automation Using Audience Research and User...
How to Supercharge Your Marketing Automation Using Audience Research and User...
Element Three
 
Hero conference 2016 - advanced bidding
Hero conference   2016 - advanced biddingHero conference   2016 - advanced bidding
Hero conference 2016 - advanced bidding
Chris Haleua
 
Retail Media in the EU and What You Can Learn from Success and Failures in th...
Retail Media in the EU and What You Can Learn from Success and Failures in th...Retail Media in the EU and What You Can Learn from Success and Failures in th...
Retail Media in the EU and What You Can Learn from Success and Failures in th...
Elizabeth Marsten
 
Marketing in the Moment: Trends and Innovations in Real-Time Omni-Channel Mar...
Marketing in the Moment: Trends and Innovations in Real-Time Omni-Channel Mar...Marketing in the Moment: Trends and Innovations in Real-Time Omni-Channel Mar...
Marketing in the Moment: Trends and Innovations in Real-Time Omni-Channel Mar...
Ensighten
 
IronSource_WN_2018.pdf
IronSource_WN_2018.pdfIronSource_WN_2018.pdf
IronSource_WN_2018.pdf
lirabini1977
 
Operational Attribution with Google Analytics
Operational Attribution with Google AnalyticsOperational Attribution with Google Analytics
Operational Attribution with Google Analytics
Jonathan Breton
 
7 Habits of Highly Effective Personalisation Organisations | Optimizely ANZ W...
7 Habits of Highly Effective Personalisation Organisations | Optimizely ANZ W...7 Habits of Highly Effective Personalisation Organisations | Optimizely ANZ W...
7 Habits of Highly Effective Personalisation Organisations | Optimizely ANZ W...
Optimizely
 

Similar to Data Summer Conf 2018, “Multi-touch Attribution: Key challenge around designing a solution (ENG)” — Sri Sri, Sr. Data Scientist at Spotify (20)

How Choosing the Right Formats and Placements Can Supercharge Your Game Reven...
How Choosing the Right Formats and Placements Can Supercharge Your Game Reven...How Choosing the Right Formats and Placements Can Supercharge Your Game Reven...
How Choosing the Right Formats and Placements Can Supercharge Your Game Reven...
 
Attribution modeling 101, Mariia Bocheva
Attribution modeling 101, Mariia BochevaAttribution modeling 101, Mariia Bocheva
Attribution modeling 101, Mariia Bocheva
 
Attribution modeling 101
Attribution modeling 101 Attribution modeling 101
Attribution modeling 101
 
The Top 5 FAQ's When Considering Programmatic
The Top 5 FAQ's When Considering ProgrammaticThe Top 5 FAQ's When Considering Programmatic
The Top 5 FAQ's When Considering Programmatic
 
Google Analytics Konferenz 2019_Attribution: building a model_Martin Frotzler...
Google Analytics Konferenz 2019_Attribution: building a model_Martin Frotzler...Google Analytics Konferenz 2019_Attribution: building a model_Martin Frotzler...
Google Analytics Konferenz 2019_Attribution: building a model_Martin Frotzler...
 
Dan Golden's Internet Summit 2015 Presentation - Practical Attribution
Dan Golden's Internet Summit 2015 Presentation - Practical AttributionDan Golden's Internet Summit 2015 Presentation - Practical Attribution
Dan Golden's Internet Summit 2015 Presentation - Practical Attribution
 
Digital marketing Nanodegree Portfolio
Digital marketing Nanodegree PortfolioDigital marketing Nanodegree Portfolio
Digital marketing Nanodegree Portfolio
 
New Adtaxi General Presentation - 9.28.16-2
New Adtaxi General Presentation - 9.28.16-2New Adtaxi General Presentation - 9.28.16-2
New Adtaxi General Presentation - 9.28.16-2
 
Data driven road to success, Analytics conference, Nora Thoth
Data driven road to success, Analytics conference, Nora ThothData driven road to success, Analytics conference, Nora Thoth
Data driven road to success, Analytics conference, Nora Thoth
 
Next Gen Lead Gen for Schools - Digital Ad Strategies for 2023
Next Gen Lead Gen for Schools - Digital Ad Strategies for 2023Next Gen Lead Gen for Schools - Digital Ad Strategies for 2023
Next Gen Lead Gen for Schools - Digital Ad Strategies for 2023
 
Nadav Ashkenazy - Casual Connect Europe 2018
Nadav Ashkenazy - Casual Connect Europe 2018Nadav Ashkenazy - Casual Connect Europe 2018
Nadav Ashkenazy - Casual Connect Europe 2018
 
Steve Smith - Opening Remarks
Steve Smith - Opening RemarksSteve Smith - Opening Remarks
Steve Smith - Opening Remarks
 
Qmd 1903 wast_googad
Qmd 1903 wast_googadQmd 1903 wast_googad
Qmd 1903 wast_googad
 
How to Supercharge Your Marketing Automation Using Audience Research and User...
How to Supercharge Your Marketing Automation Using Audience Research and User...How to Supercharge Your Marketing Automation Using Audience Research and User...
How to Supercharge Your Marketing Automation Using Audience Research and User...
 
Hero conference 2016 - advanced bidding
Hero conference   2016 - advanced biddingHero conference   2016 - advanced bidding
Hero conference 2016 - advanced bidding
 
Retail Media in the EU and What You Can Learn from Success and Failures in th...
Retail Media in the EU and What You Can Learn from Success and Failures in th...Retail Media in the EU and What You Can Learn from Success and Failures in th...
Retail Media in the EU and What You Can Learn from Success and Failures in th...
 
Marketing in the Moment: Trends and Innovations in Real-Time Omni-Channel Mar...
Marketing in the Moment: Trends and Innovations in Real-Time Omni-Channel Mar...Marketing in the Moment: Trends and Innovations in Real-Time Omni-Channel Mar...
Marketing in the Moment: Trends and Innovations in Real-Time Omni-Channel Mar...
 
IronSource_WN_2018.pdf
IronSource_WN_2018.pdfIronSource_WN_2018.pdf
IronSource_WN_2018.pdf
 
Operational Attribution with Google Analytics
Operational Attribution with Google AnalyticsOperational Attribution with Google Analytics
Operational Attribution with Google Analytics
 
7 Habits of Highly Effective Personalisation Organisations | Optimizely ANZ W...
7 Habits of Highly Effective Personalisation Organisations | Optimizely ANZ W...7 Habits of Highly Effective Personalisation Organisations | Optimizely ANZ W...
7 Habits of Highly Effective Personalisation Organisations | Optimizely ANZ W...
 

More from Provectus

Choosing the right IDP Solution
Choosing the right IDP SolutionChoosing the right IDP Solution
Choosing the right IDP Solution
Provectus
 
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Provectus
 
Choosing the Right Document Processing Solution for Healthcare Organizations
Choosing the Right Document Processing Solution for Healthcare OrganizationsChoosing the Right Document Processing Solution for Healthcare Organizations
Choosing the Right Document Processing Solution for Healthcare Organizations
Provectus
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
Provectus
 
AI Stack on AWS: Amazon SageMaker and Beyond
AI Stack on AWS: Amazon SageMaker and BeyondAI Stack on AWS: Amazon SageMaker and Beyond
AI Stack on AWS: Amazon SageMaker and Beyond
Provectus
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
Provectus
 
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerMLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
Provectus
 
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMRCost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Provectus
 
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
Provectus
 
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K..."Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
Provectus
 
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ..."How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
Provectus
 
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky..."Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
Provectus
 
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2..."Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
Provectus
 
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma..."Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
Provectus
 
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ..."Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
Provectus
 
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
Provectus
 
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
Provectus
 
"Integrate your front end apps with serverless backend in the cloud", Sebasti...
"Integrate your front end apps with serverless backend in the cloud", Sebasti..."Integrate your front end apps with serverless backend in the cloud", Sebasti...
"Integrate your front end apps with serverless backend in the cloud", Sebasti...
Provectus
 
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
Provectus
 
How to implement authorization in your backend with AWS IAM
How to implement authorization in your backend with AWS IAMHow to implement authorization in your backend with AWS IAM
How to implement authorization in your backend with AWS IAM
Provectus
 

More from Provectus (20)

Choosing the right IDP Solution
Choosing the right IDP SolutionChoosing the right IDP Solution
Choosing the right IDP Solution
 
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
 
Choosing the Right Document Processing Solution for Healthcare Organizations
Choosing the Right Document Processing Solution for Healthcare OrganizationsChoosing the Right Document Processing Solution for Healthcare Organizations
Choosing the Right Document Processing Solution for Healthcare Organizations
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
 
AI Stack on AWS: Amazon SageMaker and Beyond
AI Stack on AWS: Amazon SageMaker and BeyondAI Stack on AWS: Amazon SageMaker and Beyond
AI Stack on AWS: Amazon SageMaker and Beyond
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
 
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerMLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
 
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMRCost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
 
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
 
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K..."Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
 
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ..."How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
 
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky..."Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
 
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2..."Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
 
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma..."Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
 
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ..."Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
 
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
 
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
 
"Integrate your front end apps with serverless backend in the cloud", Sebasti...
"Integrate your front end apps with serverless backend in the cloud", Sebasti..."Integrate your front end apps with serverless backend in the cloud", Sebasti...
"Integrate your front end apps with serverless backend in the cloud", Sebasti...
 
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
 
How to implement authorization in your backend with AWS IAM
How to implement authorization in your backend with AWS IAMHow to implement authorization in your backend with AWS IAM
How to implement authorization in your backend with AWS IAM
 

Recently uploaded

platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
AADYARAJPANDEY1
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
ssuserbfdca9
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
Scintica Instrumentation
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
aishnasrivastava
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
muralinath2
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
sonaliswain16
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SELF-EXPLANATORY
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
Areesha Ahmad
 

Recently uploaded (20)

platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
 

Data Summer Conf 2018, “Multi-touch Attribution: Key challenge around designing a solution (ENG)” — Sri Sri, Sr. Data Scientist at Spotify

  • 1. Multi-touch Attribution Sri Sri Perangur Key challenge around designing a solution
  • 2. ABOUT SRI² Sr. Data Scientist at Spotify Sr. Data Scientist at Skyscanner Data Scientist at State Developer/Tech analyst at RBC Capital Markets Research Associate at BAE Systems Applied Intelligence Unit
  • 4. Multi-touch Attribution The challenge is to identify the distribution of credit amongst multiple players. The problem is seen in marketing and product space amongst others. Training Data The players’ credit is not easy to observe directly. Makes it challenging to define the label/training data. Need labeled data to enable comparison of the many solution options available these days. Simulated Training data The expert knowledge & known data trails are captured to define the Simulated Marketing Environment . This enables scoring and stress testing with a range of training data sets. SUMMARY
  • 6.
  • 8. WHERE DO WE SEE THIS OCCUR IN PRODUCTS ?
  • 9. Social Networks Search Engine Marketing Google Display Network Organic Direct Book through Time £ £ WHERE DO WE SEE THIS OCCUR IN MARKETING ? Day 0 Day 1 Day 7 Day 15 Day 16 www.skyscanner.net
  • 10. Social Networks Search Engine Marketing Google Display Network Organic Direct Book through Time £ £ WHERE DO WE SEE THIS OCCUR IN MARKETING ? Day 0 Day 1 Day 7 Day 15 Day 16 www.skyscanner.net 30% 20% 10% 45% 0% Hypothetically
  • 11. Social Networks Search Engine Marketing Google Display Network Organic Direct Book through Time WHERE DO WE SEE THIS OCCUR IN MARKETING ? Day 0 Day 1 Day 7 Day 15 Day 16 30% 20% 10% 45% 0% Hypothetically
  • 12. Designing a solution ...a Data Driven solution 2.
  • 13. MOVING AWAY FROM THE PARTIAL VIEW Even Time decay Custom rules
  • 17. WHICH IS THE BEST METHOD ?
  • 18. Methods are ready to train ...Where is the training data ? Training data3. 1010101010100101010101 0101001010100101010101 0101010100101010100101 1010101010100101010101 0101001010100101010101 0101010100101010100101 1010101011010101010100 1010101010101001010100 1010101010101010100101 0101001011010101010100 1010101010101001010100 1010101010101010100101 0101001011010101010100 1010101010101001010100 1010101010101010100101 0101001010100101010101 0101001010100101010101 0101010100101010100101
  • 19. Which is the best method for MTA Identifying the best3.
  • 20. TYPICAL MACHINE LEARNING PROBLEM ● Elevation ● Price per sqft ● Price ● Square feet ● Year built ● Bathrooms
  • 21. MULTI-TOUCH ATTRIBUTION PROBLEM ● No of touchpoints user sees ● Marketing channel distribution traits ● Sequence of occurrence ● And so on.. Channel Value Social Network ? Search Engine Marketing ? Google Display Network ? Organic ? Direct ? X
  • 22. aka Latent variables Hidden in the real world
  • 23. Concepts we can’t observe directly … LATENT VARIABLES but we can infer with byproducts...
  • 24. Social Networks Search Engine Marketing Google Display Network Organic Direct Book through Time EXPERT OPINION MATTERS Day 0 Day 1 Day 7 Day 15 Day 16 30% 20% 10% 45% 0% Hypothetically
  • 25. MULTI-TOUCH ATTRIBUTION PROBLEM ● No of touchpoints user sees ● Marketing channel distribution traits ● Sequence of occurrence ● And so on.. ?X
  • 26. MULTI-TOUCH ATTRIBUTION PROBLEM X ● No of touchpoints user sees ● Marketing channel distribution traits ● Sequence of occurrence ● And so on..
  • 27. BUT CAN AN EXPERT GAUGE MARGINS EASILY ?
  • 28. Simulated training data to enable classifier training
  • 30. CAPTURING EXPERT OPINION Aim is to capture the educated expectations of experts in the field, regarding the key perspectives . To name a few: ● Marketing channel value ● Marketing channel after effect In a way that we can fluctuate the key variables
  • 31. CAPTURING EXPERT OPINION Aim is to capture the educated expectations of experts in the field, regarding the key perspectives . To name a few: ● Marketing channel value ● Marketing channel after effect In a way that we can fluctuate the key variables Channel Value (hypo thetic ally) Social Network 30% Search Engine Marketing 20% Google Display Network 10% Organic 45% Direct 0%
  • 32. CAPTURING EXPERT OPINION Aim is to capture the educated expectations of experts in the field, regarding the key perspectives . To name a few: ● Marketing channel value ● Marketing channel after effect In a way that we can fluctuate the key variables Impact over weeks for channel X
  • 33. CAPTURING KNOWN DATA TRAITS Capturing known traits of channel behaviour seen in company data ● Distribution of users over marketing channels ● Distribution of various marketing touch points over all touchpoint ● Distribution of the marketing touch points over our users’ journey ● Distribution of sessions/conversions/conversion rate per channel
  • 34. CAPTURING KNOWN DATA TRAITS Capturing known traits of channel behaviour seen in company data ● Distribution of users over marketing channels ● Distribution of various marketing touch points over all touchpoint ● Distribution of the marketing touch points over our users’ journey ● Distribution of sessions/conversions/conversion rate per channel
  • 35. CAPTURING KNOWN DATA TRAITS Capturing known traits of channel behaviour seen in company data ● Distribution of users over marketing channels ● Distribution of various marketing touch points over all touchpoint ● Distribution of the marketing touch points over our users’ journey ● Distribution of sessions/conversions/conversion rate per channel
  • 36. CAPTURING KNOWN DATA TRAITS Capturing known traits of channel behaviour seen in company data ● Distribution of users over marketing channels ● Distribution of various marketing touch points over all touchpoint ● Distribution of the marketing touch points over our users’ journey ● Distribution of sessions/conversions/conversion rate per channel Distribution of seeing channel X our user touchpoint position
  • 37. CAPTURING KNOWN DATA TRAITS Capturing known traits of channel behaviour seen in company data ● Distribution of users over marketing channels ● Distribution of various marketing touch points over all touchpoint ● Distribution of the marketing touch points over our users’ journey ● Distribution of sessions/conversions/conversion rate per channel
  • 39. Results Best method for Multi-touch attribution For Skyscanner
  • 41. SCORING AGAINST SIMULATED GOAL 2.69% Current Industries popular method Baseline
  • 42. Stress testing models How durable are these solution in other scenarios ?
  • 43. WHAT IF... The Channel estimates are wrong ? Estimated channel value as £ X but should be … • £ X*100 • £ X * 0.01 • ….
  • 44. STRESS TESTING 1. Fluctuate probability of channel exposure 2. Fluctuate probability of purchase 3. Fluctuate relative touchpoint weights ( ex: marketing channel after effect) 90%
  • 45. STRESS TESTING RESULTS The Method Z (i.e. the 90% accurate one) is the one in Green. Method Z is the best even at adapting to change when tested in 20 scenarios. Underestimate Overestimate
  • 47. SIMULATION STACK Channel Value (hypotheticall y) Social Network 30% Search Engine Marketing 20% Google Display Network 10% Organic 45% Direct 0%
  • 48. PRODUCTION APPLICATION STACK Channel Value Social Network a% Search Engine Marketing b% Google Display Network c% Organic d% Direct e%
  • 49. CONCLUSION ● Understand accuracy of a approach ● Comparison of different approaches ● Tuning of different solutions ● Stress testing solutions
  • 50. Thank you Cheers to you all & my team and community that have made this possible Segment IO Growth images Content: Pretty images: MC Attribution Blog Shapley value Attribution blog Photos from Unsplash ML Gif Pretty Icons from Flaticons Food for thought - Prof. Tamara Broderick from MIT has been working o n enabling estimates of latent variables through Bayesian sampling methods. Skyscanner MTA research poster Research icons Movie image Oscar pic
  • 51. Thank you Cheers to you all & my team and community that have made this possible
  • 52. Any questions? You can find me at: » @perangur » www.perangur.com » Slides: http://bit.ly/MTASummerConf