SlideShare a Scribd company logo
Using Simple Machine
Learning Models in a
New Ads Manager
Ruth Garcia
The Data Science Summit – Sept 12, 2018
London, UK
Online Advertising for Mobile
$5.7 $5.4 $6.2
$7.7 $8.1 $8.6 $8.9 $9.9
$8.7 $9.4
$0.7
$1.6
$2.8
$4.4
$8.2
$11.4
$5.7 $5.4
$6.2
$7.7
$8.8
$10.3
$11.7
$14.3
$16.9
$20.8
2008 2009 2010 2011 2012 2013 2014 2015 2016 2017
Mobile
Non-Mobile
15.4%
Overall CAGR
76.8%
Mobile CAGR
– Source: IAB/PwC Internet Ad Revenue Report, HY 2017
– * CAGR: Compound Annual Growth Rate
Find balance: Increase our revenue and engage users
Revenue Engaged user
Delivery of Ads
Data
ownership
?
Technical
difficulties
Black box
algorithms
Skyscanner Ads
Manager
Solution
External Ads Managers
Click prediction algorithm
Goal: Click prediction model
Test: Is it better than random?
Rankingalgorithm
Candidates
Cloud services and tools at Skyscanner
Languages
AWS services
Batch
S3
Overview of an Advertising System
Expectation vs. reality
Reality
Not flexible but fast and
easier to implement.
Tensorfiow
• Optimization technique
• Embeddings
• Crossed columns
• Hashing
Features:
• User history,
• User features,
• Route features ,
• Ad features with,
colors, text
Challenges: Which model to use?
Model Possibilities (easy to read in node.js):
• Logistic regression
• Random Forest : gets lost
• Neural networks: too slow hard to put it in json
Solvers:
• Logistic regression: Liblinear, sag
• Train data in batches
Train all data at once
SGDClassifier
Saves memory
Gridsearch for
hyperparameteres
Challenges: Categorical values
Pros:
• No collisions
• Inverse mapping
Creatives
C1
C2
C3
C1 C2 C3
1 0 0
0 1 0
0 0 1
Creatives
C1
C2
C3
C4
C1 C2 C3 C4
1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1
Cons:
• Need to know all values in advance
• Not good for online learning
• Keep dictionary in prod
One hot encoding:
Challenges: Categorical values
id features
123 creative1,
advertiser2,mobile, etc.
321 creative2,
advertiser4,mobile, etc.
id Feat_1 Feat_2 Feat_3 …. Feat_k
123 0.1 0 1 …. 0
321 0.5 0 0 …. 1
Hashing Trick: map data of arbitrary sizes to data of a fixed size
Pros:
• Memory efficient
• Online learning
• No dictionary
Cons:
• No inverse mapping
• Hash collisions
Machine Learning Performance: offline
Precision at 1: based on
target groups.
Mean Reciprocal Rank:
order of ranked ads
AUC: if caring about ranking
Log-Loss: if caring about the
value of CTR
Other metrics :
Optimizing evaluation metric
Updating model based on different
sampling methods and training days.
2 3 4 5 6 7
Histogram of training days
6/ 4/ 18 6/ 11/ 18 6/ 18/ 18 6/ 25/ 18 7/ 2/ 18 7/ 9/ 18 7/ 16/ 18 7/ 23/ 18 7/ 30/ 18 8/ 6/ 18 8/ 13/ 18 8/ 20/ 18 8/ 27/ 18 9/ 3/ 18
AUC over time
Best AUC Worst AUC
Satisficing metric: Precision at 1
Choose best AUC conditioned of precision
at 1 better than random
6/ 4/ 18 6/ 11/ 18 6/ 18/ 18 6/ 25/ 18 7/ 2/ 18 7/ 9/ 18 7/ 16/ 18 7/ 23/ 18 7/ 30/ 18 8/ 6/ 18 8/ 13/ 18 8/ 20/ 18 8/ 27/ 18 9/ 3/ 18
Precision at 1: Satisficing metric
pr ecisi on_at _1 r andom _pr ecisi on_at _1
The road ahead: Balancing exploitation and exploration
Choose ad based on
ONLY CTR
Choose ad based on
OTHER criteria
– Most common
approaches:
– ! − #$%%&'
– ! − &%($%)*+,#
Learnings
1. Start lean to prove the value of your Machine Learning project
2. Speak up front since the beginning about the benefits and requirements of
using ML in the product (talk about time and costs)
3. If you have problems with dimensionality, explore different ways of optimizing
your resources, e.g., mini batch, hashing trick.
4, Advertising systems are very dynamic so be aware how often you need to
update the model.
Eng.
Thank you
@ruthygarcia
Questions?

More Related Content

Similar to Using Simple Machine Learning Models in a New Ads Manager

Study programmatic and data in Spain Datmean
Study programmatic and data in Spain DatmeanStudy programmatic and data in Spain Datmean
Study programmatic and data in Spain Datmean
Datmean
 
Mastering CRM Analytics
Mastering CRM AnalyticsMastering CRM Analytics
Mastering CRM Analytics
Margaret Ngai (she/her)
 
Maximizing Your User Acquisition Campaigns | Katya Kornilova
Maximizing Your User Acquisition Campaigns | Katya KornilovaMaximizing Your User Acquisition Campaigns | Katya Kornilova
Maximizing Your User Acquisition Campaigns | Katya Kornilova
Jessica Tams
 
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...Jun Wang
 
Microsoft Advertising Growth Solutions Bootcamp - Afternoon Session
Microsoft Advertising Growth Solutions Bootcamp - Afternoon SessionMicrosoft Advertising Growth Solutions Bootcamp - Afternoon Session
Microsoft Advertising Growth Solutions Bootcamp - Afternoon Session
MSFTAdvertising
 
How AI is transforming Pricing_EPP Monetized_July2019
How AI is transforming Pricing_EPP Monetized_July2019How AI is transforming Pricing_EPP Monetized_July2019
How AI is transforming Pricing_EPP Monetized_July2019
Felix Krohn
 
Mixx2016_Alexei S. Archinov
Mixx2016_Alexei S. ArchinovMixx2016_Alexei S. Archinov
Mixx2016_Alexei S. Archinov
iabrussiaprez
 
ADTECH&DATA - Tendências em programático - Mobile e Cross Device - Sell side
ADTECH&DATA - Tendências em programático - Mobile e Cross Device - Sell sideADTECH&DATA - Tendências em programático - Mobile e Cross Device - Sell side
ADTECH&DATA - Tendências em programático - Mobile e Cross Device - Sell side
IAB Brasil
 
Qmd 1903 wast_googad
Qmd 1903 wast_googadQmd 1903 wast_googad
Qmd 1903 wast_googad
EvoLife.bg
 
Fighting the bullying culture in search - Adthena/Hero Conf
Fighting the bullying culture in search - Adthena/Hero ConfFighting the bullying culture in search - Adthena/Hero Conf
Fighting the bullying culture in search - Adthena/Hero Conf
Lorna Rose Gill
 
Epiphany Summer Conference 2016
Epiphany Summer Conference 2016 Epiphany Summer Conference 2016
Epiphany Summer Conference 2016
Epiphany
 
Epiphany Summer Conference 2016 London
Epiphany Summer Conference 2016 LondonEpiphany Summer Conference 2016 London
Epiphany Summer Conference 2016 London
Epiphany
 
BRINGING A WHOLE MARKET VIEW PERSPECTIVE INTO YOUR SEARCH STRATEGY
BRINGING A WHOLE MARKET VIEW PERSPECTIVE INTO YOUR SEARCH STRATEGYBRINGING A WHOLE MARKET VIEW PERSPECTIVE INTO YOUR SEARCH STRATEGY
BRINGING A WHOLE MARKET VIEW PERSPECTIVE INTO YOUR SEARCH STRATEGY
Tinuiti
 
Disruptors in Digital Marketing - Ken Vandre
Disruptors in Digital Marketing - Ken VandreDisruptors in Digital Marketing - Ken Vandre
Disruptors in Digital Marketing - Ken Vandre
Ken Vandre
 
Second Hand Cars in the Philippines: A Buyer's Guide
Second Hand Cars in the Philippines: A Buyer's GuideSecond Hand Cars in the Philippines: A Buyer's Guide
Second Hand Cars in the Philippines: A Buyer's Guide
For The Women Foundation
 
Big data: Bringing competition policy to the digital era – VARIAN – November ...
Big data: Bringing competition policy to the digital era – VARIAN – November ...Big data: Bringing competition policy to the digital era – VARIAN – November ...
Big data: Bringing competition policy to the digital era – VARIAN – November ...
OECD Directorate for Financial and Enterprise Affairs
 
Why A ‘Channel Agnostic’ Approach To Search Can Deliver Greater ROI
Why A ‘Channel Agnostic’ Approach To Search Can Deliver Greater ROIWhy A ‘Channel Agnostic’ Approach To Search Can Deliver Greater ROI
Why A ‘Channel Agnostic’ Approach To Search Can Deliver Greater ROI
Smart Insights
 
How to use Online Marketing Technology to Improve Campaign Performance - Lowe...
How to use Online Marketing Technology to Improve Campaign Performance - Lowe...How to use Online Marketing Technology to Improve Campaign Performance - Lowe...
How to use Online Marketing Technology to Improve Campaign Performance - Lowe...
Online Marketing Summit
 
Turning PPC Auction Insights Into Actionable Data - 2017 MnSearch Summit
Turning PPC Auction Insights Into Actionable Data - 2017 MnSearch SummitTurning PPC Auction Insights Into Actionable Data - 2017 MnSearch Summit
Turning PPC Auction Insights Into Actionable Data - 2017 MnSearch Summit
MnSearch, The Minnesota Search Engine Marketing Association
 
Maddie Cary - MNSearch Summit 2017 - Turning PPC Auction Insights Into Action...
Maddie Cary - MNSearch Summit 2017 - Turning PPC Auction Insights Into Action...Maddie Cary - MNSearch Summit 2017 - Turning PPC Auction Insights Into Action...
Maddie Cary - MNSearch Summit 2017 - Turning PPC Auction Insights Into Action...
Maddie Cary Deuel
 

Similar to Using Simple Machine Learning Models in a New Ads Manager (20)

Study programmatic and data in Spain Datmean
Study programmatic and data in Spain DatmeanStudy programmatic and data in Spain Datmean
Study programmatic and data in Spain Datmean
 
Mastering CRM Analytics
Mastering CRM AnalyticsMastering CRM Analytics
Mastering CRM Analytics
 
Maximizing Your User Acquisition Campaigns | Katya Kornilova
Maximizing Your User Acquisition Campaigns | Katya KornilovaMaximizing Your User Acquisition Campaigns | Katya Kornilova
Maximizing Your User Acquisition Campaigns | Katya Kornilova
 
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
 
Microsoft Advertising Growth Solutions Bootcamp - Afternoon Session
Microsoft Advertising Growth Solutions Bootcamp - Afternoon SessionMicrosoft Advertising Growth Solutions Bootcamp - Afternoon Session
Microsoft Advertising Growth Solutions Bootcamp - Afternoon Session
 
How AI is transforming Pricing_EPP Monetized_July2019
How AI is transforming Pricing_EPP Monetized_July2019How AI is transforming Pricing_EPP Monetized_July2019
How AI is transforming Pricing_EPP Monetized_July2019
 
Mixx2016_Alexei S. Archinov
Mixx2016_Alexei S. ArchinovMixx2016_Alexei S. Archinov
Mixx2016_Alexei S. Archinov
 
ADTECH&DATA - Tendências em programático - Mobile e Cross Device - Sell side
ADTECH&DATA - Tendências em programático - Mobile e Cross Device - Sell sideADTECH&DATA - Tendências em programático - Mobile e Cross Device - Sell side
ADTECH&DATA - Tendências em programático - Mobile e Cross Device - Sell side
 
Qmd 1903 wast_googad
Qmd 1903 wast_googadQmd 1903 wast_googad
Qmd 1903 wast_googad
 
Fighting the bullying culture in search - Adthena/Hero Conf
Fighting the bullying culture in search - Adthena/Hero ConfFighting the bullying culture in search - Adthena/Hero Conf
Fighting the bullying culture in search - Adthena/Hero Conf
 
Epiphany Summer Conference 2016
Epiphany Summer Conference 2016 Epiphany Summer Conference 2016
Epiphany Summer Conference 2016
 
Epiphany Summer Conference 2016 London
Epiphany Summer Conference 2016 LondonEpiphany Summer Conference 2016 London
Epiphany Summer Conference 2016 London
 
BRINGING A WHOLE MARKET VIEW PERSPECTIVE INTO YOUR SEARCH STRATEGY
BRINGING A WHOLE MARKET VIEW PERSPECTIVE INTO YOUR SEARCH STRATEGYBRINGING A WHOLE MARKET VIEW PERSPECTIVE INTO YOUR SEARCH STRATEGY
BRINGING A WHOLE MARKET VIEW PERSPECTIVE INTO YOUR SEARCH STRATEGY
 
Disruptors in Digital Marketing - Ken Vandre
Disruptors in Digital Marketing - Ken VandreDisruptors in Digital Marketing - Ken Vandre
Disruptors in Digital Marketing - Ken Vandre
 
Second Hand Cars in the Philippines: A Buyer's Guide
Second Hand Cars in the Philippines: A Buyer's GuideSecond Hand Cars in the Philippines: A Buyer's Guide
Second Hand Cars in the Philippines: A Buyer's Guide
 
Big data: Bringing competition policy to the digital era – VARIAN – November ...
Big data: Bringing competition policy to the digital era – VARIAN – November ...Big data: Bringing competition policy to the digital era – VARIAN – November ...
Big data: Bringing competition policy to the digital era – VARIAN – November ...
 
Why A ‘Channel Agnostic’ Approach To Search Can Deliver Greater ROI
Why A ‘Channel Agnostic’ Approach To Search Can Deliver Greater ROIWhy A ‘Channel Agnostic’ Approach To Search Can Deliver Greater ROI
Why A ‘Channel Agnostic’ Approach To Search Can Deliver Greater ROI
 
How to use Online Marketing Technology to Improve Campaign Performance - Lowe...
How to use Online Marketing Technology to Improve Campaign Performance - Lowe...How to use Online Marketing Technology to Improve Campaign Performance - Lowe...
How to use Online Marketing Technology to Improve Campaign Performance - Lowe...
 
Turning PPC Auction Insights Into Actionable Data - 2017 MnSearch Summit
Turning PPC Auction Insights Into Actionable Data - 2017 MnSearch SummitTurning PPC Auction Insights Into Actionable Data - 2017 MnSearch Summit
Turning PPC Auction Insights Into Actionable Data - 2017 MnSearch Summit
 
Maddie Cary - MNSearch Summit 2017 - Turning PPC Auction Insights Into Action...
Maddie Cary - MNSearch Summit 2017 - Turning PPC Auction Insights Into Action...Maddie Cary - MNSearch Summit 2017 - Turning PPC Auction Insights Into Action...
Maddie Cary - MNSearch Summit 2017 - Turning PPC Auction Insights Into Action...
 

More from Ruth Garcia Gavilanes

Using Machine Learning in the delivery of ads
Using Machine Learning in the delivery of adsUsing Machine Learning in the delivery of ads
Using Machine Learning in the delivery of ads
Ruth Garcia Gavilanes
 
Assessing Online Ads Beyond Only Clicks
Assessing Online Ads Beyond Only ClicksAssessing Online Ads Beyond Only Clicks
Assessing Online Ads Beyond Only Clicks
Ruth Garcia Gavilanes
 
An Analysis of Human-Generated Friendship Recommendations
An Analysis of Human-Generated Friendship Recommendations An Analysis of Human-Generated Friendship Recommendations
An Analysis of Human-Generated Friendship Recommendations
Ruth Garcia Gavilanes
 
Discovering Culture in Social Media and a Brief Case of Collective Memory
Discovering Culture in Social Media and a Brief Case of Collective MemoryDiscovering Culture in Social Media and a Brief Case of Collective Memory
Discovering Culture in Social Media and a Brief Case of Collective Memory
Ruth Garcia Gavilanes
 
Language, Twitter and Academic Conferences
Language,  Twitter  and Academic Conferences Language,  Twitter  and Academic Conferences
Language, Twitter and Academic Conferences
Ruth Garcia Gavilanes
 
USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS
USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASISUSER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS
USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS
Ruth Garcia Gavilanes
 
Who are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in MicroblogsWho are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in Microblogs
Ruth Garcia Gavilanes
 
Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...
Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...
Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...
Ruth Garcia Gavilanes
 
Twitter: Time, Individualism and Power
Twitter: Time, Individualism and PowerTwitter: Time, Individualism and Power
Twitter: Time, Individualism and Power
Ruth Garcia Gavilanes
 
Cikm2011 doallbirdstweetthesame
Cikm2011 doallbirdstweetthesameCikm2011 doallbirdstweetthesame
Cikm2011 doallbirdstweetthesame
Ruth Garcia Gavilanes
 

More from Ruth Garcia Gavilanes (10)

Using Machine Learning in the delivery of ads
Using Machine Learning in the delivery of adsUsing Machine Learning in the delivery of ads
Using Machine Learning in the delivery of ads
 
Assessing Online Ads Beyond Only Clicks
Assessing Online Ads Beyond Only ClicksAssessing Online Ads Beyond Only Clicks
Assessing Online Ads Beyond Only Clicks
 
An Analysis of Human-Generated Friendship Recommendations
An Analysis of Human-Generated Friendship Recommendations An Analysis of Human-Generated Friendship Recommendations
An Analysis of Human-Generated Friendship Recommendations
 
Discovering Culture in Social Media and a Brief Case of Collective Memory
Discovering Culture in Social Media and a Brief Case of Collective MemoryDiscovering Culture in Social Media and a Brief Case of Collective Memory
Discovering Culture in Social Media and a Brief Case of Collective Memory
 
Language, Twitter and Academic Conferences
Language,  Twitter  and Academic Conferences Language,  Twitter  and Academic Conferences
Language, Twitter and Academic Conferences
 
USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS
USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASISUSER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS
USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS
 
Who are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in MicroblogsWho are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in Microblogs
 
Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...
Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...
Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...
 
Twitter: Time, Individualism and Power
Twitter: Time, Individualism and PowerTwitter: Time, Individualism and Power
Twitter: Time, Individualism and Power
 
Cikm2011 doallbirdstweetthesame
Cikm2011 doallbirdstweetthesameCikm2011 doallbirdstweetthesame
Cikm2011 doallbirdstweetthesame
 

Recently uploaded

一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
2023240532
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 

Recently uploaded (20)

一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 

Using Simple Machine Learning Models in a New Ads Manager

  • 1. Using Simple Machine Learning Models in a New Ads Manager Ruth Garcia The Data Science Summit – Sept 12, 2018 London, UK
  • 2. Online Advertising for Mobile $5.7 $5.4 $6.2 $7.7 $8.1 $8.6 $8.9 $9.9 $8.7 $9.4 $0.7 $1.6 $2.8 $4.4 $8.2 $11.4 $5.7 $5.4 $6.2 $7.7 $8.8 $10.3 $11.7 $14.3 $16.9 $20.8 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 Mobile Non-Mobile 15.4% Overall CAGR 76.8% Mobile CAGR – Source: IAB/PwC Internet Ad Revenue Report, HY 2017 – * CAGR: Compound Annual Growth Rate
  • 3. Find balance: Increase our revenue and engage users Revenue Engaged user
  • 4. Delivery of Ads Data ownership ? Technical difficulties Black box algorithms Skyscanner Ads Manager Solution External Ads Managers
  • 5. Click prediction algorithm Goal: Click prediction model Test: Is it better than random? Rankingalgorithm Candidates
  • 6. Cloud services and tools at Skyscanner Languages AWS services Batch S3
  • 7. Overview of an Advertising System
  • 8. Expectation vs. reality Reality Not flexible but fast and easier to implement. Tensorfiow • Optimization technique • Embeddings • Crossed columns • Hashing Features: • User history, • User features, • Route features , • Ad features with, colors, text
  • 9. Challenges: Which model to use? Model Possibilities (easy to read in node.js): • Logistic regression • Random Forest : gets lost • Neural networks: too slow hard to put it in json Solvers: • Logistic regression: Liblinear, sag • Train data in batches Train all data at once SGDClassifier Saves memory Gridsearch for hyperparameteres
  • 10. Challenges: Categorical values Pros: • No collisions • Inverse mapping Creatives C1 C2 C3 C1 C2 C3 1 0 0 0 1 0 0 0 1 Creatives C1 C2 C3 C4 C1 C2 C3 C4 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1 Cons: • Need to know all values in advance • Not good for online learning • Keep dictionary in prod One hot encoding:
  • 11. Challenges: Categorical values id features 123 creative1, advertiser2,mobile, etc. 321 creative2, advertiser4,mobile, etc. id Feat_1 Feat_2 Feat_3 …. Feat_k 123 0.1 0 1 …. 0 321 0.5 0 0 …. 1 Hashing Trick: map data of arbitrary sizes to data of a fixed size Pros: • Memory efficient • Online learning • No dictionary Cons: • No inverse mapping • Hash collisions
  • 12. Machine Learning Performance: offline Precision at 1: based on target groups. Mean Reciprocal Rank: order of ranked ads AUC: if caring about ranking Log-Loss: if caring about the value of CTR Other metrics :
  • 13. Optimizing evaluation metric Updating model based on different sampling methods and training days. 2 3 4 5 6 7 Histogram of training days 6/ 4/ 18 6/ 11/ 18 6/ 18/ 18 6/ 25/ 18 7/ 2/ 18 7/ 9/ 18 7/ 16/ 18 7/ 23/ 18 7/ 30/ 18 8/ 6/ 18 8/ 13/ 18 8/ 20/ 18 8/ 27/ 18 9/ 3/ 18 AUC over time Best AUC Worst AUC
  • 14. Satisficing metric: Precision at 1 Choose best AUC conditioned of precision at 1 better than random 6/ 4/ 18 6/ 11/ 18 6/ 18/ 18 6/ 25/ 18 7/ 2/ 18 7/ 9/ 18 7/ 16/ 18 7/ 23/ 18 7/ 30/ 18 8/ 6/ 18 8/ 13/ 18 8/ 20/ 18 8/ 27/ 18 9/ 3/ 18 Precision at 1: Satisficing metric pr ecisi on_at _1 r andom _pr ecisi on_at _1
  • 15. The road ahead: Balancing exploitation and exploration Choose ad based on ONLY CTR Choose ad based on OTHER criteria – Most common approaches: – ! − #$%%&' – ! − &%($%)*+,#
  • 16. Learnings 1. Start lean to prove the value of your Machine Learning project 2. Speak up front since the beginning about the benefits and requirements of using ML in the product (talk about time and costs) 3. If you have problems with dimensionality, explore different ways of optimizing your resources, e.g., mini batch, hashing trick. 4, Advertising systems are very dynamic so be aware how often you need to update the model. Eng.