SlideShare a Scribd company logo
Using Machine
Learning in the
delivery of ads
Ruth Garcia
NDR Conference – June 07, 2018
Iasi, Romania
What is Skyscanner?
Skyscanner is a leading global travel search site offering a
comprehensive and free flight search service as well as online
comparisons for hotels, car hire and now trains.
Data Science at Skyscanner
Decision Science:1. Ensuring we have the right data; insights and
information to make the most impactful, scientific decisions in every aspect
of our operations.
Building Data Products:2. Leveraging our vast wealth of data to build more
contextual, relevant products for Travellers and Travel Suppliers (our
Partners)
D.S
Eng.
Machine Learning at Skyscanner
• Destination recommendations
• Itinerary mashups
• Advertising
Recommendations
Advertising
Mashups
Ads at Skyscanner
Not always annoying ads
Advertising schemes
Brand awareness Traffic Action
Machine Learning
efforts
Advertising schemes
Brand awareness Traffic Action
Advertising Schemes
Brand awareness Traffic Action
Machine Learning
efforts
Ads at Skyscanner
Ads at Skyscanner
=
No need to worry about
how ads are delivered
?
Make sure,
ads are delivered
Delivery of Ads
Data
ownership
?
Technical
difficulties
Black box
algorithms
Skyscanner Ads
Manager
Solution
Click prediction algorithm
Click prediction algorithm
Rankingalgorithm
Candidates
Cloud services and tools at Skyscanner
Languages
AWS services
Batch
S3
Overview of an Advertising System
Expectation vs. reality
Tensorfiow
Optimization technique•
Embeddings•
Crossed columns•
Hashing•
Expectation
Flexible but with the risk of not
being fast enough.
Features:
User history,•
User features,•
Route features ,•
Ad features with,•
colors, text
Expectation vs. reality
Reality
Not flexible but fast and
easier to implement.
Tensorfiow
Optimization technique•
Embeddings•
Crossed columns•
Hashing•
Features:
User history,•
User features,•
Route features ,•
Ad features with,•
colors, text
Challenges in building the model
Goal: model, that estimates the likelihood of
whether an impression will result in a click or not.
Challenges: Which model to use?
Model Possibilities (easy to read in node.js):
• Logistic regression
• Random Forest : gets lost
• Neural networks: too slow hard to put it in json
Solvers:
• Logistic regression: Liblinear, sag
• SGDClassifier
Train all data at once
Train data in batches
Saves memory
Gridsearch for
hyperparameteres
Challenges: Categorical values
Categorical values:
One hot• encoding
One hot encoding grouping less frequent features•
Hashing trick•
One hot encoding•
• One hot encoding grouping
• Less frequent features
Creatives
C1
C2
C3
C1 C2 C3
1 0 0
0 1 0
0 0 1
Creatives
C1
C2
C3
C4
C1 C2 C3 C4
1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1
Challenges: Categorical values
id features
123 creative1,
advertiser2,mobile, etc.
321 creative2,
advertiser4,mobile, etc.
id Feat_1 Feat_2 Feat_3 …. Feat_k
123 1 0 1 …. 0
321 1 0 0 …. 1
Hashing Trick: Same hashing function in training, testing and production•
We gain
Machine Learning Performance
Precision at 1: based on
target groups.
Mean Reciprocal Rank
AUC: if caring about ranking
Log-Loss: if caring about the
value of CTR
Other metrics:
Challenges: How to monitor performance?
Updating model based on different
Sampling methods.
The road ahead: Balancing exploitation and exploration
Choose ad based on
ONLY CTR
Choose ad based on
OTHER criteria
The road even further ahead: quality
Post-click, colors, images, text,
landing page
Conclusions
1. Machine Learning can bring many benefits to the product the challenge is to prove it
2. Bringing ML into production could hard at the beginning
3. Cooperation between data scientists and engineers is crucial
D.S
Eng.
Thank you
@ruthygarcia
Questions?

More Related Content

Similar to Using Machine Learning in the delivery of ads

Track 2 Session 5_ 利用 SageMaker 深度學習容器化在廣告推播之應用
Track 2 Session 5_ 利用 SageMaker 深度學習容器化在廣告推播之應用Track 2 Session 5_ 利用 SageMaker 深度學習容器化在廣告推播之應用
Track 2 Session 5_ 利用 SageMaker 深度學習容器化在廣告推播之應用Amazon Web Services
 
Artificial Intelligence for Travel - MyLittleAdventure / Welcome City Lab Dem...
Artificial Intelligence for Travel - MyLittleAdventure / Welcome City Lab Dem...Artificial Intelligence for Travel - MyLittleAdventure / Welcome City Lab Dem...
Artificial Intelligence for Travel - MyLittleAdventure / Welcome City Lab Dem...
Valéry BERNARD
 
#MME15: How Mintigo, the Leading Predictive Marketing Platform for Enterprise...
#MME15: How Mintigo, the Leading Predictive Marketing Platform for Enterprise...#MME15: How Mintigo, the Leading Predictive Marketing Platform for Enterprise...
#MME15: How Mintigo, the Leading Predictive Marketing Platform for Enterprise...
Mintigo1
 
AI for Software Engineering
AI for Software EngineeringAI for Software Engineering
AI for Software Engineering
Miroslaw Staron
 
Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013
nkabra
 
Kde jsou limity zákaznické 360°?
 Kde jsou limity zákaznické 360°? Kde jsou limity zákaznické 360°?
Kde jsou limity zákaznické 360°?
Taste Medio
 
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Carol McDonald
 
Analytics what to look for sustaining your growing business-
Analytics   what to look for sustaining your growing business-Analytics   what to look for sustaining your growing business-
Analytics what to look for sustaining your growing business-
Ajay Ohri
 
Imtiaz khan data_science_analytics
Imtiaz khan data_science_analyticsImtiaz khan data_science_analytics
Imtiaz khan data_science_analytics
imtiaz khan
 
Lead confluent HQ Dec 2019
Lead   confluent HQ Dec 2019Lead   confluent HQ Dec 2019
Lead confluent HQ Dec 2019
Sabri Skhiri
 
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo StepanchukBig Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
GlobalLogic Ukraine
 
Wims2012
Wims2012Wims2012
Wims2012
Elena Simperl
 
Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)
Turi, Inc.
 
Pubcon 2015 - Content Management Across International Markets
Pubcon 2015 - Content Management Across International MarketsPubcon 2015 - Content Management Across International Markets
Pubcon 2015 - Content Management Across International Markets
Doug Platts
 
Knowledge Discovery
Knowledge DiscoveryKnowledge Discovery
Knowledge Discovery
André Karpištšenko
 
Leverage the power of machine learning on windows
Leverage the power of machine learning on windowsLeverage the power of machine learning on windows
Leverage the power of machine learning on windows
José António Silva
 
Workshop: Make the Most of Customer Data Platforms - David Raab
Workshop: Make the Most of Customer Data Platforms - David RaabWorkshop: Make the Most of Customer Data Platforms - David Raab
Workshop: Make the Most of Customer Data Platforms - David Raab
Digital Customer Experience (DX) Summit
 
Data Annotation_Cars.pptx
Data Annotation_Cars.pptxData Annotation_Cars.pptx
Data Annotation_Cars.pptx
ssuserfb92ae
 
2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx
gdgsurrey
 

Similar to Using Machine Learning in the delivery of ads (20)

Track 2 Session 5_ 利用 SageMaker 深度學習容器化在廣告推播之應用
Track 2 Session 5_ 利用 SageMaker 深度學習容器化在廣告推播之應用Track 2 Session 5_ 利用 SageMaker 深度學習容器化在廣告推播之應用
Track 2 Session 5_ 利用 SageMaker 深度學習容器化在廣告推播之應用
 
Artificial Intelligence for Travel - MyLittleAdventure / Welcome City Lab Dem...
Artificial Intelligence for Travel - MyLittleAdventure / Welcome City Lab Dem...Artificial Intelligence for Travel - MyLittleAdventure / Welcome City Lab Dem...
Artificial Intelligence for Travel - MyLittleAdventure / Welcome City Lab Dem...
 
#MME15: How Mintigo, the Leading Predictive Marketing Platform for Enterprise...
#MME15: How Mintigo, the Leading Predictive Marketing Platform for Enterprise...#MME15: How Mintigo, the Leading Predictive Marketing Platform for Enterprise...
#MME15: How Mintigo, the Leading Predictive Marketing Platform for Enterprise...
 
AI for Software Engineering
AI for Software EngineeringAI for Software Engineering
AI for Software Engineering
 
Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013
 
Kde jsou limity zákaznické 360°?
 Kde jsou limity zákaznické 360°? Kde jsou limity zákaznické 360°?
Kde jsou limity zákaznické 360°?
 
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
 
AI & AWS DeepComposer
AI & AWS DeepComposerAI & AWS DeepComposer
AI & AWS DeepComposer
 
Analytics what to look for sustaining your growing business-
Analytics   what to look for sustaining your growing business-Analytics   what to look for sustaining your growing business-
Analytics what to look for sustaining your growing business-
 
Imtiaz khan data_science_analytics
Imtiaz khan data_science_analyticsImtiaz khan data_science_analytics
Imtiaz khan data_science_analytics
 
Lead confluent HQ Dec 2019
Lead   confluent HQ Dec 2019Lead   confluent HQ Dec 2019
Lead confluent HQ Dec 2019
 
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo StepanchukBig Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
 
Wims2012
Wims2012Wims2012
Wims2012
 
Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)
 
Pubcon 2015 - Content Management Across International Markets
Pubcon 2015 - Content Management Across International MarketsPubcon 2015 - Content Management Across International Markets
Pubcon 2015 - Content Management Across International Markets
 
Knowledge Discovery
Knowledge DiscoveryKnowledge Discovery
Knowledge Discovery
 
Leverage the power of machine learning on windows
Leverage the power of machine learning on windowsLeverage the power of machine learning on windows
Leverage the power of machine learning on windows
 
Workshop: Make the Most of Customer Data Platforms - David Raab
Workshop: Make the Most of Customer Data Platforms - David RaabWorkshop: Make the Most of Customer Data Platforms - David Raab
Workshop: Make the Most of Customer Data Platforms - David Raab
 
Data Annotation_Cars.pptx
Data Annotation_Cars.pptxData Annotation_Cars.pptx
Data Annotation_Cars.pptx
 
2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx
 

More from Ruth Garcia Gavilanes

Using Simple Machine Learning Models in a New Ads Manager
Using Simple Machine Learning Models in a New Ads ManagerUsing Simple Machine Learning Models in a New Ads Manager
Using Simple Machine Learning Models in a New Ads Manager
Ruth Garcia Gavilanes
 
Assessing Online Ads Beyond Only Clicks
Assessing Online Ads Beyond Only ClicksAssessing Online Ads Beyond Only Clicks
Assessing Online Ads Beyond Only Clicks
Ruth Garcia Gavilanes
 
An Analysis of Human-Generated Friendship Recommendations
An Analysis of Human-Generated Friendship Recommendations An Analysis of Human-Generated Friendship Recommendations
An Analysis of Human-Generated Friendship Recommendations
Ruth Garcia Gavilanes
 
Discovering Culture in Social Media and a Brief Case of Collective Memory
Discovering Culture in Social Media and a Brief Case of Collective MemoryDiscovering Culture in Social Media and a Brief Case of Collective Memory
Discovering Culture in Social Media and a Brief Case of Collective Memory
Ruth Garcia Gavilanes
 
Language, Twitter and Academic Conferences
Language,  Twitter  and Academic Conferences Language,  Twitter  and Academic Conferences
Language, Twitter and Academic Conferences
Ruth Garcia Gavilanes
 
USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS
USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASISUSER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS
USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS
Ruth Garcia Gavilanes
 
Who are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in MicroblogsWho are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in Microblogs
Ruth Garcia Gavilanes
 
Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...
Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...
Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...
Ruth Garcia Gavilanes
 
Twitter: Time, Individualism and Power
Twitter: Time, Individualism and PowerTwitter: Time, Individualism and Power
Twitter: Time, Individualism and Power
Ruth Garcia Gavilanes
 
Cikm2011 doallbirdstweetthesame
Cikm2011 doallbirdstweetthesameCikm2011 doallbirdstweetthesame
Cikm2011 doallbirdstweetthesame
Ruth Garcia Gavilanes
 

More from Ruth Garcia Gavilanes (10)

Using Simple Machine Learning Models in a New Ads Manager
Using Simple Machine Learning Models in a New Ads ManagerUsing Simple Machine Learning Models in a New Ads Manager
Using Simple Machine Learning Models in a New Ads Manager
 
Assessing Online Ads Beyond Only Clicks
Assessing Online Ads Beyond Only ClicksAssessing Online Ads Beyond Only Clicks
Assessing Online Ads Beyond Only Clicks
 
An Analysis of Human-Generated Friendship Recommendations
An Analysis of Human-Generated Friendship Recommendations An Analysis of Human-Generated Friendship Recommendations
An Analysis of Human-Generated Friendship Recommendations
 
Discovering Culture in Social Media and a Brief Case of Collective Memory
Discovering Culture in Social Media and a Brief Case of Collective MemoryDiscovering Culture in Social Media and a Brief Case of Collective Memory
Discovering Culture in Social Media and a Brief Case of Collective Memory
 
Language, Twitter and Academic Conferences
Language,  Twitter  and Academic Conferences Language,  Twitter  and Academic Conferences
Language, Twitter and Academic Conferences
 
USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS
USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASISUSER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS
USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS
 
Who are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in MicroblogsWho are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in Microblogs
 
Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...
Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...
Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...
 
Twitter: Time, Individualism and Power
Twitter: Time, Individualism and PowerTwitter: Time, Individualism and Power
Twitter: Time, Individualism and Power
 
Cikm2011 doallbirdstweetthesame
Cikm2011 doallbirdstweetthesameCikm2011 doallbirdstweetthesame
Cikm2011 doallbirdstweetthesame
 

Recently uploaded

Getting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control TowerGetting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control Tower
Vladimir Samoylov
 
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdfBonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
khadija278284
 
María Carolina Martínez - eCommerce Day Colombia 2024
María Carolina Martínez - eCommerce Day Colombia 2024María Carolina Martínez - eCommerce Day Colombia 2024
María Carolina Martínez - eCommerce Day Colombia 2024
eCommerce Institute
 
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptxsomanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
Howard Spence
 
Obesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditionsObesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditions
Faculty of Medicine And Health Sciences
 
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdfSupercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Access Innovations, Inc.
 
Bitcoin Lightning wallet and tic-tac-toe game XOXO
Bitcoin Lightning wallet and tic-tac-toe game XOXOBitcoin Lightning wallet and tic-tac-toe game XOXO
Bitcoin Lightning wallet and tic-tac-toe game XOXO
Matjaž Lipuš
 
Acorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutesAcorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutes
IP ServerOne
 
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Sebastiano Panichella
 
Gregory Harris' Civics Presentation.pptx
Gregory Harris' Civics Presentation.pptxGregory Harris' Civics Presentation.pptx
Gregory Harris' Civics Presentation.pptx
gharris9
 
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
0x01 - Newton's Third Law:  Static vs. Dynamic Abusers0x01 - Newton's Third Law:  Static vs. Dynamic Abusers
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
OWASP Beja
 
Media as a Mind Controlling Strategy In Old and Modern Era
Media as a Mind Controlling Strategy In Old and Modern EraMedia as a Mind Controlling Strategy In Old and Modern Era
Media as a Mind Controlling Strategy In Old and Modern Era
faizulhassanfaiz1670
 
AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...
AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...
AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...
AwangAniqkmals
 
Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...
Sebastiano Panichella
 
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Dutch Power
 
Burning Issue Presentation By Kenmaryon.pdf
Burning Issue Presentation By Kenmaryon.pdfBurning Issue Presentation By Kenmaryon.pdf
Burning Issue Presentation By Kenmaryon.pdf
kkirkland2
 
International Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software TestingInternational Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software Testing
Sebastiano Panichella
 
Tom tresser burning issue.pptx My Burning issue
Tom tresser burning issue.pptx My Burning issueTom tresser burning issue.pptx My Burning issue
Tom tresser burning issue.pptx My Burning issue
amekonnen
 
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Dutch Power
 
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
OECD Directorate for Financial and Enterprise Affairs
 

Recently uploaded (20)

Getting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control TowerGetting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control Tower
 
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdfBonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
 
María Carolina Martínez - eCommerce Day Colombia 2024
María Carolina Martínez - eCommerce Day Colombia 2024María Carolina Martínez - eCommerce Day Colombia 2024
María Carolina Martínez - eCommerce Day Colombia 2024
 
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptxsomanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
 
Obesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditionsObesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditions
 
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdfSupercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
 
Bitcoin Lightning wallet and tic-tac-toe game XOXO
Bitcoin Lightning wallet and tic-tac-toe game XOXOBitcoin Lightning wallet and tic-tac-toe game XOXO
Bitcoin Lightning wallet and tic-tac-toe game XOXO
 
Acorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutesAcorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutes
 
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
 
Gregory Harris' Civics Presentation.pptx
Gregory Harris' Civics Presentation.pptxGregory Harris' Civics Presentation.pptx
Gregory Harris' Civics Presentation.pptx
 
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
0x01 - Newton's Third Law:  Static vs. Dynamic Abusers0x01 - Newton's Third Law:  Static vs. Dynamic Abusers
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
 
Media as a Mind Controlling Strategy In Old and Modern Era
Media as a Mind Controlling Strategy In Old and Modern EraMedia as a Mind Controlling Strategy In Old and Modern Era
Media as a Mind Controlling Strategy In Old and Modern Era
 
AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...
AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...
AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...
 
Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...
 
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
 
Burning Issue Presentation By Kenmaryon.pdf
Burning Issue Presentation By Kenmaryon.pdfBurning Issue Presentation By Kenmaryon.pdf
Burning Issue Presentation By Kenmaryon.pdf
 
International Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software TestingInternational Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software Testing
 
Tom tresser burning issue.pptx My Burning issue
Tom tresser burning issue.pptx My Burning issueTom tresser burning issue.pptx My Burning issue
Tom tresser burning issue.pptx My Burning issue
 
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
 
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
 

Using Machine Learning in the delivery of ads

  • 1. Using Machine Learning in the delivery of ads Ruth Garcia NDR Conference – June 07, 2018 Iasi, Romania
  • 2. What is Skyscanner? Skyscanner is a leading global travel search site offering a comprehensive and free flight search service as well as online comparisons for hotels, car hire and now trains.
  • 3. Data Science at Skyscanner Decision Science:1. Ensuring we have the right data; insights and information to make the most impactful, scientific decisions in every aspect of our operations. Building Data Products:2. Leveraging our vast wealth of data to build more contextual, relevant products for Travellers and Travel Suppliers (our Partners) D.S Eng.
  • 4. Machine Learning at Skyscanner • Destination recommendations • Itinerary mashups • Advertising Recommendations Advertising Mashups
  • 7. Advertising schemes Brand awareness Traffic Action Machine Learning efforts
  • 9. Advertising Schemes Brand awareness Traffic Action Machine Learning efforts
  • 11. Ads at Skyscanner = No need to worry about how ads are delivered ? Make sure, ads are delivered
  • 12. Delivery of Ads Data ownership ? Technical difficulties Black box algorithms Skyscanner Ads Manager Solution
  • 13. Click prediction algorithm Click prediction algorithm Rankingalgorithm Candidates
  • 14. Cloud services and tools at Skyscanner Languages AWS services Batch S3
  • 15. Overview of an Advertising System
  • 16. Expectation vs. reality Tensorfiow Optimization technique• Embeddings• Crossed columns• Hashing• Expectation Flexible but with the risk of not being fast enough. Features: User history,• User features,• Route features ,• Ad features with,• colors, text
  • 17. Expectation vs. reality Reality Not flexible but fast and easier to implement. Tensorfiow Optimization technique• Embeddings• Crossed columns• Hashing• Features: User history,• User features,• Route features ,• Ad features with,• colors, text
  • 18. Challenges in building the model Goal: model, that estimates the likelihood of whether an impression will result in a click or not.
  • 19. Challenges: Which model to use? Model Possibilities (easy to read in node.js): • Logistic regression • Random Forest : gets lost • Neural networks: too slow hard to put it in json Solvers: • Logistic regression: Liblinear, sag • SGDClassifier Train all data at once Train data in batches Saves memory Gridsearch for hyperparameteres
  • 20. Challenges: Categorical values Categorical values: One hot• encoding One hot encoding grouping less frequent features• Hashing trick• One hot encoding• • One hot encoding grouping • Less frequent features Creatives C1 C2 C3 C1 C2 C3 1 0 0 0 1 0 0 0 1 Creatives C1 C2 C3 C4 C1 C2 C3 C4 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1
  • 21. Challenges: Categorical values id features 123 creative1, advertiser2,mobile, etc. 321 creative2, advertiser4,mobile, etc. id Feat_1 Feat_2 Feat_3 …. Feat_k 123 1 0 1 …. 0 321 1 0 0 …. 1 Hashing Trick: Same hashing function in training, testing and production• We gain
  • 22. Machine Learning Performance Precision at 1: based on target groups. Mean Reciprocal Rank AUC: if caring about ranking Log-Loss: if caring about the value of CTR Other metrics:
  • 23. Challenges: How to monitor performance? Updating model based on different Sampling methods.
  • 24. The road ahead: Balancing exploitation and exploration Choose ad based on ONLY CTR Choose ad based on OTHER criteria
  • 25. The road even further ahead: quality Post-click, colors, images, text, landing page
  • 26. Conclusions 1. Machine Learning can bring many benefits to the product the challenge is to prove it 2. Bringing ML into production could hard at the beginning 3. Cooperation between data scientists and engineers is crucial D.S Eng.