Using Machine Learning in the delivery of ads

•

0 likes•325 views

In this presentation, I explained that a simple machine learning model can be implemented in some situations even if we do not have the perfect conditions. We go over the first iteration of a click prediction model used in the delivery of CPC ads.

Presentations & Public Speaking

Using Machine
Learning in the
delivery of ads
Ruth Garcia
NDR Conference – June 07, 2018
Iasi, Romania

What is Skyscanner?
Skyscanner is a leading global travel search site offering a
comprehensive and free flight search service as well as online
comparisons for hotels, car hire and now trains.

Data Science at Skyscanner
Decision Science:1. Ensuring we have the right data; insights and
information to make the most impactful, scientific decisions in every aspect
of our operations.
Building Data Products:2. Leveraging our vast wealth of data to build more
contextual, relevant products for Travellers and Travel Suppliers (our
Partners)
D.S
Eng.

Machine Learning at Skyscanner
• Destination recommendations
• Itinerary mashups
• Advertising
Recommendations
Advertising
Mashups

Ads at Skyscanner
=
No need to worry about
how ads are delivered
?
Make sure,
ads are delivered

Delivery of Ads
Data
ownership
?
Technical
difficulties
Black box
algorithms
Skyscanner Ads
Manager
Solution

Click prediction algorithm
Click prediction algorithm
Rankingalgorithm
Candidates

Cloud services and tools at Skyscanner
Languages
AWS services
Batch
S3

Expectation vs. reality
Tensorfiow
Optimization technique•
Embeddings•
Crossed columns•
Hashing•
Expectation
Flexible but with the risk of not
being fast enough.
Features:
User history,•
User features,•
Route features ,•
Ad features with,•
colors, text

Expectation vs. reality
Reality
Not flexible but fast and
easier to implement.
Tensorfiow
Optimization technique•
Embeddings•
Crossed columns•
Hashing•
Features:
User history,•
User features,•
Route features ,•
Ad features with,•
colors, text

Challenges in building the model
Goal: model, that estimates the likelihood of
whether an impression will result in a click or not.

Challenges: Which model to use?
Model Possibilities (easy to read in node.js):
• Logistic regression
• Random Forest : gets lost
• Neural networks: too slow hard to put it in json
Solvers:
• Logistic regression: Liblinear, sag
• SGDClassifier
Train all data at once
Train data in batches
Saves memory
Gridsearch for
hyperparameteres

Challenges: Categorical values
Categorical values:
One hot• encoding
One hot encoding grouping less frequent features•
Hashing trick•
One hot encoding•
• One hot encoding grouping
• Less frequent features
Creatives
C1
C2
C3
C1 C2 C3
1 0 0
0 1 0
0 0 1
Creatives
C1
C2
C3
C4
C1 C2 C3 C4
1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1

Challenges: Categorical values
id features
123 creative1,
advertiser2,mobile, etc.
321 creative2,
advertiser4,mobile, etc.
id Feat_1 Feat_2 Feat_3 …. Feat_k
123 1 0 1 …. 0
321 1 0 0 …. 1
Hashing Trick: Same hashing function in training, testing and production•
We gain

Machine Learning Performance
Precision at 1: based on
target groups.
Mean Reciprocal Rank
AUC: if caring about ranking
Log-Loss: if caring about the
value of CTR
Other metrics:

Challenges: How to monitor performance?
Updating model based on different
Sampling methods.

The road ahead: Balancing exploitation and exploration
Choose ad based on
ONLY CTR
Choose ad based on
OTHER criteria

The road even further ahead: quality
Post-click, colors, images, text,
landing page

Conclusions
1. Machine Learning can bring many benefits to the product the challenge is to prove it
2. Bringing ML into production could hard at the beginning
3. Cooperation between data scientists and engineers is crucial
D.S
Eng.

SmartNews Ads裏のデータシステムでは、様々なソースからデータを集め、広告クリックの予測を初め、ユーザプロファイリングや配信ターゲティングなど色んなアルゴリズムが動いています。このセッションでは、この一年間SmartNew Adsにおけるデータシステムの実践、特に広告クリックシステムの進化を話します！

Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...

Carol McDonald

Spark machine learning predicting customer churn

Carol McDonald

Live Machine Learning Tutorial: Churn Prediction

MapR Technologies

Churn prediction is big business. It minimizes customer defection by predicting which customers are likely to cancel a service. Though originally used within the telecommunications industry, it has become common practice for banks, ISPs, insurance firms, and other verticals. More: http://info.mapr.com/WB_PredictingChurn_Global_DG_17.06.15_RegistrationPage.html The prediction process is data-driven and often uses advanced machine learning techniques. In this webinar, we'll look at customer data, do some preliminary analysis, and generate churn prediction models – all with Spark machine learning (ML) and a Zeppelin notebook. Spark’s ML library goal is to make machine learning scalable and easy. Zeppelin with Spark provides a web-based notebook that enables interactive machine learning and visualization. In this tutorial, we'll do the following: Review classification and decision trees Use Spark DataFrames with Spark ML pipelines Predict customer churn with Apache Spark ML decision trees Use Zeppelin to run Spark commands and visualize the results

SparkML: Easy ML Productization for Real-Time Bidding

Databricks

dataxu bids on ads in real-time on behalf of its customers at the rate of 3 million requests a second and trains on past bids to optimize for future bids. Our system trains thousands of advertiser-specific models and runs multi-terabyte datasets. In this presentation we will share the lessons learned from our transition towards a fully automated Spark-based machine learning system and how this has drastically reduced the time to get a research idea into production. We'll also share how we: - continually ship models to production - train models in an unattended fashion with auto-tuning capabilities - tune and overbooked cluster resources for maximum performance - ported our previous ML solution into Spark - evaluate the performance of high-rate bidding models Speakers: Maximo Gurmendez, Javier Buquet

Demystifying AI, Machine Learning and Deep Learning

Carol McDonald

Deep learning, machine learning, artificial intelligence - all buzzwords and representative of the future of analytics. In this talk we will explain what is machine learning and deep learning at a high level with some real world examples. The goal of this is not to turn you into a data scientist, but to give you a better understanding of what you can do with machine learning. Machine learning is becoming more accessible to developers, and Data scientists work with domain experts, architects, developers and data engineers, so it is important for everyone to have a better understanding of the possibilities. Every piece of information that your business generates has potential to add value. This and future posts are meant to provoke a review of your own data to identify new opportunities.

Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...

Carol McDonald

Oracle Modern Marketing Experience 2015 Keynote Presentation Description: Is your demand generation machine broken? Are your goals unattainable with current conversion metrics? Is Sales demanding more leads and better leads? Is your budget shrinking but your goals are growing? If this sounds like you, don’t miss learning how Mintigo fixed Seagate’s demand gen machine and made the unattainable goal, attainable. Presented by: - Dr.Jacob Shama, CEO and Co-Founder, Mintigo - Michael de la Torre, Sr. Director of Demand Generation, Seagate

AI for Software Engineering

Miroslaw Staron

Big data in marketing at harvard business club nick1 june 15 2013

nkabra

Kde jsou limity zákaznické 360°?

Taste Medio

Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...

Carol McDonald

AI & AWS DeepComposerAmazon Web Services

Analytics what to look for sustaining your growing business-

Ajay Ohri

Imtiaz khan data_science_analytics

imtiaz khan

Dear Sir/Ma’am I am interested to work as a data specialist in your organization. I believe my experience, skills and work attitude will aid your organization in a great way. Please accept my enclosed resume with this letter. I worked at Accenture for the last four years. My key responsibilities here were to collect, analyse, store and create data. I made sure that these data were accurate and not damaged. As far as my educational background is concerned, I have a bachelor's degree in EXTC. I am excellent at solving problems and have great analytical skills. I am capable of working well with network administration and can explain the technical problems. I would appreciate if we could meet up for an interview wherein we can discuss more on this. I can be contacted at +919493377607 or you can email me at imtiaz.khan.sw39@gmail.com Thank You. Yours sincerely, Imtiaz Khan

Lead confluent HQ Dec 2019

Sabri Skhiri

Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk

GlobalLogic Ukraine

This presentation deals with Big Data usage in advertising industry, namely in ad exchange business. It contains an overview of Big Data tools, a respective architectural approach and Java application for MapReduce. This presentation was held by Oleksandr Fedirko (Lead Software Engineer, Consultant, GlobalLogic) and Danylo Stepanchuk (Lead Software Engineer, Consultant, GlobalLogic) at GlobalLogic Kyiv Java Career Day on August 11, 2018. Learn more: https://www.globallogic.com/ua/events/globallogic-kyiv-java-career-day

Wims2012

Elena Simperl

Webinar - Know Your Customer - Arya (20160526)

Turi, Inc.

Pubcon 2015 - Content Management Across International Markets

Doug Platts

Knowledge Discovery

André Karpištšenko

Leverage the power of machine learning on windows

José António Silva

Workshop: Make the Most of Customer Data Platforms - David Raab

Digital Customer Experience (DX) Summit

Marketers have more data available than ever but struggle to pull in together in a usable format. Customer Data Platforms promise to solve this problem by offering easy-to-deploy systems specializing in data unification and sharing. But can CDP really deliver on its promise? This workshop will equip you to understand the definition of a CDP, how CDPs differ from other systems, which features are shared by all CDPs and which are found in only some, the most important CDP use cases, how to select the right CDP, how to manage a successful deployment and where to look next for more information.

Data Annotation_Cars.pptx

ssuserfb92ae

2024-02-24_Session 1 - PMLE_UPDATED.pptx

gdgsurrey

Reviewing progress in the machine learning certification journey 𝗦𝗽𝗲𝗰𝗶𝗮𝗹 𝗔𝗱𝗱𝗶𝘁𝗶𝗼𝗻 - Short tech talk on How to Network by Qingyue(Annie) Wang C𝗼𝗻𝘁𝗲𝗻𝘁 𝗿𝗲𝘃𝗶𝗲𝘄 𝗼𝗻 AI and ML on Google Cloud by Margaret Maynard-Reid 𝗔 𝗳𝗼𝗰𝘂𝘀𝗲𝗱 𝗰𝗼𝗻𝘁𝗲𝗻𝘁 𝗿𝗲𝘃𝗶𝗲𝘄 𝗼𝗻 𝗠𝗟 𝗽𝗿𝗼𝗯𝗹𝗲𝗺 𝗳𝗿𝗮𝗺𝗶𝗻𝗴, 𝗺𝗼𝗱𝗲𝗹 𝗲𝘃𝗮𝗹𝘂𝗮𝘁𝗶𝗼𝗻, 𝗮𝗻𝗱 𝗳𝗮𝗶𝗿𝗻𝗲𝘀𝘀 by Sowndarya Venkateswaran. A discussion on sample questions to aid certification exam preparation. An interactive Q&A session to clarify doubts and questions. Previewing next steps and topics, including course completions and material reviews.

Using Simple Machine Learning Models in a New Ads Manager

Ruth Garcia Gavilanes

Assessing Online Ads Beyond Only Clicks

Ruth Garcia Gavilanes

Similar to Using Machine Learning in the delivery of ads

Track 2 Session 5_ 利用 SageMaker 深度學習容器化在廣告推播之應用Amazon Web Services

Artificial Intelligence for Travel - MyLittleAdventure / Welcome City Lab Dem...

Valéry BERNARD

#MME15: How Mintigo, the Leading Predictive Marketing Platform for Enterprise...

Mintigo1

AI for Software Engineering

Miroslaw Staron

Big data in marketing at harvard business club nick1 june 15 2013

nkabra

Kde jsou limity zákaznické 360°?

Taste Medio

Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...

Carol McDonald

AI & AWS DeepComposerAmazon Web Services

Analytics what to look for sustaining your growing business-

Ajay Ohri

Imtiaz khan data_science_analytics

imtiaz khan

Lead confluent HQ Dec 2019

Sabri Skhiri

Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk

GlobalLogic Ukraine

Wims2012

Elena Simperl

Webinar - Know Your Customer - Arya (20160526)

Turi, Inc.

Pubcon 2015 - Content Management Across International Markets

Doug Platts

Knowledge Discovery

André Karpištšenko

Leverage the power of machine learning on windows

José António Silva

Workshop: Make the Most of Customer Data Platforms - David Raab

Digital Customer Experience (DX) Summit

Data Annotation_Cars.pptx

ssuserfb92ae

2024-02-24_Session 1 - PMLE_UPDATED.pptx

gdgsurrey

Similar to Using Machine Learning in the delivery of ads (20)

Track 2 Session 5_ 利用 SageMaker 深度學習容器化在廣告推播之應用

Artificial Intelligence for Travel - MyLittleAdventure / Welcome City Lab Dem...

#MME15: How Mintigo, the Leading Predictive Marketing Platform for Enterprise...

AI for Software Engineering

Big data in marketing at harvard business club nick1 june 15 2013

Kde jsou limity zákaznické 360°?

Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...

AI & AWS DeepComposer

Analytics what to look for sustaining your growing business-

Imtiaz khan data_science_analytics

Lead confluent HQ Dec 2019

Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk

Wims2012

Webinar - Know Your Customer - Arya (20160526)

Pubcon 2015 - Content Management Across International Markets

Knowledge Discovery

Leverage the power of machine learning on windows

Workshop: Make the Most of Customer Data Platforms - David Raab

Data Annotation_Cars.pptx

2024-02-24_Session 1 - PMLE_UPDATED.pptx

More from Ruth Garcia Gavilanes

Using Simple Machine Learning Models in a New Ads Manager

Ruth Garcia Gavilanes

Assessing Online Ads Beyond Only Clicks

Ruth Garcia Gavilanes

An Analysis of Human-Generated Friendship Recommendations

Ruth Garcia Gavilanes

Online social networks support users in a wide range of activities, such as sharing information and making recommendations. In this talk, I will talk about a human-generated recommendations of friendship based on hashtags that emerged in Twitter around 2010. I will present a study of a large-scale corpus of human friendship recommendations, using a large corpus of tweets gathered over a 24 week period and involving a set of nearly 6 million users. I will show that these explicit recommendations had a measurable effect on the process of link creation, increasing the chance of link creation between two and three times on average, compared with a recommendation-free scenario. Also, ties created after such recommendations have up to 6% more longevity than other Twitter ties. Finally, I will talk about a supervised system that ranks our user-generated recommendations, surfacing the most valuable ones with high precision (0.52 MAP). We find that features describing users and the relationships between them are discriminative for this task. After the talk, we will carry out some examples on the collection of online data

Discovering Culture in Social Media and a Brief Case of Collective Memory

Ruth Garcia Gavilanes

'Discovering Cultural Trails in Social Media and Collective Memory in Wikipedia', with speaker Dr Ruth García-Gavilanes, Oxford Internet Institute. As a computational social science researcher, Ruth is interested in understanding online footprints, utilizing/developing computational methods and leveraging big data. In this seminar Ruth will present two case studies in this field: a) a study of how one’s action on Twitter (e.g., deciding when to post messages) is linked to one’s culture (e.g., country’s Pace of Life) and b) a case study of how collective memories can be measured using Wikipedia articles related to aircraft incidents and accidents.

Language, Twitter and Academic Conferences

Ruth Garcia Gavilanes

USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS

Ruth Garcia Gavilanes

PhD defense contributions: Providing a study of human generated recommendation on Twitter and its effect. García-Gavilanes et al. Follow My Friends This Friday! An Analysis of Human-generated Friendship Recommendations. SocInfo’13 [Best paper award] Describing the evolution of user behavior over time regarding the content they generate. García-Gavilanes et al. Who are my Audiences? A Study of the Evolution of Target Audiences in Microblogs. SocInfo’14 Describing differences and similarities of users across countries regarding the way people tweet and connect with others. García-Gavilanes et al. Microblogging without Borders: Differences and Similarities. Websci’11. w/ Poblete et al. Do All Birds Tweet the Same? Characterizing Twitter Around the World. In CIKM’11 Proposing how to combine anthropological studies of culture with large scale data. Correlating how and when people tweet with dimensions of national culture and pace of life García-Gavilanes et al. Cultural Dimensions in Twitter: Time, Individualism and Power. ICWSM’13 [Honorable mention] Improving the prediction of the communication strength between users from different countries by taking into account several cultural and socio-economic indicators taken from diverse sources. García-Gavilanes et al. Twitter ain’t Without Frontiers: Economic, Social, and Cultural Boundaries in International Communication. CSCW’14.

Who are my Audiences? Evolution of Target Audiences in Microblogs

Ruth Garcia Gavilanes

User behavior in online social media is not static, it evolves through the years. In Twitter, we have witnessed a maturation of its platform and its users due to endogenous and exogenous reasons. While the research using Twitter data has expanded rapidly, little work has studied the change/evolution in the Twitter ecosystem itself. In this paper, we use a taxonomy of the types of tweets posted by around 4M users during 10 weeks in 2011 and 2013. We classify users according to their tweeting behavior, and nd 5 clusters for which we can associate a different dominant tweeting type. Furthermore, we observe the evolution of users across groups between 2011 and 2013 and interesting insights such as the decrease in conversations and increase in URLs sharing. Our findings suggest that mature users evolve to adopt Twitter as a news media rather than a social network.

Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...

Ruth Garcia Gavilanes

In Twitter, the hashtag #FF, or #FOLLOWFRIDAY, arose as a popular convention for users to create contact recommendations for others. Hitherto, there has not been any quantitative study of the effect of such human-generated recommendations. This paper is the first study of a large-scale corpus of human friendship recommendations based on such hashtags, using a large corpus of recommendations gathered over a 24 week period and involving a set of nearly 6 million users. We show that these explicit recommendations have a measurable effect on the process of link creation, increasing the chance of link creation between two and three times on average, compared with a recommendation-free scenario. Also, ties created after such recommendations have up to 6\% more longevity than other Twitter ties. Finally, we build a supervised system to rank user-generated recommendations, surfacing the most valuable ones with high precision ($0.52$ MAP), and we find that features describing users and the relationships between them, are discriminative for this task.

Twitter: Time, Individualism and Power

Ruth Garcia Gavilanes

Cikm2011 doallbirdstweetthesame

Ruth Garcia Gavilanes

More from Ruth Garcia Gavilanes (10)

Using Simple Machine Learning Models in a New Ads Manager

Assessing Online Ads Beyond Only Clicks

An Analysis of Human-Generated Friendship Recommendations

Discovering Culture in Social Media and a Brief Case of Collective Memory

Language, Twitter and Academic Conferences

USER BEHAVIOR IN MICROBLOGS WITH A CULTURAL EMPHASIS

Who are my Audiences? Evolution of Target Audiences in Microblogs

Follow My Friends This Friday! An Analysis of Human-generated Friendship Reco...

Twitter: Time, Individualism and Power

Cikm2011 doallbirdstweetthesame

Recently uploaded

Getting started with Amazon Bedrock Studio and Control Tower

Vladimir Samoylov

Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf

khadija278284

María Carolina Martínez - eCommerce Day Colombia 2024

eCommerce Institute

somanykidsbutsofewfathers-140705000023-phpapp02.pptx

Howard Spence

Obesity causes and management and associated medical conditions

Faculty of Medicine And Health Sciences

Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf

Access Innovations, Inc.

Bitcoin Lightning wallet and tic-tac-toe game XOXO

Matjaž Lipuš

Acorn Recovery: Restore IT infra within minutes

IP ServerOne

Doctoral Symposium at the 17th IEEE International Conference on Software Test...

Sebastiano Panichella

Gregory Harris' Civics Presentation.pptx

gharris9

0x01 - Newton's Third Law: Static vs. Dynamic Abusers

OWASP Beja

f you offer a service on the web, odds are that someone will abuse it. Be it an API, a SaaS, a PaaS, or even a static website, someone somewhere will try to figure out a way to use it to their own needs. In this talk we'll compare measures that are effective against static attackers and how to battle a dynamic attacker who adapts to your counter-measures. About the Speaker =============== Diogo Sousa, Engineering Manager @ Canonical An opinionated individual with an interest in cryptography and its intersection with secure software development.

Media as a Mind Controlling Strategy In Old and Modern Era

faizulhassanfaiz1670

This presentation, created by Syed Faiz ul Hassan, explores the profound influence of media on public perception and behavior. It delves into the evolution of media from oral traditions to modern digital and social media platforms. Key topics include the role of media in information propagation, socialization, crisis awareness, globalization, and education. The presentation also examines media influence through agenda setting, propaganda, and manipulative techniques used by advertisers and marketers. Furthermore, it highlights the impact of surveillance enabled by media technologies on personal behavior and preferences. Through this comprehensive overview, the presentation aims to shed light on how media shapes collective consciousness and public opinion.

AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...

AwangAniqkmals

Announcement of 18th IEEE International Conference on Software Testing, Verif...

Sebastiano Panichella

Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024

Dutch Power

Burning Issue Presentation By Kenmaryon.pdf

kkirkland2

International Workshop on Artificial Intelligence in Software Testing

Sebastiano Panichella

Tom tresser burning issue.pptx My Burning issue

amekonnen

Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024

Dutch Power

Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...

OECD Directorate for Financial and Enterprise Affairs

Recently uploaded (20)

Getting started with Amazon Bedrock Studio and Control Tower

Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf

María Carolina Martínez - eCommerce Day Colombia 2024

somanykidsbutsofewfathers-140705000023-phpapp02.pptx

Obesity causes and management and associated medical conditions

Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf

Bitcoin Lightning wallet and tic-tac-toe game XOXO

Acorn Recovery: Restore IT infra within minutes

Doctoral Symposium at the 17th IEEE International Conference on Software Test...

Gregory Harris' Civics Presentation.pptx

0x01 - Newton's Third Law: Static vs. Dynamic Abusers

Media as a Mind Controlling Strategy In Old and Modern Era

AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...

Announcement of 18th IEEE International Conference on Software Testing, Verif...

Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024

Burning Issue Presentation By Kenmaryon.pdf

International Workshop on Artificial Intelligence in Software Testing

Tom tresser burning issue.pptx My Burning issue

Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024

Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...

Using Machine Learning in the delivery of ads

1. Using Machine Learning in the delivery of ads Ruth Garcia NDR Conference – June 07, 2018 Iasi, Romania

2. What is Skyscanner? Skyscanner is a leading global travel search site offering a comprehensive and free flight search service as well as online comparisons for hotels, car hire and now trains.

3. Data Science at Skyscanner Decision Science:1. Ensuring we have the right data; insights and information to make the most impactful, scientific decisions in every aspect of our operations. Building Data Products:2. Leveraging our vast wealth of data to build more contextual, relevant products for Travellers and Travel Suppliers (our Partners) D.S Eng.

4. Machine Learning at Skyscanner • Destination recommendations • Itinerary mashups • Advertising Recommendations Advertising Mashups

5. Ads at Skyscanner

6. Not always annoying ads

7. Advertising schemes Brand awareness Traffic Action Machine Learning efforts

8. Advertising schemes Brand awareness Traffic Action

9. Advertising Schemes Brand awareness Traffic Action Machine Learning efforts

10. Ads at Skyscanner

11. Ads at Skyscanner = No need to worry about how ads are delivered ? Make sure, ads are delivered

12. Delivery of Ads Data ownership ? Technical difficulties Black box algorithms Skyscanner Ads Manager Solution

13. Click prediction algorithm Click prediction algorithm Rankingalgorithm Candidates

14. Cloud services and tools at Skyscanner Languages AWS services Batch S3

15. Overview of an Advertising System

16. Expectation vs. reality Tensorfiow Optimization technique• Embeddings• Crossed columns• Hashing• Expectation Flexible but with the risk of not being fast enough. Features: User history,• User features,• Route features ,• Ad features with,• colors, text

17. Expectation vs. reality Reality Not flexible but fast and easier to implement. Tensorfiow Optimization technique• Embeddings• Crossed columns• Hashing• Features: User history,• User features,• Route features ,• Ad features with,• colors, text

18. Challenges in building the model Goal: model, that estimates the likelihood of whether an impression will result in a click or not.

19. Challenges: Which model to use? Model Possibilities (easy to read in node.js): • Logistic regression • Random Forest : gets lost • Neural networks: too slow hard to put it in json Solvers: • Logistic regression: Liblinear, sag • SGDClassifier Train all data at once Train data in batches Saves memory Gridsearch for hyperparameteres

20. Challenges: Categorical values Categorical values: One hot• encoding One hot encoding grouping less frequent features• Hashing trick• One hot encoding• • One hot encoding grouping • Less frequent features Creatives C1 C2 C3 C1 C2 C3 1 0 0 0 1 0 0 0 1 Creatives C1 C2 C3 C4 C1 C2 C3 C4 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1

21. Challenges: Categorical values id features 123 creative1, advertiser2,mobile, etc. 321 creative2, advertiser4,mobile, etc. id Feat_1 Feat_2 Feat_3 …. Feat_k 123 1 0 1 …. 0 321 1 0 0 …. 1 Hashing Trick: Same hashing function in training, testing and production• We gain

22. Machine Learning Performance Precision at 1: based on target groups. Mean Reciprocal Rank AUC: if caring about ranking Log-Loss: if caring about the value of CTR Other metrics:

23. Challenges: How to monitor performance? Updating model based on different Sampling methods.

24. The road ahead: Balancing exploitation and exploration Choose ad based on ONLY CTR Choose ad based on OTHER criteria

25. The road even further ahead: quality Post-click, colors, images, text, landing page

26. Conclusions 1. Machine Learning can bring many benefits to the product the challenge is to prove it 2. Bringing ML into production could hard at the beginning 3. Cooperation between data scientists and engineers is crucial D.S Eng.

27. Thank you @ruthygarcia Questions?

Using Machine Learning in the delivery of ads

Recommended

Recommended

More Related Content

Similar to Using Machine Learning in the delivery of ads

Similar to Using Machine Learning in the delivery of ads (20)

More from Ruth Garcia Gavilanes

More from Ruth Garcia Gavilanes (10)

Recently uploaded

Recently uploaded (20)

Using Machine Learning in the delivery of ads