SlideShare a Scribd company logo
RTB Bid Landscape
survival models
powered by bitmaps
Łukasz Mączewski Senior Data Scientist
Przemysław Piotrowski Senior Data Engineer
AI&Analytics, Adform
14.05.2019
1
Agenda
• Real Time Bidding – RTB
• Bidding Strategy
• Bid Landscape
• Bitmap population selection
• Survival analysis
• Validation
• Product demo
2
Real-Time Bidding (RTB)
RTB allows for fully automated ad
inventory selling / buying in a real-time
3
Advertiser 1
Advertiser 2
Advertiser 3
Advertiser n
AudienceInventory
Auction
Demand Side Platform is to
estimate optimal
bid price according to:
• Targeting (audience)
• Business requirements (KPIs)
• Ad inventory (context, ad slot price)
• Campaign budget and it's spend plan
First price vs. second price auction
First price auction:
- Pay exact amount that was bidded
- Bid price is close to a real impression value
- Seller revenue maximization
4
Second price auction:
- Pay 0.00001$ more than second-highest bid
price
- Bid as high as possible to maximize winning
probability
In the past the second price auction was preferred, but currently
the first price auction becomes more and more popular
RTB ecosystem
5
diverse
audience
web page
Publisher 1 and its
inventory SSP 1
Supply Side
Platform
Publisher n and its
inventory
.
.
. Ad Exchange
DMP
Data
Management
Platform
DSP 1
Demand Side Platform
SSP n DSP n
.
.
.
advertisers
demographics, browsing, interest segments data
cookie profile data
feedback data
ad-slot / banner data
campaign data
Bidding strategy
optimisation
Time to bid is ~100 ms
Bidding strategy optimisation
6
Bidding Strategy
Optimization
Candidate impression valuation with
respect to KPIs and budget
constraints
DSP algorithms
Evaluation of bid request business potential
Bid Landscape
Estimation of winning probability at bid price
Budget throttle
Control budget spend
Optimal bid price
Numbers
7
Adform provides an integrated Software as a Service
platform for the buying, managing and serving of
digital advertising.
Presence in EU, US, APAC
3 millions bid request per second
billions of engagements daily
5 millions scoring of algorithms per second
200 thousands of cookie profiles in DMP
Algorithms
8
Discovering knowledge from massive RTB data
9
Cookie profile/
Targeting:
• device type
• os type
• browser type
• screen size
• country
• regions
• city
• clicks made
• visited log points
• visited domains
• urls
• IAB categories
• ...
Feedback:
• acquisition/conversion
• click
• order status
• revenue
• viewability
• ...
Adslot/ banners/
creatives:
• inventory name
• domain
• URL
• banner position
• banner size
• banner type
• floor price
• video player properties
• ...
Data Management
Platform:
• demography
• interests and preferences
• profession
• ...
Campaign:
• budget
• pricing type
• deals
• targeting
• brand safety
• ...
Discovering knowledge from massive RTB data
10
Huge class
imbalance
Mostly
categorical
data
High
cardinality
Sparsity
High risk of
overfit
Data summary
Reduction of dimensionality
Preselection of features RegularisationData sampling
Cross validationEnsembling
RTB models
11
CPC
Cost per Click
CPL
Cost per Lead
netCPL
Confirmed lead
CPV
Cost per
Viewability
ROAS
Return on Ad
Spend
VCR
Video Completion
Rate
Adform machine learning pipeline
12
Train/Test data
sets preparation
(dedicated ETLs)
Data
preprocessing
and feature
engineering
Feature selection
Building model
and
hyperparameters
optimization
Offline backtests Online A/B tests Deployment Live monitoring
After successful model deployment it is
periodically retrained
RTB Algorithms Pipeline
13
PricingAPI
Train
Data
JAR
JAR
OFFLINE JAR
JAR
ONLINE JAR
ONLINE SPEC ONLINE MONITORING
Backtests
• Offline procedure to check performance of
two competing models in time
• Sequential analysis of historical data:
- Data divided into training and test sets
• Time series analysis of critical performance
KPIs:
- AUC
- Sparsity
- Target rates
• If the backtests succeed the algorithm is
submitted to online A/B testing procedure
14
AUC
SPARSITY
Target rate
Time
A/B test online monitoring
15
Split traffic to
hash(cookie_id)%1024
buckets
For each experiment monitor
• Cost per target
• Target rate
• Target count
• Number of impressions
• ... many more
Bid Landscape
17
User Stories
As a Trafficker I want to know volume of impressions my campaign would win in RTB given
the selected inventory, targeting and budget settings (CPM).
As a DSP I want to enable Advertisers winning impressions for lowest possible price.
18
Bid Shading
Demand Side Platform protects Advertiser from overpaying while
AdTech ecosystem migrating from second-price to first-price RTB auctions.
Forecasting & Bid Landscape
Forecasting
• Total number of matching Bid Requests for
given Campaign Targeting
• Bid Opportunities
19
Bid Landscape
• Predicting probability of winning
auction with a given bid price and
Campaign Settings
• Win rate evolution with a bid price
Impressions
CPM
Bid Landscape
• Building Bid Landscape model using
historical data regarding won and failed
auctions:
For lost auctions win price is unknown
• Right Censored Data
Survival Analysis
The exact moment of failure for majority of
objects is unknown
20
All Bids
Won Bids
Relative Rate
Bid Price Buckets
Population selection for
Bid Landscape modelling
21
The Data
22
Input
Bids and Impressions with features
Impression is a Bid that won the auction
100000 2000 5000 100 4000 3000 ... 4500
Output
cardinality of selected (Campaign Settings)
Bids bucketed by bid price
Impressions bucketed by bid price
Impressions bucketed by win price
100 buckets in our case
How?
Population Selection – select relevant population from high-dimensional
space for survival analysis.
source:biretail.com
Bitmaps (bitset, bitvector)
23
waw
1
0
0
0
1
0
0
1
…
0
iOS
0
0
1
0
1
1
0
1
…
0
11:00
1
1
0
0
1
0
0
1
…
0
Bid
0
1
2
3
4
5
6
7
…
100mln
Matching
bids
0
0
0
0
1
0
0
1
…
0
BidPriceBuckets
9900
790
2000
….
100
Counts
Nested queries on millions of features
24
waw
1
0
0
0
1
0
0
1
…
0
iOS
0
0
1
0
1
1
0
1
…
0
11:00
1
1
0
0
1
0
0
1
…
0
Bid
0
1
2
3
4
5
6
7
…
100mln
Matching
bids
0
0
0
0
1
0
0
1
…
0
BidPriceBuckets
9900
790
2000
….
100
Counts
3 mln other features
AND, OR, NOT, XOR
25
Buckets' cardinality
10000 3000 ... 4500
2000 100 ... 0
1000 800 ... 400
Survival
Analysis
Roaring Bitmap
26
Roaring Bitmap – state of the art compressed bitmap algorithm.
Hybrid approach between sparse and dense bitmaps representation.
roaringbitmap.org/about/
„Use Roaring for bitmap compression whenever possible.
Do not use other bitmap compression methods” [1]
[1] Wang, Jianguo, et al.
"An experimental study of bitmap compression vs. inverted list compression." 2017.
Bitmap Execution Database
27
• Roaring Bitmap – state of the art compressed bitmap algorithm
• Whole 7 days of Internet bid opportunities – 12GB (sample 1/1000) Naive 50TB
• Bids&Impressions – 5GB (sample 1/100)
• Majority of responses under 1 second
Nested set operations language
28
And(
Or("inventory|1", "inventory|2", "inventory|3"),
AndNot(
Universe,
Or("category|moto", "category|fishing")
),
Or(
"hour_of_week|41",
"hour_of_week|42",
"hour_of_week|43",
"hour_of_week|44"
),
Or("tld|com", "website|example.com"),
And(
"country|poland",
AndNot(
Universe,
"city|warsaw"
)
),
...
)
Survival analysis
29
Principles of survival analysis
• The basic assumption in survival analysis is that the risk of failure changes with time
• Two major approaches for survival probability modelling can be distinguished:
- Parametric – can be extrapolated for new experiences (Weibull)
- Non-parametric – very flexible but can't be extended for new experience (Kaplan-Maier)
30
FailureRate
Data censoring in survival analysis
31
Weibull based survival probability estimation
• Weibull distribution is derived from hazard function:
where:
- S(t) - survival function (cdf)
- F(t) = 1 – S(t) - failure function (cdf)
- f(t) = dF(t)/dt - pdf
• Weibull distribution is defined by 2 parameters:
- Shape parameter p - failure mode type
- Scale parameter l - position
32
Time
Survive .
0 t ∆t
Fail .
Weibull based survival probability estimation
• One can distinguish 3 classes of failure types:
- Infant mortality p < 1
- Random failures p = 1
- Ware out p > 1
33
Weibull examples
Parametric approach easy to extrapolate
age
Hazard
Kaplan-Maier based survival probability estimation
• In Kaplan-Meier method we are to predict chance of surviving time ∆t after surviving time t:
34
No parameters to be estimated, very flexible and easy in implementation
S(t,Δt) = S(t) • S(Δt|t)
Time
Survive .
0 t ∆t
Survive.
Kaplan-Meier curve calculation
35
S(t,Δt) = S(t) • S(Δt|t)
Time (weeks)
KM Curve for Group 1 (treatment)
Elements of survival analysis in
Bid Landscape modelling
36
RTB auctions modelling with survival methods
From survival models to Bid Landscape:
- time -> cost per 1k bought impressions (CPM)
- died -> won bid (left censored, known win price)
- alive -> lost bid (right censored, unknown exact win price)
37
our win price (second price auction)
competitor outbid price (unknown)
our bid price
Bid Landscape vs. Survival Analysis
Survival analysis
• Risk of failure evolves with time
• Incomplete knowledge regarding failure
time, right censoring:
- Majority of object survive
• Multiple competing failure modes
• Very high disproportion between positive
(failures) and negative (survived) class
representatives
• Usually very limited data
• Wrong predictions may have catastrophic
consequences – quality
monitoring required
38
Bid Landscape analysis
• Winning probability evolves with price
• Incomplete knowledge regarding win price, right
censoring:
- Only small fraction of bids are won
• Different inventory sources
• Very high disproportion between positive (won)
and negative (lost) class representatives
• Usually huge data sets – subsampling
• Wrong predictions can lead to not efficient
campaign money spend – quality monitoring
required
Survival model for Bid Landscape modelling
39
• Let's apply survival techniques to auction data:
- Instead of time a bid price is to be used
• Legend:
• number of lost auctions
• number of won impressions
- win price CDF
- Kaplan-Maier estimate
- Weibull estimate
Adform Bid Landscape exploits
Kaplan-Maier estimate exclusively
Bid Price (CPM)
Multiplicity
WinRate
Bid Landscape validation in
Adform
40
Bid Landscape validation – qualitative
41
Win rate evolution with bid price Win rate evolution over day
The bid landscape predictions are in line with the intuition:
• The win rate increases with bid price
• The win rate changes periodically over a day
Bid Landscape validation – quantitative (real RTB data)
42
Real auction data versus Bid Landscape predictions
Bid Landscape as Adform
product
43
Interactive tool used in UI
44
Hyperlocal precision
45
Floor Price detection
46
Bid Multipliers optimisation
47
Polish accountants'
portal
Jobs for professionals
Video oriented website
Impressions
Source of traffic:
• Campaign targeting Warsaw people to be
run on three different domains
• Global CPM makes no sense, needs to be
adjusted
Bid Landscape dynamic CPM algorithm
48
Given the Campaign Settings including hourly budget maximize
bought relevant impressions by using lowest CPM.
First tests are showing multiplied Viewable Impressions,
Clicks comparing to same Campaign using static price.
Summary and conclusions
• The Adform Bid Landscape is composed of two base elements ensuring speed and quality:
- Bitmaps – fast and efficient population selection
- Kaplan-Maier win rates – simple and precise estimate
• The Adform Bid Landscape is in production for more than one year
- It is mostly used by Adform customers optimising the CPM campaigns
- Currently we develop new RTB algorithm that is based on BL predictions
- In the short time perspective we are to extend usage of BL in other RTB models
49
5
RTB Bid Landscape in Adform

More Related Content

What's hot

Side by Side with Elasticsearch & Solr, Part 2
Side by Side with Elasticsearch & Solr, Part 2Side by Side with Elasticsearch & Solr, Part 2
Side by Side with Elasticsearch & Solr, Part 2
Sematext Group, Inc.
 
Advanced Stream Processing with Flink and Pulsar - Pulsar Summit NA 2021 Keynote
Advanced Stream Processing with Flink and Pulsar - Pulsar Summit NA 2021 KeynoteAdvanced Stream Processing with Flink and Pulsar - Pulsar Summit NA 2021 Keynote
Advanced Stream Processing with Flink and Pulsar - Pulsar Summit NA 2021 Keynote
StreamNative
 
Big data real time architectures
Big data real time architecturesBig data real time architectures
Big data real time architectures
Daniel Marcous
 
NoSQL
NoSQLNoSQL
NoSQL
Radu Potop
 
Elasticsearch와 Python을 이용하여 맨땅에서 데이터 분석하기
Elasticsearch와 Python을 이용하여 맨땅에서 데이터 분석하기Elasticsearch와 Python을 이용하여 맨땅에서 데이터 분석하기
Elasticsearch와 Python을 이용하여 맨땅에서 데이터 분석하기
흥래 김
 
Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )
Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )
Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )
SANG WON PARK
 
[NDC18] 야생의 땅 듀랑고의 데이터 엔지니어링 이야기: 로그 시스템 구축 경험 공유
[NDC18] 야생의 땅 듀랑고의 데이터 엔지니어링 이야기: 로그 시스템 구축 경험 공유[NDC18] 야생의 땅 듀랑고의 데이터 엔지니어링 이야기: 로그 시스템 구축 경험 공유
[NDC18] 야생의 땅 듀랑고의 데이터 엔지니어링 이야기: 로그 시스템 구축 경험 공유
Hyojun Jeon
 
데브시스터즈 데이터 레이크 구축 이야기 : Data Lake architecture case study (박주홍 데이터 분석 및 인프라 팀...
데브시스터즈 데이터 레이크 구축 이야기 : Data Lake architecture case study (박주홍 데이터 분석 및 인프라 팀...데브시스터즈 데이터 레이크 구축 이야기 : Data Lake architecture case study (박주홍 데이터 분석 및 인프라 팀...
데브시스터즈 데이터 레이크 구축 이야기 : Data Lake architecture case study (박주홍 데이터 분석 및 인프라 팀...
Amazon Web Services Korea
 
Auto Scalable 한 Deep Learning Production 을 위한 AI Serving Infra 구성 및 AI DevOps...
Auto Scalable 한 Deep Learning Production 을 위한 AI Serving Infra 구성 및 AI DevOps...Auto Scalable 한 Deep Learning Production 을 위한 AI Serving Infra 구성 및 AI DevOps...
Auto Scalable 한 Deep Learning Production 을 위한 AI Serving Infra 구성 및 AI DevOps...
hoondong kim
 
로그 기깔나게 잘 디자인하는 법
로그 기깔나게 잘 디자인하는 법로그 기깔나게 잘 디자인하는 법
로그 기깔나게 잘 디자인하는 법
Jeongsang Baek
 
Soma search
Soma searchSoma search
Soma search
DaeMyung Kang
 
백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스
백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스
백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스
NAVER D2
 
ElasticSearch
ElasticSearchElasticSearch
ElasticSearch
Volodymyr Kraietskyi
 
Apache Beam and Google Cloud Dataflow - IDG - final
Apache Beam and Google Cloud Dataflow - IDG - finalApache Beam and Google Cloud Dataflow - IDG - final
Apache Beam and Google Cloud Dataflow - IDG - finalSub Szabolcs Feczak
 
[팝콘 시즌1] 이윤희 : 다짜고짜 배워보는 인과추론
[팝콘 시즌1] 이윤희 : 다짜고짜 배워보는 인과추론[팝콘 시즌1] 이윤희 : 다짜고짜 배워보는 인과추론
[팝콘 시즌1] 이윤희 : 다짜고짜 배워보는 인과추론
PAP (Product Analytics Playground)
 
Recurrent Neural Networks for Recommendations and Personalization with Nick P...
Recurrent Neural Networks for Recommendations and Personalization with Nick P...Recurrent Neural Networks for Recommendations and Personalization with Nick P...
Recurrent Neural Networks for Recommendations and Personalization with Nick P...
Databricks
 
게임을 위한 DynamoDB 사례 및 팁 - 김일호 솔루션즈 아키텍트:: AWS Cloud Track 3 Gaming
게임을 위한 DynamoDB 사례 및 팁 - 김일호 솔루션즈 아키텍트:: AWS Cloud Track 3 Gaming게임을 위한 DynamoDB 사례 및 팁 - 김일호 솔루션즈 아키텍트:: AWS Cloud Track 3 Gaming
게임을 위한 DynamoDB 사례 및 팁 - 김일호 솔루션즈 아키텍트:: AWS Cloud Track 3 Gaming
Amazon Web Services Korea
 
So You Want to Write a Connector?
So You Want to Write a Connector? So You Want to Write a Connector?
So You Want to Write a Connector?
confluent
 
Sql vs NoSQL
Sql vs NoSQLSql vs NoSQL
Sql vs NoSQL
RTigger
 

What's hot (20)

Side by Side with Elasticsearch & Solr, Part 2
Side by Side with Elasticsearch & Solr, Part 2Side by Side with Elasticsearch & Solr, Part 2
Side by Side with Elasticsearch & Solr, Part 2
 
Advanced Stream Processing with Flink and Pulsar - Pulsar Summit NA 2021 Keynote
Advanced Stream Processing with Flink and Pulsar - Pulsar Summit NA 2021 KeynoteAdvanced Stream Processing with Flink and Pulsar - Pulsar Summit NA 2021 Keynote
Advanced Stream Processing with Flink and Pulsar - Pulsar Summit NA 2021 Keynote
 
Big data real time architectures
Big data real time architecturesBig data real time architectures
Big data real time architectures
 
NoSQL
NoSQLNoSQL
NoSQL
 
Elasticsearch와 Python을 이용하여 맨땅에서 데이터 분석하기
Elasticsearch와 Python을 이용하여 맨땅에서 데이터 분석하기Elasticsearch와 Python을 이용하여 맨땅에서 데이터 분석하기
Elasticsearch와 Python을 이용하여 맨땅에서 데이터 분석하기
 
Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )
Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )
Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )
 
[NDC18] 야생의 땅 듀랑고의 데이터 엔지니어링 이야기: 로그 시스템 구축 경험 공유
[NDC18] 야생의 땅 듀랑고의 데이터 엔지니어링 이야기: 로그 시스템 구축 경험 공유[NDC18] 야생의 땅 듀랑고의 데이터 엔지니어링 이야기: 로그 시스템 구축 경험 공유
[NDC18] 야생의 땅 듀랑고의 데이터 엔지니어링 이야기: 로그 시스템 구축 경험 공유
 
데브시스터즈 데이터 레이크 구축 이야기 : Data Lake architecture case study (박주홍 데이터 분석 및 인프라 팀...
데브시스터즈 데이터 레이크 구축 이야기 : Data Lake architecture case study (박주홍 데이터 분석 및 인프라 팀...데브시스터즈 데이터 레이크 구축 이야기 : Data Lake architecture case study (박주홍 데이터 분석 및 인프라 팀...
데브시스터즈 데이터 레이크 구축 이야기 : Data Lake architecture case study (박주홍 데이터 분석 및 인프라 팀...
 
Auto Scalable 한 Deep Learning Production 을 위한 AI Serving Infra 구성 및 AI DevOps...
Auto Scalable 한 Deep Learning Production 을 위한 AI Serving Infra 구성 및 AI DevOps...Auto Scalable 한 Deep Learning Production 을 위한 AI Serving Infra 구성 및 AI DevOps...
Auto Scalable 한 Deep Learning Production 을 위한 AI Serving Infra 구성 및 AI DevOps...
 
로그 기깔나게 잘 디자인하는 법
로그 기깔나게 잘 디자인하는 법로그 기깔나게 잘 디자인하는 법
로그 기깔나게 잘 디자인하는 법
 
Soma search
Soma searchSoma search
Soma search
 
백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스
백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스
백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스
 
ElasticSearch
ElasticSearchElasticSearch
ElasticSearch
 
Apache Beam and Google Cloud Dataflow - IDG - final
Apache Beam and Google Cloud Dataflow - IDG - finalApache Beam and Google Cloud Dataflow - IDG - final
Apache Beam and Google Cloud Dataflow - IDG - final
 
[팝콘 시즌1] 이윤희 : 다짜고짜 배워보는 인과추론
[팝콘 시즌1] 이윤희 : 다짜고짜 배워보는 인과추론[팝콘 시즌1] 이윤희 : 다짜고짜 배워보는 인과추론
[팝콘 시즌1] 이윤희 : 다짜고짜 배워보는 인과추론
 
Graph databases
Graph databasesGraph databases
Graph databases
 
Recurrent Neural Networks for Recommendations and Personalization with Nick P...
Recurrent Neural Networks for Recommendations and Personalization with Nick P...Recurrent Neural Networks for Recommendations and Personalization with Nick P...
Recurrent Neural Networks for Recommendations and Personalization with Nick P...
 
게임을 위한 DynamoDB 사례 및 팁 - 김일호 솔루션즈 아키텍트:: AWS Cloud Track 3 Gaming
게임을 위한 DynamoDB 사례 및 팁 - 김일호 솔루션즈 아키텍트:: AWS Cloud Track 3 Gaming게임을 위한 DynamoDB 사례 및 팁 - 김일호 솔루션즈 아키텍트:: AWS Cloud Track 3 Gaming
게임을 위한 DynamoDB 사례 및 팁 - 김일호 솔루션즈 아키텍트:: AWS Cloud Track 3 Gaming
 
So You Want to Write a Connector?
So You Want to Write a Connector? So You Want to Write a Connector?
So You Want to Write a Connector?
 
Sql vs NoSQL
Sql vs NoSQLSql vs NoSQL
Sql vs NoSQL
 

Similar to RTB Bid Landscape in Adform

RTBMA ECIR 2016 tutorial
RTBMA ECIR 2016 tutorialRTBMA ECIR 2016 tutorial
RTBMA ECIR 2016 tutorial
Shuai Yuan
 
Machine learning for profit: Computational advertising landscape
Machine learning for profit: Computational advertising landscapeMachine learning for profit: Computational advertising landscape
Machine learning for profit: Computational advertising landscape
Sharat Chikkerur
 
Fast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick GorskiFast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick Gorski
Hakka Labs
 
Computational Advertising in Yelp Local Ads
Computational Advertising in Yelp Local AdsComputational Advertising in Yelp Local Ads
Computational Advertising in Yelp Local Ads
soupsranjan
 
Reel bid insightd-eproject
Reel bid   insightd-eprojectReel bid   insightd-eproject
Reel bid insightd-eproject
Mudit Uppal
 
Weinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display Advertising
Weinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display AdvertisingWeinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display Advertising
Weinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display Advertising
Jun Wang
 
A History of Programmatic Media
A History of Programmatic MediaA History of Programmatic Media
A History of Programmatic Media
The Media Kitchen
 
Machine Learning for Computational Advertising
Machine Learning for Computational AdvertisingMachine Learning for Computational Advertising
Machine Learning for Computational Advertising
Paris Women in Machine Learning and Data Science
 
TripleLift: Preparing for a New Programmatic Ad-Tech World
TripleLift: Preparing for a New Programmatic Ad-Tech WorldTripleLift: Preparing for a New Programmatic Ad-Tech World
TripleLift: Preparing for a New Programmatic Ad-Tech World
VoltDB
 
Wsdm2015
Wsdm2015Wsdm2015
Wsdm2015
Jun Wang
 
Ad science bid simulator (public ver)
Ad science bid simulator (public ver)Ad science bid simulator (public ver)
Ad science bid simulator (public ver)
Marsan Ma
 
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
MLconf
 
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo StepanchukBig Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
GlobalLogic Ukraine
 
Machine Learning for Performance Advertising
Machine Learning for Performance AdvertisingMachine Learning for Performance Advertising
Machine Learning for Performance Advertising
Eustache Diemert
 
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Databricks
 
Programmatic advertising - overview
Programmatic advertising - overviewProgrammatic advertising - overview
Programmatic advertising - overviewJim Murray Jones
 
Int'l Conference on Predictive APIs: RTB Optimizer presentation
Int'l Conference on Predictive APIs: RTB Optimizer presentationInt'l Conference on Predictive APIs: RTB Optimizer presentation
Int'l Conference on Predictive APIs: RTB Optimizer presentation
Datacratic
 
Understanding Natural Language Instructions for Fetching Daily Objects Using ...
Understanding Natural Language Instructions for Fetching Daily Objects Using ...Understanding Natural Language Instructions for Fetching Daily Objects Using ...
Understanding Natural Language Instructions for Fetching Daily Objects Using ...
Aly Magassouba
 
Nicolas Kruchten @ Datacratic
Nicolas Kruchten @ DatacraticNicolas Kruchten @ Datacratic
Nicolas Kruchten @ Datacratic
PAPIs.io
 
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...Jun Wang
 

Similar to RTB Bid Landscape in Adform (20)

RTBMA ECIR 2016 tutorial
RTBMA ECIR 2016 tutorialRTBMA ECIR 2016 tutorial
RTBMA ECIR 2016 tutorial
 
Machine learning for profit: Computational advertising landscape
Machine learning for profit: Computational advertising landscapeMachine learning for profit: Computational advertising landscape
Machine learning for profit: Computational advertising landscape
 
Fast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick GorskiFast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick Gorski
 
Computational Advertising in Yelp Local Ads
Computational Advertising in Yelp Local AdsComputational Advertising in Yelp Local Ads
Computational Advertising in Yelp Local Ads
 
Reel bid insightd-eproject
Reel bid   insightd-eprojectReel bid   insightd-eproject
Reel bid insightd-eproject
 
Weinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display Advertising
Weinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display AdvertisingWeinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display Advertising
Weinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display Advertising
 
A History of Programmatic Media
A History of Programmatic MediaA History of Programmatic Media
A History of Programmatic Media
 
Machine Learning for Computational Advertising
Machine Learning for Computational AdvertisingMachine Learning for Computational Advertising
Machine Learning for Computational Advertising
 
TripleLift: Preparing for a New Programmatic Ad-Tech World
TripleLift: Preparing for a New Programmatic Ad-Tech WorldTripleLift: Preparing for a New Programmatic Ad-Tech World
TripleLift: Preparing for a New Programmatic Ad-Tech World
 
Wsdm2015
Wsdm2015Wsdm2015
Wsdm2015
 
Ad science bid simulator (public ver)
Ad science bid simulator (public ver)Ad science bid simulator (public ver)
Ad science bid simulator (public ver)
 
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
 
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo StepanchukBig Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
 
Machine Learning for Performance Advertising
Machine Learning for Performance AdvertisingMachine Learning for Performance Advertising
Machine Learning for Performance Advertising
 
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
 
Programmatic advertising - overview
Programmatic advertising - overviewProgrammatic advertising - overview
Programmatic advertising - overview
 
Int'l Conference on Predictive APIs: RTB Optimizer presentation
Int'l Conference on Predictive APIs: RTB Optimizer presentationInt'l Conference on Predictive APIs: RTB Optimizer presentation
Int'l Conference on Predictive APIs: RTB Optimizer presentation
 
Understanding Natural Language Instructions for Fetching Daily Objects Using ...
Understanding Natural Language Instructions for Fetching Daily Objects Using ...Understanding Natural Language Instructions for Fetching Daily Objects Using ...
Understanding Natural Language Instructions for Fetching Daily Objects Using ...
 
Nicolas Kruchten @ Datacratic
Nicolas Kruchten @ DatacraticNicolas Kruchten @ Datacratic
Nicolas Kruchten @ Datacratic
 
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
 

Recently uploaded

Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
CatarinaPereira64715
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 

Recently uploaded (20)

Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 

RTB Bid Landscape in Adform

  • 1. RTB Bid Landscape survival models powered by bitmaps Łukasz Mączewski Senior Data Scientist Przemysław Piotrowski Senior Data Engineer AI&Analytics, Adform 14.05.2019 1
  • 2. Agenda • Real Time Bidding – RTB • Bidding Strategy • Bid Landscape • Bitmap population selection • Survival analysis • Validation • Product demo 2
  • 3. Real-Time Bidding (RTB) RTB allows for fully automated ad inventory selling / buying in a real-time 3 Advertiser 1 Advertiser 2 Advertiser 3 Advertiser n AudienceInventory Auction Demand Side Platform is to estimate optimal bid price according to: • Targeting (audience) • Business requirements (KPIs) • Ad inventory (context, ad slot price) • Campaign budget and it's spend plan
  • 4. First price vs. second price auction First price auction: - Pay exact amount that was bidded - Bid price is close to a real impression value - Seller revenue maximization 4 Second price auction: - Pay 0.00001$ more than second-highest bid price - Bid as high as possible to maximize winning probability In the past the second price auction was preferred, but currently the first price auction becomes more and more popular
  • 5. RTB ecosystem 5 diverse audience web page Publisher 1 and its inventory SSP 1 Supply Side Platform Publisher n and its inventory . . . Ad Exchange DMP Data Management Platform DSP 1 Demand Side Platform SSP n DSP n . . . advertisers demographics, browsing, interest segments data cookie profile data feedback data ad-slot / banner data campaign data Bidding strategy optimisation Time to bid is ~100 ms
  • 6. Bidding strategy optimisation 6 Bidding Strategy Optimization Candidate impression valuation with respect to KPIs and budget constraints DSP algorithms Evaluation of bid request business potential Bid Landscape Estimation of winning probability at bid price Budget throttle Control budget spend Optimal bid price
  • 7. Numbers 7 Adform provides an integrated Software as a Service platform for the buying, managing and serving of digital advertising. Presence in EU, US, APAC 3 millions bid request per second billions of engagements daily 5 millions scoring of algorithms per second 200 thousands of cookie profiles in DMP
  • 9. Discovering knowledge from massive RTB data 9 Cookie profile/ Targeting: • device type • os type • browser type • screen size • country • regions • city • clicks made • visited log points • visited domains • urls • IAB categories • ... Feedback: • acquisition/conversion • click • order status • revenue • viewability • ... Adslot/ banners/ creatives: • inventory name • domain • URL • banner position • banner size • banner type • floor price • video player properties • ... Data Management Platform: • demography • interests and preferences • profession • ... Campaign: • budget • pricing type • deals • targeting • brand safety • ...
  • 10. Discovering knowledge from massive RTB data 10 Huge class imbalance Mostly categorical data High cardinality Sparsity High risk of overfit Data summary Reduction of dimensionality Preselection of features RegularisationData sampling Cross validationEnsembling
  • 11. RTB models 11 CPC Cost per Click CPL Cost per Lead netCPL Confirmed lead CPV Cost per Viewability ROAS Return on Ad Spend VCR Video Completion Rate
  • 12. Adform machine learning pipeline 12 Train/Test data sets preparation (dedicated ETLs) Data preprocessing and feature engineering Feature selection Building model and hyperparameters optimization Offline backtests Online A/B tests Deployment Live monitoring After successful model deployment it is periodically retrained
  • 13. RTB Algorithms Pipeline 13 PricingAPI Train Data JAR JAR OFFLINE JAR JAR ONLINE JAR ONLINE SPEC ONLINE MONITORING
  • 14. Backtests • Offline procedure to check performance of two competing models in time • Sequential analysis of historical data: - Data divided into training and test sets • Time series analysis of critical performance KPIs: - AUC - Sparsity - Target rates • If the backtests succeed the algorithm is submitted to online A/B testing procedure 14 AUC SPARSITY Target rate Time
  • 15. A/B test online monitoring 15 Split traffic to hash(cookie_id)%1024 buckets For each experiment monitor • Cost per target • Target rate • Target count • Number of impressions • ... many more
  • 16.
  • 18. User Stories As a Trafficker I want to know volume of impressions my campaign would win in RTB given the selected inventory, targeting and budget settings (CPM). As a DSP I want to enable Advertisers winning impressions for lowest possible price. 18 Bid Shading Demand Side Platform protects Advertiser from overpaying while AdTech ecosystem migrating from second-price to first-price RTB auctions.
  • 19. Forecasting & Bid Landscape Forecasting • Total number of matching Bid Requests for given Campaign Targeting • Bid Opportunities 19 Bid Landscape • Predicting probability of winning auction with a given bid price and Campaign Settings • Win rate evolution with a bid price Impressions CPM
  • 20. Bid Landscape • Building Bid Landscape model using historical data regarding won and failed auctions: For lost auctions win price is unknown • Right Censored Data Survival Analysis The exact moment of failure for majority of objects is unknown 20 All Bids Won Bids Relative Rate Bid Price Buckets
  • 21. Population selection for Bid Landscape modelling 21
  • 22. The Data 22 Input Bids and Impressions with features Impression is a Bid that won the auction 100000 2000 5000 100 4000 3000 ... 4500 Output cardinality of selected (Campaign Settings) Bids bucketed by bid price Impressions bucketed by bid price Impressions bucketed by win price 100 buckets in our case How? Population Selection – select relevant population from high-dimensional space for survival analysis. source:biretail.com
  • 24. Nested queries on millions of features 24 waw 1 0 0 0 1 0 0 1 … 0 iOS 0 0 1 0 1 1 0 1 … 0 11:00 1 1 0 0 1 0 0 1 … 0 Bid 0 1 2 3 4 5 6 7 … 100mln Matching bids 0 0 0 0 1 0 0 1 … 0 BidPriceBuckets 9900 790 2000 …. 100 Counts 3 mln other features AND, OR, NOT, XOR
  • 25. 25 Buckets' cardinality 10000 3000 ... 4500 2000 100 ... 0 1000 800 ... 400 Survival Analysis
  • 26. Roaring Bitmap 26 Roaring Bitmap – state of the art compressed bitmap algorithm. Hybrid approach between sparse and dense bitmaps representation. roaringbitmap.org/about/ „Use Roaring for bitmap compression whenever possible. Do not use other bitmap compression methods” [1] [1] Wang, Jianguo, et al. "An experimental study of bitmap compression vs. inverted list compression." 2017.
  • 27. Bitmap Execution Database 27 • Roaring Bitmap – state of the art compressed bitmap algorithm • Whole 7 days of Internet bid opportunities – 12GB (sample 1/1000) Naive 50TB • Bids&Impressions – 5GB (sample 1/100) • Majority of responses under 1 second
  • 28. Nested set operations language 28 And( Or("inventory|1", "inventory|2", "inventory|3"), AndNot( Universe, Or("category|moto", "category|fishing") ), Or( "hour_of_week|41", "hour_of_week|42", "hour_of_week|43", "hour_of_week|44" ), Or("tld|com", "website|example.com"), And( "country|poland", AndNot( Universe, "city|warsaw" ) ), ... )
  • 30. Principles of survival analysis • The basic assumption in survival analysis is that the risk of failure changes with time • Two major approaches for survival probability modelling can be distinguished: - Parametric – can be extrapolated for new experiences (Weibull) - Non-parametric – very flexible but can't be extended for new experience (Kaplan-Maier) 30 FailureRate
  • 31. Data censoring in survival analysis 31
  • 32. Weibull based survival probability estimation • Weibull distribution is derived from hazard function: where: - S(t) - survival function (cdf) - F(t) = 1 – S(t) - failure function (cdf) - f(t) = dF(t)/dt - pdf • Weibull distribution is defined by 2 parameters: - Shape parameter p - failure mode type - Scale parameter l - position 32 Time Survive . 0 t ∆t Fail .
  • 33. Weibull based survival probability estimation • One can distinguish 3 classes of failure types: - Infant mortality p < 1 - Random failures p = 1 - Ware out p > 1 33 Weibull examples Parametric approach easy to extrapolate age Hazard
  • 34. Kaplan-Maier based survival probability estimation • In Kaplan-Meier method we are to predict chance of surviving time ∆t after surviving time t: 34 No parameters to be estimated, very flexible and easy in implementation S(t,Δt) = S(t) • S(Δt|t) Time Survive . 0 t ∆t Survive.
  • 35. Kaplan-Meier curve calculation 35 S(t,Δt) = S(t) • S(Δt|t) Time (weeks) KM Curve for Group 1 (treatment)
  • 36. Elements of survival analysis in Bid Landscape modelling 36
  • 37. RTB auctions modelling with survival methods From survival models to Bid Landscape: - time -> cost per 1k bought impressions (CPM) - died -> won bid (left censored, known win price) - alive -> lost bid (right censored, unknown exact win price) 37 our win price (second price auction) competitor outbid price (unknown) our bid price
  • 38. Bid Landscape vs. Survival Analysis Survival analysis • Risk of failure evolves with time • Incomplete knowledge regarding failure time, right censoring: - Majority of object survive • Multiple competing failure modes • Very high disproportion between positive (failures) and negative (survived) class representatives • Usually very limited data • Wrong predictions may have catastrophic consequences – quality monitoring required 38 Bid Landscape analysis • Winning probability evolves with price • Incomplete knowledge regarding win price, right censoring: - Only small fraction of bids are won • Different inventory sources • Very high disproportion between positive (won) and negative (lost) class representatives • Usually huge data sets – subsampling • Wrong predictions can lead to not efficient campaign money spend – quality monitoring required
  • 39. Survival model for Bid Landscape modelling 39 • Let's apply survival techniques to auction data: - Instead of time a bid price is to be used • Legend: • number of lost auctions • number of won impressions - win price CDF - Kaplan-Maier estimate - Weibull estimate Adform Bid Landscape exploits Kaplan-Maier estimate exclusively Bid Price (CPM) Multiplicity WinRate
  • 40. Bid Landscape validation in Adform 40
  • 41. Bid Landscape validation – qualitative 41 Win rate evolution with bid price Win rate evolution over day The bid landscape predictions are in line with the intuition: • The win rate increases with bid price • The win rate changes periodically over a day
  • 42. Bid Landscape validation – quantitative (real RTB data) 42 Real auction data versus Bid Landscape predictions
  • 43. Bid Landscape as Adform product 43
  • 47. Bid Multipliers optimisation 47 Polish accountants' portal Jobs for professionals Video oriented website Impressions Source of traffic: • Campaign targeting Warsaw people to be run on three different domains • Global CPM makes no sense, needs to be adjusted
  • 48. Bid Landscape dynamic CPM algorithm 48 Given the Campaign Settings including hourly budget maximize bought relevant impressions by using lowest CPM. First tests are showing multiplied Viewable Impressions, Clicks comparing to same Campaign using static price.
  • 49. Summary and conclusions • The Adform Bid Landscape is composed of two base elements ensuring speed and quality: - Bitmaps – fast and efficient population selection - Kaplan-Maier win rates – simple and precise estimate • The Adform Bid Landscape is in production for more than one year - It is mostly used by Adform customers optimising the CPM campaigns - Currently we develop new RTB algorithm that is based on BL predictions - In the short time perspective we are to extend usage of BL in other RTB models 49
  • 50. 5