SlideShare a Scribd company logo
1 of 101
Download to read offline
Real-Time Bidding based Display Advertising:
Mechanisms and Algorithms
Shuai Yuan, MediaGamma Ltd
Weinan Zhang, UCL
Jun Wang, UCL
Shuai.yuan@mediagamma.com
{w.zhang, j.wang}@cs.ucl.ac.uk
ECIR16 Tutorial
Table of contents
• RTB system
• Auction mechanisms
• User response estimation
• Conversion attribution
• Learning to bid
• Data Management Platform (DMP) techniques
• Floor price optimisation
• Fighting against fraud
Table of contents
• RTB system
• Auction mechanisms
• User response estimation
• Conversion attribution
• Learning to bid
• Data Management Platform (DMP) techniques
• Floor price optimisation
• Fighting against fraud
Advertising
• Make the best match between and
with
- John Wanamaker
(1838-1922)
Father of modern advertising
and a pioneer in marketing
Wasteful Traditional Advertising
Computational Advertising
• Design algorithms to make the best match between the
advertisers and Internet users with economic constraints
Sponsored Search
Search: iphone 6s case
Sponsored Search
• Advertiser sets a bid price for the keyword
• User searches the keyword
• Search engine hosts the auction to ranking the ads
Display Advertising
http://www.nytimes.com/
Internet Advertising Frontier:
Real-Time Bidding (RTB) based Display Advertising
What is Real-Time Bidding?
• Every online ad view can be evaluated, bought,
and sold, all individually, and all instantaneously.
• Instead of buying keywords or a bundle of ad
views, advertisers are now buying users directly.
DSP/Exchange daily traffic
Advertising iPinYou, China 18 billion impressions
YOYI, China 5 billion impressions
Fikisu, US 32 billon impressions
Finance New York Stock Exchange 12 billion shares daily
Shanghai Stock Exchange 14 billion shares daily
Query per second
Turn DSP 1.6 million
Google 40,000 search
Shen, Jianqiang, et al. "From 0.5 Million to 2.5 Million: Efficiently Scaling up Real-Time Bidding." Data Mining (ICDM), 2015 IEEE International
Conference on. IEEE, 2015.
Suppose a student regularly reads articles on emarketer.com
Content-related ads
He recently checked the London hotels
(In fact, no login is required)
Relevant ads on facebook.com
Even on supervisor’s homepage!
(User targeting dominates the context)
RTB Display Advertising Mechanism
• Buying ads via real-time bidding (RTB), 10B per day
RTB
Ad
Exchange
Demand-Side
Platform
Advertiser
Data
Management
Platform
0. Ad Request
1. Bid Request
(user, page, context)
2. Bid Response
(ad, bid price)
3. Ad Auction
4. Win Notice
(charged price)
5. Ad
(with tracking)
6. User Feedback
(click, conversion)
User Information
User Demography:
Male, 26, Student
User Segmentations:
London, travelling
Page
User
<100 ms
Table of contents
• RTB system
• Auction mechanisms
• User response estimation
• Conversion attribution
• Learning to bid
• Data Management Platform (DMP) techniques
• Floor price optimisation
• Fighting against fraud
Auctions scheme
v1
v2
v3
v4
b1
b2
b3
b4
private values bids
winner
payments $$$
Modeling
• n bidders
• Each bidder has value vi for the item
– “willingness to pay”
– Known only to him – “private value”
• If bidder i wins and pays pi, his utility is vi – pi
– In addition, the utility is 0 when the bidder loses.
• Note: bidders prefer losing than paying more than their
value.
Strategy
• A strategy for each bidder
– how to bid given your intrinsic, private value?
– a strategy here is a function, a plan for the game.
Not just a bid.
• Examples for strategies:
– bi(vi) = vi (truthful)
– bi(vi) = vi /2
– bi(vi) = vi /n
– If v<50, bi(vi) = vi
otherwise, bi(vi) = vi +17
• Can be modeled as normal form game, where these
strategies are the pure strategies.
• Example for a game with incomplete information.
B(v)=v B(v)=v
/2
B(v)=v
/n
….
B(v)=v
…
Strategies and equilibrium
• An equilibrium in the auction is a profile of
strategies B1,B2,…,Bn such that:
– Dominant strategy equilibrium: each strategy is optimal
whatever the other strategies are.
– Nash equilibrium: each strategy is a best response to the
other strategies.
B(v)=v B(v)=v/2 B(v)=v/n ….
B(v)=v
…
Bayes-Nash equilibrium
• Recall a set of bidding strategies is a Nash
equilibrium if each bidder’s strategy
maximizes his payoff given the optimal
strategies of the others.
– In auctions: bidders do not know their opponent’s
values, i.e., there is incomplete information.
– Each bidder’s strategy must maximize her
expected payoff accounting for the uncertainty
about opponent values.
1st price auctions
• Truthful(bi = vi)?
$30 $100
$31
NO!
Equilibrium in 2rd-price auctions
Reserve Prices and Entry Fees
• Reserve Prices: the seller is assumed to have
committed to not selling below the reserve
– Reserve prices are assumed to be known to all bidders
– The reserve prices = the minimum bids
• Entry Fees: those bidders who enter have to pay
the entry fee to the seller
• They reduce bidders’ incentives to participate,
but they might increase revenue as 1) the seller
collects extra revenues 2) bidders might bid more
aggressively
Table of contents
• RTB system
• Auction mechanisms
• User response estimation
• Conversion attribution
• Learning to bid
• Data Management Platform (DMP) techniques
• Floor price optimisation
• Fighting against fraud
RTB Display Advertising Mechanism
• Buying ads via real-time bidding (RTB), 10B per day
RTB
Ad
Exchange
Demand-Side
Platform
Advertiser
Data
Management
Platform
0. Ad Request
1. Bid Request
(user, page, context)
2. Bid Response
(ad, bid price)
3. Ad Auction
4. Win Notice
(charged price)
5. Ad
(with tracking)
6. User Feedback
(click, conversion)
User Information
User Demography:
Male, 26, Student
User Segmentations:
London, travelling
Page
User
<100 ms
Predict how likely the user is going to
click the displayed ad.
User response estimation problem
• Click-through rate estimation as an example
• Date: 20160320
• Hour: 14
• Weekday: 7
• IP: 119.163.222.*
• Region: England
• City: London
• Country: UK
• Ad Exchange: Google
• Domain: yahoo.co.uk
• URL: http://www.yahoo.co.uk/abc/xyz.html
• OS: Windows
• Browser: Chrome
• Ad size: 300*250
• Ad ID: a1890
• User tags: Sports, Electronics
Click (1) or not (0)?
Predicted CTR (0.15)
Feature Representation
• Binary one-hot encoding of categorical data
x=[Weekday=Wednesday, Gender=Male, City=London]
x=[0,0,1,0,0,0,0 0,1 0,0,1,0…0]
High dimensional sparse binary feature vector
Linear Models
• Logistic Regression
– With SGD learning
– Sparse solution
• Online Bayesian Profit Regression
ML Framework of CTR Estimation
• A binary regression problem
– Large binary feature space (>10 millions)
• Bloom filter to detect and add new features (e.g., > 5 instances)
– Large data instance number (>10 millions daily)
– A seriously unbalanced label
• Normally, #click/#non-click = 0.3%
• Negative down sampling
• Calibration
Logistic Regression
• Prediction
• Cross Entropy Loss
• Stochastic Gradient Descent Learning
[Lee et al. Estimating Conversion Rate in Display Advertising from Past Performance Data. KDD 12]
Logistic Regression with SGD
• Pros
– Standardised, easily understood and implemented
– Easy to be parallelised
• Cons
– Learning rate η initialisation
– Uniform learning rate against different binary
features
Logistic Regression with FTRL
• In practice, we need a sparse solution as >10 million feature dimensions
• Follow-The-Regularised-Leader (FTRL) online Learning
[McMahan et al. Ad Click Prediction : a View from the Trenches. KDD 13]
s.t.
• Online closed-form update of FTRL
t: current example index
gs: gradient for example t
adaptively selects
regularization functions
[Xiao, Lin. "Dual averaging method for regularized stochastic learning and online optimization." Advances in Neural Information Processing Systems. 2009]
Online Bayesian Probit Regression
to approximate message passing. In order
full factorial structure of the likelihood,
wo latent variables , and consider the
density function which
(5)
n can be understood in terms of the
ative process, which is also reflected in
in Figure 1.
Sample weights from the Gaussian
alculate the score for x as the inner
, such that .
dd zero-mean Gaussian noise to obtain
h that .
etermine by a threshold on the noisy
ro, such that .
skills in TrueSkill after a hypothetical m
team with known skill of zero. Given the
Figure 1 together with Table 1 in the a
update equations can be derived.
3.3.1. UPDATE EQUATIONS FOR ONLINE
The update equations represent a mappin
posterior parameter values based o
pairs ̃ ̃ . In terms of
calculation can viewed as following the m
schedule towards the weights . We d
variance for a given input as
The update for the posterior parameters is
̃ (
̃ * (
Linear Prediction Models
• Pros
– Highly efficient and scalable
– Explore larger feature space and training data
• Cons
– Modelling limit: feature independence assumption
– Cannot capture feature interactions unless defining
high order combination features
• E.g., hour=10AM & city=London & browser=Chrome
Non-linear Models
• Gradient Boosting Decision Trees
• Factorisation Machines
• Combined Models
• Deep Neural Networks
Factorisation Machines
• Prediction based on feature embedding
– Explicitly model feature interactions
• Second order, third order etc.
– Empirically better than logistic regression
– A new way for user profiling
[Oentaryo et al. Predicting response in mobile advertising with hierarchical importance-
aware factorization machine. WSDM 14]
[Rendle. Factorization machines. ICDM 2010.]
Gradient Boosting Decision Trees
• Additive decision trees for prediction
• Each decision tree
[Chen and He. Higgs Boson Discovery with Boosted Trees . HEPML 2014.]
Gradient Boosting Decision Trees
• Learning
[Chen and He. Higgs Boson Discovery with Boosted Trees . HEPML 2014.]
Combined Models: GBDT + LR
[He et al. Practical Lessons from Predicting Clicks on Ads at Facebook . ADKDD 2014.]
Combined Models: GBDT + FM
[http://www.csie.ntu.edu.tw/~r01922136/kaggle-2014-criteo.pdf]
[Zhang et al. Deep Learning over Multi-field Categorical Data – A Case Study on User
Response Prediction. ECIR 16] in Monday Machine Learning Track
Table of contents
• RTB System
• Auction Mechanisms
• CTR Estimation
• Conversion Attribution
• Learning to Bid
• Data Management Platform (DMP) techniques
• Floor price optimisation
• Fighting against fraud
Conversion Attribution
• Assign credit% to each channel according to contribution
• Current industrial solution: last-touch attribution
[Shao et al. Data-driven multi-touch attribution models. KDD 11]
Heuristics-based Attribution
[Kee. Attribution playbook – google analytics. Online access.]
A Good Attribution Model
• Fairness
– Reward an individual channel in accordance with
its ability to affect the likelihood of conversion
• Data driven
– Using ad touch and conversion data for each
campaign to build its model
• Interpretability
– Generally accepted by all parties
[Dalessandro et al. Casually Motivated Attribution for Online Advertising. ADKDD 11]
Bagged Logistic Regression
• For M iterations
– Sample 50% data instances and 50% features
– Train a logistic regression and record the weights
• Average the feature weights
[Shao et al. Data-driven multi-touch attribution models. KDD 11]
Display Search Mobile Email Social Convert?
1 1 0 0 1 1
1 0 1 1 1 0
0 1 0 1 0 1
0 0 1 1 1 0
Bagged Logistic Regression
• For M iterations
– Sample 50% data instances and 50% features
– Train a logistic regression and record the weights
• Average the feature weights
[Shao et al. Data-driven multi-touch attribution models. KDD 11]
Display Search Mobile Email Social Convert?
1 1 0 0 1 1
1 0 1 1 1 0
0 1 0 1 0 1
0 0 1 1 1 0
Bagged Logistic Regression
• For M iterations
– Sample 50% data instances and 50% features
– Train a logistic regression and record the weights
• Average the feature weights
[Shao et al. Data-driven multi-touch attribution models. KDD 11]
Display Search Mobile Email Social Convert?
1 1 0 0 1 1
1 0 1 1 1 0
0 1 0 1 0 1
0 0 1 1 1 0
Shapley Value based Attribution
• Coalition game
– How much does a player contribute in the game
[Fig source: https://pjdelta.wordpress.com/2014/08/10/group-project-how-much-did-i-contribute/]
Shapley Value based Attribution
• Coalition game
A Probabilistic Attribution Model
• Conditional probabilities
[Shao et al. Data-driven multi-touch attribution models. KDD 11]
• Attributed contribution
[Shao et al. Data-driven multi-touch attribution models. KDD 11]
[Shao et al. Data-driven multi-touch attribution models. KDD 11]
Data-Driven Probabilistic Models
• The “relatively heuristic” data-driven model
[Shao et al. Data-driven multi-touch attribution models. KDD 11]
• A more generalized and data-driven model
[Dalessandro et al. Casually Motivated Attribution for Online Advertising. ADKDD 11]
– : the probability that the sequence begin with
Attribution Comparison
• Help find some “cookie bombing” channels
Other Attribution Models
• Survival models
with time
[Zhang et al. Multi-Touch Attribution in Online
Advertising with Survival Theory. ICDM 2014]
• Markov graph
[Anderl et al. Mapping the customer journey: A graph-based framework for online attribution modeling. SSRN 2014]
Table of contents
• RTB System
• Auction Mechanisms
• CTR Estimation
• Conversion Attribution
• Learning to Bid
• Data Management Platform (DMP) techniques
• Floor price optimisation
• Fighting against fraud
RTB Display Advertising Mechanism
• Buying ads via real-time bidding (RTB), 10B per day
RTB
Ad
Exchange
Demand-Side
Platform
Advertiser
Data
Management
Platform
0. Ad Request
1. Bid Request
(user, page, context)
2. Bid Response
(ad, bid price)
3. Ad Auction
4. Win Notice
(charged price)
5. Ad
(with tracking)
6. User Feedback
(click, conversion)
User Information
User Demography:
Male, 26, Student
User Segmentations:
London, travelling
Page
User
<100 ms
Data of Learning to Bid
– Bid request features: High dimensional sparse binary vector
– Bid: Non-negative real or integer value
– Win: Boolean
– Cost: Non-negative real or integer value
– Feedback: Binary
• Data
Problem Definition of Learning to Bid
• How much to bid for each bid request?
– Find an optimal bidding function b(x)
• Bid to optimise the KPI with budget constraint
Bid Request
(user, ad, page, context)
Bid Price
Bidding
Strategy
Bidding Strategy in Practice
Bid Request
(user, ad,
page, context)
Bid Price
Bidding Strategy
Feature Eng. Whitelist /
Blacklist
Retargeting
Budget
Pacing
Bid
Landscape
Bid
Calculation
Frequency
Capping CTR / CVR
Estimation
Campaign
Pricing
Scheme
64
Bidding Strategy in Practice:
A Quantitative Perspective
Bid Request
(user, ad,
page, context)
Bid Price
Bidding Strategy
Utility
Estimation
Cost
Estimation
Preprocessing
Bidding Function
CTR,
CVR,
revenue
Bid landscape
65
Bid Landscape Forecasting
Auction
Winning
Probability
Win probability: Expected cost:
Count
Win bid
Bid Landscape Forecasting
• Log-Normal Distribution
Auction
Winning
Probability
[Cui et al. Bid Landscape Forecasting in Online Ad Exchange Marketplace. KDD 11]
Bid Landscape Forecasting
• Price Prediction via Linear Regression
– Modelling censored data in lost bid requests
[Wu et al. Predicting Winning Price in Real Time Bidding with Censored Data. KDD 15]
Bidding Strategies
• How much to bid for each bid request?
• Bid to optimise the KPI with budget constraint
Bid Request
(user, ad, page, context)
Bid Price
Bidding
Strategy
Classic Second Price Auctions
• Single item, second price (i.e. pay market price)
Reward given a bid:
Optimal bid:
Bid true value
Truth-telling Bidding Strategies
• Truthful bidding in second-price auction
– Bid the true value of the impression
– Impression true value =
– Averaged impression value = value of click * CTR
– Truth-telling bidding:
[Chen et al. Real-time bidding algorithms for performance-based display ad allocation. KDD 11]
Value of click, if clicked
0, if not clicked
Truth-telling Bidding Strategies
• Pros
– Theoretic soundness
– Easy implementation (very widely used)
• Cons
– Not considering the constraints of
• Campaign lifetime auction volume
• Campaign budget
– Case 1: $1000 budget, 1 auction
– Case 2: $1 budget, 1000 auctions
[Chen et al. Real-time bidding algorithms for performance-based display ad allocation. KDD 11]
Non-truthful Linear Bidding
• Non-truthful linear bidding
– Tune base_bid parameter to maximise KPI
– Bid landscape, campaign volume and budget
indirectly considered
[Perlich et al. Bid Optimizing and Inventory Scoring in Targeted Online Advertising. KDD 12]
ORTB Bidding Strategies
• Direct functional optimisation
CTRwinning function
bidding function
budget
Est. volume cost upperbound
74
[Zhang et al. Optimal real-time bidding for display advertising. KDD 14]
• Solution: Calculus of variations
Optimal Bidding Strategy Solution
75
[Zhang et al. Optimal real-time bidding for display advertising. KDD 14]
Bidding in Multi-Touch Attribution
Mechanism
• Current bidding strategy
– Driven by last-touch attribution b(CVR)
• A new bidding strategy
– Driven by multi-touch attribution
[Xu et al. Lift-Based Bidding in Ad Selection. AAAI 2016.]
Table of contents
• RTB System
• Auction Mechanisms
• CTR Estimation
• Conversion Attribution
• Learning to Bid
• Data Management Platform (DMP) techniques
• Floor price optimisation
• Fighting against fraud
DMP Summary
• What is data management platform
• Cook sync
• Browser fingerprinting
• CF and Lookalike model
What is DMP
(Data Management Platform)
• A data warehouse that stores, merges, and sorts,
and labels it out in a way that’s useful for
marketers, publishers and other businesses.
RTB
Ad
Exchange
Demand-Side
Platform
Advertiser
Data
Management
Platform
0. Ad Request
1. Bid Request
(user, page, context)
2. Bid Response
(ad, bid price)
3. Ad Auction
4. Win Notice
(charged price)
5. Ad
(with tracking)
6. User Feedback
(click, conversion)
User Information
User Demography:
Male, 26, Student
User Segmentations:
London, travelling
User
<100 ms
WebPage
Cookie sync: merging audience data
https://freedom-to-tinker.com/blog/englehardt/the-hidden-perils-of-cookie-syncing/
When a user visits a site (e.g. ABC.com) including A.com as a third-party tracker.
(1) The browser makes a request to A.com, and included in this request is the tracking
cookie set by A.com.
(2) A.com retrieves its tracking ID from the cookie, and redirects the browser to B.com,
encoding the tracking ID into the URL.
(3) The browser then makes a request to B.com, which includes the full URL A.com
redirected to as well as B.com’s tracking cookie.
(4) B.com can then link its ID for the user to A.com’s ID for the user2
Browser fingerprinting
• A device fingerprint or
browser fingerprint is
information collected
about the remote
computing device for the
purpose of identifying the
user
• Fingerprints can be used to
fully or partially identify
individual users or devices
even when cookies are
turned off.
Eckersley, Peter. "How unique is your web browser?." Privacy Enhancing Technologies. Springer Berlin
Heidelberg, 2010.
Acar, Gunes, et al. "The web never forgets: Persistent tracking mechanisms in the wild." Proceedings of
the 2014 ACM SIGSAC Conference on Computer and Communications Security. ACM, 2014.
94.2% of browsers with Flash or
Java were unique in a study
User segmentation and Behavioural
Targeting
• Behavioural targeting helps online advertising
• From user – documents to user – topics
– Latent Semantic Analysis / Latent Dirichlet
Allocation
J Yan, et al., How much can behavioral targeting help online advertising? WWW 2009
X Wu, et al., Probabilistic latent semantic user segmentation for behavioral targeted advertising,
Intelligence for Advertising 2009
User Topic Term
Lookalike modelling
• Lookalike modeling: finding new people who
behave like current customers (converted)
Zhang, Weinan, Lingxi Chen, and Jun Wang. "Implicit Look-alike Modelling in Display Ads:
Transfer Collaborative Filtering to CTR Estimation." ECIR (2016).
Transferred lookalike
Using web browsing data, which is largely available, to infer the ad clicks
Zhang, Weinan, Lingxi Chen, and Jun Wang. "Implicit Look-alike Modelling in Display Ads: Transfer
Collaborative Filtering to CTR Estimation." ECIR (2016). In Wednesday Information Filtering Track
Table of contents
• RTB System
• Auction Mechanisms
• CTR Estimation
• Conversion Attribution
• Learning to Bid
• Data Management Platform (DMP) techniques
• Floor price optimisation
• Fighting against fraud
Reserve price optimisation
The task:
• To find the optimal reserve prices
The challenge:
• Practical constraints v.s common assumptions (bids’ distribution, bidding private values,
etc.)
S Yuan et al., An Empirical Study of Reserve Price Optimisation in Display Advertising, 2014
Why
• Suppose it is second price auction
– Normal case: 𝑏2 ≥ 𝛼
– Preferable case: 𝑏1 ≥ 𝛼 > 𝑏2 (it increases the revenue)
– Undesirable case: 𝛼 > 𝑏1 (but there is risk)
• Suppose: two bidders, private values drawn from
Uniform[0, 1]
• Without a reserve price (or 𝑎 = 0), the payoff 𝑟 is:
• With 𝑎 = 0.2:
• With 𝑎 = 0.5:
• With 𝑎 = 0.6:
An example
Ostrovsky and Schwarz , Reserve prices in internet advertising auctions: A field experiment, 2011
𝑟 = 𝐸 min 𝑏1, 𝑏2 = 0.33
𝑟 = 𝐸 min 𝑏1, 𝑏2 𝑏1 > 0.5, 𝑏2 > 0.5 + 0.5 × 0.5 = 0.42
𝑟 = 𝐸 min 𝑏1, 𝑏2 𝑏1 > 0.2, 𝑏2 > 0.2 + 0.32 × 0.2 = 0.36
𝑟 = 𝐸 min 𝑏1, 𝑏2 𝑏1 > 0.6, 𝑏2 > 0.6 + 0.6 × 0.4 × 2 × 0.6 = 0.405
Paying the second highest price Paying the reserve price
The optimal auction theory
• In the second price auctions, advertisers bid their private
values [𝑏1, … , 𝑏 𝐾]
• Private values -> Bids’ distributions
– Uniform
– Log-normal
• The publisher also has a private value 𝑉𝑝
• The optimal reserve price is given by:
Levin and Smith, Optimal Reservation Prices in Auctions, 1996
𝐹 𝒃 = 𝐹1 𝑏1 × ⋯ × 𝐹 𝐾(𝑏 𝐾)
𝛼 −
1 − 𝐹 𝒃
𝐹′ 𝒃
− 𝑉𝑝 = 0
Results from a field experiment
• On Yahoo! Sponsored search
• Using the Optimal Auction Theory
Mixed results
Ostrovsky and Schwarz , Reserve prices in internet advertising auctions: A field experiment, 2011
1) Expected payoff of advertiser, publisher
2) Payoff for the advertiser could be negative if one
has been bidding the max price
(𝑎 𝑤1: to increase 𝑏1 so that 𝑏1 ≥ 𝛼) 3) One won’t do that,
so discounted publisher’s payoff
The continuous bidding activity
The unchanged bidding pattern
The unchanged budget allocation
Anoutlier
(Triggeredbysomerandomaction)
S Yuan et al., An Empirical Study of Reserve Price Optimisation in Display Advertising, 2014
Table of contents
• RTB System
• Auction Mechanisms
• CTR Estimation
• Conversion Attribution
• Learning to Bid
• Data Management Platform (DMP) techniques
• Floor price optimisation
• Fighting against fraud
Fighting publisher fraud
• Non intentional traffic (NIT) / Non human traffic
– Web scrapers / crawlers
– Hacking tools
– Botnet
– Much of the spurious traffic is created by human
but without users’ knowledge
A Serious Problem
Dave Jakubowski, Head of Ad Tech, Facebook, March 2016
The Old Fashion Way
– Put the police on the street
– Manually eyeball the webpage
– Verify the address on the Google map
– Follow how the money flows
– This approach just can’t scale and is not
sustainable
Possible Solutions
– Rules
– Anomaly detection
– Classification algorithm
• Tricky to obtain negative samples
– Clustering algorithm
• Bots could display dramatically different behavior
– Content Analysis
• Fraudulent websites often scrape content from each
other or legit websites
Co-Visitation Networks
– Key observation:
• Even the major sites only share at most 20% cookieID
within a few hours, let alone those long tail sites.
– Define a graph:
• Node: site
• Weighted edge:
user overlap ratio of
two sites
– Cluster this weighted undirected graph
– Fraud: big cluster with long tail sites
O Stitelman, et al., Using Co-Visitation Networks For Classifying Non-Intentional Traffic, KDD 2013
O Stitelman, et al., Using Co-Visitation Networks For Classifying Non-Intentional Traffic, KDD 2013
December 2011 Co-visitation Network where and edge indicates at least 50%
overlap be- tween the browsers of both websites
O Stitelman, et al., Using Co-Visitation Networks For Classifying Non-Intentional Traffic, KDD 2013
Thank You
• RTB system
• Auction mechanisms
• User response estimation
• Conversion attribution
• Learning to bid
• Data Management Platform (DMP) techniques
• Floor price optimisation
• Fighting against fraud
Real-Time Bidding based Display Advertising:
Mechanisms and Algorithms

More Related Content

What's hot

Real time bidding (DSP, SSP, DMP, ATD, ITD)
Real time bidding (DSP, SSP, DMP, ATD, ITD)Real time bidding (DSP, SSP, DMP, ATD, ITD)
Real time bidding (DSP, SSP, DMP, ATD, ITD)Mutlu Dogus Yildirim
 
What is Real-Time Bidding (RTB)?
What is Real-Time Bidding (RTB)?What is Real-Time Bidding (RTB)?
What is Real-Time Bidding (RTB)?Search Laboratory
 
Project Lotus Intro Deck Aug
Project Lotus Intro Deck AugProject Lotus Intro Deck Aug
Project Lotus Intro Deck Augmartinsunwenhua
 
6 RTB secrets no-one tells you about
6 RTB secrets no-one tells you about6 RTB secrets no-one tells you about
6 RTB secrets no-one tells you aboutHUBRUS
 
Computational Advertising by Icha
Computational Advertising by IchaComputational Advertising by Icha
Computational Advertising by IchaAgate Studio
 
Data-Driven Reserve Prices for Social Advertising Auctions at LinkedIn
Data-Driven Reserve Prices for Social Advertising Auctions at LinkedInData-Driven Reserve Prices for Social Advertising Auctions at LinkedIn
Data-Driven Reserve Prices for Social Advertising Auctions at LinkedInKun Liu
 
The Programmatic Jargon Buster, 3rd Edition
The Programmatic Jargon Buster, 3rd EditionThe Programmatic Jargon Buster, 3rd Edition
The Programmatic Jargon Buster, 3rd EditionSociomantic Labs
 
Demand Side Platforms: Silver Bullet or Fog of War?
Demand Side Platforms: Silver Bullet or Fog of War?Demand Side Platforms: Silver Bullet or Fog of War?
Demand Side Platforms: Silver Bullet or Fog of War?Matt Hunter
 
The History of Advertising Technology
The History of Advertising TechnologyThe History of Advertising Technology
The History of Advertising TechnologyClearcode
 
Analytics, SAS, and 20+ years of optimal marketing decisions
Analytics, SAS, and 20+ years of optimal marketing decisionsAnalytics, SAS, and 20+ years of optimal marketing decisions
Analytics, SAS, and 20+ years of optimal marketing decisionspjdavis67
 
The Programmatic (R)evolution
The Programmatic (R)evolutionThe Programmatic (R)evolution
The Programmatic (R)evolutionTailWindEMEA
 
Display Advertising Basics
Display Advertising BasicsDisplay Advertising Basics
Display Advertising BasicsBidGear Inc.
 
CityAds Affiliate Marketing
CityAds Affiliate MarketingCityAds Affiliate Marketing
CityAds Affiliate MarketingNikolay Khokhlov
 
Programmatic Ad Buying: What Every Marketer Needs to Know Now.
Programmatic Ad Buying: What Every Marketer Needs to Know Now. Programmatic Ad Buying: What Every Marketer Needs to Know Now.
Programmatic Ad Buying: What Every Marketer Needs to Know Now. Kristen Whitney
 
Publisher Trading Desk
Publisher Trading DeskPublisher Trading Desk
Publisher Trading DeskMercy Setiawan
 
Programmatic ad buying in 3 slides
Programmatic ad buying in 3 slidesProgrammatic ad buying in 3 slides
Programmatic ad buying in 3 slidesOne Marketing Ltd
 
Programmatic buying
Programmatic buyingProgrammatic buying
Programmatic buyingmahajanvd28
 

What's hot (20)

Overview RTB ecosystem
Overview RTB ecosystemOverview RTB ecosystem
Overview RTB ecosystem
 
Cheat sheetmonetization1
Cheat sheetmonetization1Cheat sheetmonetization1
Cheat sheetmonetization1
 
Real time bidding (DSP, SSP, DMP, ATD, ITD)
Real time bidding (DSP, SSP, DMP, ATD, ITD)Real time bidding (DSP, SSP, DMP, ATD, ITD)
Real time bidding (DSP, SSP, DMP, ATD, ITD)
 
What is Real-Time Bidding (RTB)?
What is Real-Time Bidding (RTB)?What is Real-Time Bidding (RTB)?
What is Real-Time Bidding (RTB)?
 
Project Lotus Intro Deck Aug
Project Lotus Intro Deck AugProject Lotus Intro Deck Aug
Project Lotus Intro Deck Aug
 
6 RTB secrets no-one tells you about
6 RTB secrets no-one tells you about6 RTB secrets no-one tells you about
6 RTB secrets no-one tells you about
 
Computational Advertising by Icha
Computational Advertising by IchaComputational Advertising by Icha
Computational Advertising by Icha
 
Data-Driven Reserve Prices for Social Advertising Auctions at LinkedIn
Data-Driven Reserve Prices for Social Advertising Auctions at LinkedInData-Driven Reserve Prices for Social Advertising Auctions at LinkedIn
Data-Driven Reserve Prices for Social Advertising Auctions at LinkedIn
 
The Programmatic Jargon Buster, 3rd Edition
The Programmatic Jargon Buster, 3rd EditionThe Programmatic Jargon Buster, 3rd Edition
The Programmatic Jargon Buster, 3rd Edition
 
Demand Side Platforms: Silver Bullet or Fog of War?
Demand Side Platforms: Silver Bullet or Fog of War?Demand Side Platforms: Silver Bullet or Fog of War?
Demand Side Platforms: Silver Bullet or Fog of War?
 
The History of Advertising Technology
The History of Advertising TechnologyThe History of Advertising Technology
The History of Advertising Technology
 
Analytics, SAS, and 20+ years of optimal marketing decisions
Analytics, SAS, and 20+ years of optimal marketing decisionsAnalytics, SAS, and 20+ years of optimal marketing decisions
Analytics, SAS, and 20+ years of optimal marketing decisions
 
The Programmatic (R)evolution
The Programmatic (R)evolutionThe Programmatic (R)evolution
The Programmatic (R)evolution
 
Display Advertising Basics
Display Advertising BasicsDisplay Advertising Basics
Display Advertising Basics
 
CityAds Affiliate Marketing
CityAds Affiliate MarketingCityAds Affiliate Marketing
CityAds Affiliate Marketing
 
TradeAds Gen.Info
TradeAds Gen.InfoTradeAds Gen.Info
TradeAds Gen.Info
 
Programmatic Ad Buying: What Every Marketer Needs to Know Now.
Programmatic Ad Buying: What Every Marketer Needs to Know Now. Programmatic Ad Buying: What Every Marketer Needs to Know Now.
Programmatic Ad Buying: What Every Marketer Needs to Know Now.
 
Publisher Trading Desk
Publisher Trading DeskPublisher Trading Desk
Publisher Trading Desk
 
Programmatic ad buying in 3 slides
Programmatic ad buying in 3 slidesProgrammatic ad buying in 3 slides
Programmatic ad buying in 3 slides
 
Programmatic buying
Programmatic buyingProgrammatic buying
Programmatic buying
 

Viewers also liked

RTBkit Introduction & Best Practices
RTBkit Introduction & Best PracticesRTBkit Introduction & Best Practices
RTBkit Introduction & Best PracticesDatacratic
 
Developing Real Time Bidding Solutions with RTBkit Webinar On Demand
Developing Real Time Bidding Solutions with RTBkit Webinar On DemandDeveloping Real Time Bidding Solutions with RTBkit Webinar On Demand
Developing Real Time Bidding Solutions with RTBkit Webinar On DemandDatacratic
 
Lucie Sperkova - Pioneering multi-channel attribution for the lack of compreh...
Lucie Sperkova - Pioneering multi-channel attribution for the lack of compreh...Lucie Sperkova - Pioneering multi-channel attribution for the lack of compreh...
Lucie Sperkova - Pioneering multi-channel attribution for the lack of compreh...Marketing Festival
 
Attribution Modeling - Case Study
Attribution Modeling - Case StudyAttribution Modeling - Case Study
Attribution Modeling - Case StudyConcur
 
Advanced attribution model
Advanced attribution model Advanced attribution model
Advanced attribution model Aspa Lekka
 
High availability, real-time and scalable architectures
High availability, real-time and scalable architecturesHigh availability, real-time and scalable architectures
High availability, real-time and scalable architecturesJampp
 
Logistic Regression
Logistic RegressionLogistic Regression
Logistic RegressionMinHyeok Kim
 
Leveraging high level and low-level features for multimedia event detection.2...
Leveraging high level and low-level features for multimedia event detection.2...Leveraging high level and low-level features for multimedia event detection.2...
Leveraging high level and low-level features for multimedia event detection.2...Lu Jiang
 
Logistic Regression Demystified (Hopefully)
Logistic Regression Demystified (Hopefully)Logistic Regression Demystified (Hopefully)
Logistic Regression Demystified (Hopefully)Gabriele Tolomei
 
A Technical Introduction to RTBkit
A Technical Introduction to RTBkitA Technical Introduction to RTBkit
A Technical Introduction to RTBkitDatacratic
 
RTBkit Meetup - Developer Spotlight, Behind the Scenes of RTBkit and Intro to...
RTBkit Meetup - Developer Spotlight, Behind the Scenes of RTBkit and Intro to...RTBkit Meetup - Developer Spotlight, Behind the Scenes of RTBkit and Intro to...
RTBkit Meetup - Developer Spotlight, Behind the Scenes of RTBkit and Intro to...Datacratic
 
RTBkit 2.0 Roadmap Preview
RTBkit 2.0 Roadmap PreviewRTBkit 2.0 Roadmap Preview
RTBkit 2.0 Roadmap PreviewDatacratic
 
Domoti Corporate Presentation (eng)
Domoti Corporate Presentation (eng)Domoti Corporate Presentation (eng)
Domoti Corporate Presentation (eng)Domoti S.A.S
 
Domoti products presentation_eng
Domoti products presentation_engDomoti products presentation_eng
Domoti products presentation_engDomoti S.A.S
 
Tutorial 10 (computational advertising)
Tutorial 10 (computational advertising)Tutorial 10 (computational advertising)
Tutorial 10 (computational advertising)Kira
 
Digital Leaders Dinner (Sydney, Australia) - 6 February, 2013
Digital Leaders Dinner (Sydney, Australia) - 6 February, 2013Digital Leaders Dinner (Sydney, Australia) - 6 February, 2013
Digital Leaders Dinner (Sydney, Australia) - 6 February, 2013Exponential Interactive
 
How to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayHow to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayPhillip Law
 
Attribution theory
Attribution theoryAttribution theory
Attribution theoryImran Ayaz
 
Attribution theory (1)
Attribution theory (1)Attribution theory (1)
Attribution theory (1)Jayme Grohe
 

Viewers also liked (20)

RTBkit Introduction & Best Practices
RTBkit Introduction & Best PracticesRTBkit Introduction & Best Practices
RTBkit Introduction & Best Practices
 
Developing Real Time Bidding Solutions with RTBkit Webinar On Demand
Developing Real Time Bidding Solutions with RTBkit Webinar On DemandDeveloping Real Time Bidding Solutions with RTBkit Webinar On Demand
Developing Real Time Bidding Solutions with RTBkit Webinar On Demand
 
Lucie Sperkova - Pioneering multi-channel attribution for the lack of compreh...
Lucie Sperkova - Pioneering multi-channel attribution for the lack of compreh...Lucie Sperkova - Pioneering multi-channel attribution for the lack of compreh...
Lucie Sperkova - Pioneering multi-channel attribution for the lack of compreh...
 
Attribution Modeling - Case Study
Attribution Modeling - Case StudyAttribution Modeling - Case Study
Attribution Modeling - Case Study
 
Advanced attribution model
Advanced attribution model Advanced attribution model
Advanced attribution model
 
High availability, real-time and scalable architectures
High availability, real-time and scalable architecturesHigh availability, real-time and scalable architectures
High availability, real-time and scalable architectures
 
Logistic Regression
Logistic RegressionLogistic Regression
Logistic Regression
 
Leveraging high level and low-level features for multimedia event detection.2...
Leveraging high level and low-level features for multimedia event detection.2...Leveraging high level and low-level features for multimedia event detection.2...
Leveraging high level and low-level features for multimedia event detection.2...
 
Logistic Regression Demystified (Hopefully)
Logistic Regression Demystified (Hopefully)Logistic Regression Demystified (Hopefully)
Logistic Regression Demystified (Hopefully)
 
A Technical Introduction to RTBkit
A Technical Introduction to RTBkitA Technical Introduction to RTBkit
A Technical Introduction to RTBkit
 
RTBkit Meetup - Developer Spotlight, Behind the Scenes of RTBkit and Intro to...
RTBkit Meetup - Developer Spotlight, Behind the Scenes of RTBkit and Intro to...RTBkit Meetup - Developer Spotlight, Behind the Scenes of RTBkit and Intro to...
RTBkit Meetup - Developer Spotlight, Behind the Scenes of RTBkit and Intro to...
 
RTBkit 2.0 Roadmap Preview
RTBkit 2.0 Roadmap PreviewRTBkit 2.0 Roadmap Preview
RTBkit 2.0 Roadmap Preview
 
Domoti Corporate Presentation (eng)
Domoti Corporate Presentation (eng)Domoti Corporate Presentation (eng)
Domoti Corporate Presentation (eng)
 
Domoti products presentation_eng
Domoti products presentation_engDomoti products presentation_eng
Domoti products presentation_eng
 
Tutorial 10 (computational advertising)
Tutorial 10 (computational advertising)Tutorial 10 (computational advertising)
Tutorial 10 (computational advertising)
 
Digital Leaders Dinner (Sydney, Australia) - 6 February, 2013
Digital Leaders Dinner (Sydney, Australia) - 6 February, 2013Digital Leaders Dinner (Sydney, Australia) - 6 February, 2013
Digital Leaders Dinner (Sydney, Australia) - 6 February, 2013
 
How to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayHow to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 Day
 
Attribution theory
Attribution theoryAttribution theory
Attribution theory
 
Attribution theory (1)
Attribution theory (1)Attribution theory (1)
Attribution theory (1)
 
Attribution Theory
Attribution TheoryAttribution Theory
Attribution Theory
 

Similar to RTBMA ECIR 2016 tutorial

Machine learning for profit: Computational advertising landscape
Machine learning for profit: Computational advertising landscapeMachine learning for profit: Computational advertising landscape
Machine learning for profit: Computational advertising landscapeSharat Chikkerur
 
Response prediction for display advertising - WSDM 2014
Response prediction for display advertising - WSDM 2014Response prediction for display advertising - WSDM 2014
Response prediction for display advertising - WSDM 2014Olivier Chapelle
 
Fast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick GorskiFast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick GorskiHakka Labs
 
Computational Advertising in Yelp Local Ads
Computational Advertising in Yelp Local AdsComputational Advertising in Yelp Local Ads
Computational Advertising in Yelp Local Adssoupsranjan
 
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...Databricks
 
Int'l Conference on Predictive APIs: RTB Optimizer presentation
Int'l Conference on Predictive APIs: RTB Optimizer presentationInt'l Conference on Predictive APIs: RTB Optimizer presentation
Int'l Conference on Predictive APIs: RTB Optimizer presentationDatacratic
 
COM 5401Advertising Production and ManagementAdvertisi.docx
COM 5401Advertising Production and ManagementAdvertisi.docxCOM 5401Advertising Production and ManagementAdvertisi.docx
COM 5401Advertising Production and ManagementAdvertisi.docxmccormicknadine86
 
Nicolas Kruchten @ Datacratic
Nicolas Kruchten @ DatacraticNicolas Kruchten @ Datacratic
Nicolas Kruchten @ DatacraticPAPIs.io
 
Ed Snelson. Counterfactual Analysis
Ed Snelson. Counterfactual AnalysisEd Snelson. Counterfactual Analysis
Ed Snelson. Counterfactual AnalysisVolha Banadyseva
 
Deepak-Computational Advertising-The LinkedIn Way
Deepak-Computational Advertising-The LinkedIn WayDeepak-Computational Advertising-The LinkedIn Way
Deepak-Computational Advertising-The LinkedIn Wayyingfeng
 
New challenges for scalable machine learning in online advertising
New challenges for scalable machine learning in online advertisingNew challenges for scalable machine learning in online advertising
New challenges for scalable machine learning in online advertisingOlivier Koch
 
Programmatic Selling Ebay case study CEO dinner Madrid
Programmatic Selling Ebay case study CEO dinner MadridProgrammatic Selling Ebay case study CEO dinner Madrid
Programmatic Selling Ebay case study CEO dinner MadridJoep Hutschemakers
 
Nicholas Gorski: Real-time revenue science at Twitter
Nicholas Gorski: Real-time revenue science at TwitterNicholas Gorski: Real-time revenue science at Twitter
Nicholas Gorski: Real-time revenue science at TwitterDavid Garrison
 
A History of Programmatic Media
A History of Programmatic MediaA History of Programmatic Media
A History of Programmatic MediaThe Media Kitchen
 
Why is programmatic taking off? What is this revolution all about?
Why is programmatic taking off?  What is this revolution all about?Why is programmatic taking off?  What is this revolution all about?
Why is programmatic taking off? What is this revolution all about?Datacratic
 
Programmatic advertising - overview
Programmatic advertising - overviewProgrammatic advertising - overview
Programmatic advertising - overviewJim Murray Jones
 
Chandra ebusiness applications
Chandra ebusiness applicationsChandra ebusiness applications
Chandra ebusiness applicationsSuresh Chandra
 
Data Science at Flurry
Data Science at FlurryData Science at Flurry
Data Science at Flurrysoupsranjan
 
Constraint and Qualitative Preference Specification in Multi-Attribute Revers...
Constraint and Qualitative Preference Specification in Multi-Attribute Revers...Constraint and Qualitative Preference Specification in Multi-Attribute Revers...
Constraint and Qualitative Preference Specification in Multi-Attribute Revers...Shubhashis Shil
 

Similar to RTBMA ECIR 2016 tutorial (20)

RTB Bid Landscape in Adform
RTB Bid Landscape in AdformRTB Bid Landscape in Adform
RTB Bid Landscape in Adform
 
Machine learning for profit: Computational advertising landscape
Machine learning for profit: Computational advertising landscapeMachine learning for profit: Computational advertising landscape
Machine learning for profit: Computational advertising landscape
 
Response prediction for display advertising - WSDM 2014
Response prediction for display advertising - WSDM 2014Response prediction for display advertising - WSDM 2014
Response prediction for display advertising - WSDM 2014
 
Fast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick GorskiFast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick Gorski
 
Computational Advertising in Yelp Local Ads
Computational Advertising in Yelp Local AdsComputational Advertising in Yelp Local Ads
Computational Advertising in Yelp Local Ads
 
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
 
Int'l Conference on Predictive APIs: RTB Optimizer presentation
Int'l Conference on Predictive APIs: RTB Optimizer presentationInt'l Conference on Predictive APIs: RTB Optimizer presentation
Int'l Conference on Predictive APIs: RTB Optimizer presentation
 
COM 5401Advertising Production and ManagementAdvertisi.docx
COM 5401Advertising Production and ManagementAdvertisi.docxCOM 5401Advertising Production and ManagementAdvertisi.docx
COM 5401Advertising Production and ManagementAdvertisi.docx
 
Nicolas Kruchten @ Datacratic
Nicolas Kruchten @ DatacraticNicolas Kruchten @ Datacratic
Nicolas Kruchten @ Datacratic
 
Ed Snelson. Counterfactual Analysis
Ed Snelson. Counterfactual AnalysisEd Snelson. Counterfactual Analysis
Ed Snelson. Counterfactual Analysis
 
Deepak-Computational Advertising-The LinkedIn Way
Deepak-Computational Advertising-The LinkedIn WayDeepak-Computational Advertising-The LinkedIn Way
Deepak-Computational Advertising-The LinkedIn Way
 
New challenges for scalable machine learning in online advertising
New challenges for scalable machine learning in online advertisingNew challenges for scalable machine learning in online advertising
New challenges for scalable machine learning in online advertising
 
Programmatic Selling Ebay case study CEO dinner Madrid
Programmatic Selling Ebay case study CEO dinner MadridProgrammatic Selling Ebay case study CEO dinner Madrid
Programmatic Selling Ebay case study CEO dinner Madrid
 
Nicholas Gorski: Real-time revenue science at Twitter
Nicholas Gorski: Real-time revenue science at TwitterNicholas Gorski: Real-time revenue science at Twitter
Nicholas Gorski: Real-time revenue science at Twitter
 
A History of Programmatic Media
A History of Programmatic MediaA History of Programmatic Media
A History of Programmatic Media
 
Why is programmatic taking off? What is this revolution all about?
Why is programmatic taking off?  What is this revolution all about?Why is programmatic taking off?  What is this revolution all about?
Why is programmatic taking off? What is this revolution all about?
 
Programmatic advertising - overview
Programmatic advertising - overviewProgrammatic advertising - overview
Programmatic advertising - overview
 
Chandra ebusiness applications
Chandra ebusiness applicationsChandra ebusiness applications
Chandra ebusiness applications
 
Data Science at Flurry
Data Science at FlurryData Science at Flurry
Data Science at Flurry
 
Constraint and Qualitative Preference Specification in Multi-Attribute Revers...
Constraint and Qualitative Preference Specification in Multi-Attribute Revers...Constraint and Qualitative Preference Specification in Multi-Attribute Revers...
Constraint and Qualitative Preference Specification in Multi-Attribute Revers...
 

Recently uploaded

TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsssuserddc89b
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555kikilily0909
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |aasikanpl
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)DHURKADEVIBASKAR
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)riyaescorts54
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫qfactory1
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptxBREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptxPABOLU TEJASREE
 

Recently uploaded (20)

TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physics
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdf
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptxBREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
 

RTBMA ECIR 2016 tutorial

  • 1. Real-Time Bidding based Display Advertising: Mechanisms and Algorithms Shuai Yuan, MediaGamma Ltd Weinan Zhang, UCL Jun Wang, UCL Shuai.yuan@mediagamma.com {w.zhang, j.wang}@cs.ucl.ac.uk ECIR16 Tutorial
  • 2. Table of contents • RTB system • Auction mechanisms • User response estimation • Conversion attribution • Learning to bid • Data Management Platform (DMP) techniques • Floor price optimisation • Fighting against fraud
  • 3. Table of contents • RTB system • Auction mechanisms • User response estimation • Conversion attribution • Learning to bid • Data Management Platform (DMP) techniques • Floor price optimisation • Fighting against fraud
  • 4. Advertising • Make the best match between and with
  • 5. - John Wanamaker (1838-1922) Father of modern advertising and a pioneer in marketing
  • 7. Computational Advertising • Design algorithms to make the best match between the advertisers and Internet users with economic constraints
  • 9. Sponsored Search • Advertiser sets a bid price for the keyword • User searches the keyword • Search engine hosts the auction to ranking the ads
  • 11. Internet Advertising Frontier: Real-Time Bidding (RTB) based Display Advertising What is Real-Time Bidding? • Every online ad view can be evaluated, bought, and sold, all individually, and all instantaneously. • Instead of buying keywords or a bundle of ad views, advertisers are now buying users directly. DSP/Exchange daily traffic Advertising iPinYou, China 18 billion impressions YOYI, China 5 billion impressions Fikisu, US 32 billon impressions Finance New York Stock Exchange 12 billion shares daily Shanghai Stock Exchange 14 billion shares daily Query per second Turn DSP 1.6 million Google 40,000 search Shen, Jianqiang, et al. "From 0.5 Million to 2.5 Million: Efficiently Scaling up Real-Time Bidding." Data Mining (ICDM), 2015 IEEE International Conference on. IEEE, 2015.
  • 12. Suppose a student regularly reads articles on emarketer.com Content-related ads
  • 13. He recently checked the London hotels (In fact, no login is required)
  • 14. Relevant ads on facebook.com
  • 15. Even on supervisor’s homepage! (User targeting dominates the context)
  • 16. RTB Display Advertising Mechanism • Buying ads via real-time bidding (RTB), 10B per day RTB Ad Exchange Demand-Side Platform Advertiser Data Management Platform 0. Ad Request 1. Bid Request (user, page, context) 2. Bid Response (ad, bid price) 3. Ad Auction 4. Win Notice (charged price) 5. Ad (with tracking) 6. User Feedback (click, conversion) User Information User Demography: Male, 26, Student User Segmentations: London, travelling Page User <100 ms
  • 17. Table of contents • RTB system • Auction mechanisms • User response estimation • Conversion attribution • Learning to bid • Data Management Platform (DMP) techniques • Floor price optimisation • Fighting against fraud
  • 19. Modeling • n bidders • Each bidder has value vi for the item – “willingness to pay” – Known only to him – “private value” • If bidder i wins and pays pi, his utility is vi – pi – In addition, the utility is 0 when the bidder loses. • Note: bidders prefer losing than paying more than their value.
  • 20. Strategy • A strategy for each bidder – how to bid given your intrinsic, private value? – a strategy here is a function, a plan for the game. Not just a bid. • Examples for strategies: – bi(vi) = vi (truthful) – bi(vi) = vi /2 – bi(vi) = vi /n – If v<50, bi(vi) = vi otherwise, bi(vi) = vi +17 • Can be modeled as normal form game, where these strategies are the pure strategies. • Example for a game with incomplete information. B(v)=v B(v)=v /2 B(v)=v /n …. B(v)=v …
  • 21. Strategies and equilibrium • An equilibrium in the auction is a profile of strategies B1,B2,…,Bn such that: – Dominant strategy equilibrium: each strategy is optimal whatever the other strategies are. – Nash equilibrium: each strategy is a best response to the other strategies. B(v)=v B(v)=v/2 B(v)=v/n …. B(v)=v …
  • 22. Bayes-Nash equilibrium • Recall a set of bidding strategies is a Nash equilibrium if each bidder’s strategy maximizes his payoff given the optimal strategies of the others. – In auctions: bidders do not know their opponent’s values, i.e., there is incomplete information. – Each bidder’s strategy must maximize her expected payoff accounting for the uncertainty about opponent values.
  • 23. 1st price auctions • Truthful(bi = vi)? $30 $100 $31 NO!
  • 25. Reserve Prices and Entry Fees • Reserve Prices: the seller is assumed to have committed to not selling below the reserve – Reserve prices are assumed to be known to all bidders – The reserve prices = the minimum bids • Entry Fees: those bidders who enter have to pay the entry fee to the seller • They reduce bidders’ incentives to participate, but they might increase revenue as 1) the seller collects extra revenues 2) bidders might bid more aggressively
  • 26. Table of contents • RTB system • Auction mechanisms • User response estimation • Conversion attribution • Learning to bid • Data Management Platform (DMP) techniques • Floor price optimisation • Fighting against fraud
  • 27. RTB Display Advertising Mechanism • Buying ads via real-time bidding (RTB), 10B per day RTB Ad Exchange Demand-Side Platform Advertiser Data Management Platform 0. Ad Request 1. Bid Request (user, page, context) 2. Bid Response (ad, bid price) 3. Ad Auction 4. Win Notice (charged price) 5. Ad (with tracking) 6. User Feedback (click, conversion) User Information User Demography: Male, 26, Student User Segmentations: London, travelling Page User <100 ms
  • 28. Predict how likely the user is going to click the displayed ad.
  • 29. User response estimation problem • Click-through rate estimation as an example • Date: 20160320 • Hour: 14 • Weekday: 7 • IP: 119.163.222.* • Region: England • City: London • Country: UK • Ad Exchange: Google • Domain: yahoo.co.uk • URL: http://www.yahoo.co.uk/abc/xyz.html • OS: Windows • Browser: Chrome • Ad size: 300*250 • Ad ID: a1890 • User tags: Sports, Electronics Click (1) or not (0)? Predicted CTR (0.15)
  • 30. Feature Representation • Binary one-hot encoding of categorical data x=[Weekday=Wednesday, Gender=Male, City=London] x=[0,0,1,0,0,0,0 0,1 0,0,1,0…0] High dimensional sparse binary feature vector
  • 31. Linear Models • Logistic Regression – With SGD learning – Sparse solution • Online Bayesian Profit Regression
  • 32. ML Framework of CTR Estimation • A binary regression problem – Large binary feature space (>10 millions) • Bloom filter to detect and add new features (e.g., > 5 instances) – Large data instance number (>10 millions daily) – A seriously unbalanced label • Normally, #click/#non-click = 0.3% • Negative down sampling • Calibration
  • 33. Logistic Regression • Prediction • Cross Entropy Loss • Stochastic Gradient Descent Learning [Lee et al. Estimating Conversion Rate in Display Advertising from Past Performance Data. KDD 12]
  • 34. Logistic Regression with SGD • Pros – Standardised, easily understood and implemented – Easy to be parallelised • Cons – Learning rate η initialisation – Uniform learning rate against different binary features
  • 35. Logistic Regression with FTRL • In practice, we need a sparse solution as >10 million feature dimensions • Follow-The-Regularised-Leader (FTRL) online Learning [McMahan et al. Ad Click Prediction : a View from the Trenches. KDD 13] s.t. • Online closed-form update of FTRL t: current example index gs: gradient for example t adaptively selects regularization functions [Xiao, Lin. "Dual averaging method for regularized stochastic learning and online optimization." Advances in Neural Information Processing Systems. 2009]
  • 36. Online Bayesian Probit Regression to approximate message passing. In order full factorial structure of the likelihood, wo latent variables , and consider the density function which (5) n can be understood in terms of the ative process, which is also reflected in in Figure 1. Sample weights from the Gaussian alculate the score for x as the inner , such that . dd zero-mean Gaussian noise to obtain h that . etermine by a threshold on the noisy ro, such that . skills in TrueSkill after a hypothetical m team with known skill of zero. Given the Figure 1 together with Table 1 in the a update equations can be derived. 3.3.1. UPDATE EQUATIONS FOR ONLINE The update equations represent a mappin posterior parameter values based o pairs ̃ ̃ . In terms of calculation can viewed as following the m schedule towards the weights . We d variance for a given input as The update for the posterior parameters is ̃ ( ̃ * (
  • 37. Linear Prediction Models • Pros – Highly efficient and scalable – Explore larger feature space and training data • Cons – Modelling limit: feature independence assumption – Cannot capture feature interactions unless defining high order combination features • E.g., hour=10AM & city=London & browser=Chrome
  • 38. Non-linear Models • Gradient Boosting Decision Trees • Factorisation Machines • Combined Models • Deep Neural Networks
  • 39. Factorisation Machines • Prediction based on feature embedding – Explicitly model feature interactions • Second order, third order etc. – Empirically better than logistic regression – A new way for user profiling [Oentaryo et al. Predicting response in mobile advertising with hierarchical importance- aware factorization machine. WSDM 14] [Rendle. Factorization machines. ICDM 2010.]
  • 40. Gradient Boosting Decision Trees • Additive decision trees for prediction • Each decision tree [Chen and He. Higgs Boson Discovery with Boosted Trees . HEPML 2014.]
  • 41. Gradient Boosting Decision Trees • Learning [Chen and He. Higgs Boson Discovery with Boosted Trees . HEPML 2014.]
  • 42. Combined Models: GBDT + LR [He et al. Practical Lessons from Predicting Clicks on Ads at Facebook . ADKDD 2014.]
  • 43. Combined Models: GBDT + FM [http://www.csie.ntu.edu.tw/~r01922136/kaggle-2014-criteo.pdf]
  • 44. [Zhang et al. Deep Learning over Multi-field Categorical Data – A Case Study on User Response Prediction. ECIR 16] in Monday Machine Learning Track
  • 45. Table of contents • RTB System • Auction Mechanisms • CTR Estimation • Conversion Attribution • Learning to Bid • Data Management Platform (DMP) techniques • Floor price optimisation • Fighting against fraud
  • 46. Conversion Attribution • Assign credit% to each channel according to contribution • Current industrial solution: last-touch attribution [Shao et al. Data-driven multi-touch attribution models. KDD 11]
  • 47. Heuristics-based Attribution [Kee. Attribution playbook – google analytics. Online access.]
  • 48. A Good Attribution Model • Fairness – Reward an individual channel in accordance with its ability to affect the likelihood of conversion • Data driven – Using ad touch and conversion data for each campaign to build its model • Interpretability – Generally accepted by all parties [Dalessandro et al. Casually Motivated Attribution for Online Advertising. ADKDD 11]
  • 49. Bagged Logistic Regression • For M iterations – Sample 50% data instances and 50% features – Train a logistic regression and record the weights • Average the feature weights [Shao et al. Data-driven multi-touch attribution models. KDD 11] Display Search Mobile Email Social Convert? 1 1 0 0 1 1 1 0 1 1 1 0 0 1 0 1 0 1 0 0 1 1 1 0
  • 50. Bagged Logistic Regression • For M iterations – Sample 50% data instances and 50% features – Train a logistic regression and record the weights • Average the feature weights [Shao et al. Data-driven multi-touch attribution models. KDD 11] Display Search Mobile Email Social Convert? 1 1 0 0 1 1 1 0 1 1 1 0 0 1 0 1 0 1 0 0 1 1 1 0
  • 51. Bagged Logistic Regression • For M iterations – Sample 50% data instances and 50% features – Train a logistic regression and record the weights • Average the feature weights [Shao et al. Data-driven multi-touch attribution models. KDD 11] Display Search Mobile Email Social Convert? 1 1 0 0 1 1 1 0 1 1 1 0 0 1 0 1 0 1 0 0 1 1 1 0
  • 52. Shapley Value based Attribution • Coalition game – How much does a player contribute in the game [Fig source: https://pjdelta.wordpress.com/2014/08/10/group-project-how-much-did-i-contribute/]
  • 53. Shapley Value based Attribution • Coalition game
  • 54. A Probabilistic Attribution Model • Conditional probabilities [Shao et al. Data-driven multi-touch attribution models. KDD 11] • Attributed contribution
  • 55. [Shao et al. Data-driven multi-touch attribution models. KDD 11]
  • 56. [Shao et al. Data-driven multi-touch attribution models. KDD 11]
  • 57. Data-Driven Probabilistic Models • The “relatively heuristic” data-driven model [Shao et al. Data-driven multi-touch attribution models. KDD 11] • A more generalized and data-driven model [Dalessandro et al. Casually Motivated Attribution for Online Advertising. ADKDD 11] – : the probability that the sequence begin with
  • 58. Attribution Comparison • Help find some “cookie bombing” channels
  • 59. Other Attribution Models • Survival models with time [Zhang et al. Multi-Touch Attribution in Online Advertising with Survival Theory. ICDM 2014] • Markov graph [Anderl et al. Mapping the customer journey: A graph-based framework for online attribution modeling. SSRN 2014]
  • 60. Table of contents • RTB System • Auction Mechanisms • CTR Estimation • Conversion Attribution • Learning to Bid • Data Management Platform (DMP) techniques • Floor price optimisation • Fighting against fraud
  • 61. RTB Display Advertising Mechanism • Buying ads via real-time bidding (RTB), 10B per day RTB Ad Exchange Demand-Side Platform Advertiser Data Management Platform 0. Ad Request 1. Bid Request (user, page, context) 2. Bid Response (ad, bid price) 3. Ad Auction 4. Win Notice (charged price) 5. Ad (with tracking) 6. User Feedback (click, conversion) User Information User Demography: Male, 26, Student User Segmentations: London, travelling Page User <100 ms
  • 62. Data of Learning to Bid – Bid request features: High dimensional sparse binary vector – Bid: Non-negative real or integer value – Win: Boolean – Cost: Non-negative real or integer value – Feedback: Binary • Data
  • 63. Problem Definition of Learning to Bid • How much to bid for each bid request? – Find an optimal bidding function b(x) • Bid to optimise the KPI with budget constraint Bid Request (user, ad, page, context) Bid Price Bidding Strategy
  • 64. Bidding Strategy in Practice Bid Request (user, ad, page, context) Bid Price Bidding Strategy Feature Eng. Whitelist / Blacklist Retargeting Budget Pacing Bid Landscape Bid Calculation Frequency Capping CTR / CVR Estimation Campaign Pricing Scheme 64
  • 65. Bidding Strategy in Practice: A Quantitative Perspective Bid Request (user, ad, page, context) Bid Price Bidding Strategy Utility Estimation Cost Estimation Preprocessing Bidding Function CTR, CVR, revenue Bid landscape 65
  • 66. Bid Landscape Forecasting Auction Winning Probability Win probability: Expected cost: Count Win bid
  • 67. Bid Landscape Forecasting • Log-Normal Distribution Auction Winning Probability [Cui et al. Bid Landscape Forecasting in Online Ad Exchange Marketplace. KDD 11]
  • 68. Bid Landscape Forecasting • Price Prediction via Linear Regression – Modelling censored data in lost bid requests [Wu et al. Predicting Winning Price in Real Time Bidding with Censored Data. KDD 15]
  • 69. Bidding Strategies • How much to bid for each bid request? • Bid to optimise the KPI with budget constraint Bid Request (user, ad, page, context) Bid Price Bidding Strategy
  • 70. Classic Second Price Auctions • Single item, second price (i.e. pay market price) Reward given a bid: Optimal bid: Bid true value
  • 71. Truth-telling Bidding Strategies • Truthful bidding in second-price auction – Bid the true value of the impression – Impression true value = – Averaged impression value = value of click * CTR – Truth-telling bidding: [Chen et al. Real-time bidding algorithms for performance-based display ad allocation. KDD 11] Value of click, if clicked 0, if not clicked
  • 72. Truth-telling Bidding Strategies • Pros – Theoretic soundness – Easy implementation (very widely used) • Cons – Not considering the constraints of • Campaign lifetime auction volume • Campaign budget – Case 1: $1000 budget, 1 auction – Case 2: $1 budget, 1000 auctions [Chen et al. Real-time bidding algorithms for performance-based display ad allocation. KDD 11]
  • 73. Non-truthful Linear Bidding • Non-truthful linear bidding – Tune base_bid parameter to maximise KPI – Bid landscape, campaign volume and budget indirectly considered [Perlich et al. Bid Optimizing and Inventory Scoring in Targeted Online Advertising. KDD 12]
  • 74. ORTB Bidding Strategies • Direct functional optimisation CTRwinning function bidding function budget Est. volume cost upperbound 74 [Zhang et al. Optimal real-time bidding for display advertising. KDD 14] • Solution: Calculus of variations
  • 75. Optimal Bidding Strategy Solution 75 [Zhang et al. Optimal real-time bidding for display advertising. KDD 14]
  • 76. Bidding in Multi-Touch Attribution Mechanism • Current bidding strategy – Driven by last-touch attribution b(CVR) • A new bidding strategy – Driven by multi-touch attribution [Xu et al. Lift-Based Bidding in Ad Selection. AAAI 2016.]
  • 77. Table of contents • RTB System • Auction Mechanisms • CTR Estimation • Conversion Attribution • Learning to Bid • Data Management Platform (DMP) techniques • Floor price optimisation • Fighting against fraud
  • 78. DMP Summary • What is data management platform • Cook sync • Browser fingerprinting • CF and Lookalike model
  • 79. What is DMP (Data Management Platform) • A data warehouse that stores, merges, and sorts, and labels it out in a way that’s useful for marketers, publishers and other businesses. RTB Ad Exchange Demand-Side Platform Advertiser Data Management Platform 0. Ad Request 1. Bid Request (user, page, context) 2. Bid Response (ad, bid price) 3. Ad Auction 4. Win Notice (charged price) 5. Ad (with tracking) 6. User Feedback (click, conversion) User Information User Demography: Male, 26, Student User Segmentations: London, travelling User <100 ms WebPage
  • 80. Cookie sync: merging audience data https://freedom-to-tinker.com/blog/englehardt/the-hidden-perils-of-cookie-syncing/ When a user visits a site (e.g. ABC.com) including A.com as a third-party tracker. (1) The browser makes a request to A.com, and included in this request is the tracking cookie set by A.com. (2) A.com retrieves its tracking ID from the cookie, and redirects the browser to B.com, encoding the tracking ID into the URL. (3) The browser then makes a request to B.com, which includes the full URL A.com redirected to as well as B.com’s tracking cookie. (4) B.com can then link its ID for the user to A.com’s ID for the user2
  • 81. Browser fingerprinting • A device fingerprint or browser fingerprint is information collected about the remote computing device for the purpose of identifying the user • Fingerprints can be used to fully or partially identify individual users or devices even when cookies are turned off. Eckersley, Peter. "How unique is your web browser?." Privacy Enhancing Technologies. Springer Berlin Heidelberg, 2010. Acar, Gunes, et al. "The web never forgets: Persistent tracking mechanisms in the wild." Proceedings of the 2014 ACM SIGSAC Conference on Computer and Communications Security. ACM, 2014. 94.2% of browsers with Flash or Java were unique in a study
  • 82. User segmentation and Behavioural Targeting • Behavioural targeting helps online advertising • From user – documents to user – topics – Latent Semantic Analysis / Latent Dirichlet Allocation J Yan, et al., How much can behavioral targeting help online advertising? WWW 2009 X Wu, et al., Probabilistic latent semantic user segmentation for behavioral targeted advertising, Intelligence for Advertising 2009 User Topic Term
  • 83. Lookalike modelling • Lookalike modeling: finding new people who behave like current customers (converted) Zhang, Weinan, Lingxi Chen, and Jun Wang. "Implicit Look-alike Modelling in Display Ads: Transfer Collaborative Filtering to CTR Estimation." ECIR (2016).
  • 84. Transferred lookalike Using web browsing data, which is largely available, to infer the ad clicks Zhang, Weinan, Lingxi Chen, and Jun Wang. "Implicit Look-alike Modelling in Display Ads: Transfer Collaborative Filtering to CTR Estimation." ECIR (2016). In Wednesday Information Filtering Track
  • 85. Table of contents • RTB System • Auction Mechanisms • CTR Estimation • Conversion Attribution • Learning to Bid • Data Management Platform (DMP) techniques • Floor price optimisation • Fighting against fraud
  • 86. Reserve price optimisation The task: • To find the optimal reserve prices The challenge: • Practical constraints v.s common assumptions (bids’ distribution, bidding private values, etc.) S Yuan et al., An Empirical Study of Reserve Price Optimisation in Display Advertising, 2014
  • 87. Why • Suppose it is second price auction – Normal case: 𝑏2 ≥ 𝛼 – Preferable case: 𝑏1 ≥ 𝛼 > 𝑏2 (it increases the revenue) – Undesirable case: 𝛼 > 𝑏1 (but there is risk)
  • 88. • Suppose: two bidders, private values drawn from Uniform[0, 1] • Without a reserve price (or 𝑎 = 0), the payoff 𝑟 is: • With 𝑎 = 0.2: • With 𝑎 = 0.5: • With 𝑎 = 0.6: An example Ostrovsky and Schwarz , Reserve prices in internet advertising auctions: A field experiment, 2011 𝑟 = 𝐸 min 𝑏1, 𝑏2 = 0.33 𝑟 = 𝐸 min 𝑏1, 𝑏2 𝑏1 > 0.5, 𝑏2 > 0.5 + 0.5 × 0.5 = 0.42 𝑟 = 𝐸 min 𝑏1, 𝑏2 𝑏1 > 0.2, 𝑏2 > 0.2 + 0.32 × 0.2 = 0.36 𝑟 = 𝐸 min 𝑏1, 𝑏2 𝑏1 > 0.6, 𝑏2 > 0.6 + 0.6 × 0.4 × 2 × 0.6 = 0.405 Paying the second highest price Paying the reserve price
  • 89. The optimal auction theory • In the second price auctions, advertisers bid their private values [𝑏1, … , 𝑏 𝐾] • Private values -> Bids’ distributions – Uniform – Log-normal • The publisher also has a private value 𝑉𝑝 • The optimal reserve price is given by: Levin and Smith, Optimal Reservation Prices in Auctions, 1996 𝐹 𝒃 = 𝐹1 𝑏1 × ⋯ × 𝐹 𝐾(𝑏 𝐾) 𝛼 − 1 − 𝐹 𝒃 𝐹′ 𝒃 − 𝑉𝑝 = 0
  • 90. Results from a field experiment • On Yahoo! Sponsored search • Using the Optimal Auction Theory Mixed results Ostrovsky and Schwarz , Reserve prices in internet advertising auctions: A field experiment, 2011
  • 91. 1) Expected payoff of advertiser, publisher 2) Payoff for the advertiser could be negative if one has been bidding the max price (𝑎 𝑤1: to increase 𝑏1 so that 𝑏1 ≥ 𝛼) 3) One won’t do that, so discounted publisher’s payoff
  • 92. The continuous bidding activity The unchanged bidding pattern The unchanged budget allocation Anoutlier (Triggeredbysomerandomaction) S Yuan et al., An Empirical Study of Reserve Price Optimisation in Display Advertising, 2014
  • 93. Table of contents • RTB System • Auction Mechanisms • CTR Estimation • Conversion Attribution • Learning to Bid • Data Management Platform (DMP) techniques • Floor price optimisation • Fighting against fraud
  • 94. Fighting publisher fraud • Non intentional traffic (NIT) / Non human traffic – Web scrapers / crawlers – Hacking tools – Botnet – Much of the spurious traffic is created by human but without users’ knowledge
  • 95. A Serious Problem Dave Jakubowski, Head of Ad Tech, Facebook, March 2016
  • 96. The Old Fashion Way – Put the police on the street – Manually eyeball the webpage – Verify the address on the Google map – Follow how the money flows – This approach just can’t scale and is not sustainable
  • 97. Possible Solutions – Rules – Anomaly detection – Classification algorithm • Tricky to obtain negative samples – Clustering algorithm • Bots could display dramatically different behavior – Content Analysis • Fraudulent websites often scrape content from each other or legit websites
  • 98. Co-Visitation Networks – Key observation: • Even the major sites only share at most 20% cookieID within a few hours, let alone those long tail sites. – Define a graph: • Node: site • Weighted edge: user overlap ratio of two sites – Cluster this weighted undirected graph – Fraud: big cluster with long tail sites O Stitelman, et al., Using Co-Visitation Networks For Classifying Non-Intentional Traffic, KDD 2013
  • 99. O Stitelman, et al., Using Co-Visitation Networks For Classifying Non-Intentional Traffic, KDD 2013 December 2011 Co-visitation Network where and edge indicates at least 50% overlap be- tween the browsers of both websites
  • 100. O Stitelman, et al., Using Co-Visitation Networks For Classifying Non-Intentional Traffic, KDD 2013
  • 101. Thank You • RTB system • Auction mechanisms • User response estimation • Conversion attribution • Learning to bid • Data Management Platform (DMP) techniques • Floor price optimisation • Fighting against fraud Real-Time Bidding based Display Advertising: Mechanisms and Algorithms