Criteo AI Lab: from applied to fundamental AI

Jeremie Mary, 17/09/18
From applied to fundamental
research
Copyright © 2018 Criteo
AI applied to Criteo Dynamic Retargeting since 2008
Universal Match One user profile across
all devices
Product
Recommendations
Kinetic Design
Predictive Bidding
Chooses the right
products to display
Chooses the right look
and feel for the banners
in real time
Personalized Ads
Optimized
Performance
Chooses the right users /
advertiser / publisher
to display
 eCPM = CPC*pCTR*pCR*pOV
1
3
2
4
Optimized on
CTR
+
CR
+
Order Value
Copyright © 2018 Criteo
Outline
1. Fusion of modalities
2. Auction theory meets Machine Learning
3. Hot topics
Copyright © 2018 Criteo
Fusion of heterogeneous data
Problem How to build a predictor based on completly different kind of data ?
e.g. pictures and texts and you want to predict the interest of the user for the item.
Your favorite neural network
for pictures (Resnet?)
Some description text or
tags
Your favorite neural network
for this (BiGRU with GA?)
E
m
b
e
d
d
i
n
g
E
m
b
e
d
d
i
n
g
Prediction 1
Prediction 2
Vote!
or average
Copyright © 2018 Criteo
Fusion of heterogeneous data
Problem How to build a predictor based on completly different kind of data ?
e.g. pictures and texts and you want to predict the interest of the user for the item.
Your favorite neural network
for pictures (Resnet?)
What is the color of the
cat?
Your favorite neural network
for this (BiGRU with GA?)
E
m
b
e
d
d
i
n
g
E
m
b
e
d
d
i
n
g
Prediction
M
e
r
g
e
Is it actually good to build the
embeddings independantly ?
Copyright © 2018 Criteo
Idea
Batch Norm Parameters
In a good network activation of neurons thought the data should be similar [1].
This was introduced as a reparametrization trick to ensure faster convergence
[1] I. Sergey and S. Christian. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. ICML, 2015.
Copyright © 2018 Criteo
Few parameters but…
While Number of batch norms parameters is usually 0.2 to 5% of the net, their impact on the output is huge [2]
[2] V. Dumoulin, J. Shlens, and M. Kudlur. A Learned Representation For Artistic Style. In Proc. of ICLR, 2017.
Copyright © 2018 Criteo
An alternative way to fuse modalities
Image Text
Copyright © 2018 Criteo
… and this work well on VQA
[13] Modulating early visual processing by language. H De Vries, F Strub, J Mary, H Larochelle, O Pietquin, AC Courville, NIPS’17
Copyright © 2018 Criteo
And actually change the embedding construction from
Copyright © 2018 Criteo
And actually change the embedding construction to
Copyright © 2018 Criteo
Doing it using several states of the RNN
Copyright © 2018 Criteo
ReferIt / Guesswhat oracle problem
Copyright © 2018 Criteo
ReferIt / Guesswhat oracle
Visual Reasoning with a Multi-hop FiLM Generator
Florian Strub, Mathieu Seurin, Ethan Perez, Harm De Vries, Jeremie Mary, Philippe Preux, Aaron Courville, Olivier Pietquin
Copyright © 2018 Criteo
Cherry Picking
Copyright © 2018 Criteo
Cherry Picking Failures
Copyright © 2018 Criteo
Outline
1. Fusion of modalities
2. Auction theory meets Machine Learning
3. Hot topics
Copyright © 2018 Criteo
We are a bidding company
More than 300 billion of bids a day. Less than 10ms to make a price.
1 seller with 1 item
n bidders, bidder i has private valuation vi
“valuation” = maximum willingness-to-pay
“private” = initially known only to bidder i
Second-price auction
collect bid bi from each bidder i
winner = highest bidder
price = second-highest bid
Very often our price is way higher than the competion.
Theorem:
renders truthful bidding a
dominant strategy
Problem
Copyright © 2018 Criteo
Reserve Prices (Seller point of view)
Will extract more $$$ at the cost of not selling some displays
How to choose it ?
Assumptions:
•Bidder’s valuation v drawn from distribution F. (F known to seller, v unknown)
•Seller aims to maximize expected revenue (w.r.t. v~F)
Solution: offer r* = argmaxr≥0 r  (1-F(r))
revenue
of a sale
probability of
a sale
Copyright © 2018 Criteo
Reserve price with several bidders
Theorem : [Myyerson 81] With n symmetric iid bidders, for second price auction with
reserve contributing to revenue, the revenue maximizing reserve price is independant
of the number of bidders
Theorem: [Bulow-Klemperer 96]: for every n:
expected revenue ≥ expected revenue
of reserve price 0 of monopoly reserve
[with (n+1) i.i.d. bidders] [with n i.i.d. bidders]
Copyright © 2018 Criteo
Personalized reserves1…
Theorem [Hartline/Roughgarden 09]: for any valuation distributions F1,...,Fn:
≥
expected revenue with
monopoly reserves
(ri = monopoly price for Fi)
50% of expected revenue of
Myerson’s optimal auction
for F1,...,Fn
1 Yes the bidder can loose the auction while having the highest bid
Copyright © 2018 Criteo
In real bidding
F is unknown and is estimated from the bids.
Done by [Ostrovsky/Schwarz 09] at Yahoo
Analysis leads to some finite time ML style bounds by [Morgenstern/Roughgarden
15,16].
Typically requires O(n log n) samples in the multiple bidders setting to achieve
expected revenue within ε of best possible.
This assume the bidders to reveal their true value
Copyright © 2018 Criteo
One strategic bidder setting
A two stage game.
First day: the seller receives billions of bids from the bidders. (we do not consider any
approximation error).
Second day: she sets for each bidder their reserve price as the exact monopoly price
computed on the bids she received during the first stage.
we denote by F1, ..., FN the distribution of the bidders. We assume bidder 1 is
strategic and the others continue to bid truthfully.
G is the distribution of the maximum value
of the competitors of bidder 1.
On all illustration true distribution of values is U[0;1]
Copyright © 2018 Criteo
Myerson lemma
Defining virtual values
Suppose bidder i has values Xi with distribution Fi and associated density fi . fi is
assumed to be positive on the support of Xi . For any incentive compatible auction,
when G represents the distribution of the bids faced by user i, we have, if r is the
reserve price set by the seller,
regardless of whether ψi is increasing.
Copyright © 2018 Criteo
Visualization of Myerson’s lemma
Copyright © 2018 Criteo
ß shading
The payoff of the strategic bidder using the strategy β (ψB denotes the virtual value
associated to the new distribution of bid) is:
And we can remark:
find a « good » ψB and then the corresponding β.
Copyright © 2018 Criteo
Which is the nicest ?
Copyright © 2018 Criteo
Thresholded virtual value
Just solve
On the uniform example this is
And identity for >0.5
Copyright © 2018 Criteo
Comparision of revenue
• the strategic bidder payoff
increases from 0.083 to 0.132 (a
59% increase !!
• the payoff of the truthful bidder
remains unchanged.
• item the payoff of the seller
remains unchanged.
• In particular, the seller does not
lose money.
• welfare increases from 0.583 to
0.632. (a 8% increase!!)
Copyright © 2018 Criteo
More on the topic
Does it cost something to the strategic bidder during the learning stage of the
auctioneer: No ! Since the strategy only changes bids below the reserve price, the
strategic bidders pay nothing to try to convince the seller to decrease the reserve price.
Can we do better Yes! We only presented the simplest way to improve a bidding
strategy. There exist some better strategies that lead to even higher payoffs.
In this setting, can we find a Nash equilibrium when all the bidders become
strategic? : Yes!
Are our proposed strategies stable against some approximation error of the
seller? Yes!
Thresholding the virtual value: a simple method to increase welfare and lower reserve prices in online auction systems
Thomas Nedelec, Marc Abeille, Clément Calauzènes, Noureddine El Karoui, Benjamin Heymann, Vianney Perchet
Explicit shading strategies for repeated truthful auctions. arXiv preprint arXiv:1805.00256, 2018
Marc Abeille, Clement Calauzenes, Noureddine El Karoui, Thomas Nedelec, Vianney Perchet.
Copyright © 2018 Criteo
Outline
1. Fusion of modalities
2. Auction theory meets Machine Learning
3. Hot topics
Copyright © 2018 Criteo
3
Recommend
er
Systems
• Users can get bored seeing similar movies over and over
• Getting to know a new system can takes time and increase curiosity
at first and then decrease it after a while
Task
scheduling
• It might take a while to master a new task so performance increase
after being repeated
• Repeating always the same task can reduce productivity because of
weariness
Resource
balancing
• Always exploiting the same area can diminish returns if population can
not growth again
A B A B B
B A A B A B
Alternating Recommender Systems
Copyright © 2018 Criteo
3
|
state click probability on A
[A,A,B,B,A,A,A,B,B,A] 8.53%
[A,B,B,A,B,B,A,B,A,B] 9.12%
[B,B,B,B,A,A,A,B,B,A] 8.91%
• We use a real-world A/B testing dataset where our model assumptions are no longer satisfied. Users have been exposed to both A and B. We investigate how a long-
term policy alternating A and B on the basis of past choices can outperform each solution individually.
• simulator: measure click rate probability on a version based on the
last w = 10 pulled versions.
𝒔𝒔𝒔𝒔 𝒔𝒔 𝒔𝒔𝒔𝒔 𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔 𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗, 𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔 = 𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩(𝒑𝒑)
Compared algorithms
• Oracle optimal optimal policy given the true parameters
• Oracle greedy greedy policy given the true parameters
• UCRL (Auer, Jaksch, and Ortner 2009) considering each action
and state independently
• linUCRL our algorithm
• Only B always play B (click rate of state [B, …, B])
• Only A always play A (click rate of state [A, …, A])
Avg reward on the T steps
Avg reward after T=1600
On Criteo’s A/B tests (NIPS’18)
Romain Warlop , Alessandro Lazaric, Jeremie Mary
Copyright © 2018 Criteo
More
• DPPs for basket completion (look at work of Mike Gartrell)
• Exploration / Exploration under brownian evolution of the world
• GANs
• RNNs (and approximations) for session modelization
• Causality, Incrementality and offline A/B tests.
Copyright © 2018 Criteo
Thank you !
j.mary@criteo.com
https://aiaheadofusbycriteoailab.splashthat.com/
1 of 35

Recommended

ChatGPT and the Future of Work - Clark Boyd by
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
26.2K views69 slides
Getting into the tech field. what next by
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
6.3K views22 slides
Google's Just Not That Into You: Understanding Core Updates & Search Intent by
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
6.7K views99 slides
How to have difficult conversations by
How to have difficult conversations How to have difficult conversations
How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC
5.4K views19 slides
Introduction to Data Science by
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceChristy Abraham Joy
82.5K views51 slides
Time Management & Productivity - Best Practices by
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
169.8K views42 slides

More Related Content

Recently uploaded

Network Source of Truth and Infrastructure as Code revisited by
Network Source of Truth and Infrastructure as Code revisitedNetwork Source of Truth and Infrastructure as Code revisited
Network Source of Truth and Infrastructure as Code revisitedNetwork Automation Forum
49 views45 slides
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue by
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlueElevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlueShapeBlue
149 views7 slides
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading... by
Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading...The Digital Insurer
40 views52 slides
Future of AR - Facebook Presentation by
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook PresentationRob McCarty
54 views27 slides
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue by
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueWhat’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueShapeBlue
191 views23 slides
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue by
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueCloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueShapeBlue
63 views15 slides

Recently uploaded(20)

Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue by ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlueElevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
ShapeBlue149 views
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading... by The Digital Insurer
Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading...
Future of AR - Facebook Presentation by Rob McCarty
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook Presentation
Rob McCarty54 views
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue by ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueWhat’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
ShapeBlue191 views
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue by ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueCloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
ShapeBlue63 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R... by ShapeBlue
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
ShapeBlue105 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue128 views
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit... by ShapeBlue
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
ShapeBlue86 views
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N... by James Anderson
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
James Anderson142 views
"Surviving highload with Node.js", Andrii Shumada by Fwdays
"Surviving highload with Node.js", Andrii Shumada "Surviving highload with Node.js", Andrii Shumada
"Surviving highload with Node.js", Andrii Shumada
Fwdays49 views
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha... by ShapeBlue
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
ShapeBlue113 views
Business Analyst Series 2023 - Week 4 Session 7 by DianaGray10
Business Analyst Series 2023 -  Week 4 Session 7Business Analyst Series 2023 -  Week 4 Session 7
Business Analyst Series 2023 - Week 4 Session 7
DianaGray10110 views
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue by ShapeBlue
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlueMigrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue
ShapeBlue147 views
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda... by ShapeBlue
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...
ShapeBlue93 views
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue by ShapeBlue
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlueVNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue
ShapeBlue134 views
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ... by ShapeBlue
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
ShapeBlue52 views
Keynote Talk: Open Source is Not Dead - Charles Schulz - Vates by ShapeBlue
Keynote Talk: Open Source is Not Dead - Charles Schulz - VatesKeynote Talk: Open Source is Not Dead - Charles Schulz - Vates
Keynote Talk: Open Source is Not Dead - Charles Schulz - Vates
ShapeBlue178 views

Featured

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present... by
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
55.5K views138 slides
12 Ways to Increase Your Influence at Work by
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
401.7K views64 slides
ChatGPT webinar slides by
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slidesAlireza Esmikhani
30.4K views36 slides
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G... by
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
3.6K views12 slides
Barbie - Brand Strategy Presentation by
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationErica Santiago
25.1K views46 slides

Featured(20)

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present... by Applitools
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Applitools55.5K views
12 Ways to Increase Your Influence at Work by GetSmarter
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
GetSmarter401.7K views
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G... by DevGAMM Conference
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
DevGAMM Conference3.6K views
Barbie - Brand Strategy Presentation by Erica Santiago
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
Erica Santiago25.1K views
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well by Saba Software
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellGood Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Saba Software25.3K views
Introduction to C Programming Language by Simplilearn
Introduction to C Programming LanguageIntroduction to C Programming Language
Introduction to C Programming Language
Simplilearn8.4K views
The Pixar Way: 37 Quotes on Developing and Maintaining a Creative Company (fr... by Palo Alto Software
The Pixar Way: 37 Quotes on Developing and Maintaining a Creative Company (fr...The Pixar Way: 37 Quotes on Developing and Maintaining a Creative Company (fr...
The Pixar Way: 37 Quotes on Developing and Maintaining a Creative Company (fr...
Palo Alto Software88.4K views
9 Tips for a Work-free Vacation by Weekdone.com
9 Tips for a Work-free Vacation9 Tips for a Work-free Vacation
9 Tips for a Work-free Vacation
Weekdone.com7.2K views
How to Map Your Future by SlideShop.com
How to Map Your FutureHow to Map Your Future
How to Map Your Future
SlideShop.com275.1K views
Beyond Pride: Making Digital Marketing & SEO Authentically LGBTQ+ Inclusive -... by AccuraCast
Beyond Pride: Making Digital Marketing & SEO Authentically LGBTQ+ Inclusive -...Beyond Pride: Making Digital Marketing & SEO Authentically LGBTQ+ Inclusive -...
Beyond Pride: Making Digital Marketing & SEO Authentically LGBTQ+ Inclusive -...
AccuraCast3.4K views
Exploring ChatGPT for Effective Teaching and Learning.pptx by Stan Skrabut, Ed.D.
Exploring ChatGPT for Effective Teaching and Learning.pptxExploring ChatGPT for Effective Teaching and Learning.pptx
Exploring ChatGPT for Effective Teaching and Learning.pptx
Stan Skrabut, Ed.D.57.7K views
How to train your robot (with Deep Reinforcement Learning) by Lucas García, PhD
How to train your robot (with Deep Reinforcement Learning)How to train your robot (with Deep Reinforcement Learning)
How to train your robot (with Deep Reinforcement Learning)
Lucas García, PhD42.5K views
4 Strategies to Renew Your Career Passion by Daniel Goleman
4 Strategies to Renew Your Career Passion4 Strategies to Renew Your Career Passion
4 Strategies to Renew Your Career Passion
Daniel Goleman122K views
The Student's Guide to LinkedIn by LinkedIn
The Student's Guide to LinkedInThe Student's Guide to LinkedIn
The Student's Guide to LinkedIn
LinkedIn88.1K views
Different Roles in Machine Learning Career by Intellipaat
Different Roles in Machine Learning CareerDifferent Roles in Machine Learning Career
Different Roles in Machine Learning Career
Intellipaat12.4K views
Defining a Tech Project Vision in Eight Quick Steps pdf by TechSoup
Defining a Tech Project Vision in Eight Quick Steps pdfDefining a Tech Project Vision in Eight Quick Steps pdf
Defining a Tech Project Vision in Eight Quick Steps pdf
TechSoup 9.7K views

Criteo AI Lab: from applied to fundamental AI

  • 1. Jeremie Mary, 17/09/18 From applied to fundamental research
  • 2. Copyright © 2018 Criteo AI applied to Criteo Dynamic Retargeting since 2008 Universal Match One user profile across all devices Product Recommendations Kinetic Design Predictive Bidding Chooses the right products to display Chooses the right look and feel for the banners in real time Personalized Ads Optimized Performance Chooses the right users / advertiser / publisher to display  eCPM = CPC*pCTR*pCR*pOV 1 3 2 4 Optimized on CTR + CR + Order Value
  • 3. Copyright © 2018 Criteo Outline 1. Fusion of modalities 2. Auction theory meets Machine Learning 3. Hot topics
  • 4. Copyright © 2018 Criteo Fusion of heterogeneous data Problem How to build a predictor based on completly different kind of data ? e.g. pictures and texts and you want to predict the interest of the user for the item. Your favorite neural network for pictures (Resnet?) Some description text or tags Your favorite neural network for this (BiGRU with GA?) E m b e d d i n g E m b e d d i n g Prediction 1 Prediction 2 Vote! or average
  • 5. Copyright © 2018 Criteo Fusion of heterogeneous data Problem How to build a predictor based on completly different kind of data ? e.g. pictures and texts and you want to predict the interest of the user for the item. Your favorite neural network for pictures (Resnet?) What is the color of the cat? Your favorite neural network for this (BiGRU with GA?) E m b e d d i n g E m b e d d i n g Prediction M e r g e Is it actually good to build the embeddings independantly ?
  • 6. Copyright © 2018 Criteo Idea Batch Norm Parameters In a good network activation of neurons thought the data should be similar [1]. This was introduced as a reparametrization trick to ensure faster convergence [1] I. Sergey and S. Christian. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. ICML, 2015.
  • 7. Copyright © 2018 Criteo Few parameters but… While Number of batch norms parameters is usually 0.2 to 5% of the net, their impact on the output is huge [2] [2] V. Dumoulin, J. Shlens, and M. Kudlur. A Learned Representation For Artistic Style. In Proc. of ICLR, 2017.
  • 8. Copyright © 2018 Criteo An alternative way to fuse modalities Image Text
  • 9. Copyright © 2018 Criteo … and this work well on VQA [13] Modulating early visual processing by language. H De Vries, F Strub, J Mary, H Larochelle, O Pietquin, AC Courville, NIPS’17
  • 10. Copyright © 2018 Criteo And actually change the embedding construction from
  • 11. Copyright © 2018 Criteo And actually change the embedding construction to
  • 12. Copyright © 2018 Criteo Doing it using several states of the RNN
  • 13. Copyright © 2018 Criteo ReferIt / Guesswhat oracle problem
  • 14. Copyright © 2018 Criteo ReferIt / Guesswhat oracle Visual Reasoning with a Multi-hop FiLM Generator Florian Strub, Mathieu Seurin, Ethan Perez, Harm De Vries, Jeremie Mary, Philippe Preux, Aaron Courville, Olivier Pietquin
  • 15. Copyright © 2018 Criteo Cherry Picking
  • 16. Copyright © 2018 Criteo Cherry Picking Failures
  • 17. Copyright © 2018 Criteo Outline 1. Fusion of modalities 2. Auction theory meets Machine Learning 3. Hot topics
  • 18. Copyright © 2018 Criteo We are a bidding company More than 300 billion of bids a day. Less than 10ms to make a price. 1 seller with 1 item n bidders, bidder i has private valuation vi “valuation” = maximum willingness-to-pay “private” = initially known only to bidder i Second-price auction collect bid bi from each bidder i winner = highest bidder price = second-highest bid Very often our price is way higher than the competion. Theorem: renders truthful bidding a dominant strategy Problem
  • 19. Copyright © 2018 Criteo Reserve Prices (Seller point of view) Will extract more $$$ at the cost of not selling some displays How to choose it ? Assumptions: •Bidder’s valuation v drawn from distribution F. (F known to seller, v unknown) •Seller aims to maximize expected revenue (w.r.t. v~F) Solution: offer r* = argmaxr≥0 r  (1-F(r)) revenue of a sale probability of a sale
  • 20. Copyright © 2018 Criteo Reserve price with several bidders Theorem : [Myyerson 81] With n symmetric iid bidders, for second price auction with reserve contributing to revenue, the revenue maximizing reserve price is independant of the number of bidders Theorem: [Bulow-Klemperer 96]: for every n: expected revenue ≥ expected revenue of reserve price 0 of monopoly reserve [with (n+1) i.i.d. bidders] [with n i.i.d. bidders]
  • 21. Copyright © 2018 Criteo Personalized reserves1… Theorem [Hartline/Roughgarden 09]: for any valuation distributions F1,...,Fn: ≥ expected revenue with monopoly reserves (ri = monopoly price for Fi) 50% of expected revenue of Myerson’s optimal auction for F1,...,Fn 1 Yes the bidder can loose the auction while having the highest bid
  • 22. Copyright © 2018 Criteo In real bidding F is unknown and is estimated from the bids. Done by [Ostrovsky/Schwarz 09] at Yahoo Analysis leads to some finite time ML style bounds by [Morgenstern/Roughgarden 15,16]. Typically requires O(n log n) samples in the multiple bidders setting to achieve expected revenue within ε of best possible. This assume the bidders to reveal their true value
  • 23. Copyright © 2018 Criteo One strategic bidder setting A two stage game. First day: the seller receives billions of bids from the bidders. (we do not consider any approximation error). Second day: she sets for each bidder their reserve price as the exact monopoly price computed on the bids she received during the first stage. we denote by F1, ..., FN the distribution of the bidders. We assume bidder 1 is strategic and the others continue to bid truthfully. G is the distribution of the maximum value of the competitors of bidder 1. On all illustration true distribution of values is U[0;1]
  • 24. Copyright © 2018 Criteo Myerson lemma Defining virtual values Suppose bidder i has values Xi with distribution Fi and associated density fi . fi is assumed to be positive on the support of Xi . For any incentive compatible auction, when G represents the distribution of the bids faced by user i, we have, if r is the reserve price set by the seller, regardless of whether ψi is increasing.
  • 25. Copyright © 2018 Criteo Visualization of Myerson’s lemma
  • 26. Copyright © 2018 Criteo ß shading The payoff of the strategic bidder using the strategy β (ψB denotes the virtual value associated to the new distribution of bid) is: And we can remark: find a « good » ψB and then the corresponding β.
  • 27. Copyright © 2018 Criteo Which is the nicest ?
  • 28. Copyright © 2018 Criteo Thresholded virtual value Just solve On the uniform example this is And identity for >0.5
  • 29. Copyright © 2018 Criteo Comparision of revenue • the strategic bidder payoff increases from 0.083 to 0.132 (a 59% increase !! • the payoff of the truthful bidder remains unchanged. • item the payoff of the seller remains unchanged. • In particular, the seller does not lose money. • welfare increases from 0.583 to 0.632. (a 8% increase!!)
  • 30. Copyright © 2018 Criteo More on the topic Does it cost something to the strategic bidder during the learning stage of the auctioneer: No ! Since the strategy only changes bids below the reserve price, the strategic bidders pay nothing to try to convince the seller to decrease the reserve price. Can we do better Yes! We only presented the simplest way to improve a bidding strategy. There exist some better strategies that lead to even higher payoffs. In this setting, can we find a Nash equilibrium when all the bidders become strategic? : Yes! Are our proposed strategies stable against some approximation error of the seller? Yes! Thresholding the virtual value: a simple method to increase welfare and lower reserve prices in online auction systems Thomas Nedelec, Marc Abeille, Clément Calauzènes, Noureddine El Karoui, Benjamin Heymann, Vianney Perchet Explicit shading strategies for repeated truthful auctions. arXiv preprint arXiv:1805.00256, 2018 Marc Abeille, Clement Calauzenes, Noureddine El Karoui, Thomas Nedelec, Vianney Perchet.
  • 31. Copyright © 2018 Criteo Outline 1. Fusion of modalities 2. Auction theory meets Machine Learning 3. Hot topics
  • 32. Copyright © 2018 Criteo 3 Recommend er Systems • Users can get bored seeing similar movies over and over • Getting to know a new system can takes time and increase curiosity at first and then decrease it after a while Task scheduling • It might take a while to master a new task so performance increase after being repeated • Repeating always the same task can reduce productivity because of weariness Resource balancing • Always exploiting the same area can diminish returns if population can not growth again A B A B B B A A B A B Alternating Recommender Systems
  • 33. Copyright © 2018 Criteo 3 | state click probability on A [A,A,B,B,A,A,A,B,B,A] 8.53% [A,B,B,A,B,B,A,B,A,B] 9.12% [B,B,B,B,A,A,A,B,B,A] 8.91% • We use a real-world A/B testing dataset where our model assumptions are no longer satisfied. Users have been exposed to both A and B. We investigate how a long- term policy alternating A and B on the basis of past choices can outperform each solution individually. • simulator: measure click rate probability on a version based on the last w = 10 pulled versions. 𝒔𝒔𝒔𝒔 𝒔𝒔 𝒔𝒔𝒔𝒔 𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔 𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗𝒗, 𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔 = 𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩𝑩(𝒑𝒑) Compared algorithms • Oracle optimal optimal policy given the true parameters • Oracle greedy greedy policy given the true parameters • UCRL (Auer, Jaksch, and Ortner 2009) considering each action and state independently • linUCRL our algorithm • Only B always play B (click rate of state [B, …, B]) • Only A always play A (click rate of state [A, …, A]) Avg reward on the T steps Avg reward after T=1600 On Criteo’s A/B tests (NIPS’18) Romain Warlop , Alessandro Lazaric, Jeremie Mary
  • 34. Copyright © 2018 Criteo More • DPPs for basket completion (look at work of Mike Gartrell) • Exploration / Exploration under brownian evolution of the world • GANs • RNNs (and approximations) for session modelization • Causality, Incrementality and offline A/B tests.
  • 35. Copyright © 2018 Criteo Thank you ! j.mary@criteo.com https://aiaheadofusbycriteoailab.splashthat.com/