Reward Innovation for long-term member satisfaction

•Download as PPTX, PDF•

0 likes•1,053 views

Overview of reward innovation for long-term member satisfaction at Netflix. Paper: https://dl.acm.org/doi/10.1145/3604915.3608873

Technology

Reward innovation for
long-term member
satisfaction
Gary Tang, Jiangwei Pan, Henry
Wang and Justin Basilico

Goal
Create a personalized homepage to help
members find content to watch and enjoy
that
maximizes long-term member satisfaction.

Member enjoys watching Netflix
So continues the subscription
and
tells their friends about it
Long-term satisfaction for Netflix

Batch learning from bandit feedback
Production policy
● show recommendations to the member
Member gives feedbacks on recommendations
● immediate: skip/play a show
● long-term: cancel/renew subscription
Goal
● train a policy to maximize the long-term reward

Challenges with long-term retention
Noisy signal
Influenced by external
factors
Nonsensitive signal
Only sensitive for
“borderline” members
Delayed signal
Need to wait a long time &
hard to attribute

Proxy reward
Train policy to optimize proxy reward
● highly correlated with long-term
satisfaction
● sensitive to individual
recommendations
Immediate feedback as proxy
● e.g. start to play a show
Delayed feedback could be more aligned
● e.g. completing a show

Delayed proxy reward
Need to wait a long time to observe the
delayed reward
● can not be used in training
immediately
● can hurt coldstarting of model
Don’t want to wait? predict delayed reward
● use all user actions up to policy training
time
Train policy to maximize predicted reward
Note: predicted reward can not be used
online as it uses post-recommendation user
actions

The ideas are not new
Related work:
● Long-term optimization in recommenders
● Reward shaping in reinforcement learning
● Online reward optimization
We focus on reward innovation as an important
product development workstream at Netflix

Integrating reward in bandit
Reward component
provides the objective
of bandit policy
training data

Reward innovation
● Ideation: what aspect of long-term satisfaction hasn’t been
captured as a reward?
○ Requires balancing perspectives of ML, business, and
psychology. Not easy!
● Development: how do we compute this new reward for every
recommended item?
○ Collecting immediate feedbacks, predicting delayed feedbacks
● Evaluation: whether this is a good reward for the bandit
policy to maximize?

Reward infrastructure
Common development patterns
● Computing immediate proxy
rewards
● Predicting delayed proxy rewards
● Combining multiple rewards
● Sharing rewards across multiple
recommender components

Reward evaluation
● Online testing is expensive (time, resources)
● Use offline evaluation to determine promising
reward candidates for online testing
● Challenge: hard to compare policies trained with
different rewards

Offline reward evaluation
● compare policies along multiple
reward axes using OPE
● choose small number of candidate
policies on pareto front
● compare candidate policies online
using long-term user satisfaction
metrics

Practical learnings
● Reward normalization: dynamic range of reward
can affect SGD training dynamics
● Reward features: pairing a reward with a correlated
feature tends to improve the model’s ability to
optimize that reward
● Reward alignment: make different parts of the
overall recommender system “point in the same
direction”

Summary
● Proxy rewards: we can train bandit policies using
proxy rewards to optimize long-term member
satisfaction
● Art and science: to come up with good reward
hypotheses
● Supporting infrastructure: we develop
infrastructure to help iterate on new hypotheses
quickly

Open challenges
● Proxy rewards: how can we identify proxy rewards that are
aligned with long-term satisfaction in a more principle
way?
● Reinforcement learning: can we use reinforcement
learning to optimize long-term reward directly for
recommendations?

Thank you!
Questions?
Jiangwei Pan
jpan@netflix.com

by Harald Steck (Netflix Inc., US), Roelof van Zwol (Netflix Inc., US) and Chris Johnson (Spotify Inc., US) Slides of the tutorial on interactive recommender systems at the 2015 conference on Recommender Systems (RecSys). Interactive recommender systems enable the user to steer the received recommendations in the desired direction through explicit interaction with the system. In the larger ecosystem of recommender systems used on a website, it is positioned between a lean-back recommendation experience and an active search for a specific piece of content. Besides this aspect, we will discuss several parts that are especially important for interactive recommender systems, including the following: design of the user interface and its tight integration with the algorithm in the back-end; computational efficiency of the recommender algorithm; as well as choosing the right balance between exploiting the feedback from the user as to provide relevant recommendations, and enabling the user to explore the catalog and steer the recommendations in the desired direction. In particular, we will explore the field of interactive video and music recommendations and their application at Netflix and Spotify. We outline some of the user-experiences built, and discuss the approaches followed to tackle the various aspects of interactive recommendations. We present our insights from user studies and A/B tests. The tutorial targets researchers and practitioners in the field of recommender systems, and will give the participants a unique opportunity to learn about the various aspects of interactive recommender systems in the video and music domain. The tutorial assumes familiarity with the common methods of recommender systems. DATE: Wednesday, Sept 16, 2015, 11:00-12:30

Artwork Personalization at Netflix

Justin Basilico

This document summarizes an presentation about personalizing artwork selection on Netflix using multi-armed bandit algorithms. Bandit algorithms were applied to choose representative, informative and engaging artwork for each title to maximize member satisfaction and retention. Contextual bandits were used to personalize artwork selection based on member preferences and context. Netflix deployed a system that precomputes personalized artwork using bandit models and caches the results to serve images quickly at scale. The system was able to lift engagement metrics based on A/B tests of the personalized artwork selection models.

Recommendations for Building Machine Learning Software

Justin Basilico

Time, Context and Causality in Recommender Systems

Yves Raimond

Artwork Personalization at Netflix Fernando Amat RecSys2018

Fernando Amat

For many years, the main goal of the Netflix personalized recommendation system has been to get the right titles in front of our members at the right time. But the job of recommendation does not end there. The homepage should be able to convey to the member enough evidence of why a title may be good for her, especially for shows that the member has never heard of. One way to address this challenge is to personalize the way we portray the titles on our service. An important aspect of how to portray titles is through the artwork or imagery we display to visually represent each title. The artwork may highlight an actor that you recognize, capture an exciting moment like a car chase, or contain a dramatic scene that conveys the essence of a movie or show. It is important to select good artwork because it may be the first time a member becomes aware of a title (and sometimes the only time), so it must speak to them in a meaningful way. In this talk, we will present an approach for personalizing the artwork we use on the Netflix homepage. The system selects an image for each member and video to give better visual evidence for why the title might be appealing to that particular member.

Déjà Vu: The Importance of Time and Causality in Recommender Systems

Justin Basilico

This document discusses the importance of time and causality in recommender systems. It summarizes that (1) time and causality are critical aspects that must be considered in data collection, experiment design, algorithms, and system design. (2) Recommender systems operate within a feedback loop where the recommendations influence future user behavior and data, so effects like reinforcement of biases can occur. (3) Both offline and online experimentation are needed to properly evaluate systems and generalization over time.

Learning to Personalize

Justin Basilico

In this talk, we will provide an overview of Deep Learning methods applied to personalization and search at Netflix. We will set the stage by describing the unique challenges faced at Netflix in the areas of recommendations and information retrieval. Then we will delve into how we leverage a blend of traditional algorithms and emergent deep learning methods and new types of embeddings, especially hyperbolic space embeddings, to address these challenges.

Data council SF 2020 Building a Personalized Messaging System at Netflix

Grace T. Huang

This document discusses building a personalized messaging system at Netflix to recommend content to users. It covers four key considerations: 1) Personalizing messaging decisions using classification techniques like logistic regression on outcome features. 2) Removing bias from the system using techniques like Thompson sampling, exploration-exploitation, and propensity correction. 3) Maximizing causal impact by explicitly modeling past actions and comparing member satisfaction with and without messages. 4) Balancing reward against cost by imposing a volume constraint like an incrementality threshold and using reinforcement learning approaches.

Correlation, causation and incrementally recommendation problems at netflix ...

Roelof van Zwol

Within Netflix, personalization is a key differentiator, helping members to quickly discover new content that matches their taste. Done well, it creates an immersive user experience, however when the recommendation is out of tune, it is immediately noticed by our members. During this presentation I will cover some of the personalization and recommendation tasks that jointly define the Netflix user experience that entertains more that 130M members world wide. In particular, I will focus on several of the algorithmic challenges related to the launch of new Netflix originals in the service, and go over concepts such as causality, incrementality and explore-exploit strategies. The research presented in this talk represents the collaborative efforts of a team of research scientists and engineers at Netflix on our journey to create best in class user experiences.

Learning a Personalized Homepage

Justin Basilico

Deep Learning for Recommender Systems

Justin Basilico

Talk with Yves Raimond at the GPU Tech Conference on Marth 28, 2018 in San Jose, CA. Abstract: In this talk, we will survey how Deep Learning methods can be applied to personalization and recommendations. We will cover why standard Deep Learning approaches don't perform better than typical collaborative filtering techniques. Then we will survey we will go over recently published research at the intersection of Deep Learning and recommender systems, looking at how they integrate new types of data, explore new models, or change the recommendation problem statement. We will also highlight some of the ways that neural networks are used at Netflix and how we can use GPUs to train recommender systems. Finally, we will highlight promising new directions in this space.

Past, Present & Future of Recommender Systems: An Industry Perspective

Justin Basilico

Missing values in recommender models

Parmeshwar Khurd

The document summarizes techniques for handling missing values in recommender models. It discusses how gradient boosted decision trees (GBDTs) and neural networks (NNs) can deal with missing features during training without imputing values. For GBDTs, XGBoost and R's GBM handle missing values differently, with XGBoost sending examples left or right and GBM using a ternary split. NNs can handle missing features via techniques like dropout, imputing averages, or including a "missing" embedding value. The document concludes that the optimal approach depends on the dataset.

Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...

MLconf

Recent Trends in Personalization at Netflix

Justin Basilico

Context Aware Recommendations at Netflix

Linas Baltrunas

At Netflix, we try to provide the best personalized video recommendations to our members. To do this, we need to adapt our recommendations for each contextual situation, which depends on information such as time or device. In this talk, I will describe how state of the art Contextual Recommendations are used at Netflix. A first example of contextual adaptation is the model that powers the Continue Watching row. It uses a feature-based approach with a carefully constructed training set to learn how to adapt to the context of the member. Next, I will dive into more modern approaches such as Tensor Factorization and LSTMs and share some results from deployments of these methods. I will highlight lessons learned and some common pitfalls of using these powerful methods in industrial scale systems. Finally, I will touch upon system reliability, choice of optimization metrics, hidden costs, risks and benefits of using highly adaptive systems.

Homepage Personalization at Spotify

Oguz Semerci

The document discusses homepage personalization at Spotify. It begins by noting that the homepage is an important discovery, personalization, and marketplace tool. It then describes how the homepage is organized into shelves and cards containing content like albums and playlists. It discusses how a ranking algorithm and bandit policy are used to serve personalized recommendations while introducing exploration to avoid feedback loops. Finally, it provides examples of sanity checks used in production to validate that the policy and models are working as intended.

Making Netflix Machine Learning Algorithms Reliable

Justin Basilico

This document discusses making Netflix machine learning algorithms reliable. It describes how Netflix uses machine learning for tasks like personalized ranking and recommendation. The goals are to maximize member satisfaction and retention. The models and algorithms used include regression, matrix factorization, neural networks, and bandits. The key aspects of making the models reliable discussed are: automated retraining of models, testing training pipelines, checking models and inputs online for anomalies, responding gracefully to failures, and training models to be resilient to different conditions and failures.

Deep Learning for Recommender Systems

Yves Raimond

(Presented at the Deep Learning Re-Work SF Summit on 01/25/2018) In this talk, we go through the traditional recommendation systems set-up, and show that deep learning approaches in that set-up don't bring a lot of extra value. We then focus on different ways to leverage these techniques, most of which relying on breaking away from that traditional set-up; through providing additional data to your recommendation algorithm, modeling different facets of user/item interactions, and most importantly re-framing the recommendation problem itself. In particular we show a few results obtained by casting the problem as a contextual sequence prediction task, and using it to model time (a very important dimension in most recommendation systems).

Recent Trends in Personalization at Netflix

Förderverein Technische Fakultät

Recommendation systems today are widely used across many applications such as in multimedia content platforms, social networks, and ecommerce, to provide suggestions to users that are most likely to fulfill their needs, thereby improving the user experience. Academic research, to date, largely focuses on the performance of recommendation models in terms of ranking quality or accuracy measures, which often don’t directly translate into improvements in the real-world. In this talk, we present some of the most interesting challenges that we face in the personalization efforts at Netflix. The goal of this talk is to sunshine challenging research problems in industrial recommendation systems and start a conversation about exciting areas of future research.

Personalized Page Generation for Browsing Recommendations

Justin Basilico

Recent Trends in Personalization: A Netflix Perspective

Justin Basilico

Personalizing "The Netflix Experience" with Deep Learning

Anoop Deoras

Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...

Anmol Bhasin

The document summarizes a presentation on people recommender systems and social networks. It discusses key concepts in social recommenders like reciprocity and multiple objectives. It provides examples of recommender systems at LinkedIn including People You May Know, talent matching, and endorsements. It also covers special topics like intent understanding using techniques like survival analysis, and evaluation challenges for social recommenders.

Lessons Learned from Building Machine Learning Software at Netflix

Justin Basilico

Talk from Software Engineering for Machine Learning Workshop (SW4ML) at the Neural Information Processing Systems (NIPS) 2014 conference in Montreal, Canada on 2014-12-13. Abstract: Building a real system that incorporates machine learning as a part can be a difficult effort, both in terms of the algorithmic and engineering challenges involved. In this talk I will focus on the engineering side and discuss some of the practical issues we’ve encountered in developing real machine learning systems at Netflix and some of the lessons we’ve learned over time. I will describe our approach for building machine learning systems and how it comes from a desire to balance many different, and sometimes conflicting, requirements such as handling large volumes of data, choosing and adapting good algorithms, keeping recommendations fresh and accurate, remaining responsive to user actions, and also being flexible to accommodate research and experimentation. I will focus on what it takes to put machine learning into a real system that works in a feedback loop with our users and how that imposes different requirements and a different focus than doing machine learning only within a lab environment. I will address the particular software engineering challenges that we’ve faced in running our algorithms at scale in the cloud. I will also mention some simple design patterns that we’ve fond to be useful across a wide variety of machine-learned systems.

VIP Program Creation

Jenny Weigle

2016 5-5 technology association of georgia - tag - sales force engagement (2)...

Erin Bush

What's hot

A Multi-Armed Bandit Framework For Recommendations at Netflix

Jaya Kawale

Deeper Things: How Netflix Leverages Deep Learning in Recommendations and Se...

Sudeep Das, Ph.D.

Data council SF 2020 Building a Personalized Messaging System at Netflix

Grace T. Huang

Correlation, causation and incrementally recommendation problems at netflix ...

Roelof van Zwol

Learning a Personalized Homepage

Justin Basilico

Deep Learning for Recommender Systems

Justin Basilico

Past, Present & Future of Recommender Systems: An Industry Perspective

Justin Basilico

Missing values in recommender models

Parmeshwar Khurd

Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...

MLconf

Recent Trends in Personalization at Netflix

Justin Basilico

Context Aware Recommendations at Netflix

Linas Baltrunas

Homepage Personalization at Spotify

Oguz Semerci

Making Netflix Machine Learning Algorithms Reliable

Justin Basilico

Deep Learning for Recommender Systems

Yves Raimond

Recent Trends in Personalization at Netflix

Förderverein Technische Fakultät

Personalized Page Generation for Browsing Recommendations

Justin Basilico

Recent Trends in Personalization: A Netflix Perspective

Justin Basilico

Personalizing "The Netflix Experience" with Deep Learning

Anoop Deoras

Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...

Anmol Bhasin

Lessons Learned from Building Machine Learning Software at Netflix

Justin Basilico

What's hot (20)

A Multi-Armed Bandit Framework For Recommendations at Netflix

Deeper Things: How Netflix Leverages Deep Learning in Recommendations and Se...

Data council SF 2020 Building a Personalized Messaging System at Netflix

Correlation, causation and incrementally recommendation problems at netflix ...

Learning a Personalized Homepage

Deep Learning for Recommender Systems

Past, Present & Future of Recommender Systems: An Industry Perspective

Missing values in recommender models

Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...

Recent Trends in Personalization at Netflix

Context Aware Recommendations at Netflix

Homepage Personalization at Spotify

Making Netflix Machine Learning Algorithms Reliable

Deep Learning for Recommender Systems

Recent Trends in Personalization at Netflix

Personalized Page Generation for Browsing Recommendations

Recent Trends in Personalization: A Netflix Perspective

Personalizing "The Netflix Experience" with Deep Learning

Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...

Lessons Learned from Building Machine Learning Software at Netflix

Similar to Reward Innovation for long-term member satisfaction

VIP Program Creation

Jenny Weigle

2016 5-5 technology association of georgia - tag - sales force engagement (2)...

Erin Bush

Headspace apm growth project

Dhiren Patel

Analytics Academy 2017 Presentation Slides

HarvardComms

Aligning with partners on customer success

Matthew Klassen

Ensuring client outcomes is difficult enough when there are only two sides to the relationship: the vendor and the customer. When you add partners into the mix, it can make aligning the goals of all parties much more complicated. Allison Pickens, CCO of Gainsight and Chris Doell, VP of Customer Success for Cisco's Cloud Security guide you through the complexities of managing Customer Success in your partner ecosystem.

Case Study: How Our B2B Tech Company Amplified Demand Gen with Podcast Advert...

Kiwi Creative

Learn how Veeva Systems, a leader in cloud-based software for the global life sciences industry, amplified demand generation through an often overlooked platform: podcast advertising. Abbie Hosta, Associate Director of Marketing, will share what worked (and what didn't!) as the tech company: ● Identified the right shows and metrics for success ● Built relationships and trust with podcasters ● Determined tactics and activities The result? 1,000+ product demo views on LinkedIn and YouTube, a 225% increase in traffic from social media and 35 new customers…all within six months! If you’re looking to breathe new life into your marketing strategy, stand out from competitors and do more with fewer resources, read this presentation to determine if podcast advertising might work for you. - - - This is the slide deck from the September 2023 HubSpot User Group (HUG) for B2B Technology USA. View the webinar recording at https://youtu.be/ipU3qmSb2lU Sign up for future HUG events at https://events.hubspot.com/b2b-technology-usa/

Sequential Decision Making in Recommendations

Jaya Kawale

Places for People - Web Strategy Presentation

Michael Sutcliffe (CIM, BA Hons)

The document outlines a 4 stage process for developing a digital web strategy: 1) Research & Planning - Conduct stakeholder interviews, competitor analysis, SWOT analysis and define objectives. 2) Aims & Objectives - Establish a long term vision and set measurable short and long term goals. 3) Implementation - Develop a project plan and roadmap, assign roles, and implement the strategy through regular meetings. 4) Monitor & Improve - Promote the new strategy using SEO, PPC, and digital PR and continually monitor success metrics to refine the strategy.

How to Gauge Engagement in Moodle Using NPS

Lambda Solutions

The document discusses using Net Promoter Score (NPS) in Moodle to gauge learner engagement, including how to set up an NPS feedback activity, analyze the learning analytics, and take action based on the feedback; NPS is typically used to measure customer loyalty and satisfaction on a scale of how likely they are to recommend a product or service to others; the presentation demonstrates how to build NPS reports in Moodle to collect feedback and monitor scores over time in order to make improvements.

Dev's Guide to Feedback Driven Development

Marty Haught

FbDD is a technique for product development that relies on customer feedback to guide decisions. It emphasizes building minimum viable products and testing hypotheses through techniques like A/B testing, tracking usage metrics, net promoter scores, and direct feedback. The goal is to continuously learn what customers need through iterative releases and adjusting the product vision based on validated learning from customer interactions and data.

Panel: Making responsible gambling work within the industry

Horizons RG

This document summarizes a panel discussion on responsible gambling efforts within the gaming industry. The panel was moderated by Paul Smith and included panelists Yasmine Roulleau de La Roussiere, Tammi Barlow, and Mark Vander Linden. Some of the key points discussed include: - Establishing a mission statement and drivers to promote responsible gaming for employees, management, patrons and the community. - Operationalizing responsible gaming through frameworks like Game Sense and integrating best practices. - Partnerships with organizations like BCLC and training programs for employees on responsible gaming. - Pilot programs and initiatives to promote responsible gaming and enhance procedures. - Metrics and goals to be industry leaders in responsible

Net Promoter Camp Training

IPRA conference

The document discusses using the Net Promoter Score (NPS) methodology to measure customer satisfaction and loyalty at camps. It explains how to calculate the NPS, which segments customers into promoters, passives, and detractors. It also discusses using NPS data to drive improvements, ensuring organizational alignment around NPS, and implementing systems like regular data collection and action planning.

Serious Games + Learning Science = Win: How to Teach Product Knowledge, Polic...

Bottom-Line Performance

Serious games have the potential to engage and motivate your learners. But what about driving long-term retention of business-critical knowledge? In this session, the speakers will share how four organizations have put theory into practice and implemented games as part of their training programs. Then, they will explore how serious games can be linked to learning science to increase learner retention of product knowledge, policies, procedures, and basic facts. This session includes seven practical tips for implementing serious games in an organization. Application on the Job: Discover how learning science and games can be linked to drive retention of topics such as product knowledge, policies, and procedures. Access case studies and research you can use to make the case for serious games or gamification in your organization.

Mkt2 building youronlinemktgpresence

brock55

This document discusses building an online marketing presence through affiliate marketing. It defines affiliate marketing and explains that it is part of a larger marketing strategy including pay per click advertising, media buys, social media, search engine optimization, and email marketing. It provides details on each of these marketing channels and emphasizes measuring return on investment to optimize campaigns. The document also provides considerations for running a successful affiliate program, such as building relationships with affiliates and providing tools, promotions, and tracking/reporting. It stresses evaluating an existing affiliate program to improve performance.

The New World of Channel Incentives

CCI - An E2open Company

5 Ways to Increase the Effectiveness of Your Compensation Programs

Human Capital Media

Escalating competition for talent has made it substantially more challenging to attract, retain and motivate employees, while competitive markets have made it more important to control compensation costs. During this spotlight webinar, Dow Scott, professor of human resources in the Graduate School of Business at Loyola University Chicago, will examine five ways to increase the effectiveness of your compensation programs: Align your business strategy and compensation programs. Understand employee pay preferences. Communicate pay information effectively. Utilize incentive pay strategically. Evaluate pay programs to achieve continuous improvement.

How to Keep Up Your Creative Testing After Budget Cuts

Tinuiti

Zero- and first-party data isn’t going anywhere – and while you know that, you may not have a comprehensive strategy for it yet. You’ve probably spotted inefficiencies across your messaging channels and are keen to convert that into opportunities to optimize for conversions – but you haven’t gotten there yet. And, like all marketers, you’re also likely highly concerned with getting the most performance out of every dollar you spend. All of the above elements of lifecycle marketing are no longer just nice-to-have, but a need-to-have in order to succeed now and future proof your strategy. The good news is that our experts know how to navigate these challenges and opportunities and partnered with some of our favorite industry experts to share their insights.

Necessary Elements of Digital Marketing to Grow Your Business

Digital Vidya

First 30 days of Your CRO Program

VWO

Are you about to launch a new CRO program? The first 30 days are pivotal for your long-term success. There’s a lot of pressure to demonstrate value quickly, which isn’t easy. The process is far more complicated than it seems and can only be tackled with a well-thought-out plan. Our expert for this session, Joe Johnston, is the Head of Conversion at Launch, and he understands the common challenges that often arise when you kickstart a CRO program. Fortunately, he also knows how to mitigate them. In this webinar, he will share methods for gaining trust from key stakeholders and laying the foundations for collecting robust (valid) data, which is crucial in the first phase.

Closing the Loop: Enhancing User Experience with Monetization | Tal Shoham

Jessica Tams

This document discusses how mobile game developers can improve user engagement and monetization through rewarded ad placements. It analyzes key metrics for in-app purchases (IAPs) and rewarded ads, such as engagement rate and usage rate. The document provides case studies of games that increased these metrics and monetization by optimizing ad placement, reward schemes, user flows, and targeting high-value player segments. Placing rewarded ads at optimal points, offering more valuable rewards, and tailoring ads to individual players led to higher engagement, retention, and average revenue per daily active user in the case studies.

Similar to Reward Innovation for long-term member satisfaction (20)

VIP Program Creation

2016 5-5 technology association of georgia - tag - sales force engagement (2)...

Headspace apm growth project

Analytics Academy 2017 Presentation Slides

Aligning with partners on customer success

Case Study: How Our B2B Tech Company Amplified Demand Gen with Podcast Advert...

Sequential Decision Making in Recommendations

Places for People - Web Strategy Presentation

How to Gauge Engagement in Moodle Using NPS

Dev's Guide to Feedback Driven Development

Panel: Making responsible gambling work within the industry

Net Promoter Camp Training

Serious Games + Learning Science = Win: How to Teach Product Knowledge, Polic...

Mkt2 building youronlinemktgpresence

The New World of Channel Incentives

5 Ways to Increase the Effectiveness of Your Compensation Programs

How to Keep Up Your Creative Testing After Budget Cuts

Necessary Elements of Digital Marketing to Grow Your Business

First 30 days of Your CRO Program

Closing the Loop: Enhancing User Experience with Monetization | Tal Shoham

Recently uploaded

A Comprehensive Guide to DeFi Development Services in 2024

Intelisync

DeFi represents a paradigm shift in the financial industry. Instead of relying on traditional, centralized institutions like banks, DeFi leverages blockchain technology to create a decentralized network of financial services. This means that financial transactions can occur directly between parties, without intermediaries, using smart contracts on platforms like Ethereum. In 2024, we are witnessing an explosion of new DeFi projects and protocols, each pushing the boundaries of what’s possible in finance. In summary, DeFi in 2024 is not just a trend; it’s a revolution that democratizes finance, enhances security and transparency, and fosters continuous innovation. As we proceed through this presentation, we'll explore the various components and services of DeFi in detail, shedding light on how they are transforming the financial landscape. At Intelisync, we specialize in providing comprehensive DeFi development services tailored to meet the unique needs of our clients. From smart contract development to dApp creation and security audits, we ensure that your DeFi project is built with innovation, security, and scalability in mind. Trust Intelisync to guide you through the intricate landscape of decentralized finance and unlock the full potential of blockchain technology. Ready to take your DeFi project to the next level? Partner with Intelisync for expert DeFi development services today!

Skybuffer SAM4U tool for SAP license adoption

Tatiana Kojar

Manage and optimize your license adoption and consumption with SAM4U, an SAP free customer software asset management tool. SAM4U, an SAP complimentary software asset management tool for customers, delivers a detailed and well-structured overview of license inventory and usage with a user-friendly interface. We offer a hosted, cost-effective, and performance-optimized SAM4U setup in the Skybuffer Cloud environment. You retain ownership of the system and data, while we manage the ABAP 7.58 infrastructure, ensuring fixed Total Cost of Ownership (TCO) and exceptional services through the SAP Fiori interface.

HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU

panagenda

Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-und-domino-lizenzkostenreduzierung-in-der-welt-von-dlau/ DLAU und die Lizenzen nach dem CCB- und CCX-Modell sind für viele in der HCL-Community seit letztem Jahr ein heißes Thema. Als Notes- oder Domino-Kunde haben Sie vielleicht mit unerwartet hohen Benutzerzahlen und Lizenzgebühren zu kämpfen. Sie fragen sich vielleicht, wie diese neue Art der Lizenzierung funktioniert und welchen Nutzen sie Ihnen bringt. Vor allem wollen Sie sicherlich Ihr Budget einhalten und Kosten sparen, wo immer möglich. Das verstehen wir und wir möchten Ihnen dabei helfen! Wir erklären Ihnen, wie Sie häufige Konfigurationsprobleme lösen können, die dazu führen können, dass mehr Benutzer gezählt werden als nötig, und wie Sie überflüssige oder ungenutzte Konten identifizieren und entfernen können, um Geld zu sparen. Es gibt auch einige Ansätze, die zu unnötigen Ausgaben führen können, z. B. wenn ein Personendokument anstelle eines Mail-Ins für geteilte Mailboxen verwendet wird. Wir zeigen Ihnen solche Fälle und deren Lösungen. Und natürlich erklären wir Ihnen das neue Lizenzmodell. Nehmen Sie an diesem Webinar teil, bei dem HCL-Ambassador Marc Thomas und Gastredner Franz Walder Ihnen diese neue Welt näherbringen. Es vermittelt Ihnen die Tools und das Know-how, um den Überblick zu bewahren. Sie werden in der Lage sein, Ihre Kosten durch eine optimierte Domino-Konfiguration zu reduzieren und auch in Zukunft gering zu halten. Diese Themen werden behandelt - Reduzierung der Lizenzkosten durch Auffinden und Beheben von Fehlkonfigurationen und überflüssigen Konten - Wie funktionieren CCB- und CCX-Lizenzen wirklich? - Verstehen des DLAU-Tools und wie man es am besten nutzt - Tipps für häufige Problembereiche, wie z. B. Team-Postfächer, Funktions-/Testbenutzer usw. - Praxisbeispiele und Best Practices zum sofortigen Umsetzen

How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf

Chart Kalyan

UI5 Controls simplified - UI5con2024 presentation

Wouter Lemaire

Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...

saastr

AWS Cloud Cost Optimization Presentation.pptx

HarisZaheer8

This presentation provides valuable insights into effective cost-saving techniques on AWS. Learn how to optimize your AWS resources by rightsizing, increasing elasticity, picking the right storage class, and choosing the best pricing model. Additionally, discover essential governance mechanisms to ensure continuous cost efficiency. Whether you are new to AWS or an experienced user, this presentation provides clear and practical tips to help you reduce your cloud costs and get the most out of your budget.

Finale of the Year: Apply for Next One!

GDSC PJATK

Serial Arm Control in Real Time Presentation

tolgahangng

Introduction of Cybersecurity with OSS at Code Europe 2024

Hiroshi SHIBATA

I develop the Ruby programming language, RubyGems, and Bundler, which are package managers for Ruby. Today, I will introduce how to enhance the security of your application using open-source software (OSS) examples from Ruby and RubyGems. The first topic is CVE (Common Vulnerabilities and Exposures). I have published CVEs many times. But what exactly is a CVE? I'll provide a basic understanding of CVEs and explain how to detect and handle vulnerabilities in OSS. Next, let's discuss package managers. Package managers play a critical role in the OSS ecosystem. I'll explain how to manage library dependencies in your application. I'll share insights into how the Ruby and RubyGems core team works to keep our ecosystem safe. By the end of this talk, you'll have a better understanding of how to safeguard your code.

dbms calicut university B. sc Cs 4th sem.pdf

Shinana2

Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...

Tatiana Kojar

Skybuffer AI, built on the robust SAP Business Technology Platform (SAP BTP), is the latest and most advanced version of our AI development, reaffirming our commitment to delivering top-tier AI solutions. Skybuffer AI harnesses all the innovative capabilities of the SAP BTP in the AI domain, from Conversational AI to cutting-edge Generative AI and Retrieval-Augmented Generation (RAG). It also helps SAP customers safeguard their investments into SAP Conversational AI and ensure a seamless, one-click transition to SAP Business AI. With Skybuffer AI, various AI models can be integrated into a single communication channel such as Microsoft Teams. This integration empowers business users with insights drawn from SAP backend systems, enterprise documents, and the expansive knowledge of Generative AI. And the best part of it is that it is all managed through our intuitive no-code Action Server interface, requiring no extensive coding knowledge and making the advanced AI accessible to more users.

Recommendation System using RAG Architecture

fredae14

Presentation of the OECD Artificial Intelligence Review of Germany

innovationoecd

GraphRAG for Life Science to increase LLM accuracy

Tomaz Bratanic

Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf

Malak Abu Hammad

Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers: * What is Vector Search? * Importance and benefits of vector search * Practical use cases across various industries * Step-by-step implementation guide * Live demos with code snippets * Enhancing LLM capabilities with vector search * Best practices and optimization strategies Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications. #MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology

Best 20 SEO Techniques To Improve Website Visibility In SERP

Pixlogix Infotech

Driving Business Innovation: Latest Generative AI Advancements & Success Story

Safe Software

Are you ready to revolutionize how you handle data? Join us for a webinar where we’ll bring you up to speed with the latest advancements in Generative AI technology and discover how leveraging FME with tools from giants like Google Gemini, Amazon, and Microsoft OpenAI can supercharge your workflow efficiency. During the hour, we’ll take you through: Guest Speaker Segment with Hannah Barrington: Dive into the world of dynamic real estate marketing with Hannah, the Marketing Manager at Workspace Group. Hear firsthand how their team generates engaging descriptions for thousands of office units by integrating diverse data sources—from PDF floorplans to web pages—using FME transformers, like OpenAIVisionConnector and AnthropicVisionConnector. This use case will show you how GenAI can streamline content creation for marketing across the board. Ollama Use Case: Learn how Scenario Specialist Dmitri Bagh has utilized Ollama within FME to input data, create custom models, and enhance security protocols. This segment will include demos to illustrate the full capabilities of FME in AI-driven processes. Custom AI Models: Discover how to leverage FME to build personalized AI models using your data. Whether it’s populating a model with local data for added security or integrating public AI tools, find out how FME facilitates a versatile and secure approach to AI. We’ll wrap up with a live Q&A session where you can engage with our experts on your specific use cases, and learn more about optimizing your data workflows with AI. This webinar is ideal for professionals seeking to harness the power of AI within their data management systems while ensuring high levels of customization and security. Whether you're a novice or an expert, gain actionable insights and strategies to elevate your data processes. Join us to see how FME and AI can revolutionize how you work with data!

Energy Efficient Video Encoding for Cloud and Edge Computing Instances

Alpen-Adria-Universität

Trusted Execution Environment for Decentralized Process Mining

LucaBarbaro3

Recently uploaded (20)

A Comprehensive Guide to DeFi Development Services in 2024

Skybuffer SAM4U tool for SAP license adoption

HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU

How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf

UI5 Controls simplified - UI5con2024 presentation

Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...

AWS Cloud Cost Optimization Presentation.pptx

Finale of the Year: Apply for Next One!

Serial Arm Control in Real Time Presentation

Introduction of Cybersecurity with OSS at Code Europe 2024

dbms calicut university B. sc Cs 4th sem.pdf

Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...

Recommendation System using RAG Architecture

Presentation of the OECD Artificial Intelligence Review of Germany

GraphRAG for Life Science to increase LLM accuracy

Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf

Best 20 SEO Techniques To Improve Website Visibility In SERP

Driving Business Innovation: Latest Generative AI Advancements & Success Story

Energy Efficient Video Encoding for Cloud and Edge Computing Instances

Trusted Execution Environment for Decentralized Process Mining

Reward Innovation for long-term member satisfaction

1. Reward innovation for long-term member satisfaction Gary Tang, Jiangwei Pan, Henry Wang and Justin Basilico

2. Goal Create a personalized homepage to help members find content to watch and enjoy that maximizes long-term member satisfaction.

3. Member enjoys watching Netflix So continues the subscription and tells their friends about it Long-term satisfaction for Netflix

4. Batch learning from bandit feedback Production policy ● show recommendations to the member Member gives feedbacks on recommendations ● immediate: skip/play a show ● long-term: cancel/renew subscription Goal ● train a policy to maximize the long-term reward

5. Batch learning from bandit feedback Production policy ● show recommendations to the member Member gives feedbacks on recommendations ● immediate: skip/play a show ● long-term: cancel/renew subscription Goal ● train a policy to maximize the long-term reward

6. Challenges with long-term retention Noisy signal Influenced by external factors Nonsensitive signal Only sensitive for “borderline” members Delayed signal Need to wait a long time & hard to attribute

7. Proxy reward Train policy to optimize proxy reward ● highly correlated with long-term satisfaction ● sensitive to individual recommendations Immediate feedback as proxy ● e.g. start to play a show Delayed feedback could be more aligned ● e.g. completing a show

8. Delayed proxy reward Need to wait a long time to observe the delayed reward ● can not be used in training immediately ● can hurt coldstarting of model Don’t want to wait? predict delayed reward ● use all user actions up to policy training time Train policy to maximize predicted reward Note: predicted reward can not be used online as it uses post-recommendation user actions

9. The ideas are not new Related work: ● Long-term optimization in recommenders ● Reward shaping in reinforcement learning ● Online reward optimization We focus on reward innovation as an important product development workstream at Netflix

10. Integrating reward in bandit Reward component provides the objective of bandit policy training data

11. Reward innovation ● Ideation: what aspect of long-term satisfaction hasn’t been captured as a reward? ○ Requires balancing perspectives of ML, business, and psychology. Not easy! ● Development: how do we compute this new reward for every recommended item? ○ Collecting immediate feedbacks, predicting delayed feedbacks ● Evaluation: whether this is a good reward for the bandit policy to maximize?

12. Reward infrastructure Common development patterns ● Computing immediate proxy rewards ● Predicting delayed proxy rewards ● Combining multiple rewards ● Sharing rewards across multiple recommender components

13. Reward evaluation ● Online testing is expensive (time, resources) ● Use offline evaluation to determine promising reward candidates for online testing ● Challenge: hard to compare policies trained with different rewards

14. Offline reward evaluation ● compare policies along multiple reward axes using OPE ● choose small number of candidate policies on pareto front ● compare candidate policies online using long-term user satisfaction metrics

15. Practical learnings ● Reward normalization: dynamic range of reward can affect SGD training dynamics ● Reward features: pairing a reward with a correlated feature tends to improve the model’s ability to optimize that reward ● Reward alignment: make different parts of the overall recommender system “point in the same direction”

16. Summary ● Proxy rewards: we can train bandit policies using proxy rewards to optimize long-term member satisfaction ● Art and science: to come up with good reward hypotheses ● Supporting infrastructure: we develop infrastructure to help iterate on new hypotheses quickly

17. Open challenges ● Proxy rewards: how can we identify proxy rewards that are aligned with long-term satisfaction in a more principle way? ● Reinforcement learning: can we use reinforcement learning to optimize long-term reward directly for recommendations?

18. Thank you! Questions? Jiangwei Pan jpan@netflix.com

Reward Innovation for long-term member satisfaction

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Reward Innovation for long-term member satisfaction

Similar to Reward Innovation for long-term member satisfaction (20)

Recently uploaded

Recently uploaded (20)

Reward Innovation for long-term member satisfaction