SlideShare a Scribd company logo
1 of 18
Download to read offline
Understand your customers deeply ... Engage with a personal touch! #ItsPersonal
Debdoot Mukherjee
Personalization & Customer Insights @
Modeling
Personalized Customer
Engagement
Data Driven Retail Functions
Product Listings Handpicked For Me
Notifications Fashion Feed
Offers &
Promotions
Re-targeting
Marketing
Research
Campaign Targeting Audience Monetization
Category
Planning
Brand
Benchmarking
Content Design
• Ephemeral and non-identifiable items
unlike Books and Movies
–Extremely sparse user/item matrix
–Bias of products with higher inventory
–Exploration versus Exploitation trade-off
• Diversity and Serendipity
• Closest domain: News Articles
Fashion – What s diffe e t?
• Recommend based on user profiles stored as preference /
weight vectors on item features, learnt from relevance
feedback on items.
• Good vector representation for items?
– Bag of product attributes does not work! Too many features,
s a e o se histo y fo a si gle use , so p ofiles do t
generalize.
– Learning preferences along latent factor / topical dimensions or
product groups (clusters) helps.
• Maintain two user profiles: long term (taste), short term
(intent)
– Incorporate time decay into browse history
– Degree of personalization depend on the amount of browse data
• BUT, att i ute ele a e does t suffi e. The e is so ethi g
e a t aptu e ia att i utes a out so e fashio ite s
that make them popular, others not. May be aesthetics.
Feature / Content based Approach
• Simple user-user, item-ite CF te h i ues do t o k ell e ause of
extremely sparse user-item matrix
• Matrix factorization:
• Regularization is tricky and severe cold start. In practice, models are
trained specific to each category of product, so maintaining separate
models for cold start and warm start becomes difficult.
• Recent advances in Feature based Matrix Factorization address this -
SVDFeature, Factorization Machines, RLFM, FOBM, fLDA, UFSM …
• Train model on snapshot of active products for recommendations
Collaborative Filtering Approaches
• A good vector representation for items would make
si ila ite s eigh o s i the e to spa e. #di e sio s
should be not very high.
• Co-browse of items in a session is (weakly) indicative of
si ila ity . “u h a sig al ei fo ed a oss a y sessio s
becomes strong.
• Inferring substitutable and complementary products –
Leskovic et al. KDD15
– Train a logistic regressor with features defined on the similarity
of item vectors represented as topics to predict whether two
products are similar. LDA using the analogy (Item Document,
Item Attribute Word)
– Core Idea: Joint training of logistic regressor and item topics by
simultaneously optimizing both topic distributions and logistic
parameters to maximize the joint likelihood of topic
memberships and product similarity.
Vector Representation for Items
• We use this analogy so that existing models for finding
representations in text / IR become applicable:
– Browse Session Document, Items Clicked Sentences, Item
Attributes Words
• Evaluate LDA, Word2Vec, GloVe …
– Yields varying levels of topical and functional similarities along
dimensions of the item vector
• “ea h fo si ila te s fo nike :
– Topical Similarity: adidas, puma, sports, dry-fit, polyeste … spo ts elated te s
– Functional Similarity: adidas, puma, fila, merrell, hrx …
– Mining interesting relationships between entities of interest viz.
brand, price band, pattern, item collection etc.
• Spherical clustering to create product groups – a better unit of
analyses than individual products.
• Create user profiles by aggregating their preferences on such item
dimensions and product groups across all browsing sessions.
Vector Representation for Items (2)
• Explore/Exploit trade off
– Popularity scoring of items (normalized for each
category / product group)
• Use Thomson Sampling in a context free bandit formulation
that assumes Gaussian reward (CTR)
• Adjust CTR with rank to formulate reward
– Contextual bandits can help in choosing the right
recommendation strategy given page and session
context
• Use of LSH to ensure diversity of
recommendations
Explore / Exploit, Diversity ..
Personalized Product Listings
Devarshi – A Football Enthusiast Anand – Appreciates Value for Money
Personalized Product Listings
Devarshi – A Football Enthusiast Anand – Appreciates Value for Money
Handpicked For Me
Personalized page
with different kinds
of recommendations:
• Taste, Intent based
• Cross sell based on
last purchase
Notifications
Drive a contextual conversation y i i g the use s shoppi g
intent in real time
Customer Insights Platform
A platform to slice / dice mined customer profiles has over 50K
different dimensions
Create a segment of loyal customers in Delhi who wear heels
Affinity toward heels
Highly loyal
From Delhi
Delhi women have a greater affinity for taller heel heights than Chennai women
A woman from Delhi is 2x more likely to be
interested in stilettos than someone from
Chennai
Brand A
Brand CBrand B
48.5% - 26 yr
27% - 28 yr
15% - 26 yr
1.8%-29y
Loyalist Distribution
0.4% - 28 yr
Comparing 3 Men Shirt Brands and their loyalists
Compared to an average Myntra customer
What else do the loyalists buy?
Less
likely
More
likely
Personalization ServicesMyntra.com
Event
Distributions
Event
Processor
S3 Event
Storage
Cassandra
(Clickstream Aggregates)
Model
Training
Serving Caches
Near-line
Personalization
Online
Personalization
Customer
Profile ETL
Customer-
wise Event
Aggregator
Mongo
(Customer
Insights)
Insights Platform
Architecture
Transactional
DW
Product
Knowledge Graph
Ariana Grande
M perso al st le is a i ture of, like, girl ,
throwback, like retro '50s pin-ups, floral, like
hippies, like a thi g fe i i e, a d like flirt .
Perso al “t le is
about having a
sense of yourself
what you believe
i ever da
Ralph Lauren
Ever o e looks at your
watch and it represents
who you are, your values
and your personal st le
Kobe Bryant
And You Still
Think I Would
Know About
Personal Style ?!!
Read more at http://sartorialscience.myntrablogs.com

More Related Content

Similar to meetup-talk

Boosting Personalization In SaaS Using Machine Learning
Boosting Personalization  In SaaS Using Machine LearningBoosting Personalization  In SaaS Using Machine Learning
Boosting Personalization In SaaS Using Machine LearningGlorium Tech
 
Webinar: Increase Conversion With Better Search
Webinar: Increase Conversion With Better SearchWebinar: Increase Conversion With Better Search
Webinar: Increase Conversion With Better SearchLucidworks
 
Data Science for Digital Commerce
Data Science for Digital CommerceData Science for Digital Commerce
Data Science for Digital CommerceManish Gupta, Ph.D.
 
case based recommendation approach for market basket data
case based recommendation approach for market basket datacase based recommendation approach for market basket data
case based recommendation approach for market basket datamniranjanmurthy
 
Practical ways to use dynamic recommendations
Practical ways to use dynamic recommendationsPractical ways to use dynamic recommendations
Practical ways to use dynamic recommendationsYesLifecycleMarketing
 
Personalisation Product Roadmap
Personalisation Product RoadmapPersonalisation Product Roadmap
Personalisation Product RoadmapAnkit Gupta
 
Digitale verlage by Günther Haslbeck / Ovenga Media
Digitale verlage  by Günther Haslbeck / Ovenga MediaDigitale verlage  by Günther Haslbeck / Ovenga Media
Digitale verlage by Günther Haslbeck / Ovenga MediaGünther Haslbeck
 
Big Data and Product Affinity
Big Data and Product Affinity Big Data and Product Affinity
Big Data and Product Affinity Michel Bruley
 
Business model canvas notes
Business model canvas notesBusiness model canvas notes
Business model canvas notesTucker Shouse
 
Recommender systems for E-commerce
Recommender systems for E-commerceRecommender systems for E-commerce
Recommender systems for E-commerceAlexander Konduforov
 
Recommendation Systems Basics
Recommendation Systems BasicsRecommendation Systems Basics
Recommendation Systems BasicsJarin Tasnim Khan
 
High Noon at the Pricing Model Corral
High Noon at the Pricing Model CorralHigh Noon at the Pricing Model Corral
High Noon at the Pricing Model CorralCharleston Conference
 
How to Build Your Product Manager Toolbox by former Microsoft PM
How to Build Your Product Manager Toolbox by former Microsoft PMHow to Build Your Product Manager Toolbox by former Microsoft PM
How to Build Your Product Manager Toolbox by former Microsoft PMProduct School
 
Demystifying Recommendation Systems
Demystifying Recommendation SystemsDemystifying Recommendation Systems
Demystifying Recommendation SystemsRumman Chowdhury
 
DataEngConf 2017 - Machine Learning Models in Production
DataEngConf 2017 - Machine Learning Models in ProductionDataEngConf 2017 - Machine Learning Models in Production
DataEngConf 2017 - Machine Learning Models in ProductionSharath Rao
 
Big data certification training mumbai
Big data certification training mumbaiBig data certification training mumbai
Big data certification training mumbaiTejaspathiLV
 
Best data science courses in pune
Best data science courses in puneBest data science courses in pune
Best data science courses in puneprathyusha1234
 
Top data science institutes in hyderabad
Top data science institutes in hyderabadTop data science institutes in hyderabad
Top data science institutes in hyderabadprathyusha1234
 
best online data science courses
best online data science coursesbest online data science courses
best online data science coursesprathyusha1234
 

Similar to meetup-talk (20)

Boosting Personalization In SaaS Using Machine Learning
Boosting Personalization  In SaaS Using Machine LearningBoosting Personalization  In SaaS Using Machine Learning
Boosting Personalization In SaaS Using Machine Learning
 
Webinar: Increase Conversion With Better Search
Webinar: Increase Conversion With Better SearchWebinar: Increase Conversion With Better Search
Webinar: Increase Conversion With Better Search
 
Data Science for Digital Commerce
Data Science for Digital CommerceData Science for Digital Commerce
Data Science for Digital Commerce
 
case based recommendation approach for market basket data
case based recommendation approach for market basket datacase based recommendation approach for market basket data
case based recommendation approach for market basket data
 
Practical ways to use dynamic recommendations
Practical ways to use dynamic recommendationsPractical ways to use dynamic recommendations
Practical ways to use dynamic recommendations
 
Class 9
Class 9Class 9
Class 9
 
Personalisation Product Roadmap
Personalisation Product RoadmapPersonalisation Product Roadmap
Personalisation Product Roadmap
 
Digitale verlage by Günther Haslbeck / Ovenga Media
Digitale verlage  by Günther Haslbeck / Ovenga MediaDigitale verlage  by Günther Haslbeck / Ovenga Media
Digitale verlage by Günther Haslbeck / Ovenga Media
 
Big Data and Product Affinity
Big Data and Product Affinity Big Data and Product Affinity
Big Data and Product Affinity
 
Business model canvas notes
Business model canvas notesBusiness model canvas notes
Business model canvas notes
 
Recommender systems for E-commerce
Recommender systems for E-commerceRecommender systems for E-commerce
Recommender systems for E-commerce
 
Recommendation Systems Basics
Recommendation Systems BasicsRecommendation Systems Basics
Recommendation Systems Basics
 
High Noon at the Pricing Model Corral
High Noon at the Pricing Model CorralHigh Noon at the Pricing Model Corral
High Noon at the Pricing Model Corral
 
How to Build Your Product Manager Toolbox by former Microsoft PM
How to Build Your Product Manager Toolbox by former Microsoft PMHow to Build Your Product Manager Toolbox by former Microsoft PM
How to Build Your Product Manager Toolbox by former Microsoft PM
 
Demystifying Recommendation Systems
Demystifying Recommendation SystemsDemystifying Recommendation Systems
Demystifying Recommendation Systems
 
DataEngConf 2017 - Machine Learning Models in Production
DataEngConf 2017 - Machine Learning Models in ProductionDataEngConf 2017 - Machine Learning Models in Production
DataEngConf 2017 - Machine Learning Models in Production
 
Big data certification training mumbai
Big data certification training mumbaiBig data certification training mumbai
Big data certification training mumbai
 
Best data science courses in pune
Best data science courses in puneBest data science courses in pune
Best data science courses in pune
 
Top data science institutes in hyderabad
Top data science institutes in hyderabadTop data science institutes in hyderabad
Top data science institutes in hyderabad
 
best online data science courses
best online data science coursesbest online data science courses
best online data science courses
 

More from Debdoot Mukherjee

Serving Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersServing Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersDebdoot Mukherjee
 
Determining QoS of WS-BPEL Compositions
Determining QoS of WS-BPEL CompositionsDetermining QoS of WS-BPEL Compositions
Determining QoS of WS-BPEL CompositionsDebdoot Mukherjee
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Debdoot Mukherjee
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Debdoot Mukherjee
 
Which Work-Item Updates Need Your Response?
Which Work-Item Updates Need Your Response?Which Work-Item Updates Need Your Response?
Which Work-Item Updates Need Your Response?Debdoot Mukherjee
 

More from Debdoot Mukherjee (6)

Serving Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersServing Information Needs of Knowledge Workers
Serving Information Needs of Knowledge Workers
 
Determining QoS of WS-BPEL Compositions
Determining QoS of WS-BPEL CompositionsDetermining QoS of WS-BPEL Compositions
Determining QoS of WS-BPEL Compositions
 
Scc talk
Scc talkScc talk
Scc talk
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
 
Which Work-Item Updates Need Your Response?
Which Work-Item Updates Need Your Response?Which Work-Item Updates Need Your Response?
Which Work-Item Updates Need Your Response?
 

meetup-talk

  • 1. Understand your customers deeply ... Engage with a personal touch! #ItsPersonal Debdoot Mukherjee Personalization & Customer Insights @
  • 2. Modeling Personalized Customer Engagement Data Driven Retail Functions Product Listings Handpicked For Me Notifications Fashion Feed Offers & Promotions Re-targeting Marketing Research Campaign Targeting Audience Monetization Category Planning Brand Benchmarking Content Design
  • 3. • Ephemeral and non-identifiable items unlike Books and Movies –Extremely sparse user/item matrix –Bias of products with higher inventory –Exploration versus Exploitation trade-off • Diversity and Serendipity • Closest domain: News Articles Fashion – What s diffe e t?
  • 4. • Recommend based on user profiles stored as preference / weight vectors on item features, learnt from relevance feedback on items. • Good vector representation for items? – Bag of product attributes does not work! Too many features, s a e o se histo y fo a si gle use , so p ofiles do t generalize. – Learning preferences along latent factor / topical dimensions or product groups (clusters) helps. • Maintain two user profiles: long term (taste), short term (intent) – Incorporate time decay into browse history – Degree of personalization depend on the amount of browse data • BUT, att i ute ele a e does t suffi e. The e is so ethi g e a t aptu e ia att i utes a out so e fashio ite s that make them popular, others not. May be aesthetics. Feature / Content based Approach
  • 5. • Simple user-user, item-ite CF te h i ues do t o k ell e ause of extremely sparse user-item matrix • Matrix factorization: • Regularization is tricky and severe cold start. In practice, models are trained specific to each category of product, so maintaining separate models for cold start and warm start becomes difficult. • Recent advances in Feature based Matrix Factorization address this - SVDFeature, Factorization Machines, RLFM, FOBM, fLDA, UFSM … • Train model on snapshot of active products for recommendations Collaborative Filtering Approaches
  • 6. • A good vector representation for items would make si ila ite s eigh o s i the e to spa e. #di e sio s should be not very high. • Co-browse of items in a session is (weakly) indicative of si ila ity . “u h a sig al ei fo ed a oss a y sessio s becomes strong. • Inferring substitutable and complementary products – Leskovic et al. KDD15 – Train a logistic regressor with features defined on the similarity of item vectors represented as topics to predict whether two products are similar. LDA using the analogy (Item Document, Item Attribute Word) – Core Idea: Joint training of logistic regressor and item topics by simultaneously optimizing both topic distributions and logistic parameters to maximize the joint likelihood of topic memberships and product similarity. Vector Representation for Items
  • 7. • We use this analogy so that existing models for finding representations in text / IR become applicable: – Browse Session Document, Items Clicked Sentences, Item Attributes Words • Evaluate LDA, Word2Vec, GloVe … – Yields varying levels of topical and functional similarities along dimensions of the item vector • “ea h fo si ila te s fo nike : – Topical Similarity: adidas, puma, sports, dry-fit, polyeste … spo ts elated te s – Functional Similarity: adidas, puma, fila, merrell, hrx … – Mining interesting relationships between entities of interest viz. brand, price band, pattern, item collection etc. • Spherical clustering to create product groups – a better unit of analyses than individual products. • Create user profiles by aggregating their preferences on such item dimensions and product groups across all browsing sessions. Vector Representation for Items (2)
  • 8. • Explore/Exploit trade off – Popularity scoring of items (normalized for each category / product group) • Use Thomson Sampling in a context free bandit formulation that assumes Gaussian reward (CTR) • Adjust CTR with rank to formulate reward – Contextual bandits can help in choosing the right recommendation strategy given page and session context • Use of LSH to ensure diversity of recommendations Explore / Exploit, Diversity ..
  • 9. Personalized Product Listings Devarshi – A Football Enthusiast Anand – Appreciates Value for Money
  • 10. Personalized Product Listings Devarshi – A Football Enthusiast Anand – Appreciates Value for Money
  • 11. Handpicked For Me Personalized page with different kinds of recommendations: • Taste, Intent based • Cross sell based on last purchase
  • 12. Notifications Drive a contextual conversation y i i g the use s shoppi g intent in real time
  • 13. Customer Insights Platform A platform to slice / dice mined customer profiles has over 50K different dimensions
  • 14. Create a segment of loyal customers in Delhi who wear heels Affinity toward heels Highly loyal From Delhi
  • 15. Delhi women have a greater affinity for taller heel heights than Chennai women A woman from Delhi is 2x more likely to be interested in stilettos than someone from Chennai
  • 16. Brand A Brand CBrand B 48.5% - 26 yr 27% - 28 yr 15% - 26 yr 1.8%-29y Loyalist Distribution 0.4% - 28 yr Comparing 3 Men Shirt Brands and their loyalists Compared to an average Myntra customer What else do the loyalists buy? Less likely More likely
  • 17. Personalization ServicesMyntra.com Event Distributions Event Processor S3 Event Storage Cassandra (Clickstream Aggregates) Model Training Serving Caches Near-line Personalization Online Personalization Customer Profile ETL Customer- wise Event Aggregator Mongo (Customer Insights) Insights Platform Architecture Transactional DW Product Knowledge Graph
  • 18. Ariana Grande M perso al st le is a i ture of, like, girl , throwback, like retro '50s pin-ups, floral, like hippies, like a thi g fe i i e, a d like flirt . Perso al “t le is about having a sense of yourself what you believe i ever da Ralph Lauren Ever o e looks at your watch and it represents who you are, your values and your personal st le Kobe Bryant And You Still Think I Would Know About Personal Style ?!! Read more at http://sartorialscience.myntrablogs.com