SlideShare a Scribd company logo
Fashion and Data
Suman Bhattacharya
Senior Data Scientist @ Uber
Astronomy
Health
Care
retail
social media
e commerce
About me
2003-2014
2014-2017
2017—
Ride
sharing
Fashion Industry
•  Rose lo bertin (July 2, 1747 – September 22, 1813) was the
dressmaker named bill to Marie Antoinette, Queen of France, and a
high public profile. Sometimes called sarcastically the "Minister of
Fashion"
Ref: Wikipedia
• 1900— Golden age of fashion
• $1.2 Trillion annual revenue as of 2016
• 240 Billion USA only
A new breed of fashion companies
• 2005—
• online only
• mostly do not own the brands themselves
• leverages the idea of “shared economy”
• uses data and technology as a leverage
When Data meet Fashion
Customer uses the app
Get personalized list of
items to choose from
Receives a box delivered
send back after usage
or hold back the bought item
, send the rest
do it over
Customer uses the app
app flow
data gathered
user interaction data
Provides feedback
provides feedback
ratings / experience data
warehouse
warehouse operations/ quality control data
Data science project cycle
Business problem Exploratory data
analysis
build some prediction system
customer interact
experiment
Prediction system
• Recommender system: given past preference of the customer,
recommend clothes that the customer want to buy or get in the next
box
• Fit algorithm: predict the right size for the clothes that the customer
wants to receive
• Experimental methodology to test algorithms
What Makes Me Likely to Buy?
Provide me products that improve my life…
…found in a place convenient to me
…in a store where the products are easy to find
…where they are provided by friendly people
…who are able to anticipate my needs
…at a time convenient for me
…for a price I am willing to pay*.
Given our product set, which products are customers demonstrating the
most interest in? Which ones are they likely to be interested in next
season?
recommendation system landscape
explicit
(e.g. rating) implicit
(e.g. click/not)
feedback data
algorithms
factorization based e.g.
SVD, latent
factors
neighborhood based
e.g. kNN
optimization based
e.g. gradient descent
data
profile data
e.g. user/ item
information
Bayesian
recommendation
System
top-N
e.g., predict top 10
items
predict ratings
e.g. for a user-item
pair
accuracy measure
AUC/ROC
precision/recall
MSE
end result
ordered
e.g., 1st>2nd
unordered
e.g., anywhere in
top 10
ratings
approach
frequentist
No user data:
cold start problem->
Content based
The Netflix Prize
Problem:
predict ratings
that users give to movies
optimize
time-SVD++
Koren 2009
data:
rotten or not
ratings
reviews
or/and
+ may be meta data
user info:
age/sex/profession/friends with
item info:
genre, dir, actors
more available
implicit feedback
less available
explicit feedback
becoming available
available
relatively less available
12
• historically, focus has been on models that predict the ratings accurately
• need explicit feedback, less available, less commercial use
• lately, focus is more on top-N recommendation
• implicit feedback ok, much more data, more commercial interests (top 5 books , top 10 movies)
end goal of the recommendation system:
predict stars predict top 5 itemsor
13
accuracy metrics:
AUC/ROC
if positions of items
matters
Precision/Recall
if only top-N matters
mean square error
if predicting ratings
14
algorithms:
neighborhood optimization factorization
collaborative filtering
find items that are similar to the items used
or
find users that use items similar to yours, then recommend
the items they have used and you haven’t
kNN
very fast
less accurate for predicting
ratings
reasonable accuracy for
top-N
SLIM
fast
less accurate for predicting
ratings
best accuracy for top-N
matrix factorization to find latent
factors
relatively slow
best accuracy for ratings
predictions
less accurate for top-N
Can also be
used for content based
recommendation
Recommendations - Signals we use
🔑🔑🔑
• Past Ratings
• Purchases
• Browse history
• Brand, category, color & style types usage in the past.
• Contexts: occasion, temperature/ weather of your location next 7
days
• Find similar customers -> find what they’ve liked in the past -> if she
hasn’t received it yet, we recommend it.
• Need future prediction in real world! Most open source solutions are not
16
Matrix Factorization
item
user
1
0
0
1
.
.
1
1
0
1
.
.
0
0
0
1
.
.
…….
…….
……..
……..
•# factors typically << # items or # of users
•User -factors: users taste given the past usage
•Item- factors: item attributes
•We use Factorization machine for our work
E.g.: collaborative filtering using implicit feedback
Factors
user X
= Factors
Items
Factorization machine
Rendle 2012
temporal information:
Past ratings
works better in sparse data
Visual recommendation
• Convolution neural net
visual features
Visual
Features
K nearest neighbor
find similar items to each item
Visual recommendations
• Extract features from an image to find similar products that are visually
similar
~80% similar
~40% similar
~80% similar
~40% similar
Visual Recommendations
• Visual recommendations based on images in her closet
Her closet
Visual recommendations - extract hidden visual attributes
• Similar items in other categories:
Because you chose this
———————————————————————
Predicting Fit
Customer
style
Customer
Fit
• for each customer find the style, and the one that fit
• Past ratings- what fit/ didn’t fit, customer body measurements, garment
measurements, body-to-mass index
• predict for each customer, top styles that will fit
Customer see this
How it works in production
Data
in HDFS
FM training
on Spark using ratings
train image data using
CNN in GPU k nearest neighbor
additional
Features
combine fit and like
score of each
product for each user
list of products users
like
to app
Shuffle the list in real time
as users interacts using Multi-Arm-Bandit
train to predict fit
Data science project cycle
Business problem Exploratory data
analysis
build some prediction system
customer interact
experiment
Treat data science as… science!
The big question –
especially with complex
predictive analytics, is did
it work?
The
Scientific
Method
Define
Questio
n
Gather
Informati
on
Form
Hypothe
sis
Test
Hypothe
sis
Analyze
Results
Draw
Conclusi
ons
Publish
Results
Retest
Define
Questio
n
Gather
Informati
on
Form
Hypothe
sis
Test
Hypothe
sis
Analyze
Results
Draw
Conclusi
ons
Publish
Results
Retest
The
Scientific
Method:
5/8ths of the steps in
the scientific method
are about testing our
hypothesis and doing
something with it.
A/ B testing
• simplest form of testing the causality
• can have multiple treatment groups, so A/B is really A/B1/B2…/BN
• randomly split traffic into two groups- A (control) and B(treatment)
=> e.g., random split: flip a coin to decide which one of your party guests goes to
control or treatment
=> e.g. non-random: family members in treatment , else in control
• Goal is to draw causal inference from the treatment group
Feature: changed
tab color
first A/ B test in recorded history
• Scurvy was an epidemic in 18th century
• In 1740, Anson’s circumnavigation voyage-
->1900 sailors joined
-> 1400 died most likely from Scurvy
• Lind’s hypothesis: Scurvy could be cured by acids, include
dietary supplements of acids
• Lind’s approach: divide 12 sailors into six groups of two, give
everyone same diet, but supplement that
group 1: cider, 2: vitriol 3: vinegar , 4: seawater, and 5:
oranges and lemon, 6: barley water
• Result: group 5 recovered within a week
James Lind
Circa 1716- 94
How to experiment? Steps
Hypothesis
how much desired lift
with what type-I and II error
split data into
two groups
expose treatment to one
of the groups until you reach the
required sample size
Analyze results
calculate required sample
size
How to experiment? Example
Hypothesis
how much desired lift
with what type-I and II error
split data into
two groups
expose treatment to one
of the groups until you reach the
required sample size
Analyze results
Calculate sample size
required
a new feature will
Increase CTR
Increase CTR by 10%
with 5/100 false positive
20% type 2 error
need pre experiment
data to calculate sample size
Limitations of A/ B testing-> MAB
• cannot adaptively allocate user to treatment groups
• cannot analyze results mid way through the experiment
• if the lift seen is less than the desired lift , cannot confirm its significance
• A/ B - explore first, then exploit
• Multi arm Bandit- explore- exploit simultaneously, adaptively drive traffic
towards winning variations without waiting for final results
Recommendations and MAB
• machine learning models have parameters to tune, search space is big
• showing similar items sorted by score often looses the discovery aspect of the
recommender system
• split users into 10 groups- group 1: sees 1,2,3,4,5; group 2: 1,3,5,..; group3: 1,
5, 9,…
• Explore - which combinations gives the best ratings- Exploit: choose that
combinations for that user
• We will know for which combinations works for each users, after the explore -
exploit
sorted by score:
1 2 3 4 5
Takeaways
• A new breed of fashion companies are changing the industry leveraging data
and personalizing shoppers’ experience
• Deep learning especially computer vision allow use to photos , this was
previously untapped data.
• Combining standard signals (ratings, clicks ) with signals from photos, improve
the recommendation accuracy significantly
• Experimentation is just as important as the prediction of the recommender
system
• Combining experimentation and recommender prediction further optimize the
personalized experience of shoppers

More Related Content

What's hot

Information Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slidesInformation Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slides
Daniel Valcarce
 
Recommendation system based on adaptive ontological graphs and weighted ranking
Recommendation system based on adaptive ontological graphs and weighted rankingRecommendation system based on adaptive ontological graphs and weighted ranking
Recommendation system based on adaptive ontological graphs and weighted ranking
vikramadityajakkula
 
Data Mining and Recommendation Systems
Data Mining and Recommendation SystemsData Mining and Recommendation Systems
Data Mining and Recommendation Systems
Salil Navgire
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
Carlos Castillo (ChaTo)
 
Introduction to Recommendation System
Introduction to Recommendation SystemIntroduction to Recommendation System
Introduction to Recommendation System
Minha Hwang
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
Girish Khanzode
 
Impersonal Recommendation system on top of Hadoop
Impersonal Recommendation system on top of HadoopImpersonal Recommendation system on top of Hadoop
Impersonal Recommendation system on top of Hadoop
Kostiantyn Kudriavtsev
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
Lior Rokach
 
Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011
idoguy
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Xavier Amatriain
 
Movies Recommendation System
Movies Recommendation SystemMovies Recommendation System
Movies Recommendation System
Shubham Patil
 
Restaurant recommender
Restaurant recommenderRestaurant recommender
Restaurant recommender
Annie Thomas
 
Recommender system a-introduction
Recommender system a-introductionRecommender system a-introduction
Recommender system a-introduction
zh3f
 
Survey of Recommendation Systems
Survey of Recommendation SystemsSurvey of Recommendation Systems
Survey of Recommendation Systems
youalab
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
Xavier Amatriain
 
Retail products - machine learning recommendation engine
Retail products   - machine learning recommendation engineRetail products   - machine learning recommendation engine
Retail products - machine learning recommendation engine
hkbhadraa
 
[Final]collaborative filtering and recommender systems
[Final]collaborative filtering and recommender systems[Final]collaborative filtering and recommender systems
[Final]collaborative filtering and recommender systems
Falitokiniaina Rabearison
 
Recommendation Systems Basics
Recommendation Systems BasicsRecommendation Systems Basics
Recommendation Systems Basics
Jarin Tasnim Khan
 
Recsys2021_slides_sato
Recsys2021_slides_satoRecsys2021_slides_sato
Recsys2021_slides_sato
Masahiro Sato
 
Active Learning in Collaborative Filtering Recommender Systems : a Survey
Active Learning in Collaborative Filtering Recommender Systems : a SurveyActive Learning in Collaborative Filtering Recommender Systems : a Survey
Active Learning in Collaborative Filtering Recommender Systems : a Survey
University of Bergen
 

What's hot (20)

Information Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slidesInformation Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slides
 
Recommendation system based on adaptive ontological graphs and weighted ranking
Recommendation system based on adaptive ontological graphs and weighted rankingRecommendation system based on adaptive ontological graphs and weighted ranking
Recommendation system based on adaptive ontological graphs and weighted ranking
 
Data Mining and Recommendation Systems
Data Mining and Recommendation SystemsData Mining and Recommendation Systems
Data Mining and Recommendation Systems
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Introduction to Recommendation System
Introduction to Recommendation SystemIntroduction to Recommendation System
Introduction to Recommendation System
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Impersonal Recommendation system on top of Hadoop
Impersonal Recommendation system on top of HadoopImpersonal Recommendation system on top of Hadoop
Impersonal Recommendation system on top of Hadoop
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
 
Movies Recommendation System
Movies Recommendation SystemMovies Recommendation System
Movies Recommendation System
 
Restaurant recommender
Restaurant recommenderRestaurant recommender
Restaurant recommender
 
Recommender system a-introduction
Recommender system a-introductionRecommender system a-introduction
Recommender system a-introduction
 
Survey of Recommendation Systems
Survey of Recommendation SystemsSurvey of Recommendation Systems
Survey of Recommendation Systems
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
Retail products - machine learning recommendation engine
Retail products   - machine learning recommendation engineRetail products   - machine learning recommendation engine
Retail products - machine learning recommendation engine
 
[Final]collaborative filtering and recommender systems
[Final]collaborative filtering and recommender systems[Final]collaborative filtering and recommender systems
[Final]collaborative filtering and recommender systems
 
Recommendation Systems Basics
Recommendation Systems BasicsRecommendation Systems Basics
Recommendation Systems Basics
 
Recsys2021_slides_sato
Recsys2021_slides_satoRecsys2021_slides_sato
Recsys2021_slides_sato
 
Active Learning in Collaborative Filtering Recommender Systems : a Survey
Active Learning in Collaborative Filtering Recommender Systems : a SurveyActive Learning in Collaborative Filtering Recommender Systems : a Survey
Active Learning in Collaborative Filtering Recommender Systems : a Survey
 

Similar to Fashiondatasc

Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
Stanley Wang
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System Introduction
PerumalPitchandi
 
Mini-training: Personalization & Recommendation Demystified
Mini-training: Personalization & Recommendation DemystifiedMini-training: Personalization & Recommendation Demystified
Mini-training: Personalization & Recommendation Demystified
Betclic Everest Group Tech Team
 
Recommender systems for E-commerce
Recommender systems for E-commerceRecommender systems for E-commerce
Recommender systems for E-commerce
Alexander Konduforov
 
Chapter 02 collaborative recommendation
Chapter 02   collaborative recommendationChapter 02   collaborative recommendation
Chapter 02 collaborative recommendation
Aravindharamanan S
 
Chapter 02 collaborative recommendation
Chapter 02   collaborative recommendationChapter 02   collaborative recommendation
Chapter 02 collaborative recommendation
Aravindharamanan S
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
Milind Gokhale
 
Recommendation system
Recommendation system Recommendation system
Recommendation system
Vikrant Arya
 
recommendationsystem-140410131156-phpapp01 (1).pdf
recommendationsystem-140410131156-phpapp01 (1).pdfrecommendationsystem-140410131156-phpapp01 (1).pdf
recommendationsystem-140410131156-phpapp01 (1).pdf
ssuserff0096
 
Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptx
Satyam Sharma
 
Recommendation Systems Roadtrip
Recommendation Systems RoadtripRecommendation Systems Roadtrip
Recommendation Systems Roadtrip
The Real Dyl
 
Big data certification training mumbai
Big data certification training mumbaiBig data certification training mumbai
Big data certification training mumbai
TejaspathiLV
 
Best data science courses in pune
Best data science courses in puneBest data science courses in pune
Best data science courses in pune
prathyusha1234
 
best online data science courses
best online data science coursesbest online data science courses
best online data science courses
prathyusha1234
 
Top data science institutes in hyderabad
Top data science institutes in hyderabadTop data science institutes in hyderabad
Top data science institutes in hyderabad
prathyusha1234
 
Demystifying Recommendation Systems
Demystifying Recommendation SystemsDemystifying Recommendation Systems
Demystifying Recommendation Systems
Rumman Chowdhury
 
IntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdfIntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdf
AlphaIssaghaDiallo
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Kris Jack
 
Survey analytics conjointanalysis_1
Survey analytics conjointanalysis_1Survey analytics conjointanalysis_1
Survey analytics conjointanalysis_1
QuestionPro
 
IRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation SystemIRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation System
IRJET Journal
 

Similar to Fashiondatasc (20)

Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System Introduction
 
Mini-training: Personalization & Recommendation Demystified
Mini-training: Personalization & Recommendation DemystifiedMini-training: Personalization & Recommendation Demystified
Mini-training: Personalization & Recommendation Demystified
 
Recommender systems for E-commerce
Recommender systems for E-commerceRecommender systems for E-commerce
Recommender systems for E-commerce
 
Chapter 02 collaborative recommendation
Chapter 02   collaborative recommendationChapter 02   collaborative recommendation
Chapter 02 collaborative recommendation
 
Chapter 02 collaborative recommendation
Chapter 02   collaborative recommendationChapter 02   collaborative recommendation
Chapter 02 collaborative recommendation
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
 
Recommendation system
Recommendation system Recommendation system
Recommendation system
 
recommendationsystem-140410131156-phpapp01 (1).pdf
recommendationsystem-140410131156-phpapp01 (1).pdfrecommendationsystem-140410131156-phpapp01 (1).pdf
recommendationsystem-140410131156-phpapp01 (1).pdf
 
Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptx
 
Recommendation Systems Roadtrip
Recommendation Systems RoadtripRecommendation Systems Roadtrip
Recommendation Systems Roadtrip
 
Big data certification training mumbai
Big data certification training mumbaiBig data certification training mumbai
Big data certification training mumbai
 
Best data science courses in pune
Best data science courses in puneBest data science courses in pune
Best data science courses in pune
 
best online data science courses
best online data science coursesbest online data science courses
best online data science courses
 
Top data science institutes in hyderabad
Top data science institutes in hyderabadTop data science institutes in hyderabad
Top data science institutes in hyderabad
 
Demystifying Recommendation Systems
Demystifying Recommendation SystemsDemystifying Recommendation Systems
Demystifying Recommendation Systems
 
IntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdfIntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdf
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
 
Survey analytics conjointanalysis_1
Survey analytics conjointanalysis_1Survey analytics conjointanalysis_1
Survey analytics conjointanalysis_1
 
IRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation SystemIRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation System
 

Recently uploaded

原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
ihavuls
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Kaxil Naik
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
a9qfiubqu
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
ElizabethGarrettChri
 
Monthly Management report for the Month of May 2024
Monthly Management report for the Month of May 2024Monthly Management report for the Month of May 2024
Monthly Management report for the Month of May 2024
facilitymanager11
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
UofT毕业证如何办理
UofT毕业证如何办理UofT毕业证如何办理
UofT毕业证如何办理
exukyp
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
VyNguyen709676
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
hyfjgavov
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
xclpvhuk
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
mkkikqvo
 

Recently uploaded (20)

原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
 
Monthly Management report for the Month of May 2024
Monthly Management report for the Month of May 2024Monthly Management report for the Month of May 2024
Monthly Management report for the Month of May 2024
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
UofT毕业证如何办理
UofT毕业证如何办理UofT毕业证如何办理
UofT毕业证如何办理
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
 

Fashiondatasc

  • 1. Fashion and Data Suman Bhattacharya Senior Data Scientist @ Uber
  • 2. Astronomy Health Care retail social media e commerce About me 2003-2014 2014-2017 2017— Ride sharing
  • 3. Fashion Industry •  Rose lo bertin (July 2, 1747 – September 22, 1813) was the dressmaker named bill to Marie Antoinette, Queen of France, and a high public profile. Sometimes called sarcastically the "Minister of Fashion" Ref: Wikipedia • 1900— Golden age of fashion • $1.2 Trillion annual revenue as of 2016 • 240 Billion USA only
  • 4. A new breed of fashion companies • 2005— • online only • mostly do not own the brands themselves • leverages the idea of “shared economy” • uses data and technology as a leverage
  • 5. When Data meet Fashion Customer uses the app Get personalized list of items to choose from Receives a box delivered send back after usage or hold back the bought item , send the rest do it over Customer uses the app app flow data gathered user interaction data Provides feedback provides feedback ratings / experience data warehouse warehouse operations/ quality control data
  • 6. Data science project cycle Business problem Exploratory data analysis build some prediction system customer interact experiment
  • 7. Prediction system • Recommender system: given past preference of the customer, recommend clothes that the customer want to buy or get in the next box • Fit algorithm: predict the right size for the clothes that the customer wants to receive • Experimental methodology to test algorithms
  • 8. What Makes Me Likely to Buy? Provide me products that improve my life… …found in a place convenient to me …in a store where the products are easy to find …where they are provided by friendly people …who are able to anticipate my needs …at a time convenient for me …for a price I am willing to pay*. Given our product set, which products are customers demonstrating the most interest in? Which ones are they likely to be interested in next season?
  • 9. recommendation system landscape explicit (e.g. rating) implicit (e.g. click/not) feedback data algorithms factorization based e.g. SVD, latent factors neighborhood based e.g. kNN optimization based e.g. gradient descent data profile data e.g. user/ item information Bayesian recommendation System top-N e.g., predict top 10 items predict ratings e.g. for a user-item pair accuracy measure AUC/ROC precision/recall MSE end result ordered e.g., 1st>2nd unordered e.g., anywhere in top 10 ratings approach frequentist No user data: cold start problem-> Content based
  • 10. The Netflix Prize Problem: predict ratings that users give to movies optimize time-SVD++ Koren 2009
  • 11. data: rotten or not ratings reviews or/and + may be meta data user info: age/sex/profession/friends with item info: genre, dir, actors more available implicit feedback less available explicit feedback becoming available available relatively less available
  • 12. 12 • historically, focus has been on models that predict the ratings accurately • need explicit feedback, less available, less commercial use • lately, focus is more on top-N recommendation • implicit feedback ok, much more data, more commercial interests (top 5 books , top 10 movies) end goal of the recommendation system: predict stars predict top 5 itemsor
  • 13. 13 accuracy metrics: AUC/ROC if positions of items matters Precision/Recall if only top-N matters mean square error if predicting ratings
  • 14. 14 algorithms: neighborhood optimization factorization collaborative filtering find items that are similar to the items used or find users that use items similar to yours, then recommend the items they have used and you haven’t kNN very fast less accurate for predicting ratings reasonable accuracy for top-N SLIM fast less accurate for predicting ratings best accuracy for top-N matrix factorization to find latent factors relatively slow best accuracy for ratings predictions less accurate for top-N Can also be used for content based recommendation
  • 15. Recommendations - Signals we use 🔑🔑🔑 • Past Ratings • Purchases • Browse history • Brand, category, color & style types usage in the past. • Contexts: occasion, temperature/ weather of your location next 7 days • Find similar customers -> find what they’ve liked in the past -> if she hasn’t received it yet, we recommend it. • Need future prediction in real world! Most open source solutions are not
  • 16. 16 Matrix Factorization item user 1 0 0 1 . . 1 1 0 1 . . 0 0 0 1 . . ……. ……. …….. …….. •# factors typically << # items or # of users •User -factors: users taste given the past usage •Item- factors: item attributes •We use Factorization machine for our work E.g.: collaborative filtering using implicit feedback Factors user X = Factors Items
  • 17. Factorization machine Rendle 2012 temporal information: Past ratings works better in sparse data
  • 18. Visual recommendation • Convolution neural net visual features Visual Features K nearest neighbor find similar items to each item
  • 19. Visual recommendations • Extract features from an image to find similar products that are visually similar ~80% similar ~40% similar
  • 21. Visual Recommendations • Visual recommendations based on images in her closet Her closet
  • 22. Visual recommendations - extract hidden visual attributes • Similar items in other categories: Because you chose this ———————————————————————
  • 23. Predicting Fit Customer style Customer Fit • for each customer find the style, and the one that fit • Past ratings- what fit/ didn’t fit, customer body measurements, garment measurements, body-to-mass index • predict for each customer, top styles that will fit Customer see this
  • 24. How it works in production Data in HDFS FM training on Spark using ratings train image data using CNN in GPU k nearest neighbor additional Features combine fit and like score of each product for each user list of products users like to app Shuffle the list in real time as users interacts using Multi-Arm-Bandit train to predict fit
  • 25. Data science project cycle Business problem Exploratory data analysis build some prediction system customer interact experiment
  • 26. Treat data science as… science! The big question – especially with complex predictive analytics, is did it work?
  • 29. A/ B testing • simplest form of testing the causality • can have multiple treatment groups, so A/B is really A/B1/B2…/BN • randomly split traffic into two groups- A (control) and B(treatment) => e.g., random split: flip a coin to decide which one of your party guests goes to control or treatment => e.g. non-random: family members in treatment , else in control • Goal is to draw causal inference from the treatment group Feature: changed tab color
  • 30. first A/ B test in recorded history • Scurvy was an epidemic in 18th century • In 1740, Anson’s circumnavigation voyage- ->1900 sailors joined -> 1400 died most likely from Scurvy • Lind’s hypothesis: Scurvy could be cured by acids, include dietary supplements of acids • Lind’s approach: divide 12 sailors into six groups of two, give everyone same diet, but supplement that group 1: cider, 2: vitriol 3: vinegar , 4: seawater, and 5: oranges and lemon, 6: barley water • Result: group 5 recovered within a week James Lind Circa 1716- 94
  • 31. How to experiment? Steps Hypothesis how much desired lift with what type-I and II error split data into two groups expose treatment to one of the groups until you reach the required sample size Analyze results calculate required sample size
  • 32. How to experiment? Example Hypothesis how much desired lift with what type-I and II error split data into two groups expose treatment to one of the groups until you reach the required sample size Analyze results Calculate sample size required a new feature will Increase CTR Increase CTR by 10% with 5/100 false positive 20% type 2 error need pre experiment data to calculate sample size
  • 33. Limitations of A/ B testing-> MAB • cannot adaptively allocate user to treatment groups • cannot analyze results mid way through the experiment • if the lift seen is less than the desired lift , cannot confirm its significance • A/ B - explore first, then exploit • Multi arm Bandit- explore- exploit simultaneously, adaptively drive traffic towards winning variations without waiting for final results
  • 34. Recommendations and MAB • machine learning models have parameters to tune, search space is big • showing similar items sorted by score often looses the discovery aspect of the recommender system • split users into 10 groups- group 1: sees 1,2,3,4,5; group 2: 1,3,5,..; group3: 1, 5, 9,… • Explore - which combinations gives the best ratings- Exploit: choose that combinations for that user • We will know for which combinations works for each users, after the explore - exploit sorted by score: 1 2 3 4 5
  • 35. Takeaways • A new breed of fashion companies are changing the industry leveraging data and personalizing shoppers’ experience • Deep learning especially computer vision allow use to photos , this was previously untapped data. • Combining standard signals (ratings, clicks ) with signals from photos, improve the recommendation accuracy significantly • Experimentation is just as important as the prediction of the recommender system • Combining experimentation and recommender prediction further optimize the personalized experience of shoppers