SlideShare a Scribd company logo
Recommendation algorithms
in fashion
Rehan Ali, Y.J. Kim and Nik Anestev
2
I am Javier
DAWN: Deep Learning to Analyze Webb-detected
Nascent Galaxies
Machine Learning in Retail (jaggu.com)
B2B NLP services (growthintel.com)
Machine Learning in Insurance
RecoTour
◉ Similarity-based recommendations
◉ KNN CF
◉ GBM-based recommendations
◉ LibLinear, FM, FFM
◉ NMF
◉ Deep Learning
https://github.com/jrzaurin/RecoTour
Recommendation
algorithms in
Fashion
◉ Computer Vision
◉ NLP
◉ User Behaviour
◉ Production Pipeline
1 Computer Vision
Computer Vision is a field that includes methods for acquiring, processing, analysing and
understanding images to produce numerical information that can be interpreted by a
computer.
Item similarity: shape,color, pattern
Shape: Shape Context
1. Per point in the contour:
1.1. Find, within a given radius, the number of
other points at a distance d and angle .
1.2. Fill the corresponding bucket in a “Shape
Context” Matrix
2. Repeat
0 0 2 6 5 12 11 0 0 23 0 0
0 0 0 0 34 1 4 9 10 17 0 0
0 1 5 14 0 0 6 45 1 23 0 1
4 8 0 9 21 9 0 0 6 12 9 0
d
https://github.com/jrzaurin/Shoe-Shape-Classifier
cv2.ShapeContextDistanceExtractor()
Color: color histograms
In the RGB space colors range from 0 to 255. Simply
divide each channel into N bins and count the number
of pixels within each color-bin.
cv2.calcHist()
Color: color naming
We “all” see different colors.
Color: color naming algorithm
1. Prepare color naming table: download table and transform RGB into LAB color space.
2. Remove Image background
3. Compute a LAB color histogram of the image without background and extract those
bins that have nonzero counts
4. Compute Color distances : calculate the color distance between each bin and the LAB
values in the table from Step 1. The color distance should be computed using an
adequate metric, e.g. deltaE functions.
5. Count and assign color : let's say we have 100 nonzero bins. Based on the distances
computed at Step 4 we know that The LAB values from those bins are:
{bin1 : blue, bin2: blue, bin3: blue, ..., bin45: red, bin46: red, ..., bin100: green}
{bin1 : 10 pixels, bin2: 24, bin3: 2, ..., bin45: 20, bin46: 15, ..., bin100: 26}
{blue: 1293, red: 67, green: 325}
Content of the image is "blue" or "blue and green”
Pattern: Gabor Filters
cv2.getGaborKernel( ksize, σ, θ, λ, γ, ψ, ktype)
Where
Real Scenes
fashion scene
Automatically detect
and segment clothing
Real Scenes: clothe detection and segmentation algorithm
The algorithm splits an image I into n regions L = l1
,l1
,...,ln
. It then uses a pose estimator and a
skin detector to determine which of the regions are clothing ( C ) or not clothing (¬C)
◉ Pose estimator generates a probability map: pbody
◉ Skin detector generates a map pskin
◉ Edge detector generates L = l1
,l1
,...,ln
.
◉ We then classify each pixel in each region I(i,j)
○
○
◉ Classifies each region li
with pixel count ti
:
○
○
◉ Non-Maximal Suppression to retrieve largest clothing region
Real Scenes: clothe detection and segmentation algorithm
Dollár, Piotr, and C. Lawrence Zitnick. "Structured forests for fast edge detection." Computer Vision (ICCV), 2013
IEEE International Conference on. IEEE, 2013.
Real Scenes: clothe detection and segmentation algorithm
Background Torso Similar Regions Skin Clothing
2 NLP
Natural Language Processing is a field of computer science, artificial intelligence, and
computational linguistics concerned with the interactions between computers and
human (natural) languages
Item similarity: text mining+tf-idf
[…] It's crafted from coated canvas and decked out in the designer's famous
geometric pattern in tones of black, white and brown. There's plenty of room
inside to hold daily or overnight essentials. Carry it by the black leather handles
or over the shoulder with the detachablestrap. Black, white and brown
cube-print, coated-canvas Black leather top handless and trim detail Front and
back slip-pockets Gold-tone hook-fastening and zip top-closure Detachabel
and adjustabel […]
Text mining: spell corrector
norvig.com/spell-correct.html
For a given misspelled word w, we are trying to find the correction c, out of all possible
candidate corrections, that maximizes the probability that c is the intended correction:
◉ Language Model: P(c)
Probability that c appears as a word of English text
◉ Error Model: P(c|w)
Probability that w would be typed in a text when the author meant c
tf-idf
Term Frequency tf(t,d)
Can simply be the raw counts of a term in a document or a modified version
Inverse document Frequency idf(t,d)
Is a measure of how much information a word provide
tf-idf searcher:
Similarity Score = w1
*simproduct_title
+ w2
*simprod_description
Content Based Recommender
CV
Content Based Recommender
NLP
Shape Vector
Color Vector
Pattern Vector
Title Vector
Description Vector
love your users
User behaviour
u1
u2
u3
u4
u5
u6
u7
u8
it1
0 0 1 0 2 0 0 0
it2
0 1 0 3 0 0 0 0
it3
0 0 0 0 1 0 0 1
it4
4 1 0 0 0 0 0 0
it5
0 0 2 0 0 1 5 0
KNN item-based collaborative filtering
Using the interaction matrix we recommend items that
are similar based on how users interact with them
Matrix Factorization
Our goal is to find a representation of our users and
items based on interactions rather than feature-based
definitions
Where R is the MxN interaction matrix (M items and N
users) and I and U are MxK and NxK matrices. K are
called latent factors, and are representations of our
users and items.
Interaction matrix based on
user’s behaviour in the site
https://github.com/jrzaurin/RecoTour
ML in Production
Click, scroll, tap, add
to basket, open, etc
Recommendations
Redshift and CloudSearch
in sync
EC2
Http requests and queue
messages
User Interface
Data Collection and
recommendation service
ML in production
Data processing and ML
Any questions ?
You can find me at
◉ jrzaurin@gmail.com
◉ javier.rodriguez@simplybusiness.co.uk
Thanks!

More Related Content

What's hot

Hufman coding basic
Hufman coding basicHufman coding basic
Hufman coding basic
radthees
 
Small world effect
Small world effectSmall world effect
Small world effect
Zvi Lotker
 
Frame buffer
Frame bufferFrame buffer
Frame buffer
Aparna Joshi
 
Hybrid Technique for Copy-Move Forgery Detection Using L*A*B* Color Space
Hybrid Technique for Copy-Move Forgery Detection Using L*A*B* Color Space Hybrid Technique for Copy-Move Forgery Detection Using L*A*B* Color Space
Hybrid Technique for Copy-Move Forgery Detection Using L*A*B* Color Space
IJEEE
 
AI On Your Phone: CoreML and Tensorflow Lite
AI On Your Phone: CoreML and Tensorflow LiteAI On Your Phone: CoreML and Tensorflow Lite
AI On Your Phone: CoreML and Tensorflow Lite
Ray Deck
 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning Basics
Humberto Marchezi
 
Chapter 3 data representations
Chapter 3 data representationsChapter 3 data representations
Chapter 3 data representations
ABDUmomo
 
Attributes of Output Primitives
Attributes of Output PrimitivesAttributes of Output Primitives
Attributes of Output Primitives
Renita Santhmayora
 
Multimedia digital images
 Multimedia  digital images Multimedia  digital images
Multimedia digital images
Mohammad Dwikat
 
Attributes of output primitive(line attributes)
Attributes of output primitive(line attributes)Attributes of output primitive(line attributes)
Attributes of output primitive(line attributes)
shalinikarunakaran1
 
Attributes of output primitives unit ii
Attributes of output primitives unit iiAttributes of output primitives unit ii
Attributes of output primitives unit ii
Balamurugan M
 
Image compression
Image compressionImage compression
Image compression
Ishucs
 

What's hot (12)

Hufman coding basic
Hufman coding basicHufman coding basic
Hufman coding basic
 
Small world effect
Small world effectSmall world effect
Small world effect
 
Frame buffer
Frame bufferFrame buffer
Frame buffer
 
Hybrid Technique for Copy-Move Forgery Detection Using L*A*B* Color Space
Hybrid Technique for Copy-Move Forgery Detection Using L*A*B* Color Space Hybrid Technique for Copy-Move Forgery Detection Using L*A*B* Color Space
Hybrid Technique for Copy-Move Forgery Detection Using L*A*B* Color Space
 
AI On Your Phone: CoreML and Tensorflow Lite
AI On Your Phone: CoreML and Tensorflow LiteAI On Your Phone: CoreML and Tensorflow Lite
AI On Your Phone: CoreML and Tensorflow Lite
 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning Basics
 
Chapter 3 data representations
Chapter 3 data representationsChapter 3 data representations
Chapter 3 data representations
 
Attributes of Output Primitives
Attributes of Output PrimitivesAttributes of Output Primitives
Attributes of Output Primitives
 
Multimedia digital images
 Multimedia  digital images Multimedia  digital images
Multimedia digital images
 
Attributes of output primitive(line attributes)
Attributes of output primitive(line attributes)Attributes of output primitive(line attributes)
Attributes of output primitive(line attributes)
 
Attributes of output primitives unit ii
Attributes of output primitives unit iiAttributes of output primitives unit ii
Attributes of output primitives unit ii
 
Image compression
Image compressionImage compression
Image compression
 

Similar to Data Summer Conf 2018, “From the math to the business value: machine learning in the real world (ENG)” — Javier Rodriguez Zaurin, Data Scientist at Simply Business

ML Basic Concepts.pdf
ML Basic Concepts.pdfML Basic Concepts.pdf
ML Basic Concepts.pdf
ManishaS49
 
Java and Deep Learning
Java and Deep LearningJava and Deep Learning
Java and Deep Learning
Oswald Campesato
 
Machine Learning ebook.pdf
Machine Learning ebook.pdfMachine Learning ebook.pdf
Machine Learning ebook.pdf
HODIT12
 
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 11_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
MostafaHazemMostafaa
 
Multimodal Learning Analytics
Multimodal Learning AnalyticsMultimodal Learning Analytics
Multimodal Learning Analytics
Xavier Ochoa
 
Hand Written Digit Classification
Hand Written Digit ClassificationHand Written Digit Classification
Hand Written Digit Classification
ijtsrd
 
A Pointing Gesture-based Signal to Text Communication System Using OpenCV in ...
A Pointing Gesture-based Signal to Text Communication System Using OpenCV in ...A Pointing Gesture-based Signal to Text Communication System Using OpenCV in ...
A Pointing Gesture-based Signal to Text Communication System Using OpenCV in ...
IRJET Journal
 
Machine learning for document analysis and understanding
Machine learning for document analysis and understandingMachine learning for document analysis and understanding
Machine learning for document analysis and understanding
Seiichi Uchida
 
Overblik over kunstig intelligens og digital billedanalyse
Overblik over kunstig intelligens og digital billedanalyseOverblik over kunstig intelligens og digital billedanalyse
Overblik over kunstig intelligens og digital billedanalyse
LFF - Landsforeningen til bevaring af foto og film
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Jonathan Stray
 
CG.pptx
CG.pptxCG.pptx
CG.pptx
AdityaBisht34
 
MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀
GDSCNiT
 
物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術
CHENHuiMei
 
AI - Media Art. 인공지능과 미디어아트
AI - Media Art. 인공지능과 미디어아트AI - Media Art. 인공지능과 미디어아트
AI - Media Art. 인공지능과 미디어아트
Tae wook kang
 
know Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfknow Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdf
hemangppatel
 
Color Based Object Tracking with OpenCV A Survey
Color Based Object Tracking with OpenCV A SurveyColor Based Object Tracking with OpenCV A Survey
Color Based Object Tracking with OpenCV A Survey
YogeshIJTSRD
 
Content Based Image Retrieval (CBIR)
Content Based Image Retrieval (CBIR)Content Based Image Retrieval (CBIR)
Content Based Image Retrieval (CBIR)
Behzad Shomali
 
People counting in low density video sequences2
People counting in low density video sequences2People counting in low density video sequences2
People counting in low density video sequences2
Ahmed Tememe
 
Meetup 29042015
Meetup 29042015Meetup 29042015
Meetup 29042015lbishal
 

Similar to Data Summer Conf 2018, “From the math to the business value: machine learning in the real world (ENG)” — Javier Rodriguez Zaurin, Data Scientist at Simply Business (20)

ML Basic Concepts.pdf
ML Basic Concepts.pdfML Basic Concepts.pdf
ML Basic Concepts.pdf
 
Java and Deep Learning
Java and Deep LearningJava and Deep Learning
Java and Deep Learning
 
Machine Learning ebook.pdf
Machine Learning ebook.pdfMachine Learning ebook.pdf
Machine Learning ebook.pdf
 
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 11_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
 
Multimodal Learning Analytics
Multimodal Learning AnalyticsMultimodal Learning Analytics
Multimodal Learning Analytics
 
Hand Written Digit Classification
Hand Written Digit ClassificationHand Written Digit Classification
Hand Written Digit Classification
 
A Pointing Gesture-based Signal to Text Communication System Using OpenCV in ...
A Pointing Gesture-based Signal to Text Communication System Using OpenCV in ...A Pointing Gesture-based Signal to Text Communication System Using OpenCV in ...
A Pointing Gesture-based Signal to Text Communication System Using OpenCV in ...
 
Machine learning for document analysis and understanding
Machine learning for document analysis and understandingMachine learning for document analysis and understanding
Machine learning for document analysis and understanding
 
mini prjt
mini prjtmini prjt
mini prjt
 
Overblik over kunstig intelligens og digital billedanalyse
Overblik over kunstig intelligens og digital billedanalyseOverblik over kunstig intelligens og digital billedanalyse
Overblik over kunstig intelligens og digital billedanalyse
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
 
CG.pptx
CG.pptxCG.pptx
CG.pptx
 
MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀
 
物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術
 
AI - Media Art. 인공지능과 미디어아트
AI - Media Art. 인공지능과 미디어아트AI - Media Art. 인공지능과 미디어아트
AI - Media Art. 인공지능과 미디어아트
 
know Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfknow Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdf
 
Color Based Object Tracking with OpenCV A Survey
Color Based Object Tracking with OpenCV A SurveyColor Based Object Tracking with OpenCV A Survey
Color Based Object Tracking with OpenCV A Survey
 
Content Based Image Retrieval (CBIR)
Content Based Image Retrieval (CBIR)Content Based Image Retrieval (CBIR)
Content Based Image Retrieval (CBIR)
 
People counting in low density video sequences2
People counting in low density video sequences2People counting in low density video sequences2
People counting in low density video sequences2
 
Meetup 29042015
Meetup 29042015Meetup 29042015
Meetup 29042015
 

More from Provectus

Choosing the right IDP Solution
Choosing the right IDP SolutionChoosing the right IDP Solution
Choosing the right IDP Solution
Provectus
 
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Provectus
 
Choosing the Right Document Processing Solution for Healthcare Organizations
Choosing the Right Document Processing Solution for Healthcare OrganizationsChoosing the Right Document Processing Solution for Healthcare Organizations
Choosing the Right Document Processing Solution for Healthcare Organizations
Provectus
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
Provectus
 
AI Stack on AWS: Amazon SageMaker and Beyond
AI Stack on AWS: Amazon SageMaker and BeyondAI Stack on AWS: Amazon SageMaker and Beyond
AI Stack on AWS: Amazon SageMaker and Beyond
Provectus
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
Provectus
 
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerMLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
Provectus
 
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMRCost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Provectus
 
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
Provectus
 
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K..."Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
Provectus
 
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ..."How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
Provectus
 
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky..."Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
Provectus
 
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2..."Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
Provectus
 
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma..."Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
Provectus
 
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ..."Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
Provectus
 
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
Provectus
 
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
Provectus
 
"Integrate your front end apps with serverless backend in the cloud", Sebasti...
"Integrate your front end apps with serverless backend in the cloud", Sebasti..."Integrate your front end apps with serverless backend in the cloud", Sebasti...
"Integrate your front end apps with serverless backend in the cloud", Sebasti...
Provectus
 
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
Provectus
 
How to implement authorization in your backend with AWS IAM
How to implement authorization in your backend with AWS IAMHow to implement authorization in your backend with AWS IAM
How to implement authorization in your backend with AWS IAM
Provectus
 

More from Provectus (20)

Choosing the right IDP Solution
Choosing the right IDP SolutionChoosing the right IDP Solution
Choosing the right IDP Solution
 
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
 
Choosing the Right Document Processing Solution for Healthcare Organizations
Choosing the Right Document Processing Solution for Healthcare OrganizationsChoosing the Right Document Processing Solution for Healthcare Organizations
Choosing the Right Document Processing Solution for Healthcare Organizations
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
 
AI Stack on AWS: Amazon SageMaker and Beyond
AI Stack on AWS: Amazon SageMaker and BeyondAI Stack on AWS: Amazon SageMaker and Beyond
AI Stack on AWS: Amazon SageMaker and Beyond
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
 
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerMLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
 
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMRCost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
 
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
 
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K..."Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
 
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ..."How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
 
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky..."Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
 
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2..."Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
 
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma..."Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
 
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ..."Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
 
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
 
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
 
"Integrate your front end apps with serverless backend in the cloud", Sebasti...
"Integrate your front end apps with serverless backend in the cloud", Sebasti..."Integrate your front end apps with serverless backend in the cloud", Sebasti...
"Integrate your front end apps with serverless backend in the cloud", Sebasti...
 
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
 
How to implement authorization in your backend with AWS IAM
How to implement authorization in your backend with AWS IAMHow to implement authorization in your backend with AWS IAM
How to implement authorization in your backend with AWS IAM
 

Recently uploaded

insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
muralinath2
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
AADYARAJPANDEY1
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
AADYARAJPANDEY1
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
DiyaBiswas10
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
aishnasrivastava
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 

Recently uploaded (20)

insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 

Data Summer Conf 2018, “From the math to the business value: machine learning in the real world (ENG)” — Javier Rodriguez Zaurin, Data Scientist at Simply Business

  • 1. Recommendation algorithms in fashion Rehan Ali, Y.J. Kim and Nik Anestev
  • 2. 2 I am Javier DAWN: Deep Learning to Analyze Webb-detected Nascent Galaxies Machine Learning in Retail (jaggu.com) B2B NLP services (growthintel.com) Machine Learning in Insurance
  • 3. RecoTour ◉ Similarity-based recommendations ◉ KNN CF ◉ GBM-based recommendations ◉ LibLinear, FM, FFM ◉ NMF ◉ Deep Learning https://github.com/jrzaurin/RecoTour
  • 4. Recommendation algorithms in Fashion ◉ Computer Vision ◉ NLP ◉ User Behaviour ◉ Production Pipeline
  • 5. 1 Computer Vision Computer Vision is a field that includes methods for acquiring, processing, analysing and understanding images to produce numerical information that can be interpreted by a computer.
  • 7. Shape: Shape Context 1. Per point in the contour: 1.1. Find, within a given radius, the number of other points at a distance d and angle . 1.2. Fill the corresponding bucket in a “Shape Context” Matrix 2. Repeat 0 0 2 6 5 12 11 0 0 23 0 0 0 0 0 0 34 1 4 9 10 17 0 0 0 1 5 14 0 0 6 45 1 23 0 1 4 8 0 9 21 9 0 0 6 12 9 0 d https://github.com/jrzaurin/Shoe-Shape-Classifier cv2.ShapeContextDistanceExtractor()
  • 8. Color: color histograms In the RGB space colors range from 0 to 255. Simply divide each channel into N bins and count the number of pixels within each color-bin. cv2.calcHist()
  • 9. Color: color naming We “all” see different colors.
  • 10. Color: color naming algorithm 1. Prepare color naming table: download table and transform RGB into LAB color space. 2. Remove Image background 3. Compute a LAB color histogram of the image without background and extract those bins that have nonzero counts 4. Compute Color distances : calculate the color distance between each bin and the LAB values in the table from Step 1. The color distance should be computed using an adequate metric, e.g. deltaE functions. 5. Count and assign color : let's say we have 100 nonzero bins. Based on the distances computed at Step 4 we know that The LAB values from those bins are: {bin1 : blue, bin2: blue, bin3: blue, ..., bin45: red, bin46: red, ..., bin100: green} {bin1 : 10 pixels, bin2: 24, bin3: 2, ..., bin45: 20, bin46: 15, ..., bin100: 26} {blue: 1293, red: 67, green: 325} Content of the image is "blue" or "blue and green”
  • 11. Pattern: Gabor Filters cv2.getGaborKernel( ksize, σ, θ, λ, γ, ψ, ktype) Where
  • 12. Real Scenes fashion scene Automatically detect and segment clothing
  • 13. Real Scenes: clothe detection and segmentation algorithm The algorithm splits an image I into n regions L = l1 ,l1 ,...,ln . It then uses a pose estimator and a skin detector to determine which of the regions are clothing ( C ) or not clothing (¬C) ◉ Pose estimator generates a probability map: pbody ◉ Skin detector generates a map pskin ◉ Edge detector generates L = l1 ,l1 ,...,ln . ◉ We then classify each pixel in each region I(i,j) ○ ○ ◉ Classifies each region li with pixel count ti : ○ ○ ◉ Non-Maximal Suppression to retrieve largest clothing region
  • 14. Real Scenes: clothe detection and segmentation algorithm Dollár, Piotr, and C. Lawrence Zitnick. "Structured forests for fast edge detection." Computer Vision (ICCV), 2013 IEEE International Conference on. IEEE, 2013.
  • 15. Real Scenes: clothe detection and segmentation algorithm Background Torso Similar Regions Skin Clothing
  • 16. 2 NLP Natural Language Processing is a field of computer science, artificial intelligence, and computational linguistics concerned with the interactions between computers and human (natural) languages Item similarity: text mining+tf-idf […] It's crafted from coated canvas and decked out in the designer's famous geometric pattern in tones of black, white and brown. There's plenty of room inside to hold daily or overnight essentials. Carry it by the black leather handles or over the shoulder with the detachablestrap. Black, white and brown cube-print, coated-canvas Black leather top handless and trim detail Front and back slip-pockets Gold-tone hook-fastening and zip top-closure Detachabel and adjustabel […]
  • 17. Text mining: spell corrector norvig.com/spell-correct.html For a given misspelled word w, we are trying to find the correction c, out of all possible candidate corrections, that maximizes the probability that c is the intended correction: ◉ Language Model: P(c) Probability that c appears as a word of English text ◉ Error Model: P(c|w) Probability that w would be typed in a text when the author meant c
  • 18. tf-idf Term Frequency tf(t,d) Can simply be the raw counts of a term in a document or a modified version Inverse document Frequency idf(t,d) Is a measure of how much information a word provide tf-idf searcher: Similarity Score = w1 *simproduct_title + w2 *simprod_description
  • 20. CV Content Based Recommender NLP Shape Vector Color Vector Pattern Vector Title Vector Description Vector
  • 22. User behaviour u1 u2 u3 u4 u5 u6 u7 u8 it1 0 0 1 0 2 0 0 0 it2 0 1 0 3 0 0 0 0 it3 0 0 0 0 1 0 0 1 it4 4 1 0 0 0 0 0 0 it5 0 0 2 0 0 1 5 0 KNN item-based collaborative filtering Using the interaction matrix we recommend items that are similar based on how users interact with them Matrix Factorization Our goal is to find a representation of our users and items based on interactions rather than feature-based definitions Where R is the MxN interaction matrix (M items and N users) and I and U are MxK and NxK matrices. K are called latent factors, and are representations of our users and items. Interaction matrix based on user’s behaviour in the site https://github.com/jrzaurin/RecoTour
  • 24. Click, scroll, tap, add to basket, open, etc Recommendations Redshift and CloudSearch in sync EC2 Http requests and queue messages User Interface Data Collection and recommendation service ML in production Data processing and ML
  • 25. Any questions ? You can find me at ◉ jrzaurin@gmail.com ◉ javier.rodriguez@simplybusiness.co.uk Thanks!