5 Lessons I’ve Learned Tackling Product Matching for E-commerce

5 Lessons I’ve Learned
Tackling Product Matching for
E-commerce
Govind Chandrasekhar
@govind201

Hello!
1. Unsupervised Content Extraction
HTML→ structured attributes
2. Categorization
Dell LED Monitor →
Electronics | Computers & Accessories | Monitors
3. Feature Enhancement
Hip rubber insole Air Jordans available in black →
{“model” : “Air Jordan”, “color” : “Black”,
“insole_material” : “Rubber”}

5 Stories
● Goal
● Problem
● Cause
● Lessons

Build a good dataset for training and validation
● Matches (1): Manually curated by humans
● Non-Matches (0): Semi-automated heuristic-based generation
Goal

Same Website
Goal
Highly Rated
Low Edit Distance
=> Not a Match

Training & Validation
Problem
≈90%
≈70%Real World Testing

Same Distinct Signal
{
“name” : “Apple iPhone 7 - 128GB - Black”,
“image” : “sem3-idn/image.jpg”,
“description” : “This sleek …”,
“features” : {
"Hello.com SKU": "B123",
}, ...
}
Cause
{
“name” : “Apple iPhone 7 - 32GB - Black”,
“features” : {
"Hello.com SKU": "B456",
}, ...
}
Same Website
=> Not a Match

Cause
Same Background Color
Same Website
=> Not a Match

Cause
Different Background Color
Different Website
=> Match

➔ Watch out for quirks in your training dataset, especially causal
vs. incidental relationships.
➔ Models don’t care about your problem; they only care about
minimizing loss.
➔ When working on your own custom problems, you can’t make
the assumption that your dataset is flawless (vis-a-vis
peer-reviewed standardized datasets).
Lessons

➔ “Automated Inference on Criminality using Face Images” [Wu &
Zhang - Nov 2016]
➔ Identified criminality with 90% accuracy (AlexNet)
“[…] the angle θ from nose tip to two mouth corners is on average 19.6% smaller for criminals
than for non-criminals and has a larger variance. Also, the upper lip curvature ρ is on average
23.4% larger for criminals than for noncriminals. On the other hand, the distance d between
two eye inner corners for criminals is slightly narrower (5.6%) than for non-criminals.”
Aside

➔ Teardown (Link)
◆ Bias towards collared shirts?
◆ Bias against younger people?
◆ Bias towards likelihood of conviction or criminality?
Aside

Multiple signals on offer
➔ Text
➔ Images
➔ Identifiers (UPC/Model …)
➔ HTML
Goal
FOCUS

Find the odd one out:
1. Map of Arizona
2. Map of AR
3. Map of Arkansas
Cause
No underlying rule here

➔ Sift out knowledge-based tasks from logic-based tasks.
➔ “Never mind a neural network; can a human with no prior
knowledge, educated on nothing but a diet of your training
dataset, solve the problem?”
➔ Spending hours poring over your dataset can be rewarding.
Lessons

Combine multiple models built on individual signals into a single
multimodal model
Goal
DECISION

Problem
Combined model only slightly better than the best individual model
Model Accuracy
Text Only X %
Image Only Y %
Image + Text max(X, Y) + %

➔ Combined model had learned to only consider unimodal
features / the stronger of the two signals.
➔ It had failed to learn correlations between images and text.
➔ Since our text and image models had been pre-trained
separately, they’d learned isolated, unrelated representations.
Cause
How do we learn shared representations?

Lessons
Multimodal Deep Learning by Ngiam, Khosla, Kim, Nam, Lee & Ng

➔ Check if your multimodal models have been able to learn
meaningful correlations / shared representations.
➔ If you want your network to develop a characteristic, explicitly
set an objective to achieve this goal (autoencoder example).
Lessons

➔ Make a case to the team for replacing our hand-crafted
heuristic-based model with a machine-learning model.
➔ But in benchmark tests, for certain pockets of data, the simplistic
heuristic-based approach performed better!
Goal & Problem

For these pockets of data, one or more of the following was at play:
➔ Our training data wasn’t rich enough.
➔ Our model hadn’t been perfectly tuned.
➔ Our older hand-crafted features were surprisingly good.
Cause

Lessons
DECISION
FEATURE SET 1
FEATURE SET 2
Accuracy went up by ≈3%!

Lessons
➔ Hand-crafted feature engineering is a potent tool. Critical for
best-in-class solutions for image retrieval, tagging and more.
➔ It can be cheaper & quicker than architecture engineering. You
can’t deep-learn your way out of everything.
➔ A good way to think up features is to retrace your own
intermediary cognitive steps.
➔ Find data-scientists who are willing to do last-mile grunt work.

Goal & Problem
➔ Package our model as an AI-as-a-service product offering.
➔ Load the model behind a metered firewall, and we’re good to
go. Right ...?
➔ The service worked well for some customers. Failed miserably
for the others.

Cause
Expectation of inference of missing data
iPhone 7 128GB Black Apple iPhone 7 128GB Blackvs.

{
“name” : “Apple iPhone 7 128 gig Black”,
“image” : undef
}
Cause
Significant variety of quality in our inputs.
{
“name” : “Apple iPhone 7 - 128GB - Black
Unlocked”,
“features” : {
"RAM Memory": "2 GB",
"Has Touchscreen": "Y",
"Has Bluetooth": "Y",
"Processor Type": "A10 Fusion",
"Has Flash": "Y",
"Display Resolution": "3840 x 2160",
"Display Technology": "LCD",
"Backlight Type": "LED",
"Operating System": "iOS 10",
"Screen Size": "5.5"",
"Assembled Product Weight": "6.63 oz",
}, ...
}
vs.

➔ Moving from ML model → ML product isn’t easy. Algorithmic APIs
are “non-deterministic” (vis-a-vis Stripe/Facebook APIs).
➔ Product design is crucial; PMs take note.
➔ Setting customer expectations is crucial; UX designers take note.
➔ Building (multiple) models resilient to different types of data in the
last mile is crucial; data-scientists take note.
Lessons

Contact
semantics3.com/blog
govindc.com
medium.com/@govind201
twitter.com/@govind201

5 Lessons I’ve Learned Tackling Product Matching for E-commerce

Recommended

Recommended

More Related Content

Similar to 5 Lessons I’ve Learned Tackling Product Matching for E-commerce

Similar to 5 Lessons I’ve Learned Tackling Product Matching for E-commerce (20)

Recently uploaded

Recently uploaded (20)

5 Lessons I’ve Learned Tackling Product Matching for E-commerce