Betabrand presentation

•Download as PPTX, PDF•

0 likes•132 views

This document discusses using data science techniques to build a content recommendation engine and predict customer preferences for the clothing company Betabrand. It analyzes data on Betabrand product descriptions, Facebook likes of customers, and uses vectorization, clustering, and cross-tabulation to develop recommendations. Key findings include interesting correlations between Facebook likes and product types, and that the content recommendation engine and cross-tabulation provided more useful results than k-means clustering with the available data. Recommendations include using the results for upselling and targeted advertising.

Data & Analytics

Understanding Betabrand
Using Data Science to Develop a Content
Recommendation Engine and Predict Customer
Preferences

Retail Clothing and Crowd-Funding Platform
• Highly Social
• Unusual Clothing Items
• Interesting Customer Base
• Goofy Marketing Campaigns

Problem: Can We Make Recommendations for
“Similar Items” based on Story Descriptions?

The Data
• Around 1600 Clothing Products and Story Descriptions for Each
Product in excel from Betabrand
• Facebook Likes (along with “Category” of Likes) of Users Who
Purchased Betabrand items
• Types of Analysis
• Content Recommendation Engine
• Cross tab in Pandas for raw counts
• K Means clustering analysis

Code: Remove Duplicates and Reset the Index

Code: Top Facebook Like Categories for
Executive Ponte Top

Dataframe: Top Facebook Like Categories for
Executive Ponte Top
• Science, Medical, Health
• School
• Shopping & Retail
• Education
• Society/Culture
• Professional Services
• Health/Wellness

Compare this to the Toaster!
• Aerospace/Defense
• Performance Venue
• Song, Concert, Record Label, Musical INstrument
• Food
• Computers/Internet Webiste
• Internet/Software

What About K-Means Clustering?
• Analyze Category of Facebook Likes to develop User Personas
• Map those Personas to Clothing Preferences

Plot to find the best number of clusters and
identify labels

Results
• K means had too small of a sample size to identify any meaningful
trends in persona clustering
• Content Recommendation Engine delivered useful results once
duplicates were removed. It might be helpful to do additional NLP
analysis on designer profiles to remove items that are similar
because of the designer from the analysis
• Cross Tab- simplest analysis of raw counts, but perhaps most
informative

Impact and Future Directions
• Results of Content Recommendation Engine can be used for upsell
opportunities in Betabrand – identifying products that are similar to
suggest to users at the point of check out on the Betabrand website
• Pandas Crosstab can be used for better Facebook advertisement
targeting and we can better market certain products, via email
campaigns or other channels for certain customer segments
• K Means will need to be refined to identify meaningful user clusters for
collaborative filtering. In combination with other methods, Betabrand
could do some powerful targeting for key demographics and encourage
designers to design for certain audiences.

Recently uploaded

9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster

Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ

From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck

Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics

How we prevented account sharing with MFAAndrei Kaleshka

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort

Call Girls in Saket 99530🔝 56974 Escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics

20240419 - Measurecamp Amsterdam - SAM.pdfHuman37

IMA MSN - Medical Students Network (2).pptxdolaknnilon

INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman

2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSINGmarianagonzalez07

Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2

DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett

RadioAdProWritingCinderellabyButleri.pdfgstagge

Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster

MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh

Recently uploaded (20)

9654467111 Call Girls In Munirka Hotel And Home Service

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx

Advanced Machine Learning for Business Professionals

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

Heart Disease Classification Report: A Data Analysis Project

How we prevented account sharing with MFA

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service

Call Girls in Saket 99530🔝 56974 Escort Service

Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT

20240419 - Measurecamp Amsterdam - SAM.pdf

IMA MSN - Medical Students Network (2).pptx

INTERNSHIP ON PURBASHA COMPOSITE TEX LTD

2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING

Identifying Appropriate Test Statistics Involving Population Mean

DBA Basics: Getting Started with Performance Tuning.pdf

RadioAdProWritingCinderellabyButleri.pdf

Generative AI for Social Good at Open Data Science East 2024

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024

MK KOMUNIKASI DATA (TI)komdat komdat.docx

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...

Betabrand presentation

1. Understanding Betabrand Using Data Science to Develop a Content Recommendation Engine and Predict Customer Preferences

2. What is ?

3. Retail Clothing and Crowd-Funding Platform • Highly Social • Unusual Clothing Items • Interesting Customer Base • Goofy Marketing Campaigns

4. Every item has a unique “Story”

5. Problem: Can We Make Recommendations for “Similar Items” based on Story Descriptions?

6. The Data • Around 1600 Clothing Products and Story Descriptions for Each Product in excel from Betabrand • Facebook Likes (along with “Category” of Likes) of Users Who Purchased Betabrand items • Types of Analysis • Content Recommendation Engine • Cross tab in Pandas for raw counts • K Means clustering analysis

7. Code: Remove Duplicates and Reset the Index

8. Use of Vectorizer and Cosine in NLP

9. Recommendations and Scores

10. Top Facebook Like Categories of Users Who Bought a Particular Product? • Use of Cross Tab • Use of Lift When Facebook Likes are too “generalized” across different products • Results were interesting! They were actually informative with different results per product. • Can see the how the Category of Facebook Like for a User who bought a product made sense based on the designer’s profile • *FUTURE EXPLORATION: NLP analysis of designer profile and whether story description text of product can be correlated with Facebook Likes? It would be interesting to examine the linkage.

11. Code: Top Facebook Like Categories for Executive Ponte Top

12. Dataframe: Top Facebook Like Categories for Executive Ponte Top • Science, Medical, Health • School • Shopping & Retail • Education • Society/Culture • Professional Services • Health/Wellness

13. Compare this to the Toaster! • Aerospace/Defense • Performance Venue • Song, Concert, Record Label, Musical INstrument • Food • Computers/Internet Webiste • Internet/Software

14. What About K-Means Clustering? • Analyze Category of Facebook Likes to develop User Personas • Map those Personas to Clothing Preferences

15. K Means Analysis

16. Plot to find the best number of clusters and identify labels

17. Identified 5 Clusters

18. Results • K means had too small of a sample size to identify any meaningful trends in persona clustering • Content Recommendation Engine delivered useful results once duplicates were removed. It might be helpful to do additional NLP analysis on designer profiles to remove items that are similar because of the designer from the analysis • Cross Tab- simplest analysis of raw counts, but perhaps most informative

19. Impact and Future Directions • Results of Content Recommendation Engine can be used for upsell opportunities in Betabrand – identifying products that are similar to suggest to users at the point of check out on the Betabrand website • Pandas Crosstab can be used for better Facebook advertisement targeting and we can better market certain products, via email campaigns or other channels for certain customer segments • K Means will need to be refined to identify meaningful user clusters for collaborative filtering. In combination with other methods, Betabrand could do some powerful targeting for key demographics and encourage designers to design for certain audiences.

Betabrand presentation

Recommended

Recommended

More Related Content

Similar to Betabrand presentation

Similar to Betabrand presentation (20)

Recently uploaded

Recently uploaded (20)

Betabrand presentation