SlideShare a Scribd company logo
Angie Wang
Customer Segmentation & Predictive Modeling Project
The Dataset
A rich data set with over 226,000 records, reflecting over 137,000 orders from 100,000 random
U.S. customers (representative of all customers) who make purchases between 12/15/2004 and
09/17/2012. Based upon every single order line, this database records a wide range of historical
sale information including customer ID number, zip code, order date, cancel date, shipping date,
price, cost, channel, payment method and etc.
IDENTIFY MEANINGFUL CUSTOMER SEGMENTS01
PROVIDE MANAGERIAL IMPLICATIONS02
DEVELOP PREDICTIVE MODELING OF TOTAL
PROFIT
03
PURPOSES OF THE
PROJECT
IDENTIFY MEANGINGFUL SEGMENTS
PART ONE
TO UNDERSTAND CUSTOMER BEHAVIORS TO HELP
BARNEYS GENERATE MORE PROFITS
ASSUMPTION IN CUSTOMER SEGMENTATION
The RFM Model identifies meaningful customer segments
Recency Frequency Monetary
How recently a customer
makes a purchase
How much a customer
spends
How often a customer
makes a purchase
METHODOLOGY
Segment a large sample of customers into distinct groups of homogeneous customers
Aggregate transactional
data to customer data
Identify critical variables
related to RFM Model
(Profit, the time between
the first and last orders,
and number of orders)
Use SPSS Hierarchical and
K-Means Cluster Analysis to
identify meaningful customer
segments
1 2 3
SPSS Customer Segmentation Results
100K customers are segmented into six clusters. Customers in Cluster 3 (Middle-class shoppers)
and Cluster 6 (Upper-class shoppers) are identified as the most valuable customers to Barneys
based on Frequency and Monetary in the RFM model.
Payment Method
1. Amex
2. Discover
3. MasterCard
4. Visa
Channel
1. Phone
2. In- Store
3. Website
Highlights of the SPSS Customer Segmentation
Results
Cluster 1,
$5,128,492.86
, 50%
Cluster 2,
$1,094,995.41
, 11%
Cluster 3,
$1,118,466.09
, 11%
Cluster 4,
$1,884,696.28
, 18%
Cluster 5,
$791,138.06 ,
8%
Cluster 6,
$176,439.06 ,
2%
TOTAL PROFIT DISTRIBUTION
CLUSTER 1,
$65.09 , 1%
CLUSTER 2,
$217.43 , 4%
CLUSTER 3,
$621.03 , 10%
CLUSTER 4,
$143.62 , 2%
CLUSTER 5,
$716.61 , 12%
CLUSTER 6,
$4,303.39 ,
71%
AVERAGE PROFIT OF EACH CUSTOMER
Highlights of the SPSS Customer Segmentation
Results
1
1.5
2
2.5
3
CLUSTER 1
CLUSTER 2
CLUSTER 3
CLUSTER 4
CLUSTER 5
CLUSTER 6
Most Popular Shopping Channel
0
1
2
3
4
CLUSTER 1
CLUSTER 2
CLUSTER 3
CLUSTER 4
CLUSTER 5
CLUSTER 6
Most Popular Payment Method
1. Amex
2. Discover
3. MasterCard
4. Visa
1. Phone
2. In- Store
3. Website
Highlights of the SPSS Customer Segmentation
Results
0.0
0.9
3.7
3.0
1.7
3.2
0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0
CLUSTER 1
CLUSTER 2
CLUSTER 3
CLUSTER 4
CLUSTER 5
CLUSTER 6
ANNUAL PURCHASE FREQUENCY
MANAGERIAL IMPLICATIONS
PART TWO
The Most Valuable Customer Segments: Cluster 3 & 6
Managerial Implications for Cluster 3
Characteristics of an
average customer
• ≈ 4 purchases per year
• $621 in average profit
• Visa Payment
• In-Store shoppers
• 8 months in service
Recommendations
• Referral Program
• Cash-Back rewards
• Customer Knowledge -
feedback
Managerial Implications for Cluster 6
Characteristics of an
average customer
• ≈ 3 purchases per year
• $4303 in average profit
• American Express
• In-Store shoppers
• 45 months in service
Recommendations
• Increase Customer Lifetime
Value
- VIP in-store services , birthday gifts
• Increase Customer Influencer
Value
- Customer satisfaction , word-of-mouth
• Partner with American Express
PREDICTIVE MODELING OF TOTAL
PROFIT
PART THREE
The two selected customer segments with little similarity:
Cluster 1 & 2
METHODOLOGY
Develop Predictive Modeling to Forecast Total Profit for the Two Selected Customer Segments
Select variables that are
relevant to total profit
Run Multiple Linear Regression in SPSS
with Calibration sample (60% of random
sample from a cluster) and then validate the
predictive modeling with Validation sample
(40%), and identify outliers.
21
ASSUMPTIONS IN PREDICTIVE MODELING
TOTAL
PROFIT
NUMBER OF
ORDERS
NUMBER OF
ITEMS
ONLINE
PURCHASE
RETURN
QUANTITY
VISA
PAYMENT
Significance Criteria
If the significance level of a
variable is less than 0.05 in
Coefficients Table, that variable
will have impact on Total Profit.
.
Multicollinearity Criteria
If toleration is greater than 0.1
or 0.25 and VIF is less than 10
or 4 in Coefficients Table, there
is no multicollinearity effect.
.
.
PREDICTIVE MODELING OF TOTAL PROFIT – CLUSTER
1
TotalProfit = 106.424 – 76.543*OrderNumber + 6.079*ReturnQuantity + 23.202*Quantity– 1.270*WEB - 2.075*VISA
REGRESSION RESULTS FROM SPSS (Note: Web and Visa are dummy coded, 1 or 0)
24.9% of the variation in total profit in cluster 1 can be estimated by the selected five variables
(Number of orders, number of items, return quantity, online purchase, and visa payment). The remaining
75.1% is unexplained by this model, due to other variables.
01
02
03
• Negatively related to total profit
• The mode of OrderNumber
 one-time shopper
• Only 116 customers out of the
total 78,792 customers in
Cluster 1 shop at Barneys twice
NUMBER OF ORDERS (-)
VISA PAYMENT (-)
ONLINE PURCHASE (-
)
CONSUMER INSIGHTS – CLUSTER 1
• A customer who places orders
online creates $1.27 less in total
profit than through other
channels (in-store and by phone)
• Only customers in Cluster 1
prefer online shopping.
• No human interaction in-store or
over the phone  less profit
• A customer who places orders
by Visa creates $2.075 less in
total profit than by other
payment methods.
• Fees are charged by Visa
provider.
TotalProfit = 106.424 – 76.543*OrderNumber + 6.079*ReturnQuantity + 23.202*Quantity– 1.270*WEB - 2.075*VISA
PREDICTIVE MODELING OF TOTAL PROFIT – CLUSTER
2REGRESSION RESULTS FROM SPSS
The two variables, WEB and VISA, are deleted from the regression model stepwise because the significance
levels are greater than 0.05, indicating no significant relationships with total profit.
NEW PREDICTIVE MODEL OF TOTAL PROFIT – CLUSTER
2REGRESSION RESULTS FROM SPSS
40.9% of variation in total profit in cluster 2 can be estimated by the THREE independent variables
(Number of orders, number of items and return quantity). The remaining 59.1% is unexplained by this model,
due to other variables.
TotalProfit = 42.822+14.859*OrderNumber+23.193*Quantity+8.210*ReturnQuantity
01
02 (SAME CONSUMER INSIGHTS IN CLUSTER 1)
03 (SAME IN CLUSTER
1)
• Positively related to total profit.
• An average customer in Cluster 2
shops at Barneys for 3 times over
45 months in service.
 greater than one time in Cluster 1
• For each additional order, total
profit increases by $14.859.
NUMBER OF ORDERS (+)
RETURN QUANTITY (+)
NUMBER OF ITEMS
(+)
CONSUMER INSIGHTS – CLUSTER 2
• Positively related to total profit.
• The more items a customer
purchases
 higher total profit
• For every additional item that a
customer purchases, there is an
increase of $23.193 in total
profit.
• Positively related to total profit.
• The free return policy and 100%
money back guarantee
 Customers buy more and return more
 Higher total profit
TotalProfit = 42.822+14.859*OrderNumber+23.193*Quantity+8.210*ReturnQuantity
THANK YOU
Angie Wang

More Related Content

Viewers also liked

[Startup Nations Summit 2014] Competition - Ireland
[Startup Nations Summit 2014] Competition - Ireland[Startup Nations Summit 2014] Competition - Ireland
[Startup Nations Summit 2014] Competition - Ireland
StartupNations
 
The Content Continuum, part two
The Content Continuum, part twoThe Content Continuum, part two
The Content Continuum, part two
agencyside
 
How to Build a Winning Campaign with Strategic Content - Target X CRM Summit ...
How to Build a Winning Campaign with Strategic Content - Target X CRM Summit ...How to Build a Winning Campaign with Strategic Content - Target X CRM Summit ...
How to Build a Winning Campaign with Strategic Content - Target X CRM Summit ...
Converge Consulting
 
Data Quality Services in SQL Server 2012
Data Quality Services in SQL Server 2012Data Quality Services in SQL Server 2012
Data Quality Services in SQL Server 2012
Stéphane Fréchette
 
Customer Segmentation Masterclass - IIR 2010
Customer Segmentation Masterclass - IIR 2010Customer Segmentation Masterclass - IIR 2010
Customer Segmentation Masterclass - IIR 2010
Vladimir Dimitroff
 
Message in a Digital Bottle: Finding the Right Audience By Marla Johnson - #S...
Message in a Digital Bottle: Finding the Right Audience By Marla Johnson - #S...Message in a Digital Bottle: Finding the Right Audience By Marla Johnson - #S...
Message in a Digital Bottle: Finding the Right Audience By Marla Johnson - #S...
Search Engine Journal
 
Scenario Planning Linking Scenarios to Strategy
Scenario Planning Linking Scenarios to StrategyScenario Planning Linking Scenarios to Strategy
Scenario Planning Linking Scenarios to Strategy
Awais e Siraj
 
#INBOUND13 - Harnessing the Power of Segmentation for Marketing Results
#INBOUND13 - Harnessing the Power of Segmentation for Marketing Results#INBOUND13 - Harnessing the Power of Segmentation for Marketing Results
#INBOUND13 - Harnessing the Power of Segmentation for Marketing Results
Ellie Mirman
 
Social customer segmentation overcomes the limits of traditional segmentation
Social customer segmentation overcomes the limits of traditional segmentationSocial customer segmentation overcomes the limits of traditional segmentation
Social customer segmentation overcomes the limits of traditional segmentation
tracx
 
Guide To Segmentation
Guide To SegmentationGuide To Segmentation
Guide To Segmentation
The House of Marketing
 
A Smarter Customer Segmentation Approach for Utilities
A Smarter Customer Segmentation Approach for UtilitiesA Smarter Customer Segmentation Approach for Utilities
A Smarter Customer Segmentation Approach for Utilities
Black & Veatch
 
Customer Segmentation
Customer SegmentationCustomer Segmentation
Customer Segmentation
Tuhin AI Advisory
 
Object Based Image Analysis
Object Based Image Analysis Object Based Image Analysis
Object Based Image Analysis
Kabir Uddin
 
Babelfish: Ad Agency Model Disruption 15 3 09 J
Babelfish: Ad Agency Model Disruption 15 3 09 JBabelfish: Ad Agency Model Disruption 15 3 09 J
Babelfish: Ad Agency Model Disruption 15 3 09 J
Brian Crotty
 
Go-to-Market Customer Segmentation
Go-to-Market Customer SegmentationGo-to-Market Customer Segmentation
Go-to-Market Customer Segmentation
Ashley Greene
 
Customer Event Hub - the modern Customer 360° view
Customer Event Hub - the modern Customer 360° viewCustomer Event Hub - the modern Customer 360° view
Customer Event Hub - the modern Customer 360° view
Swiss Data Forum Swiss Data Forum
 
How to leverage loyalty data to generate deep customer segmentation?
How to leverage loyalty data to generate deep customer segmentation?How to leverage loyalty data to generate deep customer segmentation?
How to leverage loyalty data to generate deep customer segmentation?
Comarch
 
segmentation, positioning and targeting
segmentation, positioning and targetingsegmentation, positioning and targeting
segmentation, positioning and targeting
Monika Maciuliene
 

Viewers also liked (18)

[Startup Nations Summit 2014] Competition - Ireland
[Startup Nations Summit 2014] Competition - Ireland[Startup Nations Summit 2014] Competition - Ireland
[Startup Nations Summit 2014] Competition - Ireland
 
The Content Continuum, part two
The Content Continuum, part twoThe Content Continuum, part two
The Content Continuum, part two
 
How to Build a Winning Campaign with Strategic Content - Target X CRM Summit ...
How to Build a Winning Campaign with Strategic Content - Target X CRM Summit ...How to Build a Winning Campaign with Strategic Content - Target X CRM Summit ...
How to Build a Winning Campaign with Strategic Content - Target X CRM Summit ...
 
Data Quality Services in SQL Server 2012
Data Quality Services in SQL Server 2012Data Quality Services in SQL Server 2012
Data Quality Services in SQL Server 2012
 
Customer Segmentation Masterclass - IIR 2010
Customer Segmentation Masterclass - IIR 2010Customer Segmentation Masterclass - IIR 2010
Customer Segmentation Masterclass - IIR 2010
 
Message in a Digital Bottle: Finding the Right Audience By Marla Johnson - #S...
Message in a Digital Bottle: Finding the Right Audience By Marla Johnson - #S...Message in a Digital Bottle: Finding the Right Audience By Marla Johnson - #S...
Message in a Digital Bottle: Finding the Right Audience By Marla Johnson - #S...
 
Scenario Planning Linking Scenarios to Strategy
Scenario Planning Linking Scenarios to StrategyScenario Planning Linking Scenarios to Strategy
Scenario Planning Linking Scenarios to Strategy
 
#INBOUND13 - Harnessing the Power of Segmentation for Marketing Results
#INBOUND13 - Harnessing the Power of Segmentation for Marketing Results#INBOUND13 - Harnessing the Power of Segmentation for Marketing Results
#INBOUND13 - Harnessing the Power of Segmentation for Marketing Results
 
Social customer segmentation overcomes the limits of traditional segmentation
Social customer segmentation overcomes the limits of traditional segmentationSocial customer segmentation overcomes the limits of traditional segmentation
Social customer segmentation overcomes the limits of traditional segmentation
 
Guide To Segmentation
Guide To SegmentationGuide To Segmentation
Guide To Segmentation
 
A Smarter Customer Segmentation Approach for Utilities
A Smarter Customer Segmentation Approach for UtilitiesA Smarter Customer Segmentation Approach for Utilities
A Smarter Customer Segmentation Approach for Utilities
 
Customer Segmentation
Customer SegmentationCustomer Segmentation
Customer Segmentation
 
Object Based Image Analysis
Object Based Image Analysis Object Based Image Analysis
Object Based Image Analysis
 
Babelfish: Ad Agency Model Disruption 15 3 09 J
Babelfish: Ad Agency Model Disruption 15 3 09 JBabelfish: Ad Agency Model Disruption 15 3 09 J
Babelfish: Ad Agency Model Disruption 15 3 09 J
 
Go-to-Market Customer Segmentation
Go-to-Market Customer SegmentationGo-to-Market Customer Segmentation
Go-to-Market Customer Segmentation
 
Customer Event Hub - the modern Customer 360° view
Customer Event Hub - the modern Customer 360° viewCustomer Event Hub - the modern Customer 360° view
Customer Event Hub - the modern Customer 360° view
 
How to leverage loyalty data to generate deep customer segmentation?
How to leverage loyalty data to generate deep customer segmentation?How to leverage loyalty data to generate deep customer segmentation?
How to leverage loyalty data to generate deep customer segmentation?
 
segmentation, positioning and targeting
segmentation, positioning and targetingsegmentation, positioning and targeting
segmentation, positioning and targeting
 

Similar to Customer Segmentation and Predictive Modeling

Business and Data Analytics Collaborative April Meetup
Business and Data Analytics Collaborative April MeetupBusiness and Data Analytics Collaborative April Meetup
Business and Data Analytics Collaborative April Meetup
Ken Tucker
 
TransactionBasedAnalytics2010
TransactionBasedAnalytics2010TransactionBasedAnalytics2010
TransactionBasedAnalytics2010
Vijay Desai
 
Retail Energy Analytics_Marketelligent
Retail Energy Analytics_MarketelligentRetail Energy Analytics_Marketelligent
Retail Energy Analytics_Marketelligent
Marketelligent
 
Payments Pulse Survey: Small Business Edition
Payments Pulse Survey: Small Business EditionPayments Pulse Survey: Small Business Edition
Payments Pulse Survey: Small Business Edition
Payments Canada
 
19. saa s kp is and profitability analysis (deb sahoo)
19. saa s kp is and profitability analysis (deb sahoo)19. saa s kp is and profitability analysis (deb sahoo)
19. saa s kp is and profitability analysis (deb sahoo)
Deb Sahoo, MBA(Finance), MS(EE), BTech(EE),
 
Credit risk scoring model final
Credit risk scoring model finalCredit risk scoring model final
Credit risk scoring model final
Ritu Sarkar
 
Data mining to improve e-mail marketing
Data mining to improve e-mail marketing Data mining to improve e-mail marketing
Data mining to improve e-mail marketing
Ritu Sarkar
 
B2B Sales Hacks
B2B Sales HacksB2B Sales Hacks
B2B Sales Hacks
Anthony Brown
 
Fraudulent credit card cash-out detection On Graphs
Fraudulent credit card cash-out detection On GraphsFraudulent credit card cash-out detection On Graphs
Fraudulent credit card cash-out detection On Graphs
TigerGraph
 
Revenue Assurance Industry Update - Webinar by Dr. Gadi Solotorevsky, cVidya'...
Revenue Assurance Industry Update - Webinar by Dr. Gadi Solotorevsky, cVidya'...Revenue Assurance Industry Update - Webinar by Dr. Gadi Solotorevsky, cVidya'...
Revenue Assurance Industry Update - Webinar by Dr. Gadi Solotorevsky, cVidya'...
cVidya Networks
 
httpswww.azed.govoelaselpsUse this to see the English Lang.docx
httpswww.azed.govoelaselpsUse this to see the English Lang.docxhttpswww.azed.govoelaselpsUse this to see the English Lang.docx
httpswww.azed.govoelaselpsUse this to see the English Lang.docx
pooleavelina
 
4/11
4/114/11
4/11
SWKTECH
 
Final SAS Day 2015 Poster
Final SAS Day 2015 PosterFinal SAS Day 2015 Poster
Final SAS Day 2015 Poster
Reuben Hilliard
 
Customer analytics
Customer analyticsCustomer analytics
Customer analytics
Karl Melo
 
Socket presentation 2014
Socket presentation 2014Socket presentation 2014
Socket presentation 2014
Company Spotlight
 
Why Size Matters in Merchant Onboarding
Why Size Matters in Merchant OnboardingWhy Size Matters in Merchant Onboarding
Why Size Matters in Merchant Onboarding
Provenir
 
Case Study on Data Analytics with given Dataset (Biswadeep Ghosh Hazra) - [Ha...
Case Study on Data Analytics with given Dataset (Biswadeep Ghosh Hazra) - [Ha...Case Study on Data Analytics with given Dataset (Biswadeep Ghosh Hazra) - [Ha...
Case Study on Data Analytics with given Dataset (Biswadeep Ghosh Hazra) - [Ha...
Biswadeep Ghosh Hazra
 
Mining Loyalty Card Data for Increased Competitiveness: Case of a leading Ret...
Mining Loyalty Card Data for Increased Competitiveness: Case of a leading Ret...Mining Loyalty Card Data for Increased Competitiveness: Case of a leading Ret...
Mining Loyalty Card Data for Increased Competitiveness: Case of a leading Ret...
Atish Chattopadhyay
 
How to Maximize Discount Capture
How to Maximize Discount CaptureHow to Maximize Discount Capture
How to Maximize Discount Capture
Taulia
 
Product recommendation for Santander Bank customers
Product recommendation for Santander Bank customersProduct recommendation for Santander Bank customers
Product recommendation for Santander Bank customers
Sumit Saini
 

Similar to Customer Segmentation and Predictive Modeling (20)

Business and Data Analytics Collaborative April Meetup
Business and Data Analytics Collaborative April MeetupBusiness and Data Analytics Collaborative April Meetup
Business and Data Analytics Collaborative April Meetup
 
TransactionBasedAnalytics2010
TransactionBasedAnalytics2010TransactionBasedAnalytics2010
TransactionBasedAnalytics2010
 
Retail Energy Analytics_Marketelligent
Retail Energy Analytics_MarketelligentRetail Energy Analytics_Marketelligent
Retail Energy Analytics_Marketelligent
 
Payments Pulse Survey: Small Business Edition
Payments Pulse Survey: Small Business EditionPayments Pulse Survey: Small Business Edition
Payments Pulse Survey: Small Business Edition
 
19. saa s kp is and profitability analysis (deb sahoo)
19. saa s kp is and profitability analysis (deb sahoo)19. saa s kp is and profitability analysis (deb sahoo)
19. saa s kp is and profitability analysis (deb sahoo)
 
Credit risk scoring model final
Credit risk scoring model finalCredit risk scoring model final
Credit risk scoring model final
 
Data mining to improve e-mail marketing
Data mining to improve e-mail marketing Data mining to improve e-mail marketing
Data mining to improve e-mail marketing
 
B2B Sales Hacks
B2B Sales HacksB2B Sales Hacks
B2B Sales Hacks
 
Fraudulent credit card cash-out detection On Graphs
Fraudulent credit card cash-out detection On GraphsFraudulent credit card cash-out detection On Graphs
Fraudulent credit card cash-out detection On Graphs
 
Revenue Assurance Industry Update - Webinar by Dr. Gadi Solotorevsky, cVidya'...
Revenue Assurance Industry Update - Webinar by Dr. Gadi Solotorevsky, cVidya'...Revenue Assurance Industry Update - Webinar by Dr. Gadi Solotorevsky, cVidya'...
Revenue Assurance Industry Update - Webinar by Dr. Gadi Solotorevsky, cVidya'...
 
httpswww.azed.govoelaselpsUse this to see the English Lang.docx
httpswww.azed.govoelaselpsUse this to see the English Lang.docxhttpswww.azed.govoelaselpsUse this to see the English Lang.docx
httpswww.azed.govoelaselpsUse this to see the English Lang.docx
 
4/11
4/114/11
4/11
 
Final SAS Day 2015 Poster
Final SAS Day 2015 PosterFinal SAS Day 2015 Poster
Final SAS Day 2015 Poster
 
Customer analytics
Customer analyticsCustomer analytics
Customer analytics
 
Socket presentation 2014
Socket presentation 2014Socket presentation 2014
Socket presentation 2014
 
Why Size Matters in Merchant Onboarding
Why Size Matters in Merchant OnboardingWhy Size Matters in Merchant Onboarding
Why Size Matters in Merchant Onboarding
 
Case Study on Data Analytics with given Dataset (Biswadeep Ghosh Hazra) - [Ha...
Case Study on Data Analytics with given Dataset (Biswadeep Ghosh Hazra) - [Ha...Case Study on Data Analytics with given Dataset (Biswadeep Ghosh Hazra) - [Ha...
Case Study on Data Analytics with given Dataset (Biswadeep Ghosh Hazra) - [Ha...
 
Mining Loyalty Card Data for Increased Competitiveness: Case of a leading Ret...
Mining Loyalty Card Data for Increased Competitiveness: Case of a leading Ret...Mining Loyalty Card Data for Increased Competitiveness: Case of a leading Ret...
Mining Loyalty Card Data for Increased Competitiveness: Case of a leading Ret...
 
How to Maximize Discount Capture
How to Maximize Discount CaptureHow to Maximize Discount Capture
How to Maximize Discount Capture
 
Product recommendation for Santander Bank customers
Product recommendation for Santander Bank customersProduct recommendation for Santander Bank customers
Product recommendation for Santander Bank customers
 

Customer Segmentation and Predictive Modeling

  • 1. Angie Wang Customer Segmentation & Predictive Modeling Project
  • 2. The Dataset A rich data set with over 226,000 records, reflecting over 137,000 orders from 100,000 random U.S. customers (representative of all customers) who make purchases between 12/15/2004 and 09/17/2012. Based upon every single order line, this database records a wide range of historical sale information including customer ID number, zip code, order date, cancel date, shipping date, price, cost, channel, payment method and etc.
  • 3. IDENTIFY MEANINGFUL CUSTOMER SEGMENTS01 PROVIDE MANAGERIAL IMPLICATIONS02 DEVELOP PREDICTIVE MODELING OF TOTAL PROFIT 03 PURPOSES OF THE PROJECT
  • 4. IDENTIFY MEANGINGFUL SEGMENTS PART ONE TO UNDERSTAND CUSTOMER BEHAVIORS TO HELP BARNEYS GENERATE MORE PROFITS
  • 5. ASSUMPTION IN CUSTOMER SEGMENTATION The RFM Model identifies meaningful customer segments Recency Frequency Monetary How recently a customer makes a purchase How much a customer spends How often a customer makes a purchase
  • 6. METHODOLOGY Segment a large sample of customers into distinct groups of homogeneous customers Aggregate transactional data to customer data Identify critical variables related to RFM Model (Profit, the time between the first and last orders, and number of orders) Use SPSS Hierarchical and K-Means Cluster Analysis to identify meaningful customer segments 1 2 3
  • 7. SPSS Customer Segmentation Results 100K customers are segmented into six clusters. Customers in Cluster 3 (Middle-class shoppers) and Cluster 6 (Upper-class shoppers) are identified as the most valuable customers to Barneys based on Frequency and Monetary in the RFM model. Payment Method 1. Amex 2. Discover 3. MasterCard 4. Visa Channel 1. Phone 2. In- Store 3. Website
  • 8. Highlights of the SPSS Customer Segmentation Results Cluster 1, $5,128,492.86 , 50% Cluster 2, $1,094,995.41 , 11% Cluster 3, $1,118,466.09 , 11% Cluster 4, $1,884,696.28 , 18% Cluster 5, $791,138.06 , 8% Cluster 6, $176,439.06 , 2% TOTAL PROFIT DISTRIBUTION CLUSTER 1, $65.09 , 1% CLUSTER 2, $217.43 , 4% CLUSTER 3, $621.03 , 10% CLUSTER 4, $143.62 , 2% CLUSTER 5, $716.61 , 12% CLUSTER 6, $4,303.39 , 71% AVERAGE PROFIT OF EACH CUSTOMER
  • 9. Highlights of the SPSS Customer Segmentation Results 1 1.5 2 2.5 3 CLUSTER 1 CLUSTER 2 CLUSTER 3 CLUSTER 4 CLUSTER 5 CLUSTER 6 Most Popular Shopping Channel 0 1 2 3 4 CLUSTER 1 CLUSTER 2 CLUSTER 3 CLUSTER 4 CLUSTER 5 CLUSTER 6 Most Popular Payment Method 1. Amex 2. Discover 3. MasterCard 4. Visa 1. Phone 2. In- Store 3. Website
  • 10. Highlights of the SPSS Customer Segmentation Results 0.0 0.9 3.7 3.0 1.7 3.2 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 CLUSTER 1 CLUSTER 2 CLUSTER 3 CLUSTER 4 CLUSTER 5 CLUSTER 6 ANNUAL PURCHASE FREQUENCY
  • 11. MANAGERIAL IMPLICATIONS PART TWO The Most Valuable Customer Segments: Cluster 3 & 6
  • 12. Managerial Implications for Cluster 3 Characteristics of an average customer • ≈ 4 purchases per year • $621 in average profit • Visa Payment • In-Store shoppers • 8 months in service Recommendations • Referral Program • Cash-Back rewards • Customer Knowledge - feedback
  • 13. Managerial Implications for Cluster 6 Characteristics of an average customer • ≈ 3 purchases per year • $4303 in average profit • American Express • In-Store shoppers • 45 months in service Recommendations • Increase Customer Lifetime Value - VIP in-store services , birthday gifts • Increase Customer Influencer Value - Customer satisfaction , word-of-mouth • Partner with American Express
  • 14. PREDICTIVE MODELING OF TOTAL PROFIT PART THREE The two selected customer segments with little similarity: Cluster 1 & 2
  • 15. METHODOLOGY Develop Predictive Modeling to Forecast Total Profit for the Two Selected Customer Segments Select variables that are relevant to total profit Run Multiple Linear Regression in SPSS with Calibration sample (60% of random sample from a cluster) and then validate the predictive modeling with Validation sample (40%), and identify outliers. 21
  • 16. ASSUMPTIONS IN PREDICTIVE MODELING TOTAL PROFIT NUMBER OF ORDERS NUMBER OF ITEMS ONLINE PURCHASE RETURN QUANTITY VISA PAYMENT Significance Criteria If the significance level of a variable is less than 0.05 in Coefficients Table, that variable will have impact on Total Profit. . Multicollinearity Criteria If toleration is greater than 0.1 or 0.25 and VIF is less than 10 or 4 in Coefficients Table, there is no multicollinearity effect. . .
  • 17. PREDICTIVE MODELING OF TOTAL PROFIT – CLUSTER 1 TotalProfit = 106.424 – 76.543*OrderNumber + 6.079*ReturnQuantity + 23.202*Quantity– 1.270*WEB - 2.075*VISA REGRESSION RESULTS FROM SPSS (Note: Web and Visa are dummy coded, 1 or 0) 24.9% of the variation in total profit in cluster 1 can be estimated by the selected five variables (Number of orders, number of items, return quantity, online purchase, and visa payment). The remaining 75.1% is unexplained by this model, due to other variables.
  • 18. 01 02 03 • Negatively related to total profit • The mode of OrderNumber  one-time shopper • Only 116 customers out of the total 78,792 customers in Cluster 1 shop at Barneys twice NUMBER OF ORDERS (-) VISA PAYMENT (-) ONLINE PURCHASE (- ) CONSUMER INSIGHTS – CLUSTER 1 • A customer who places orders online creates $1.27 less in total profit than through other channels (in-store and by phone) • Only customers in Cluster 1 prefer online shopping. • No human interaction in-store or over the phone  less profit • A customer who places orders by Visa creates $2.075 less in total profit than by other payment methods. • Fees are charged by Visa provider. TotalProfit = 106.424 – 76.543*OrderNumber + 6.079*ReturnQuantity + 23.202*Quantity– 1.270*WEB - 2.075*VISA
  • 19. PREDICTIVE MODELING OF TOTAL PROFIT – CLUSTER 2REGRESSION RESULTS FROM SPSS The two variables, WEB and VISA, are deleted from the regression model stepwise because the significance levels are greater than 0.05, indicating no significant relationships with total profit.
  • 20. NEW PREDICTIVE MODEL OF TOTAL PROFIT – CLUSTER 2REGRESSION RESULTS FROM SPSS 40.9% of variation in total profit in cluster 2 can be estimated by the THREE independent variables (Number of orders, number of items and return quantity). The remaining 59.1% is unexplained by this model, due to other variables. TotalProfit = 42.822+14.859*OrderNumber+23.193*Quantity+8.210*ReturnQuantity
  • 21. 01 02 (SAME CONSUMER INSIGHTS IN CLUSTER 1) 03 (SAME IN CLUSTER 1) • Positively related to total profit. • An average customer in Cluster 2 shops at Barneys for 3 times over 45 months in service.  greater than one time in Cluster 1 • For each additional order, total profit increases by $14.859. NUMBER OF ORDERS (+) RETURN QUANTITY (+) NUMBER OF ITEMS (+) CONSUMER INSIGHTS – CLUSTER 2 • Positively related to total profit. • The more items a customer purchases  higher total profit • For every additional item that a customer purchases, there is an increase of $23.193 in total profit. • Positively related to total profit. • The free return policy and 100% money back guarantee  Customers buy more and return more  Higher total profit TotalProfit = 42.822+14.859*OrderNumber+23.193*Quantity+8.210*ReturnQuantity

Editor's Notes

  1. Recency is not used because the dataset is between 2004 and 2012.
  2. Annual purchase frequency for an average customer = (total number of orders /the time between the first and last orders) * 12
  3. Multicollinearity: two or more predictor variables are correlated.
  4. No multicollinearity problem is detected. \
  5. Order Number (Negative):
  6. No multicollinearity problem is detected.