SlideShare a Scribd company logo
1 of 11
IMPROVING RETAIL SALES CAMPAIGN PROFITABILITY
FINAL CAPSTONE PROJECT : DATA ANALYSIS , FINDINGS AND RECOMMENDATIONS
SHOUNAK MONDAL
POSTGRADUATE DIPLOMA IN DATA SCIENCE, EMERITUS AND COLUMBIA UNIVERSITY
CONTENTS 1 BACKGROUND
2 OBJECTIVES
3 APPROACH
4 ANALYSIS RESULTS
5 RECOMMENDATIONS
1. The Customer is a B2B retailer for office supplies, office electronics and office furniture
2. A marketing campaign was executed to ~16k of its customers
3. Detailed campaign target data and results are available including key data per customer :
a) Resulting Sales
b) Historical sales
c) Type of previous purchase
d) Communication channel preferences
e) Size of target company
f) Language
Background
Analyze the campaign results and provide insights and recommendations on :
1. Which type of customers responded positively to the campaign ?
2. What can the customer be doing for better future campaign performance ?
3. How much can be the financial gains of the improved campaign strategies ?
Objective
Approach
Exploratory Data
Analysis
• Removed outliers
in Number of Year
Prior Transaction
• Drop 10 rows
which has null
values for No of
Employees and
type of previous
purchase
• Impute 442 rows
with Last
Transaction
channel as
Unknown
• Impute 4470 rows
with language as
unknown
• Remove negative
sales and
Historical sales
volume rows
Data Transformation
& Analsysis
• Transform
categorical values
to binary coded
values
• Observe
correlation
between features
and remove highly
corelated features
Model Building
• Build a robust
classification
model with most
influential features
to predict sales or
no sales
• Build a regression
model to predict
sales with the
most influential
features and
predict sales
• Using the
probability of
sales and the
amount of sales
predicted
calculate
profitability given,
gross margin , the
marketing and
transaction costs
Gains / Lift Chart
• Classify the
customers into
deciles and show
profits per decile
• Show profitable
deciles and
expected gains
over random
targeting
• Inner Join based
on customer ID, the
probabilities from
classifier and
predicted sales
sales from linear
regressor model
to formula to find
profits.
• Sort profits and
split by deciles
Recommendations
• Show Which type
of customers
responded
positively to the
campaign ?
• Show What can
XYZ Ltd be doing
for better future
campaign
performance ?
• Show How much
can be the
financial gains of
the improved
campaign
strategies ?
1. Those who have purchased before ( particularly 16-22
Transactions in prior year )
2. Have had historical sales of up to $ 720,000+ ( 75% of the
purchases )
3. Made purchases in the year 1993 and 1994 – these 2 years
seems to have created long term loyal customers
Typical Purchasers are…
0
100
200
300
400
500
600
700
1926
1950
1954
1958
1962
1966
1970
1974
1978
1982
1986
1990
1994
1998
2002
2006
2010
2014
2018
Year of First Purchase
Coefficients–DegreeofInfluence
Numberof
purchaserecords
Year of Purchase
Classification Model was built and tested on ~8000 records test data. The model can be now
used to identify, predict and target positive sales candidates for future campaigns
To build a robust model, Random Forest Classifier was built by training it on ~8000 campaign data which yielded the following :
1. Prediction accuracy score : 85% - Ability of the classifier to make correct predictions
2. Precision score: 77% - The ability of the classifier to predict true sales accurately. Cost of low score is wasted marketing cost.
3. Recall score:65% - The ability of the classifier to find all the true sales. Cost of low score is missed revenue opportunities.
4. f1 Score 70% - balance between the precision and the recall
5. Confusion Matrix
NO YES
NO 5400 438
YES 788 1453
Predicted Sale ( Yes or No )
Actual
Sale
Now that we know targeting which customers will result in sales, next we built a model to predict
“Amount of Sales” for purchasing customers and tested it on ~4384 records positive sales test data
Linear Regression was used which yielded the following :
1. Linear Regression Fit score Training data : 77%
2. Linear Regression Fit score Test data : 75%
3. Root Mean Squared Error of prediction : 559
4. R squared score 0.75 ( degree to which the model
captures and explains the variance of the data )
Findings
1. Size of the company has the largest influence on the
sales amount : larger the company, larger the sales
amounts
2. Previous purchase of office furniture and computer
equipment has next significant influence in amount of
sales.
Coefficients–DegreeofInfluence
Gains Chart for ~1120 records test data representing a future campaign
Total Actual Profit per
customer of the campaign that was executed
( 16,000 records )
The lift chart is built from about 1120 records from test
data for linear regression and same 1120 records from
the classifier by using "inner" join on customer number
i.e common records between the two dataframes in the
linear reg test and classifier test
Deciles
Number of
customers per
decile
Actual Profitability
per customer
Lift over
average
Total Profit % of Profit
Incr Proj Profit
100k Customer base
Total Proj Profit
100k Customer
base
Cuml Incr Profit
100k Customer
base
Cuml Total
Profit
100k Customer
base
(1009.0, 1121.0] 112 504 497 55,630 76% 4,967 5,037 4,967 5,037
(897.0, 1009.0] 112 271 264 29,535 40% 2,637 2,707 7,604 7,744
(785.0, 897.0] 112 75 68 7,607 10% 679 749 8,283 8,493
(673.0, 785.0] 112 21 14 1,518 2% 135 205 8,419 8,699
(561.0, 673.0] 112 -1 -8 (947) -1% (85) (15) 8,334 8,684
(449.0, 561.0] 112 -12 -19 (2,113) -3% (189) (119) 8,145 8,565
(337.0, 449.0] 112 -20 -27 (2,994) -4% (267) (197) 7,878 8,368
(225.0, 337.0] 112 -28 -35 (3,959) -5% (354) (284) 7,525 8,085
(113.0, 225.0] 112 -37 -44 (4,944) -7% (441) (371) 7,083 7,713
(0.999, 113.0] 112 -49 -56 (6,268) -9% (560) (490) 6,523 7,223
Total 1120 7.00 73,063 100%
Recommendations
1. Instead of random targeting of customer base, use the prediction model to target only first 4 deciles type
customers for maximum profitability for future campaigns 1120 record test data.
2. Maximize profitability further by using lower cost channels that reach the above target of customers
effectively since marketing channels showed little or no influence on sales
3. Replicate what was done in the year 1993 and 1994, as it seems to have created long loyal customers
4. Use the model to predict sales, profitability, and expected Return on Investment and leverage it for a more
fact based budget requirements for decision by management / budget approver for next campaigns
THANK YOU

More Related Content

Similar to Improving profitability of campaigns through data science

Marketelligent Capabilities & Offerings for Sales Analytics
Marketelligent Capabilities & Offerings for Sales AnalyticsMarketelligent Capabilities & Offerings for Sales Analytics
Marketelligent Capabilities & Offerings for Sales Analytics
Marketelligent
 
eSUG-April 5-final
eSUG-April 5-finaleSUG-April 5-final
eSUG-April 5-final
Shrey Tandon
 
CRM Analytics_Marketelligent
CRM Analytics_MarketelligentCRM Analytics_Marketelligent
CRM Analytics_Marketelligent
Marketelligent
 

Similar to Improving profitability of campaigns through data science (20)

Marketelligent Capabilities & Offerings for Sales Analytics
Marketelligent Capabilities & Offerings for Sales AnalyticsMarketelligent Capabilities & Offerings for Sales Analytics
Marketelligent Capabilities & Offerings for Sales Analytics
 
Revenue Planning: How Planning for Revenue Impacts a Budget's Effectiveness
Revenue Planning: How Planning for Revenue Impacts a Budget's EffectivenessRevenue Planning: How Planning for Revenue Impacts a Budget's Effectiveness
Revenue Planning: How Planning for Revenue Impacts a Budget's Effectiveness
 
MARKETING INTELLIGENCE_Marketing ROI, come aumentare i ritorni sugli investim...
MARKETING INTELLIGENCE_Marketing ROI, come aumentare i ritorni sugli investim...MARKETING INTELLIGENCE_Marketing ROI, come aumentare i ritorni sugli investim...
MARKETING INTELLIGENCE_Marketing ROI, come aumentare i ritorni sugli investim...
 
B2B Sales Hacks
B2B Sales HacksB2B Sales Hacks
B2B Sales Hacks
 
eSUG-April 5-final
eSUG-April 5-finaleSUG-April 5-final
eSUG-April 5-final
 
Marketing Mix Models In a Changing Environment
Marketing Mix Models In a Changing EnvironmentMarketing Mix Models In a Changing Environment
Marketing Mix Models In a Changing Environment
 
From Zero to Hero: Building the Internal Reputation of Customer Success
From Zero to Hero: Building the Internal Reputation of Customer SuccessFrom Zero to Hero: Building the Internal Reputation of Customer Success
From Zero to Hero: Building the Internal Reputation of Customer Success
 
Building a Successful Cloud and Mobility Practice
Building a Successful Cloud and Mobility PracticeBuilding a Successful Cloud and Mobility Practice
Building a Successful Cloud and Mobility Practice
 
Trivia marketing solutions services v14 uk
Trivia marketing solutions services v14 ukTrivia marketing solutions services v14 uk
Trivia marketing solutions services v14 uk
 
Business Analytics Forum - Sheffield November 2018
Business Analytics Forum - Sheffield November 2018Business Analytics Forum - Sheffield November 2018
Business Analytics Forum - Sheffield November 2018
 
Introduction to marketing analytics
Introduction to marketing analyticsIntroduction to marketing analytics
Introduction to marketing analytics
 
Database Marketing, part two: data enhancement, analytics, and attribution
Database Marketing, part two: data enhancement, analytics, and attribution Database Marketing, part two: data enhancement, analytics, and attribution
Database Marketing, part two: data enhancement, analytics, and attribution
 
Intent Based Segmentation by CleverTap
Intent Based Segmentation by CleverTapIntent Based Segmentation by CleverTap
Intent Based Segmentation by CleverTap
 
Delivering Personalized Experiences using the Power of Data
Delivering Personalized Experiences using the Power of Data Delivering Personalized Experiences using the Power of Data
Delivering Personalized Experiences using the Power of Data
 
Wooing the Best Bank Deposit Customers
Wooing the Best Bank Deposit CustomersWooing the Best Bank Deposit Customers
Wooing the Best Bank Deposit Customers
 
Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...
Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...
Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...
 
5 metrics to strengthen your multichannel sales strategy
5 metrics to strengthen your multichannel sales strategy5 metrics to strengthen your multichannel sales strategy
5 metrics to strengthen your multichannel sales strategy
 
Segmenting the Market to Find Areas of Opportunity
Segmenting the Market to Find Areas of OpportunitySegmenting the Market to Find Areas of Opportunity
Segmenting the Market to Find Areas of Opportunity
 
CRM Analytics_Marketelligent
CRM Analytics_MarketelligentCRM Analytics_Marketelligent
CRM Analytics_Marketelligent
 
Assignment johnson
Assignment johnsonAssignment johnson
Assignment johnson
 

Recently uploaded

FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756
dollysharma2066
 

Recently uploaded (20)

Unveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptx
Unveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptxUnveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptx
Unveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptx
 
FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756
 
Alpha Media March 2024 Buyers Guide.pptx
Alpha Media March 2024 Buyers Guide.pptxAlpha Media March 2024 Buyers Guide.pptx
Alpha Media March 2024 Buyers Guide.pptx
 
Rise and fall of Kulula.com, an airline won consumers by different marketing ...
Rise and fall of Kulula.com, an airline won consumers by different marketing ...Rise and fall of Kulula.com, an airline won consumers by different marketing ...
Rise and fall of Kulula.com, an airline won consumers by different marketing ...
 
2024 Social Trends Report V4 from Later.com
2024 Social Trends Report V4 from Later.com2024 Social Trends Report V4 from Later.com
2024 Social Trends Report V4 from Later.com
 
The+State+of+Careers+In+Retention+Marketing-2.pdf
The+State+of+Careers+In+Retention+Marketing-2.pdfThe+State+of+Careers+In+Retention+Marketing-2.pdf
The+State+of+Careers+In+Retention+Marketing-2.pdf
 
What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?
 
W.H.Bender Quote 61 -Influential restaurant and food service industry network...
W.H.Bender Quote 61 -Influential restaurant and food service industry network...W.H.Bender Quote 61 -Influential restaurant and food service industry network...
W.H.Bender Quote 61 -Influential restaurant and food service industry network...
 
How consumers use technology and the impacts on their lives
How consumers use technology and the impacts on their livesHow consumers use technology and the impacts on their lives
How consumers use technology and the impacts on their lives
 
Welcome to DataMetricks Consulting (1).pptx
Welcome to DataMetricks Consulting (1).pptxWelcome to DataMetricks Consulting (1).pptx
Welcome to DataMetricks Consulting (1).pptx
 
Unraveling the Mystery of The Circleville Letters.pptx
Unraveling the Mystery of The Circleville Letters.pptxUnraveling the Mystery of The Circleville Letters.pptx
Unraveling the Mystery of The Circleville Letters.pptx
 
Social Media Marketing Portfolio - Maharsh Benday
Social Media Marketing Portfolio - Maharsh BendaySocial Media Marketing Portfolio - Maharsh Benday
Social Media Marketing Portfolio - Maharsh Benday
 
Unlocking the Mystery of the Voynich Manuscript
Unlocking the Mystery of the Voynich ManuscriptUnlocking the Mystery of the Voynich Manuscript
Unlocking the Mystery of the Voynich Manuscript
 
Distribution Ad Platform_ The Role of Distribution Ad Network.pdf
Distribution Ad Platform_ The Role of  Distribution Ad Network.pdfDistribution Ad Platform_ The Role of  Distribution Ad Network.pdf
Distribution Ad Platform_ The Role of Distribution Ad Network.pdf
 
Busty Desi⚡Call Girls in Sector 49 Noida Escorts >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Sector 49 Noida Escorts >༒8448380779 Escort ServiceBusty Desi⚡Call Girls in Sector 49 Noida Escorts >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Sector 49 Noida Escorts >༒8448380779 Escort Service
 
Busty Desi⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort ServiceBusty Desi⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort Service
 
personal branding kit for music business
personal branding kit for music businesspersonal branding kit for music business
personal branding kit for music business
 
[Expert Panel] New Google Shopping Ads Strategies Uncovered
[Expert Panel] New Google Shopping Ads Strategies Uncovered[Expert Panel] New Google Shopping Ads Strategies Uncovered
[Expert Panel] New Google Shopping Ads Strategies Uncovered
 
10 Email Marketing Best Practices to Increase Engagements, CTR, And ROI
10 Email Marketing Best Practices to Increase Engagements, CTR, And ROI10 Email Marketing Best Practices to Increase Engagements, CTR, And ROI
10 Email Marketing Best Practices to Increase Engagements, CTR, And ROI
 
Discover Ardency Elite: Elevate Your Lifestyle
Discover Ardency Elite: Elevate Your LifestyleDiscover Ardency Elite: Elevate Your Lifestyle
Discover Ardency Elite: Elevate Your Lifestyle
 

Improving profitability of campaigns through data science

  • 1. IMPROVING RETAIL SALES CAMPAIGN PROFITABILITY FINAL CAPSTONE PROJECT : DATA ANALYSIS , FINDINGS AND RECOMMENDATIONS SHOUNAK MONDAL POSTGRADUATE DIPLOMA IN DATA SCIENCE, EMERITUS AND COLUMBIA UNIVERSITY
  • 2. CONTENTS 1 BACKGROUND 2 OBJECTIVES 3 APPROACH 4 ANALYSIS RESULTS 5 RECOMMENDATIONS
  • 3. 1. The Customer is a B2B retailer for office supplies, office electronics and office furniture 2. A marketing campaign was executed to ~16k of its customers 3. Detailed campaign target data and results are available including key data per customer : a) Resulting Sales b) Historical sales c) Type of previous purchase d) Communication channel preferences e) Size of target company f) Language Background
  • 4. Analyze the campaign results and provide insights and recommendations on : 1. Which type of customers responded positively to the campaign ? 2. What can the customer be doing for better future campaign performance ? 3. How much can be the financial gains of the improved campaign strategies ? Objective
  • 5. Approach Exploratory Data Analysis • Removed outliers in Number of Year Prior Transaction • Drop 10 rows which has null values for No of Employees and type of previous purchase • Impute 442 rows with Last Transaction channel as Unknown • Impute 4470 rows with language as unknown • Remove negative sales and Historical sales volume rows Data Transformation & Analsysis • Transform categorical values to binary coded values • Observe correlation between features and remove highly corelated features Model Building • Build a robust classification model with most influential features to predict sales or no sales • Build a regression model to predict sales with the most influential features and predict sales • Using the probability of sales and the amount of sales predicted calculate profitability given, gross margin , the marketing and transaction costs Gains / Lift Chart • Classify the customers into deciles and show profits per decile • Show profitable deciles and expected gains over random targeting • Inner Join based on customer ID, the probabilities from classifier and predicted sales sales from linear regressor model to formula to find profits. • Sort profits and split by deciles Recommendations • Show Which type of customers responded positively to the campaign ? • Show What can XYZ Ltd be doing for better future campaign performance ? • Show How much can be the financial gains of the improved campaign strategies ?
  • 6. 1. Those who have purchased before ( particularly 16-22 Transactions in prior year ) 2. Have had historical sales of up to $ 720,000+ ( 75% of the purchases ) 3. Made purchases in the year 1993 and 1994 – these 2 years seems to have created long term loyal customers Typical Purchasers are… 0 100 200 300 400 500 600 700 1926 1950 1954 1958 1962 1966 1970 1974 1978 1982 1986 1990 1994 1998 2002 2006 2010 2014 2018 Year of First Purchase Coefficients–DegreeofInfluence Numberof purchaserecords Year of Purchase
  • 7. Classification Model was built and tested on ~8000 records test data. The model can be now used to identify, predict and target positive sales candidates for future campaigns To build a robust model, Random Forest Classifier was built by training it on ~8000 campaign data which yielded the following : 1. Prediction accuracy score : 85% - Ability of the classifier to make correct predictions 2. Precision score: 77% - The ability of the classifier to predict true sales accurately. Cost of low score is wasted marketing cost. 3. Recall score:65% - The ability of the classifier to find all the true sales. Cost of low score is missed revenue opportunities. 4. f1 Score 70% - balance between the precision and the recall 5. Confusion Matrix NO YES NO 5400 438 YES 788 1453 Predicted Sale ( Yes or No ) Actual Sale
  • 8. Now that we know targeting which customers will result in sales, next we built a model to predict “Amount of Sales” for purchasing customers and tested it on ~4384 records positive sales test data Linear Regression was used which yielded the following : 1. Linear Regression Fit score Training data : 77% 2. Linear Regression Fit score Test data : 75% 3. Root Mean Squared Error of prediction : 559 4. R squared score 0.75 ( degree to which the model captures and explains the variance of the data ) Findings 1. Size of the company has the largest influence on the sales amount : larger the company, larger the sales amounts 2. Previous purchase of office furniture and computer equipment has next significant influence in amount of sales. Coefficients–DegreeofInfluence
  • 9. Gains Chart for ~1120 records test data representing a future campaign Total Actual Profit per customer of the campaign that was executed ( 16,000 records ) The lift chart is built from about 1120 records from test data for linear regression and same 1120 records from the classifier by using "inner" join on customer number i.e common records between the two dataframes in the linear reg test and classifier test Deciles Number of customers per decile Actual Profitability per customer Lift over average Total Profit % of Profit Incr Proj Profit 100k Customer base Total Proj Profit 100k Customer base Cuml Incr Profit 100k Customer base Cuml Total Profit 100k Customer base (1009.0, 1121.0] 112 504 497 55,630 76% 4,967 5,037 4,967 5,037 (897.0, 1009.0] 112 271 264 29,535 40% 2,637 2,707 7,604 7,744 (785.0, 897.0] 112 75 68 7,607 10% 679 749 8,283 8,493 (673.0, 785.0] 112 21 14 1,518 2% 135 205 8,419 8,699 (561.0, 673.0] 112 -1 -8 (947) -1% (85) (15) 8,334 8,684 (449.0, 561.0] 112 -12 -19 (2,113) -3% (189) (119) 8,145 8,565 (337.0, 449.0] 112 -20 -27 (2,994) -4% (267) (197) 7,878 8,368 (225.0, 337.0] 112 -28 -35 (3,959) -5% (354) (284) 7,525 8,085 (113.0, 225.0] 112 -37 -44 (4,944) -7% (441) (371) 7,083 7,713 (0.999, 113.0] 112 -49 -56 (6,268) -9% (560) (490) 6,523 7,223 Total 1120 7.00 73,063 100%
  • 10. Recommendations 1. Instead of random targeting of customer base, use the prediction model to target only first 4 deciles type customers for maximum profitability for future campaigns 1120 record test data. 2. Maximize profitability further by using lower cost channels that reach the above target of customers effectively since marketing channels showed little or no influence on sales 3. Replicate what was done in the year 1993 and 1994, as it seems to have created long loyal customers 4. Use the model to predict sales, profitability, and expected Return on Investment and leverage it for a more fact based budget requirements for decision by management / budget approver for next campaigns