SlideShare a Scribd company logo
1 of 14
Hackathon
Machine Learning
Submitted by Pro Squad
Apoorva, Deepak, Kunal & Yogesh
INDEX
1. Problem Statement
2. Challenges
4. Binning
5. Data Analysis
6. ML & Business Insights
3. Missing Value Treatment
PROBLEM STATEMENT
Problem Context Relevance AIMs & Objectives
A mall is doing a
coupon campaign
and wants to
ensure the
success of
campaign using a
Robust prediction
model built with
Machine Learning
techniques.
Mall has provided
historical data
which comprises
of recommended
coupons,
customer details
and coupon
consumption
details of
previous years
Mall is going to
run the campaign
again and based
on the historical
data of coupons
effectiveness they
want to increase
the footfalls in
the Mall which
will help the mall
to increase
business for the
shops in the mall.
The AIM of the
project is to come
out with Business
Insights on the
data provided
and Train a
Machine Learning
model which can
predict the
success of
campaign with
highest accuracy
percentage.
CHALLENGES IN HISTORICAL DATA
• 26 features – 9 Numerical and 17
Categorical
• Missing values in 5 Columns
• Categorical Columns have Multiple labels,
going to maximum 25 labels in 1 column.
• Categorical Data has outliers and
skewness
• Most of the features are correlated
MISSING VALUE TREATMENT
• Car – There are 84 values only out of 10147 in
this column which is less then 1% hence we
removed this column as it has no impact.
• Bar, CoffeeHouse, CarryAway,
RestaurantLessThan20, Restaurant20To50 – These
have missing values around 2% hence we have used
the Feature engineering technique to fill the most
commonly occurring value out of the total values
available in these columns.
BINNING
Occupation column has 25 labels and the data frequency variation is very
high creating outliers and skewness, so we used the Binning technique to
reduce the number of labels hence removed the outliers and skewness
BINNING CONTD.
Fig. : 1 Fig. : 2
Fig. : 3 Fig. : 4
Outliers: In Figure – 1, we can see
two dots, these are outliers which we
tackled with binning and hence Figure
- 2 shows the result of binning on the
categorical column
Skewness: In Figure – 3, we can see
the curve is skewed on the right, which
we have tackled with binning and post
processing; Figure – 4, shows the
result of binning on the categorical
column
DATA ANALYSIS
Success of Coupons (Historical Data)
28%
27%
25%
11%
9%
Coffee House
Restaurant(<20)
Carry out & Take away
Bar
Restaurant(20-50)
Coffee House, Carry out and Restaurant(<20) were
the most successful coupons
Age Vs Coupons (Historical Data)
164
862
817
751
495
363
235
692
268
1271
1216
885
570
516
303
739
<21 21 26 31 36 41 46 50+
N Y
Age group from 21 to 31 and 50+, the coupon
usage is very high. Below 21 years the coupon
distribution is low and hence the usage.
DATA ANALYSIS CONTD.
Occupation Vs Coupon Success (Historical Data)
N, 860
Y, 1262
0
200
400
600
800
1000
1200
1400
Student, Unemployed, computer professionals and
Retired categories the success rate is high.
Marital Status (Historical Data)
40%
38%
17%
4% 1%
Single
Married partner
Unmarried partner
Divorced
Widowed
Age group from 21 to 31 and 50+, the coupon
usage is very high. Below 21 years the coupon
distribution is low and hence the usage.
DATA ANALYSIS CONTD.
Multicollinearity Chart
Colour Legend
• Yellow shade – Correlation is 0
• Red and Dark Green is -1 and +1
Business Understanding
• Customer ID, Temperature, Time,
Weather, Direction, Passenger and
Driving Distance impact is very low
• Age, Has Children, Marital status,
Gender, Occupation the impact is
intermediate.
• Restaurant type visit rating has the
highest impact
MACHINE LEARNING MODEL
ML Model 1: Logistic Regression
Logistic
Regression
Cross
Validation
Accuracy
68.97%
ML Model 2: Decision Tree
Hyper Tuning
Cross
Validation
Accuracy
70.95%
Decision Tree
Accuracy
76.63%
ML Model 3: Random Forest
ML Models with their accuracy scores
Random
Forest
Hyper Tuning
Cross
Validation
MACHINE LEARNING
(HYPERTUNING)
Random Forest – Hyper Tuning to get accuracy
No of Estimators: We used Randomize Search and Grid Search
to find the optimum number of Estimators (Trees) which can
give the highest accuracy score and then used the same in our
Machine Learning Model.
No of Folds: We used 5 folds to create random test and train
split within the model to generate 5 accuracy scores and
based on which the average score got select as the most
optimum score.
Random State: We have tuned the Random state to 80 which
is giving the maximum accuracy score in our model.
Business Insights
Advantages to Business
1. Coffee, Restaurant (<20) and Take away coupons are
more successful.
2. Coupons are mostly used by age group 21 to 31 and 50+
3. Computer Workers, Retired, students and Unemployed
are mostly using the coupons.
4. Customers tend to use the coupons if Driving Distance is
between 5 to 15 minutes.
5. Customers tend to use the coupons mostly when the
weather is sunny.
6. Carry away coupons utilization is most for customers
using it 1~3 times in a month.
7. Most footfalls are at 7:00 AM and 6:00 PM, probably to
pick a snack.
Mall coupon campaign success with ML

More Related Content

What's hot

Regularization and variable selection via elastic net
Regularization and variable selection via elastic netRegularization and variable selection via elastic net
Regularization and variable selection via elastic netKyusonLim
 
Overview of Artificial Intelligence
Overview of Artificial IntelligenceOverview of Artificial Intelligence
Overview of Artificial IntelligenceSiddhant Fulzele
 
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...Sri Ambati
 
Machine Learning Unit 4 Semester 3 MSc IT Part 2 Mumbai University
Machine Learning Unit 4 Semester 3  MSc IT Part 2 Mumbai UniversityMachine Learning Unit 4 Semester 3  MSc IT Part 2 Mumbai University
Machine Learning Unit 4 Semester 3 MSc IT Part 2 Mumbai UniversityMadhav Mishra
 
7. data link layer error detection and correction codes - parity and checksum
7. data link layer   error detection and correction codes - parity and checksum7. data link layer   error detection and correction codes - parity and checksum
7. data link layer error detection and correction codes - parity and checksumJAIGANESH SEKAR
 
Applied Artificial Intelligence Unit 1 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 1 Semester 3 MSc IT Part 2 Mumbai Univer...Applied Artificial Intelligence Unit 1 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 1 Semester 3 MSc IT Part 2 Mumbai Univer...Madhav Mishra
 
Preparing your data for Machine Learning with Feature Scaling
Preparing your data for  Machine Learning with Feature ScalingPreparing your data for  Machine Learning with Feature Scaling
Preparing your data for Machine Learning with Feature ScalingRahul K Chauhan
 
Vertical redundancy check in Computer Network
Vertical redundancy check in Computer NetworkVertical redundancy check in Computer Network
Vertical redundancy check in Computer NetworkShivangiTak1
 
Important Classification and Regression Metrics.pptx
Important Classification and Regression Metrics.pptxImportant Classification and Regression Metrics.pptx
Important Classification and Regression Metrics.pptxChode Amarnath
 
Back propagation
Back propagationBack propagation
Back propagationNagarajan
 
Authentication in cloud computing
Authentication in cloud computingAuthentication in cloud computing
Authentication in cloud computingvidhya dharmarajan
 
Ccn(Data communication and networking) edited solution-manual suitable to be ...
Ccn(Data communication and networking) edited solution-manual suitable to be ...Ccn(Data communication and networking) edited solution-manual suitable to be ...
Ccn(Data communication and networking) edited solution-manual suitable to be ...Vishal kakade
 
Online Shopping Agent in AI
Online Shopping Agent in AIOnline Shopping Agent in AI
Online Shopping Agent in AIFazle Rabbi Ador
 
Relational Algebra and MapReduce
Relational Algebra and MapReduceRelational Algebra and MapReduce
Relational Algebra and MapReducePietro Michiardi
 
A survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithmsA survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithmsAhmed Magdy Ezzeldin, MSc.
 
Pixel Relationships Examples
Pixel Relationships ExamplesPixel Relationships Examples
Pixel Relationships ExamplesMarwa Ahmeid
 

What's hot (20)

Regularization and variable selection via elastic net
Regularization and variable selection via elastic netRegularization and variable selection via elastic net
Regularization and variable selection via elastic net
 
Overview of Artificial Intelligence
Overview of Artificial IntelligenceOverview of Artificial Intelligence
Overview of Artificial Intelligence
 
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
 
Machine Learning Unit 4 Semester 3 MSc IT Part 2 Mumbai University
Machine Learning Unit 4 Semester 3  MSc IT Part 2 Mumbai UniversityMachine Learning Unit 4 Semester 3  MSc IT Part 2 Mumbai University
Machine Learning Unit 4 Semester 3 MSc IT Part 2 Mumbai University
 
7. data link layer error detection and correction codes - parity and checksum
7. data link layer   error detection and correction codes - parity and checksum7. data link layer   error detection and correction codes - parity and checksum
7. data link layer error detection and correction codes - parity and checksum
 
Applied Artificial Intelligence Unit 1 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 1 Semester 3 MSc IT Part 2 Mumbai Univer...Applied Artificial Intelligence Unit 1 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 1 Semester 3 MSc IT Part 2 Mumbai Univer...
 
Preparing your data for Machine Learning with Feature Scaling
Preparing your data for  Machine Learning with Feature ScalingPreparing your data for  Machine Learning with Feature Scaling
Preparing your data for Machine Learning with Feature Scaling
 
Vertical redundancy check in Computer Network
Vertical redundancy check in Computer NetworkVertical redundancy check in Computer Network
Vertical redundancy check in Computer Network
 
Important Classification and Regression Metrics.pptx
Important Classification and Regression Metrics.pptxImportant Classification and Regression Metrics.pptx
Important Classification and Regression Metrics.pptx
 
Back propagation
Back propagationBack propagation
Back propagation
 
Android - Values folder
Android - Values folderAndroid - Values folder
Android - Values folder
 
Authentication in cloud computing
Authentication in cloud computingAuthentication in cloud computing
Authentication in cloud computing
 
Ccn(Data communication and networking) edited solution-manual suitable to be ...
Ccn(Data communication and networking) edited solution-manual suitable to be ...Ccn(Data communication and networking) edited solution-manual suitable to be ...
Ccn(Data communication and networking) edited solution-manual suitable to be ...
 
Online Shopping Agent in AI
Online Shopping Agent in AIOnline Shopping Agent in AI
Online Shopping Agent in AI
 
RPA ppt.pptx
RPA ppt.pptxRPA ppt.pptx
RPA ppt.pptx
 
Transfer Learning
Transfer LearningTransfer Learning
Transfer Learning
 
Relational Algebra and MapReduce
Relational Algebra and MapReduceRelational Algebra and MapReduce
Relational Algebra and MapReduce
 
A survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithmsA survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithms
 
Binomial heaps
Binomial heapsBinomial heaps
Binomial heaps
 
Pixel Relationships Examples
Pixel Relationships ExamplesPixel Relationships Examples
Pixel Relationships Examples
 

Similar to Mall coupon campaign success with ML

Big Data LDN 2017: Advanced Analytics Applied to Marketing Attribution
Big Data LDN 2017: Advanced Analytics Applied to Marketing AttributionBig Data LDN 2017: Advanced Analytics Applied to Marketing Attribution
Big Data LDN 2017: Advanced Analytics Applied to Marketing AttributionMatt Stubbs
 
Improving profitability of campaigns through data science
Improving profitability of campaigns through data scienceImproving profitability of campaigns through data science
Improving profitability of campaigns through data scienceswebi
 
Database Marketing, part two: data enhancement, analytics, and attribution
Database Marketing, part two: data enhancement, analytics, and attribution Database Marketing, part two: data enhancement, analytics, and attribution
Database Marketing, part two: data enhancement, analytics, and attribution Relevate
 
Attribution modeling 101
Attribution modeling 101 Attribution modeling 101
Attribution modeling 101 OWOX BI
 
Attribution modeling 101, Mariia Bocheva
Attribution modeling 101, Mariia BochevaAttribution modeling 101, Mariia Bocheva
Attribution modeling 101, Mariia BochevaMariia Bocheva
 
IBM Transforming Customer Relationships Through Predictive Analytics
IBM Transforming Customer Relationships Through Predictive AnalyticsIBM Transforming Customer Relationships Through Predictive Analytics
IBM Transforming Customer Relationships Through Predictive AnalyticsSFIMA
 
Supply chain strategy and financial metrics 20180118
Supply chain strategy and financial metrics 20180118Supply chain strategy and financial metrics 20180118
Supply chain strategy and financial metrics 20180118Bram Desmet
 
Entering the Data Analytics industry
Entering the Data Analytics industryEntering the Data Analytics industry
Entering the Data Analytics industryGramener
 
Customer analytics
Customer analyticsCustomer analytics
Customer analyticsKarl Melo
 
Data analytics in retail
Data analytics in retailData analytics in retail
Data analytics in retailtanyazyabkina
 
Improving Customer Experience via Experimentation Dynamic Talks: San Francisc...
Improving Customer Experience via Experimentation Dynamic Talks: San Francisc...Improving Customer Experience via Experimentation Dynamic Talks: San Francisc...
Improving Customer Experience via Experimentation Dynamic Talks: San Francisc...Grid Dynamics
 
Mather Disciplined Pricing Approach For Banking Summary
Mather Disciplined Pricing Approach For Banking SummaryMather Disciplined Pricing Approach For Banking Summary
Mather Disciplined Pricing Approach For Banking Summarydfischer
 
Campaign response modeling
Campaign response modelingCampaign response modeling
Campaign response modelingEsteban Ribero
 
Data Insight Leaders Summit Barcelona 2017
Data Insight Leaders Summit Barcelona 2017Data Insight Leaders Summit Barcelona 2017
Data Insight Leaders Summit Barcelona 2017Harvinder Atwal
 
Rapid Optimization Application Development Using Excel and Solver
Rapid Optimization Application Development Using Excel and SolverRapid Optimization Application Development Using Excel and Solver
Rapid Optimization Application Development Using Excel and SolverMichael Mina
 
Reduce Churn and Improve Customer Loyalty
Reduce Churn and Improve Customer LoyaltyReduce Churn and Improve Customer Loyalty
Reduce Churn and Improve Customer LoyaltyMekko Graphics
 
Sidewalk Event - Why CX matters by mikael vandeskelde
Sidewalk Event - Why CX matters by mikael vandeskelde Sidewalk Event - Why CX matters by mikael vandeskelde
Sidewalk Event - Why CX matters by mikael vandeskelde Mikael Vandeskelde
 

Similar to Mall coupon campaign success with ML (20)

Pro_Squad.pptx
Pro_Squad.pptxPro_Squad.pptx
Pro_Squad.pptx
 
Big Data LDN 2017: Advanced Analytics Applied to Marketing Attribution
Big Data LDN 2017: Advanced Analytics Applied to Marketing AttributionBig Data LDN 2017: Advanced Analytics Applied to Marketing Attribution
Big Data LDN 2017: Advanced Analytics Applied to Marketing Attribution
 
Improving profitability of campaigns through data science
Improving profitability of campaigns through data scienceImproving profitability of campaigns through data science
Improving profitability of campaigns through data science
 
Database Marketing, part two: data enhancement, analytics, and attribution
Database Marketing, part two: data enhancement, analytics, and attribution Database Marketing, part two: data enhancement, analytics, and attribution
Database Marketing, part two: data enhancement, analytics, and attribution
 
Attribution modeling 101
Attribution modeling 101 Attribution modeling 101
Attribution modeling 101
 
Attribution modeling 101, Mariia Bocheva
Attribution modeling 101, Mariia BochevaAttribution modeling 101, Mariia Bocheva
Attribution modeling 101, Mariia Bocheva
 
IBM Transforming Customer Relationships Through Predictive Analytics
IBM Transforming Customer Relationships Through Predictive AnalyticsIBM Transforming Customer Relationships Through Predictive Analytics
IBM Transforming Customer Relationships Through Predictive Analytics
 
Supply chain strategy and financial metrics 20180118
Supply chain strategy and financial metrics 20180118Supply chain strategy and financial metrics 20180118
Supply chain strategy and financial metrics 20180118
 
Entering the Data Analytics industry
Entering the Data Analytics industryEntering the Data Analytics industry
Entering the Data Analytics industry
 
Customer analytics
Customer analyticsCustomer analytics
Customer analytics
 
Data analytics in retail
Data analytics in retailData analytics in retail
Data analytics in retail
 
36% Average Yearly Increase By AutomatingClosedLoopMarketing[1]
36% Average Yearly Increase By AutomatingClosedLoopMarketing[1]36% Average Yearly Increase By AutomatingClosedLoopMarketing[1]
36% Average Yearly Increase By AutomatingClosedLoopMarketing[1]
 
Improving Customer Experience via Experimentation Dynamic Talks: San Francisc...
Improving Customer Experience via Experimentation Dynamic Talks: San Francisc...Improving Customer Experience via Experimentation Dynamic Talks: San Francisc...
Improving Customer Experience via Experimentation Dynamic Talks: San Francisc...
 
Cocoa chocolate a dream company
Cocoa chocolate  a dream companyCocoa chocolate  a dream company
Cocoa chocolate a dream company
 
Mather Disciplined Pricing Approach For Banking Summary
Mather Disciplined Pricing Approach For Banking SummaryMather Disciplined Pricing Approach For Banking Summary
Mather Disciplined Pricing Approach For Banking Summary
 
Campaign response modeling
Campaign response modelingCampaign response modeling
Campaign response modeling
 
Data Insight Leaders Summit Barcelona 2017
Data Insight Leaders Summit Barcelona 2017Data Insight Leaders Summit Barcelona 2017
Data Insight Leaders Summit Barcelona 2017
 
Rapid Optimization Application Development Using Excel and Solver
Rapid Optimization Application Development Using Excel and SolverRapid Optimization Application Development Using Excel and Solver
Rapid Optimization Application Development Using Excel and Solver
 
Reduce Churn and Improve Customer Loyalty
Reduce Churn and Improve Customer LoyaltyReduce Churn and Improve Customer Loyalty
Reduce Churn and Improve Customer Loyalty
 
Sidewalk Event - Why CX matters by mikael vandeskelde
Sidewalk Event - Why CX matters by mikael vandeskelde Sidewalk Event - Why CX matters by mikael vandeskelde
Sidewalk Event - Why CX matters by mikael vandeskelde
 

Recently uploaded

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 

Recently uploaded (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 

Mall coupon campaign success with ML

  • 1. Hackathon Machine Learning Submitted by Pro Squad Apoorva, Deepak, Kunal & Yogesh
  • 2. INDEX 1. Problem Statement 2. Challenges 4. Binning 5. Data Analysis 6. ML & Business Insights 3. Missing Value Treatment
  • 3. PROBLEM STATEMENT Problem Context Relevance AIMs & Objectives A mall is doing a coupon campaign and wants to ensure the success of campaign using a Robust prediction model built with Machine Learning techniques. Mall has provided historical data which comprises of recommended coupons, customer details and coupon consumption details of previous years Mall is going to run the campaign again and based on the historical data of coupons effectiveness they want to increase the footfalls in the Mall which will help the mall to increase business for the shops in the mall. The AIM of the project is to come out with Business Insights on the data provided and Train a Machine Learning model which can predict the success of campaign with highest accuracy percentage.
  • 4. CHALLENGES IN HISTORICAL DATA • 26 features – 9 Numerical and 17 Categorical • Missing values in 5 Columns • Categorical Columns have Multiple labels, going to maximum 25 labels in 1 column. • Categorical Data has outliers and skewness • Most of the features are correlated
  • 5. MISSING VALUE TREATMENT • Car – There are 84 values only out of 10147 in this column which is less then 1% hence we removed this column as it has no impact. • Bar, CoffeeHouse, CarryAway, RestaurantLessThan20, Restaurant20To50 – These have missing values around 2% hence we have used the Feature engineering technique to fill the most commonly occurring value out of the total values available in these columns.
  • 6. BINNING Occupation column has 25 labels and the data frequency variation is very high creating outliers and skewness, so we used the Binning technique to reduce the number of labels hence removed the outliers and skewness
  • 7. BINNING CONTD. Fig. : 1 Fig. : 2 Fig. : 3 Fig. : 4 Outliers: In Figure – 1, we can see two dots, these are outliers which we tackled with binning and hence Figure - 2 shows the result of binning on the categorical column Skewness: In Figure – 3, we can see the curve is skewed on the right, which we have tackled with binning and post processing; Figure – 4, shows the result of binning on the categorical column
  • 8. DATA ANALYSIS Success of Coupons (Historical Data) 28% 27% 25% 11% 9% Coffee House Restaurant(<20) Carry out & Take away Bar Restaurant(20-50) Coffee House, Carry out and Restaurant(<20) were the most successful coupons Age Vs Coupons (Historical Data) 164 862 817 751 495 363 235 692 268 1271 1216 885 570 516 303 739 <21 21 26 31 36 41 46 50+ N Y Age group from 21 to 31 and 50+, the coupon usage is very high. Below 21 years the coupon distribution is low and hence the usage.
  • 9. DATA ANALYSIS CONTD. Occupation Vs Coupon Success (Historical Data) N, 860 Y, 1262 0 200 400 600 800 1000 1200 1400 Student, Unemployed, computer professionals and Retired categories the success rate is high. Marital Status (Historical Data) 40% 38% 17% 4% 1% Single Married partner Unmarried partner Divorced Widowed Age group from 21 to 31 and 50+, the coupon usage is very high. Below 21 years the coupon distribution is low and hence the usage.
  • 10. DATA ANALYSIS CONTD. Multicollinearity Chart Colour Legend • Yellow shade – Correlation is 0 • Red and Dark Green is -1 and +1 Business Understanding • Customer ID, Temperature, Time, Weather, Direction, Passenger and Driving Distance impact is very low • Age, Has Children, Marital status, Gender, Occupation the impact is intermediate. • Restaurant type visit rating has the highest impact
  • 11. MACHINE LEARNING MODEL ML Model 1: Logistic Regression Logistic Regression Cross Validation Accuracy 68.97% ML Model 2: Decision Tree Hyper Tuning Cross Validation Accuracy 70.95% Decision Tree Accuracy 76.63% ML Model 3: Random Forest ML Models with their accuracy scores Random Forest Hyper Tuning Cross Validation
  • 12. MACHINE LEARNING (HYPERTUNING) Random Forest – Hyper Tuning to get accuracy No of Estimators: We used Randomize Search and Grid Search to find the optimum number of Estimators (Trees) which can give the highest accuracy score and then used the same in our Machine Learning Model. No of Folds: We used 5 folds to create random test and train split within the model to generate 5 accuracy scores and based on which the average score got select as the most optimum score. Random State: We have tuned the Random state to 80 which is giving the maximum accuracy score in our model.
  • 13. Business Insights Advantages to Business 1. Coffee, Restaurant (<20) and Take away coupons are more successful. 2. Coupons are mostly used by age group 21 to 31 and 50+ 3. Computer Workers, Retired, students and Unemployed are mostly using the coupons. 4. Customers tend to use the coupons if Driving Distance is between 5 to 15 minutes. 5. Customers tend to use the coupons mostly when the weather is sunny. 6. Carry away coupons utilization is most for customers using it 1~3 times in a month. 7. Most footfalls are at 7:00 AM and 6:00 PM, probably to pick a snack.