SlideShare a Scribd company logo
1 of 32
MONTHLY SALES PREDICTION
FOR ROSMANN STORES
GROUP NO. : 9
1. KETKI KULKARNI (010699455)
2. MADHURA KAPLE (010653071)
3. PRATIKSHYA MISHRA (010691200)
AGENDA
• Project Synopsis.
• Explaining Data Set
• Project Flow
• Implementation
PROJECT SYNOPSIS :
• Rossmann operates over 3,000 drug stores in 7 European countries
• Aim : To predict the monthly sales for Rossmann stores for
particular store type.
• Use of historical data to recommend monthly sale based on factors
such as various promotional features, competitions and holidays.
STORE DATA : SUPPLEMENTAL
INFORMATION ABOUT THE STORES
SALES DATA : HISTORICAL DATA
INCLUDING SALES
FEATURES USED IN PROJECT
• Features from Sales.csv :
Sale amount : turnover for any given day
Holidays
Promo : whether a store is running a promotional event on that day
• Features from Store.csv :
Competition distance (distance to the nearest competitor store)
Promo2 : continuing and consecutive promotion
Promo2SinceWeek : the week when the store started participating in promo2
Promo Interval : describes the consecutive intervals when promo2 is started
BIG DATA PARADIGMS :
• Ecosystem used : Hive
• Map – reduce program
• Recommendation Algorithm : K-nearest neighbors (KNN)
algorithm
PROJECT FLOW :
Hive tables
Store
informati
on
Sales
information
PROJECT FLOW :
Hive table : Sales
Mapper1
Reducer 1
Transforming daily sales
data to monthly sales data
Hive table : Store
Mapper 2
Reducer 2
Combining store and sales
data
PROJECT FLOW :
Reducer 2
output
KNN
Algorithm
Recomme
n-dation
IMPLEMENTATION OVERVIEW
HIVE QL
MAPPER 1 (SALES)
REDUCER 1 (SALES)
OUTPUT OF MAP-REDUCE PROGRAM 1
MAPPER 2 (STORE)
REDUCER 2 (STORE)
OUTPUT OF MAP-REDUCE PROGRAM 2
Recommendation algorithm : KNN
• Creating the Feature (Dimension) Matrix from the input File.
• Dividing the Data into Training and Testing. (Hold Out – 20%)
• Finding the Accuracy of the model.
• Predicting the Sales of an input using the model.
KNN: Mapping Input Data to Feature(Dimension)
KNN: Creating Training and Testing Data
KNN: Testing and Finding the Accuracy
KNN: Predicting the Minimum Sales for the Input
KNN: Result
• Initial accuracy was around 67%.
• Modified the Model to find most similar neighbor by adding
additional conditions to features.
• Accuracy increased to 88%.
INPUT OUTPUT
input.txt 10 06 2016 10 3 3160 The sales for the store 10 for the
period 06/2016 is 4513.0
input.txt 106 10 2016 10 3 1360 The sales for the store 106 for the
period 10/2016 is 5600.9355
KNN: Input and output
SCRIPT FILE
INPUT
RECOMMENDED SALES FOR INPUT MONTH
CONCLUSION AND FUTURE WORK
• Holistic implementation of all the concept thought. (Map-
Reduce , Ecosystem and Machine Learning)
• Future Work:
Implement the project on the entire dataset.
Prediction useful to create effective staff schedules that increase
productivity and motivation and promotional events.
REFERENCES :
• https://www.kaggle.com/c/rossmann-store-sales
• Course slides
ANY QUESTIONS???????????????
THANK YOU.

More Related Content

Similar to Group 9

5 How The Model Works (With Notes)
5 How The Model Works (With Notes)5 How The Model Works (With Notes)
5 How The Model Works (With Notes)Abhishek Datta
 
CapstoneProjectPresentation.pdf
CapstoneProjectPresentation.pdfCapstoneProjectPresentation.pdf
CapstoneProjectPresentation.pdfVineetGupta76987
 
Retail_Giant_Sales_Forecasting_Presentation_Sunil_Gupta.pptx
Retail_Giant_Sales_Forecasting_Presentation_Sunil_Gupta.pptxRetail_Giant_Sales_Forecasting_Presentation_Sunil_Gupta.pptx
Retail_Giant_Sales_Forecasting_Presentation_Sunil_Gupta.pptxSunil Gupta
 
TECNICHE INNOVATIVE DI OTTIMIZZAZIONE PER GARANTIRE GLI OBIETTIVI DI SERVIZIO...
TECNICHE INNOVATIVE DI OTTIMIZZAZIONE PER GARANTIRE GLI OBIETTIVI DI SERVIZIO...TECNICHE INNOVATIVE DI OTTIMIZZAZIONE PER GARANTIRE GLI OBIETTIVI DI SERVIZIO...
TECNICHE INNOVATIVE DI OTTIMIZZAZIONE PER GARANTIRE GLI OBIETTIVI DI SERVIZIO...logisticaefficiente
 
Ecommerce Market Mix Modeling using Linear Regression
Ecommerce Market Mix Modeling using Linear RegressionEcommerce Market Mix Modeling using Linear Regression
Ecommerce Market Mix Modeling using Linear RegressionAchal Kagwad
 
NAPA Auto Parts Builds Better Assortments with Predictive Analytics - World o...
NAPA Auto Parts Builds Better Assortments with Predictive Analytics - World o...NAPA Auto Parts Builds Better Assortments with Predictive Analytics - World o...
NAPA Auto Parts Builds Better Assortments with Predictive Analytics - World o...QueBIT Consulting
 
3 PU ERP - MRP.pptx
3 PU ERP - MRP.pptx3 PU ERP - MRP.pptx
3 PU ERP - MRP.pptxGentaSahuri2
 
Retail analytics - Improvising pricing strategy using markup/markdown
Retail analytics - Improvising pricing strategy using markup/markdownRetail analytics - Improvising pricing strategy using markup/markdown
Retail analytics - Improvising pricing strategy using markup/markdownSmitha Mysore Lokesh
 
Promotion Analytics in Consumer Electronics - Module 1: Data
Promotion Analytics in Consumer Electronics - Module 1: DataPromotion Analytics in Consumer Electronics - Module 1: Data
Promotion Analytics in Consumer Electronics - Module 1: DataMinha Hwang
 
Automated Data Mining for Everyone
Automated Data Mining for EveryoneAutomated Data Mining for Everyone
Automated Data Mining for EveryoneExponea
 
SAP Sales and Operational Planning -SOP
SAP Sales and Operational Planning -SOPSAP Sales and Operational Planning -SOP
SAP Sales and Operational Planning -SOPAnkit Sharma
 
Improving profitability of campaigns through data science
Improving profitability of campaigns through data scienceImproving profitability of campaigns through data science
Improving profitability of campaigns through data scienceswebi
 
Smarter And Granular Marketing Planning With Goals And Attribution
Smarter And Granular Marketing Planning With Goals And AttributionSmarter And Granular Marketing Planning With Goals And Attribution
Smarter And Granular Marketing Planning With Goals And AttributionG3 Communications
 
Walmart Sales Prediction Using Rapidminer Prepared by Naga.docx
Walmart Sales Prediction Using Rapidminer Prepared by  Naga.docxWalmart Sales Prediction Using Rapidminer Prepared by  Naga.docx
Walmart Sales Prediction Using Rapidminer Prepared by Naga.docxcelenarouzie
 
Core Planning Improvement Project
Core Planning Improvement ProjectCore Planning Improvement Project
Core Planning Improvement ProjectAbhishek Kumar
 
«DIGITAL SUPPLY CHAIN»: using data optimisation and IA for improving services
«DIGITAL SUPPLY CHAIN»: using data optimisation and IA for improving services«DIGITAL SUPPLY CHAIN»: using data optimisation and IA for improving services
«DIGITAL SUPPLY CHAIN»: using data optimisation and IA for improving servicesDecision Science Community
 
SFA | Sell-in CISLINK
SFA | Sell-in CISLINKSFA | Sell-in CISLINK
SFA | Sell-in CISLINKPavel Nazarov
 

Similar to Group 9 (20)

5 How The Model Works (With Notes)
5 How The Model Works (With Notes)5 How The Model Works (With Notes)
5 How The Model Works (With Notes)
 
CapstoneProjectPresentation.pdf
CapstoneProjectPresentation.pdfCapstoneProjectPresentation.pdf
CapstoneProjectPresentation.pdf
 
Retail_Giant_Sales_Forecasting_Presentation_Sunil_Gupta.pptx
Retail_Giant_Sales_Forecasting_Presentation_Sunil_Gupta.pptxRetail_Giant_Sales_Forecasting_Presentation_Sunil_Gupta.pptx
Retail_Giant_Sales_Forecasting_Presentation_Sunil_Gupta.pptx
 
TECNICHE INNOVATIVE DI OTTIMIZZAZIONE PER GARANTIRE GLI OBIETTIVI DI SERVIZIO...
TECNICHE INNOVATIVE DI OTTIMIZZAZIONE PER GARANTIRE GLI OBIETTIVI DI SERVIZIO...TECNICHE INNOVATIVE DI OTTIMIZZAZIONE PER GARANTIRE GLI OBIETTIVI DI SERVIZIO...
TECNICHE INNOVATIVE DI OTTIMIZZAZIONE PER GARANTIRE GLI OBIETTIVI DI SERVIZIO...
 
Overview release features 16.1
Overview   release features 16.1Overview   release features 16.1
Overview release features 16.1
 
Ecommerce Market Mix Modeling using Linear Regression
Ecommerce Market Mix Modeling using Linear RegressionEcommerce Market Mix Modeling using Linear Regression
Ecommerce Market Mix Modeling using Linear Regression
 
Demand Driven MRP
Demand Driven MRP Demand Driven MRP
Demand Driven MRP
 
NAPA Auto Parts Builds Better Assortments with Predictive Analytics - World o...
NAPA Auto Parts Builds Better Assortments with Predictive Analytics - World o...NAPA Auto Parts Builds Better Assortments with Predictive Analytics - World o...
NAPA Auto Parts Builds Better Assortments with Predictive Analytics - World o...
 
3 PU ERP - MRP.pptx
3 PU ERP - MRP.pptx3 PU ERP - MRP.pptx
3 PU ERP - MRP.pptx
 
Retail analytics - Improvising pricing strategy using markup/markdown
Retail analytics - Improvising pricing strategy using markup/markdownRetail analytics - Improvising pricing strategy using markup/markdown
Retail analytics - Improvising pricing strategy using markup/markdown
 
Promotion Analytics in Consumer Electronics - Module 1: Data
Promotion Analytics in Consumer Electronics - Module 1: DataPromotion Analytics in Consumer Electronics - Module 1: Data
Promotion Analytics in Consumer Electronics - Module 1: Data
 
Automated Data Mining for Everyone
Automated Data Mining for EveryoneAutomated Data Mining for Everyone
Automated Data Mining for Everyone
 
SAP Sales and Operational Planning -SOP
SAP Sales and Operational Planning -SOPSAP Sales and Operational Planning -SOP
SAP Sales and Operational Planning -SOP
 
Improving profitability of campaigns through data science
Improving profitability of campaigns through data scienceImproving profitability of campaigns through data science
Improving profitability of campaigns through data science
 
Smarter And Granular Marketing Planning With Goals And Attribution
Smarter And Granular Marketing Planning With Goals And AttributionSmarter And Granular Marketing Planning With Goals And Attribution
Smarter And Granular Marketing Planning With Goals And Attribution
 
Walmart Sales Prediction Using Rapidminer Prepared by Naga.docx
Walmart Sales Prediction Using Rapidminer Prepared by  Naga.docxWalmart Sales Prediction Using Rapidminer Prepared by  Naga.docx
Walmart Sales Prediction Using Rapidminer Prepared by Naga.docx
 
Core Planning Improvement Project
Core Planning Improvement ProjectCore Planning Improvement Project
Core Planning Improvement Project
 
«DIGITAL SUPPLY CHAIN»: using data optimisation and IA for improving services
«DIGITAL SUPPLY CHAIN»: using data optimisation and IA for improving services«DIGITAL SUPPLY CHAIN»: using data optimisation and IA for improving services
«DIGITAL SUPPLY CHAIN»: using data optimisation and IA for improving services
 
Six Sigma
Six SigmaSix Sigma
Six Sigma
 
SFA | Sell-in CISLINK
SFA | Sell-in CISLINKSFA | Sell-in CISLINK
SFA | Sell-in CISLINK
 

Group 9