SlideShare a Scribd company logo
1 of 15
Data mining
for the Online
Retail
Industry
By: Atul Sharma
2K20/BMBA/10 1
AIM OF THIS PPT
“Torture The Data, and it will Confess to Anything”
-Ronald Coase
• This presentation mainly focus on the application of
Data Mining in Online Retail Industry.
• This presentation is based on the technical article “Data
mining for the online retail industry: A case study of
RFM model-based customer segmentation using data
mining” by Daqing Chen, Sai Laing Sain & Kun
Guo. 2
3
 Nowadays online shopping is a daily practice for urban areas.
 Online penetration of retail is expected to reach 10.7% by 2024
compared with 4.7% in 2019. Moreover, online shoppers in India
are expected to reach 220 million by 2025.
 There are various factors responsible for this growth:
o Localization of Internet content
o Growth in cities beyond metros
o Growth of mobile commerce
o Growing usage of debit cards for cashless transaction
o COVID-19
 With this increase in online smart customers, the traders and
sellers also need to be more aware and need to know the
personality of different buyers.
INTRODUCTION
4
Problems faced by online
retailers
 Which items / products’ web pages has a customer visited? How long has a
customer stayed with each web page, and in which sequence has a customer
visited a set of products’ web pages?
 Who are the most / least loyal customers, and how are they characterized?
 What are customers’ purchase behavior patterns? Which products / items have
customers purchased together often?
 In which sequence the products have been purchased?
 Which types of customers are more likely to respond to a certain promotion
mailing?
 What are the sales patterns in terms of various perspectives such as products
/items, regions and time (weekly, monthly, quarterly, yearly and seasonally), and
so on?
 Who are the most / least valuable customers to the business? What are the
distinct characteristics of them?
Data Mining
 Data mining is the process of analyzing massive volumes of
data to discover business intelligence that helps companies
solve problems, mitigate risks, and seize new
opportunities.
 Technically, data mining is the process of finding
correlations or patterns among dozens of fields in large
relational databases.
5
6
Different steps involved in Data Mining
Data Cleaning
This step involves the
removal of noisy or
incomplete data from
the collection.
Data
Integration
When multiple
heterogeneous data
sources such as
databases, data cubes
or files are combined
for analysis, this
process is called data
integration.
Data
Reduction
This technique is
applied to obtain
relevant data for
analysis from the
collection of data. The
size of the
representation is much
smaller in volume
while maintaining
integrity.
Data
Transformation
In this process, data is
transformed into a
form suitable for the
data mining process.
Data is consolidated so
that the mining process
is more efficient and
the patterns are easier
to understand.
Data Mining
The data is represented
in the form of patterns
and models are
structured using
classification and
clustering techniques.
Pattern
Evaluation
This step involves
identifying interesting
patterns representing
the knowledge based
on interestingness
measures. Data
summarization and
visualization methods
are used to make the
data understandable by
the user.
Knowledge
Representation
Knowledge
representation is a step
where data
visualization and
knowledge
representation tools
are used to represent
the mined data. Data is
visualized in the form
of reports, tables, etc.
7
Case Study Review
 In the article author used a sample case to demonstrate the Data Mining process for online retail industry.
 The author considered a UK-based non-store business with some 80 member of staff.
 The company was established in 1981 mainly selling unique all-occasion gifts. For years in the past, the merchant
relied heavily on direct mailing catalogues, and orders were taken over phone calls.
 It was only 2 years ago that the company launched its own web site and shifted completely to the Web.
 Since then the company has maintained a steady and healthy number of customers from all parts of the United
Kingdom and Europe, and has accumulated a huge amount of data about many customers.
 The company also uses Amazon.co.uk to market and sell its products.
Company Overview:
8
Overview of Data
MM.DD.20XX
ADD A FOOTER
9
Methodology
 The data mining in this case is done by RFM MODEL-BASED CLUSTERING ANALYSIS. Following
steps are involved in the process:
 Data pre-processing
 K-Means Clustering
 Enhancing clustering analysis using decision tree
 Conclusion
10
Data Pre-processing
 First step is to select the appropriate variable of interest from the dataset. For example - Invoice, StockCode,
Quantity, Price, InvoiceDate and PostCode.
 Create an aggregated variable named Amount , by multiplying Quantity with Price, which gives the total
amount of money spent per product / item in each transaction.
 Separate the variable InvoiceDate into two variables Date and Time . This allows different transactions
created by the same consumer on the same day but at different times to be treated separately.
 Filter out any transactions that do not have a postcode associated with. This resolves any missing value issues
in relation to the variable PostCode.
 Sort out the dataset by Postcode and create three essential aggregated variables Recency, requency and
Monetary. Calculate the values of these variables per postcode.
11
Clustering Analysis
12
Enhancing
clustering analysis
using decision tree
 The customers can be divided into such categories
as frequency more than 2.5 with an average
monetary value of 990.66; and frequency more
than 2.5 and less than 3.5 with an average
monetary value of 1056.70 and so on.
 Also, it is interesting to note that the relationship
between frequency and monetary seems to be a
monotonic linear relationship.
13
Interpretation & Recommendations
 The most valuable consumers of the business have contributed more than 60 per cent of the total sales in year
2011, whereas the least valuable ones only made up 4 per cent of the total sales.
 For each of these consumer groups, it is essential to further find out which products the customers in each group
have purchased, which products have been purchased together most frequently and in which sequence the
products have been purchased.
 Many of the consumers of the business were organizational consumers with a high quantity of a product per
transaction. Examining at which specific times (seasons), what products and which types of products they have
purchased frequently will be beneficiary to the business.
 Another aspect worth further investigation is to link consumer groups to geographical locations. This correlation,
if exists, may help the business look into other factors, such as culture, customs, and economics, that may affect a
consumer’s buying intention and preferences.
14
Conclusion
 As shown in the case data mining can help businesses know the purchasing behavior of the customers
which help them to promote their products and services.
 The segmentation of customers help businesses to differentiate the between the right target
customers and the one which are unprofitable.
 The information businesses get through data mining and analysis is the base for the progressive
strategies and growth.
THANK YOU
15

More Related Content

Similar to Data mining for the online retail industry

Data-Analytics-Resource-updated for analysis
Data-Analytics-Resource-updated for analysisData-Analytics-Resource-updated for analysis
Data-Analytics-Resource-updated for analysisBhavinGada5
 
Todays consumer behaviour demands a new data model
Todays consumer behaviour demands a new data modelTodays consumer behaviour demands a new data model
Todays consumer behaviour demands a new data modelPaul Kennedy
 
AT Kearney: Retail success still depends on core principles May 2013
AT Kearney: Retail success still depends on core principles May 2013AT Kearney: Retail success still depends on core principles May 2013
AT Kearney: Retail success still depends on core principles May 2013Brian Crotty
 
Big Data
Big DataBig Data
Big Datasiware
 
Benefits of a data warehouse presentation by Being topper
Benefits of a data warehouse presentation by Being topperBenefits of a data warehouse presentation by Being topper
Benefits of a data warehouse presentation by Being topperBeing Topper
 
Big Data Analytics for Predicting Consumer Behaviour
Big Data Analytics for Predicting Consumer BehaviourBig Data Analytics for Predicting Consumer Behaviour
Big Data Analytics for Predicting Consumer BehaviourIRJET Journal
 
AGILONE-ACADEMY_The-State-of-Big-Customer-Data-2015-FINAL3
AGILONE-ACADEMY_The-State-of-Big-Customer-Data-2015-FINAL3AGILONE-ACADEMY_The-State-of-Big-Customer-Data-2015-FINAL3
AGILONE-ACADEMY_The-State-of-Big-Customer-Data-2015-FINAL3Angela Sanfilippo
 
Running Head CONSUMER BEHAVIOR ANALYSISCONSUMER BEHAVIOR ANAL
Running Head CONSUMER BEHAVIOR ANALYSISCONSUMER BEHAVIOR ANALRunning Head CONSUMER BEHAVIOR ANALYSISCONSUMER BEHAVIOR ANAL
Running Head CONSUMER BEHAVIOR ANALYSISCONSUMER BEHAVIOR ANALMalikPinckney86
 
Big Data_Retail Apparel_Dubai
Big Data_Retail Apparel_DubaiBig Data_Retail Apparel_Dubai
Big Data_Retail Apparel_DubaiAnkit Sinha
 
Big Data World presentation - Sep. 2014
Big Data World presentation - Sep. 2014Big Data World presentation - Sep. 2014
Big Data World presentation - Sep. 2014Wing Yuen Loon
 
IBM Guide to Consumer Products Industry Technology Trends
IBM Guide to Consumer Products Industry Technology TrendsIBM Guide to Consumer Products Industry Technology Trends
IBM Guide to Consumer Products Industry Technology TrendsTero Angeria
 
Data Driven Marketing: the DNA of customer orientated companies
Data Driven Marketing: the DNA of customer orientated companiesData Driven Marketing: the DNA of customer orientated companies
Data Driven Marketing: the DNA of customer orientated companiesGood Rebels
 
DEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUES
DEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUESDEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUES
DEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUESIRJET Journal
 
Zyme Using Analytics To White Paper
Zyme Using Analytics To White PaperZyme Using Analytics To White Paper
Zyme Using Analytics To White PaperAmit Kumar
 
Big data analytics in payments
Big data analytics in payments Big data analytics in payments
Big data analytics in payments Ashish Anand
 
Integrating Analytics into the Operational Fabric of Your Business
Integrating Analytics into the Operational Fabric of Your BusinessIntegrating Analytics into the Operational Fabric of Your Business
Integrating Analytics into the Operational Fabric of Your BusinessIBM India Smarter Computing
 
Analytics Insights Deliver Competitive Differentiation - RIS
Analytics Insights Deliver Competitive Differentiation - RISAnalytics Insights Deliver Competitive Differentiation - RIS
Analytics Insights Deliver Competitive Differentiation - RISKellie Peterson
 

Similar to Data mining for the online retail industry (20)

Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
Data-Analytics-Resource-updated for analysis
Data-Analytics-Resource-updated for analysisData-Analytics-Resource-updated for analysis
Data-Analytics-Resource-updated for analysis
 
Todays consumer behaviour demands a new data model
Todays consumer behaviour demands a new data modelTodays consumer behaviour demands a new data model
Todays consumer behaviour demands a new data model
 
AT Kearney: Retail success still depends on core principles May 2013
AT Kearney: Retail success still depends on core principles May 2013AT Kearney: Retail success still depends on core principles May 2013
AT Kearney: Retail success still depends on core principles May 2013
 
Big Data
Big DataBig Data
Big Data
 
Benefits of a data warehouse presentation by Being topper
Benefits of a data warehouse presentation by Being topperBenefits of a data warehouse presentation by Being topper
Benefits of a data warehouse presentation by Being topper
 
Big Data Analytics for Predicting Consumer Behaviour
Big Data Analytics for Predicting Consumer BehaviourBig Data Analytics for Predicting Consumer Behaviour
Big Data Analytics for Predicting Consumer Behaviour
 
AGILONE-ACADEMY_The-State-of-Big-Customer-Data-2015-FINAL3
AGILONE-ACADEMY_The-State-of-Big-Customer-Data-2015-FINAL3AGILONE-ACADEMY_The-State-of-Big-Customer-Data-2015-FINAL3
AGILONE-ACADEMY_The-State-of-Big-Customer-Data-2015-FINAL3
 
Running Head CONSUMER BEHAVIOR ANALYSISCONSUMER BEHAVIOR ANAL
Running Head CONSUMER BEHAVIOR ANALYSISCONSUMER BEHAVIOR ANALRunning Head CONSUMER BEHAVIOR ANALYSISCONSUMER BEHAVIOR ANAL
Running Head CONSUMER BEHAVIOR ANALYSISCONSUMER BEHAVIOR ANAL
 
Big Data_Retail Apparel_Dubai
Big Data_Retail Apparel_DubaiBig Data_Retail Apparel_Dubai
Big Data_Retail Apparel_Dubai
 
Big Data World presentation - Sep. 2014
Big Data World presentation - Sep. 2014Big Data World presentation - Sep. 2014
Big Data World presentation - Sep. 2014
 
IBM Guide to Consumer Products Industry Technology Trends
IBM Guide to Consumer Products Industry Technology TrendsIBM Guide to Consumer Products Industry Technology Trends
IBM Guide to Consumer Products Industry Technology Trends
 
IBM - 2016 - Guide to Consumer Products
IBM - 2016 - Guide to Consumer ProductsIBM - 2016 - Guide to Consumer Products
IBM - 2016 - Guide to Consumer Products
 
IBM - 2016 - guide to consumer products
IBM - 2016 - guide to consumer productsIBM - 2016 - guide to consumer products
IBM - 2016 - guide to consumer products
 
Data Driven Marketing: the DNA of customer orientated companies
Data Driven Marketing: the DNA of customer orientated companiesData Driven Marketing: the DNA of customer orientated companies
Data Driven Marketing: the DNA of customer orientated companies
 
DEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUES
DEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUESDEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUES
DEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUES
 
Zyme Using Analytics To White Paper
Zyme Using Analytics To White PaperZyme Using Analytics To White Paper
Zyme Using Analytics To White Paper
 
Big data analytics in payments
Big data analytics in payments Big data analytics in payments
Big data analytics in payments
 
Integrating Analytics into the Operational Fabric of Your Business
Integrating Analytics into the Operational Fabric of Your BusinessIntegrating Analytics into the Operational Fabric of Your Business
Integrating Analytics into the Operational Fabric of Your Business
 
Analytics Insights Deliver Competitive Differentiation - RIS
Analytics Insights Deliver Competitive Differentiation - RISAnalytics Insights Deliver Competitive Differentiation - RIS
Analytics Insights Deliver Competitive Differentiation - RIS
 

Recently uploaded

VIP Call Girls Pune Kirti 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Kirti 8617697112 Independent Escort Service PuneVIP Call Girls Pune Kirti 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Kirti 8617697112 Independent Escort Service PuneCall girls in Ahmedabad High profile
 
Lean: From Theory to Practice — One City’s (and Library’s) Lean Story… Abridged
Lean: From Theory to Practice — One City’s (and Library’s) Lean Story… AbridgedLean: From Theory to Practice — One City’s (and Library’s) Lean Story… Abridged
Lean: From Theory to Practice — One City’s (and Library’s) Lean Story… AbridgedKaiNexus
 
(8264348440) 🔝 Call Girls In Keshav Puram 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Keshav Puram 🔝 Delhi NCR(8264348440) 🔝 Call Girls In Keshav Puram 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Keshav Puram 🔝 Delhi NCRsoniya singh
 
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,noida100girls
 
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...lizamodels9
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.Aaiza Hassan
 
Marketing Management Business Plan_My Sweet Creations
Marketing Management Business Plan_My Sweet CreationsMarketing Management Business Plan_My Sweet Creations
Marketing Management Business Plan_My Sweet Creationsnakalysalcedo61
 
/:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In...
/:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In.../:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In...
/:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In...lizamodels9
 
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Howrah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Roomdivyansh0kumar0
 
(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCRsoniya singh
 
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts ServiceVip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Serviceankitnayak356677
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...lizamodels9
 
2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis UsageNeil Kimberley
 
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service JamshedpurVIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service JamshedpurSuhani Kapoor
 
The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024christinemoorman
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation SlidesKeppelCorporation
 
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
rishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfrishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfmuskan1121w
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth MarketingShawn Pang
 

Recently uploaded (20)

VIP Call Girls Pune Kirti 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Kirti 8617697112 Independent Escort Service PuneVIP Call Girls Pune Kirti 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Kirti 8617697112 Independent Escort Service Pune
 
Lean: From Theory to Practice — One City’s (and Library’s) Lean Story… Abridged
Lean: From Theory to Practice — One City’s (and Library’s) Lean Story… AbridgedLean: From Theory to Practice — One City’s (and Library’s) Lean Story… Abridged
Lean: From Theory to Practice — One City’s (and Library’s) Lean Story… Abridged
 
(8264348440) 🔝 Call Girls In Keshav Puram 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Keshav Puram 🔝 Delhi NCR(8264348440) 🔝 Call Girls In Keshav Puram 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Keshav Puram 🔝 Delhi NCR
 
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
 
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.
 
Marketing Management Business Plan_My Sweet Creations
Marketing Management Business Plan_My Sweet CreationsMarketing Management Business Plan_My Sweet Creations
Marketing Management Business Plan_My Sweet Creations
 
/:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In...
/:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In.../:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In...
/:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In...
 
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Howrah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
 
(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR
 
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts ServiceVip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
 
2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage
 
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service JamshedpurVIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
 
The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
 
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
 
rishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfrishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdf
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
 
Best Practices for Implementing an External Recruiting Partnership
Best Practices for Implementing an External Recruiting PartnershipBest Practices for Implementing an External Recruiting Partnership
Best Practices for Implementing an External Recruiting Partnership
 

Data mining for the online retail industry

  • 1. Data mining for the Online Retail Industry By: Atul Sharma 2K20/BMBA/10 1
  • 2. AIM OF THIS PPT “Torture The Data, and it will Confess to Anything” -Ronald Coase • This presentation mainly focus on the application of Data Mining in Online Retail Industry. • This presentation is based on the technical article “Data mining for the online retail industry: A case study of RFM model-based customer segmentation using data mining” by Daqing Chen, Sai Laing Sain & Kun Guo. 2
  • 3. 3  Nowadays online shopping is a daily practice for urban areas.  Online penetration of retail is expected to reach 10.7% by 2024 compared with 4.7% in 2019. Moreover, online shoppers in India are expected to reach 220 million by 2025.  There are various factors responsible for this growth: o Localization of Internet content o Growth in cities beyond metros o Growth of mobile commerce o Growing usage of debit cards for cashless transaction o COVID-19  With this increase in online smart customers, the traders and sellers also need to be more aware and need to know the personality of different buyers. INTRODUCTION
  • 4. 4 Problems faced by online retailers  Which items / products’ web pages has a customer visited? How long has a customer stayed with each web page, and in which sequence has a customer visited a set of products’ web pages?  Who are the most / least loyal customers, and how are they characterized?  What are customers’ purchase behavior patterns? Which products / items have customers purchased together often?  In which sequence the products have been purchased?  Which types of customers are more likely to respond to a certain promotion mailing?  What are the sales patterns in terms of various perspectives such as products /items, regions and time (weekly, monthly, quarterly, yearly and seasonally), and so on?  Who are the most / least valuable customers to the business? What are the distinct characteristics of them?
  • 5. Data Mining  Data mining is the process of analyzing massive volumes of data to discover business intelligence that helps companies solve problems, mitigate risks, and seize new opportunities.  Technically, data mining is the process of finding correlations or patterns among dozens of fields in large relational databases. 5
  • 6. 6 Different steps involved in Data Mining Data Cleaning This step involves the removal of noisy or incomplete data from the collection. Data Integration When multiple heterogeneous data sources such as databases, data cubes or files are combined for analysis, this process is called data integration. Data Reduction This technique is applied to obtain relevant data for analysis from the collection of data. The size of the representation is much smaller in volume while maintaining integrity. Data Transformation In this process, data is transformed into a form suitable for the data mining process. Data is consolidated so that the mining process is more efficient and the patterns are easier to understand. Data Mining The data is represented in the form of patterns and models are structured using classification and clustering techniques. Pattern Evaluation This step involves identifying interesting patterns representing the knowledge based on interestingness measures. Data summarization and visualization methods are used to make the data understandable by the user. Knowledge Representation Knowledge representation is a step where data visualization and knowledge representation tools are used to represent the mined data. Data is visualized in the form of reports, tables, etc.
  • 7. 7 Case Study Review  In the article author used a sample case to demonstrate the Data Mining process for online retail industry.  The author considered a UK-based non-store business with some 80 member of staff.  The company was established in 1981 mainly selling unique all-occasion gifts. For years in the past, the merchant relied heavily on direct mailing catalogues, and orders were taken over phone calls.  It was only 2 years ago that the company launched its own web site and shifted completely to the Web.  Since then the company has maintained a steady and healthy number of customers from all parts of the United Kingdom and Europe, and has accumulated a huge amount of data about many customers.  The company also uses Amazon.co.uk to market and sell its products. Company Overview:
  • 9. MM.DD.20XX ADD A FOOTER 9 Methodology  The data mining in this case is done by RFM MODEL-BASED CLUSTERING ANALYSIS. Following steps are involved in the process:  Data pre-processing  K-Means Clustering  Enhancing clustering analysis using decision tree  Conclusion
  • 10. 10 Data Pre-processing  First step is to select the appropriate variable of interest from the dataset. For example - Invoice, StockCode, Quantity, Price, InvoiceDate and PostCode.  Create an aggregated variable named Amount , by multiplying Quantity with Price, which gives the total amount of money spent per product / item in each transaction.  Separate the variable InvoiceDate into two variables Date and Time . This allows different transactions created by the same consumer on the same day but at different times to be treated separately.  Filter out any transactions that do not have a postcode associated with. This resolves any missing value issues in relation to the variable PostCode.  Sort out the dataset by Postcode and create three essential aggregated variables Recency, requency and Monetary. Calculate the values of these variables per postcode.
  • 12. 12 Enhancing clustering analysis using decision tree  The customers can be divided into such categories as frequency more than 2.5 with an average monetary value of 990.66; and frequency more than 2.5 and less than 3.5 with an average monetary value of 1056.70 and so on.  Also, it is interesting to note that the relationship between frequency and monetary seems to be a monotonic linear relationship.
  • 13. 13 Interpretation & Recommendations  The most valuable consumers of the business have contributed more than 60 per cent of the total sales in year 2011, whereas the least valuable ones only made up 4 per cent of the total sales.  For each of these consumer groups, it is essential to further find out which products the customers in each group have purchased, which products have been purchased together most frequently and in which sequence the products have been purchased.  Many of the consumers of the business were organizational consumers with a high quantity of a product per transaction. Examining at which specific times (seasons), what products and which types of products they have purchased frequently will be beneficiary to the business.  Another aspect worth further investigation is to link consumer groups to geographical locations. This correlation, if exists, may help the business look into other factors, such as culture, customs, and economics, that may affect a consumer’s buying intention and preferences.
  • 14. 14 Conclusion  As shown in the case data mining can help businesses know the purchasing behavior of the customers which help them to promote their products and services.  The segmentation of customers help businesses to differentiate the between the right target customers and the one which are unprofitable.  The information businesses get through data mining and analysis is the base for the progressive strategies and growth.