SlideShare a Scribd company logo
Data Mining
Techniques for CRM
Seyyed Jamaleddin Pishvayi
Customer Relationship Management
Instructor : Dr. Taghiyare
Tehran University
Spring 1383
2
Outlines
 What is Data Mining?
 Data Mining Motivation
 Data Mining Applications
 Applications of Data Mining in CRM
 Data Mining Taxonomy
 Data Mining Techniques
3
Data Mining
 The non-trivial extraction of novel, implicit, and actionable
knowledge from large datasets.
 Extremely large datasets
 Discovery of the non-obvious
 Useful knowledge that can improve processes
 Can not be done manually
 Technology to enable data exploration, data analysis, and data
visualization of very large databases at a high level of
abstraction, without a specific hypothesis in mind.
 Sophisticated data search capability that uses statistical
algorithms to discover patterns and correlations in data.
4
Data Mining (cont.)
5
Data Mining (cont.)
 Data Mining is a step of Knowledge Discovery in
Databases (KDD) Process
 Data Warehousing
 Data Selection
 Data Preprocessing
 Data Transformation
 Data Mining
 Interpretation/Evaluation
 Data Mining is sometimes referred to as KDD and
DM and KDD tend to be used as synonyms
6
Data Mining Evaluation
7
Data Mining is Not …
 Data warehousing
 SQL / Ad Hoc Queries / Reporting
 Software Agents
 Online Analytical Processing (OLAP)
 Data Visualization
8
Data Mining Motivation
 Changes in the Business Environment
 Customers becoming more demanding
 Markets are saturated
 Databases today are huge:
 More than 1,000,000 entities/records/rows
 From 10 to 10,000 fields/attributes/variables
 Gigabytes and terabytes
 Databases a growing at an unprecedented rate
 Decisions must be made rapidly
 Decisions must be made with maximum knowledge
9
“The key in business is to know something that
nobody else knows.”
— Aristotle Onassis
“To understand is to perceive patterns.”
— Sir Isaiah Berlin
PHOTO:LUCINDADOUGLAS-MENZIES
PHOTO: HULTON-DEUTSCH COLL
Data Mining Motivation
10
Data Mining Applications
11
Data Mining Applications:
Retail
 Performing basket analysis
 Which items customers tend to purchase together. This
knowledge can improve stocking, store layout strategies, and
promotions.
 Sales forecasting
 Examining time-based patterns helps retailers make stocking
decisions. If a customer purchases an item today, when are they
likely to purchase a complementary item?
 Database marketing
 Retailers can develop profiles of customers with certain
behaviors, for example, those who purchase designer labels
clothing or those who attend sales. This information can be used
to focus cost–effective promotions.
 Merchandise planning and allocation
 When retailers add new stores, they can improve merchandise
planning and allocation by examining patterns in stores with
similar demographic characteristics. Retailers can also use data
mining to determine the ideal layout for a specific store.
12
Data Mining Applications:
Banking
 Card marketing
 By identifying customer segments, card issuers and acquirers
can improve profitability with more effective acquisition and
retention programs, targeted product development, and
customized pricing.
 Cardholder pricing and profitability
 Card issuers can take advantage of data mining technology to
price their products so as to maximize profit and minimize loss of
customers. Includes risk-based pricing.
 Fraud detection
 Fraud is enormously costly. By analyzing past transactions that
were later determined to be fraudulent, banks can identify
patterns.
 Predictive life-cycle management
 DM helps banks predict each customer’s lifetime value and to
service each segment appropriately (for example, offering
special deals and discounts).
13
Data Mining Applications:
Telecommunication
 Call detail record analysis
 Telecommunication companies accumulate detailed call
records. By identifying customer segments with similar use
patterns, the companies can develop attractive pricing and
feature promotions.
 Customer loyalty
 Some customers repeatedly switch providers, or “churn”, to
take advantage of attractive incentives by competing
companies. The companies can use DM to identify the
characteristics of customers who are likely to remain loyal
once they switch, thus enabling the companies to target
their spending on customers who will produce the most
profit.
14
Data Mining Applications:
Other Applications
 Customer segmentation
 All industries can take advantage of DM to discover discrete
segments in their customer bases by considering additional
variables beyond traditional analysis.
 Manufacturing
 Through choice boards, manufacturers are beginning to
customize products for customers; therefore they must be able to
predict which features should be bundled to meet customer
demand.
 Warranties
 Manufacturers need to predict the number of customers who will
submit warranty claims and the average cost of those claims.
 Frequent flier incentives
 Airlines can identify groups of customers that can be given
incentives to fly more.
15
Data Mining in CRM:
Customer Life Cycle
 Customer Life Cycle
 The stages in the relationship between a customer and a
business
 Key stages in the customer lifecycle
 Prospects: people who are not yet customers but are in
the target market
 Responders: prospects who show an interest in a product
or service
 Active Customers: people who are currently using the
product or service
 Former Customers: may be “bad” customers who did not
pay their bills or who incurred high costs
 It’s important to know life cycle events (e.g.
retirement)
16
Data Mining in CRM:
Customer Life Cycle
 What marketers want: Increasing customer
revenue and customer profitability
 Up-sell
 Cross-sell
 Keeping the customers for a longer period of time
 Solution: Applying data mining
17
Data Mining in CRM
 DM helps to
 Determine the behavior surrounding a particular
lifecycle event
 Find other people in similar life stages and
determine which customers are following similar
behavior patterns
18
Data Mining in CRM (cont.)
Data Warehouse Data Mining
Campaign Management
Customer Profile
Customer Life Cycle Info.
19
Data Mining in CRM:
More
 Building Data Mining Applications for CRM
by Alex Berson, Stephen Smith, Kurt
Thearling (McGraw Hill, 2000).
20
Data Mining Techniques
Data Mining Techniques
Descriptive Predictive
Clustering
Association
Classification
Regression
Sequential Analysis
Decision Tree
Rule Induction
Neural Networks
Nearest Neighbor Classification
21
Two Good Algorithm Books
 Intelligent Data
Analysis: An
Introduction
 by Berthold and Hand
 The Elements of
Statistical Learning:
Data Mining, Inference,
and Prediction
 by Hastie, Tibshirani, and
Friedman
22
Predictive Data Mining
Tridas Vickie Mike
Honest
BarneyWaldoWally
Crooked
23
Prediction
Tridas Vickie Mike
Honest = has round eyes and a smile
24
Decision Trees
 Data
height hair eyes class
short blond blue A
tall blond brown B
tall red blue A
short dark blue B
tall dark blue B
tall blond blue A
tall dark brown B
short blond brown B
25
Decision Trees (cont.)
hair
dark
red
blond
short, blue = B
tall, blue = B
tall, brown= B
{tall, blue = A} short, blue = A
tall, brown = B
tall, blue = A
short, brown = B
Completely classifies dark-haired
and red-haired people
Does not completely classify
blonde-haired people.
More work is required
26
Decision Trees (cont.)
hair
dark
red
blond
short, blue = B
tall, blue = B
tall, brown= B
{tall, blue = A} short, blue = A
tall, brown = B
tall, blue = A
short, brown = B
eye
blue brown
short = A
tall = A
tall = B
short = B
Decision tree is complete because
1. All 8 cases appear at nodes
2. At each node, all cases are in
the same class (A or B)
27
Decision Trees:
Learned Predictive Rules
hair
eyesB
B
A
A
dark
red
blond
blue brown
28
Decision Trees:
Another Example
Total list
50% member
0-1 child 2-3 child
20% member
4+ children
$50-75k income
15% member
$75k+ income
70% member
$50-75k income $20-50k income
85% member
Age: 40-60
80% member
Age: 20-40
45% member
29
Rule Induction
 Try to find rules of the form
IF <left-hand-side> THEN <right-hand-side>
 This is the reverse of a rule-based agent, where the rules are
given and the agent must act. Here the actions are given
and we have to discover the rules!
 Prevalence = probability that LHS and RHS
occur together (sometimes called “support factor,”
“leverage” or “lift”)
 Predictability = probability of RHS given LHS
(sometimes called “confidence” or “strength”)
30
Association Rules from
Market Basket Analysis
 <Dairy-Milk-Refrigerated> → <Soft Drinks Carbonated>
 prevalence = 4.99%, predictability = 22.89%
 <Dry Dinners - Pasta> → <Soup-Canned>
 prevalence = 0.94%, predictability = 28.14%
 <Dry Dinners - Pasta> → <Cereal - Ready to Eat>
 prevalence = 1.36%, predictability = 41.02%
 <Cheese Slices > → <Cereal - Ready to Eat>
 prevalence = 1.16%, predictability = 38.01%
31
Use of Rule Associations
 Coupons, discounts
 Don’t give discounts on 2 items that are frequently bought
together. Use the discount on 1 to “pull” the other
 Product placement
 Offer correlated products to the customer at the same time.
Increases sales
 Timing of cross-marketing
 Send camcorder offer to VCR purchasers 2-3 months after
VCR purchase
 Discovery of patterns
 People who bought X, Y and Z (but not any pair) bought W
over half the time
32
Finding Rule Associations
Algorithm
 Example: grocery shopping
 For each item, count # of occurrences (say out of 100,000)
apples 1891, caviar 3, ice cream 1088, …
 Drop the ones that are below a minimum support level
apples 1891, ice cream 1088, pet food 2451, …
 Make a table of each item against each other item:
 Discard cells below support threshold. Now make a cube for triples,
etc. Add 1 dimension for each product on LHS.
apples ice cream pet food
apples 1891 685 24
ice cream ----- 1088 322
pet food ----- ----- 2451
33
Clustering
 The art of finding groups in data
 Objective: gather items from a database into
sets according to (unknown) common
characteristics
 Much more difficult than classification since
the classes are not known in advance (no
training)
 Technique: unsupervised learning
34
The K-Means Clustering
Method
0
1
2
3
4
5
6
7
8
9
10
0 1 2 3 4 5 6 7 8 9 10
0
1
2
3
4
5
6
7
8
9
10
0 1 2 3 4 5 6 7 8 9 10
0
1
2
3
4
5
6
7
8
9
10
0 1 2 3 4 5 6 7 8 9 10
0
1
2
3
4
5
6
7
8
9
10
0 1 2 3 4 5 6 7 8 9 10
0
1
2
3
4
5
6
7
8
9
10
0 1 2 3 4 5 6 7 8 9 10
K=2
Arbitrarily choose K
objects as initial
cluster center
Assign
each of
the
objects
to most
similar
center
Update
the
cluster
means
Update
the
cluster
means
reassignreassign
Thanks
Seyyed Jamaleddin Pishvayi
Customer Relationship Management
Instructor : Dr. Taghiyare
Tehran University
Spring 1383

More Related Content

What's hot

Predicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using ClassificationPredicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using Classification
Vishva Abeyrathne
 
customer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedincustomer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedinAsoka Korale
 
Presentation Title
Presentation TitlePresentation Title
Presentation Titlebutest
 
Reduction in customer complaints - Mortgage Industry
Reduction in customer complaints - Mortgage IndustryReduction in customer complaints - Mortgage Industry
Reduction in customer complaints - Mortgage Industry
Pranov Mishra
 
What is the Decision Tree Analysis and How Does it Help a Business to Analyze...
What is the Decision Tree Analysis and How Does it Help a Business to Analyze...What is the Decision Tree Analysis and How Does it Help a Business to Analyze...
What is the Decision Tree Analysis and How Does it Help a Business to Analyze...
Smarten Augmented Analytics
 
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A SurveyIRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET Journal
 
Teaching Descriptive Analytics, Customer Profiling and Clustering
Teaching Descriptive Analytics, Customer Profiling and ClusteringTeaching Descriptive Analytics, Customer Profiling and Clustering
Teaching Descriptive Analytics, Customer Profiling and Clustering
St. Edward's University
 
11.direct marketing with the application of data mining
11.direct marketing with the application of data mining11.direct marketing with the application of data mining
11.direct marketing with the application of data mining
Alexander Decker
 
Direct marketing with the application of data mining
Direct marketing with the application of data miningDirect marketing with the application of data mining
Direct marketing with the application of data mining
Alexander Decker
 
Customer Segmentation Project
Customer Segmentation ProjectCustomer Segmentation Project
Customer Segmentation Project
Aditya Ekawade
 
Sales Performance Deep Dive and Forecast: A ML Driven Analytics Solution
Sales Performance Deep Dive and Forecast: A ML Driven Analytics SolutionSales Performance Deep Dive and Forecast: A ML Driven Analytics Solution
Sales Performance Deep Dive and Forecast: A ML Driven Analytics Solution
Pranov Mishra
 
An impact of knowledge mining on satisfaction of consumers in super bazaars
An impact of knowledge mining on satisfaction of consumers in super bazaarsAn impact of knowledge mining on satisfaction of consumers in super bazaars
An impact of knowledge mining on satisfaction of consumers in super bazaarsIAEME Publication
 
Predictive Modelling & Market-Basket Analysis.
Predictive Modelling & Market-Basket Analysis.Predictive Modelling & Market-Basket Analysis.
Predictive Modelling & Market-Basket Analysis.
Siddhanth Chaurasiya
 
Purchase Prediction for Insurance Company
Purchase Prediction for Insurance CompanyPurchase Prediction for Insurance Company
Purchase Prediction for Insurance Company
Keerthana Ashok Kumar
 
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all AnalyticsDay 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
Aseda Owusua Addai-Deseh
 
A high level overview of all that is Analytics
A high level overview of all that is AnalyticsA high level overview of all that is Analytics
A high level overview of all that is Analytics
Ramkumar Ravichandran
 
Day 1 (Lecture 2): Business Analytics
Day 1 (Lecture 2): Business AnalyticsDay 1 (Lecture 2): Business Analytics
Day 1 (Lecture 2): Business Analytics
Aseda Owusua Addai-Deseh
 
Segmentation
SegmentationSegmentation
Segmentation
ModakAnalytics
 
Using R for customer segmentation
Using R  for customer segmentationUsing R  for customer segmentation
Using R for customer segmentation
Kumar P
 
Customer Clustering Based on Customer Purchasing Sequence Data
Customer Clustering Based on Customer Purchasing Sequence DataCustomer Clustering Based on Customer Purchasing Sequence Data
Customer Clustering Based on Customer Purchasing Sequence Data
IJERA Editor
 

What's hot (20)

Predicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using ClassificationPredicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using Classification
 
customer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedincustomer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedin
 
Presentation Title
Presentation TitlePresentation Title
Presentation Title
 
Reduction in customer complaints - Mortgage Industry
Reduction in customer complaints - Mortgage IndustryReduction in customer complaints - Mortgage Industry
Reduction in customer complaints - Mortgage Industry
 
What is the Decision Tree Analysis and How Does it Help a Business to Analyze...
What is the Decision Tree Analysis and How Does it Help a Business to Analyze...What is the Decision Tree Analysis and How Does it Help a Business to Analyze...
What is the Decision Tree Analysis and How Does it Help a Business to Analyze...
 
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A SurveyIRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
 
Teaching Descriptive Analytics, Customer Profiling and Clustering
Teaching Descriptive Analytics, Customer Profiling and ClusteringTeaching Descriptive Analytics, Customer Profiling and Clustering
Teaching Descriptive Analytics, Customer Profiling and Clustering
 
11.direct marketing with the application of data mining
11.direct marketing with the application of data mining11.direct marketing with the application of data mining
11.direct marketing with the application of data mining
 
Direct marketing with the application of data mining
Direct marketing with the application of data miningDirect marketing with the application of data mining
Direct marketing with the application of data mining
 
Customer Segmentation Project
Customer Segmentation ProjectCustomer Segmentation Project
Customer Segmentation Project
 
Sales Performance Deep Dive and Forecast: A ML Driven Analytics Solution
Sales Performance Deep Dive and Forecast: A ML Driven Analytics SolutionSales Performance Deep Dive and Forecast: A ML Driven Analytics Solution
Sales Performance Deep Dive and Forecast: A ML Driven Analytics Solution
 
An impact of knowledge mining on satisfaction of consumers in super bazaars
An impact of knowledge mining on satisfaction of consumers in super bazaarsAn impact of knowledge mining on satisfaction of consumers in super bazaars
An impact of knowledge mining on satisfaction of consumers in super bazaars
 
Predictive Modelling & Market-Basket Analysis.
Predictive Modelling & Market-Basket Analysis.Predictive Modelling & Market-Basket Analysis.
Predictive Modelling & Market-Basket Analysis.
 
Purchase Prediction for Insurance Company
Purchase Prediction for Insurance CompanyPurchase Prediction for Insurance Company
Purchase Prediction for Insurance Company
 
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all AnalyticsDay 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
 
A high level overview of all that is Analytics
A high level overview of all that is AnalyticsA high level overview of all that is Analytics
A high level overview of all that is Analytics
 
Day 1 (Lecture 2): Business Analytics
Day 1 (Lecture 2): Business AnalyticsDay 1 (Lecture 2): Business Analytics
Day 1 (Lecture 2): Business Analytics
 
Segmentation
SegmentationSegmentation
Segmentation
 
Using R for customer segmentation
Using R  for customer segmentationUsing R  for customer segmentation
Using R for customer segmentation
 
Customer Clustering Based on Customer Purchasing Sequence Data
Customer Clustering Based on Customer Purchasing Sequence DataCustomer Clustering Based on Customer Purchasing Sequence Data
Customer Clustering Based on Customer Purchasing Sequence Data
 

Similar to Chapter14 example2

Datamining for crm
Datamining for crmDatamining for crm
Datamining for crm
Anonim Anonimler
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
Seerat Malik
 
Data Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship ManagementData Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship Management
IJERA Editor
 
Data Mining Presentation for College Harsh.pptx
Data Mining Presentation for College Harsh.pptxData Mining Presentation for College Harsh.pptx
Data Mining Presentation for College Harsh.pptx
hp41112004
 
Data mining techniques and dss
Data mining techniques and dssData mining techniques and dss
Data mining techniques and dssNiyitegekabilly
 
IBM Transforming Customer Relationships Through Predictive Analytics
IBM Transforming Customer Relationships Through Predictive AnalyticsIBM Transforming Customer Relationships Through Predictive Analytics
IBM Transforming Customer Relationships Through Predictive Analytics
SFIMA
 
Data MiningData MiningData MiningData Mining
Data MiningData MiningData MiningData MiningData MiningData MiningData MiningData Mining
Data MiningData MiningData MiningData Mining
abdulraqeebalareqi1
 
Data Mining in Life Insurance Business
Data Mining in Life Insurance BusinessData Mining in Life Insurance Business
Data Mining in Life Insurance BusinessAnkur Khanna
 
Equitec Case - Consumer Behavior
Equitec Case - Consumer BehaviorEquitec Case - Consumer Behavior
Equitec Case - Consumer Behavior
bdorne
 
Datamining
DataminingDatamining
Datamining
DataminingDatamining
USE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORUSE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTOR
arpit bhadoriya
 
Data Mining in Retail Industries
Data Mining in Retail IndustriesData Mining in Retail Industries
Data Mining in Retail Industries
Rahul Sinha
 
Big data: What's the big deal?
Big data: What's the big deal?Big data: What's the big deal?
Big data: What's the big deal?
Penser
 
Unit I-Final MArketing analytics unit 1 ppt
Unit I-Final MArketing analytics unit 1 pptUnit I-Final MArketing analytics unit 1 ppt
Unit I-Final MArketing analytics unit 1 ppt
PriyadharshiniG41
 
Developing a customer data platform
Developing a customer data platformDeveloping a customer data platform
Developing a customer data platform
Tredence Inc
 
Data mining & data warehousing
Data mining & data warehousingData mining & data warehousing
Data mining & data warehousing
Shubha Brota Raha
 
Big Data in Retail (White paper)
Big Data in Retail (White paper)Big Data in Retail (White paper)
Big Data in Retail (White paper)
InData Labs
 

Similar to Chapter14 example2 (20)

Datamining for crm
Datamining for crmDatamining for crm
Datamining for crm
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
 
Data Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship ManagementData Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship Management
 
Data Mining Presentation for College Harsh.pptx
Data Mining Presentation for College Harsh.pptxData Mining Presentation for College Harsh.pptx
Data Mining Presentation for College Harsh.pptx
 
Data mining techniques and dss
Data mining techniques and dssData mining techniques and dss
Data mining techniques and dss
 
IBM Transforming Customer Relationships Through Predictive Analytics
IBM Transforming Customer Relationships Through Predictive AnalyticsIBM Transforming Customer Relationships Through Predictive Analytics
IBM Transforming Customer Relationships Through Predictive Analytics
 
Data MiningData MiningData MiningData Mining
Data MiningData MiningData MiningData MiningData MiningData MiningData MiningData Mining
Data MiningData MiningData MiningData Mining
 
Data Mining in Life Insurance Business
Data Mining in Life Insurance BusinessData Mining in Life Insurance Business
Data Mining in Life Insurance Business
 
Crm
CrmCrm
Crm
 
Equitec Case - Consumer Behavior
Equitec Case - Consumer BehaviorEquitec Case - Consumer Behavior
Equitec Case - Consumer Behavior
 
Datamining
DataminingDatamining
Datamining
 
Datamining
DataminingDatamining
Datamining
 
USE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORUSE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTOR
 
Data Mining in Retail Industries
Data Mining in Retail IndustriesData Mining in Retail Industries
Data Mining in Retail Industries
 
Big data: What's the big deal?
Big data: What's the big deal?Big data: What's the big deal?
Big data: What's the big deal?
 
Unit I-Final MArketing analytics unit 1 ppt
Unit I-Final MArketing analytics unit 1 pptUnit I-Final MArketing analytics unit 1 ppt
Unit I-Final MArketing analytics unit 1 ppt
 
Rahul ppt
Rahul pptRahul ppt
Rahul ppt
 
Developing a customer data platform
Developing a customer data platformDeveloping a customer data platform
Developing a customer data platform
 
Data mining & data warehousing
Data mining & data warehousingData mining & data warehousing
Data mining & data warehousing
 
Big Data in Retail (White paper)
Big Data in Retail (White paper)Big Data in Retail (White paper)
Big Data in Retail (White paper)
 

Chapter14 example2

  • 1. Data Mining Techniques for CRM Seyyed Jamaleddin Pishvayi Customer Relationship Management Instructor : Dr. Taghiyare Tehran University Spring 1383
  • 2. 2 Outlines  What is Data Mining?  Data Mining Motivation  Data Mining Applications  Applications of Data Mining in CRM  Data Mining Taxonomy  Data Mining Techniques
  • 3. 3 Data Mining  The non-trivial extraction of novel, implicit, and actionable knowledge from large datasets.  Extremely large datasets  Discovery of the non-obvious  Useful knowledge that can improve processes  Can not be done manually  Technology to enable data exploration, data analysis, and data visualization of very large databases at a high level of abstraction, without a specific hypothesis in mind.  Sophisticated data search capability that uses statistical algorithms to discover patterns and correlations in data.
  • 5. 5 Data Mining (cont.)  Data Mining is a step of Knowledge Discovery in Databases (KDD) Process  Data Warehousing  Data Selection  Data Preprocessing  Data Transformation  Data Mining  Interpretation/Evaluation  Data Mining is sometimes referred to as KDD and DM and KDD tend to be used as synonyms
  • 7. 7 Data Mining is Not …  Data warehousing  SQL / Ad Hoc Queries / Reporting  Software Agents  Online Analytical Processing (OLAP)  Data Visualization
  • 8. 8 Data Mining Motivation  Changes in the Business Environment  Customers becoming more demanding  Markets are saturated  Databases today are huge:  More than 1,000,000 entities/records/rows  From 10 to 10,000 fields/attributes/variables  Gigabytes and terabytes  Databases a growing at an unprecedented rate  Decisions must be made rapidly  Decisions must be made with maximum knowledge
  • 9. 9 “The key in business is to know something that nobody else knows.” — Aristotle Onassis “To understand is to perceive patterns.” — Sir Isaiah Berlin PHOTO:LUCINDADOUGLAS-MENZIES PHOTO: HULTON-DEUTSCH COLL Data Mining Motivation
  • 11. 11 Data Mining Applications: Retail  Performing basket analysis  Which items customers tend to purchase together. This knowledge can improve stocking, store layout strategies, and promotions.  Sales forecasting  Examining time-based patterns helps retailers make stocking decisions. If a customer purchases an item today, when are they likely to purchase a complementary item?  Database marketing  Retailers can develop profiles of customers with certain behaviors, for example, those who purchase designer labels clothing or those who attend sales. This information can be used to focus cost–effective promotions.  Merchandise planning and allocation  When retailers add new stores, they can improve merchandise planning and allocation by examining patterns in stores with similar demographic characteristics. Retailers can also use data mining to determine the ideal layout for a specific store.
  • 12. 12 Data Mining Applications: Banking  Card marketing  By identifying customer segments, card issuers and acquirers can improve profitability with more effective acquisition and retention programs, targeted product development, and customized pricing.  Cardholder pricing and profitability  Card issuers can take advantage of data mining technology to price their products so as to maximize profit and minimize loss of customers. Includes risk-based pricing.  Fraud detection  Fraud is enormously costly. By analyzing past transactions that were later determined to be fraudulent, banks can identify patterns.  Predictive life-cycle management  DM helps banks predict each customer’s lifetime value and to service each segment appropriately (for example, offering special deals and discounts).
  • 13. 13 Data Mining Applications: Telecommunication  Call detail record analysis  Telecommunication companies accumulate detailed call records. By identifying customer segments with similar use patterns, the companies can develop attractive pricing and feature promotions.  Customer loyalty  Some customers repeatedly switch providers, or “churn”, to take advantage of attractive incentives by competing companies. The companies can use DM to identify the characteristics of customers who are likely to remain loyal once they switch, thus enabling the companies to target their spending on customers who will produce the most profit.
  • 14. 14 Data Mining Applications: Other Applications  Customer segmentation  All industries can take advantage of DM to discover discrete segments in their customer bases by considering additional variables beyond traditional analysis.  Manufacturing  Through choice boards, manufacturers are beginning to customize products for customers; therefore they must be able to predict which features should be bundled to meet customer demand.  Warranties  Manufacturers need to predict the number of customers who will submit warranty claims and the average cost of those claims.  Frequent flier incentives  Airlines can identify groups of customers that can be given incentives to fly more.
  • 15. 15 Data Mining in CRM: Customer Life Cycle  Customer Life Cycle  The stages in the relationship between a customer and a business  Key stages in the customer lifecycle  Prospects: people who are not yet customers but are in the target market  Responders: prospects who show an interest in a product or service  Active Customers: people who are currently using the product or service  Former Customers: may be “bad” customers who did not pay their bills or who incurred high costs  It’s important to know life cycle events (e.g. retirement)
  • 16. 16 Data Mining in CRM: Customer Life Cycle  What marketers want: Increasing customer revenue and customer profitability  Up-sell  Cross-sell  Keeping the customers for a longer period of time  Solution: Applying data mining
  • 17. 17 Data Mining in CRM  DM helps to  Determine the behavior surrounding a particular lifecycle event  Find other people in similar life stages and determine which customers are following similar behavior patterns
  • 18. 18 Data Mining in CRM (cont.) Data Warehouse Data Mining Campaign Management Customer Profile Customer Life Cycle Info.
  • 19. 19 Data Mining in CRM: More  Building Data Mining Applications for CRM by Alex Berson, Stephen Smith, Kurt Thearling (McGraw Hill, 2000).
  • 20. 20 Data Mining Techniques Data Mining Techniques Descriptive Predictive Clustering Association Classification Regression Sequential Analysis Decision Tree Rule Induction Neural Networks Nearest Neighbor Classification
  • 21. 21 Two Good Algorithm Books  Intelligent Data Analysis: An Introduction  by Berthold and Hand  The Elements of Statistical Learning: Data Mining, Inference, and Prediction  by Hastie, Tibshirani, and Friedman
  • 22. 22 Predictive Data Mining Tridas Vickie Mike Honest BarneyWaldoWally Crooked
  • 23. 23 Prediction Tridas Vickie Mike Honest = has round eyes and a smile
  • 24. 24 Decision Trees  Data height hair eyes class short blond blue A tall blond brown B tall red blue A short dark blue B tall dark blue B tall blond blue A tall dark brown B short blond brown B
  • 25. 25 Decision Trees (cont.) hair dark red blond short, blue = B tall, blue = B tall, brown= B {tall, blue = A} short, blue = A tall, brown = B tall, blue = A short, brown = B Completely classifies dark-haired and red-haired people Does not completely classify blonde-haired people. More work is required
  • 26. 26 Decision Trees (cont.) hair dark red blond short, blue = B tall, blue = B tall, brown= B {tall, blue = A} short, blue = A tall, brown = B tall, blue = A short, brown = B eye blue brown short = A tall = A tall = B short = B Decision tree is complete because 1. All 8 cases appear at nodes 2. At each node, all cases are in the same class (A or B)
  • 27. 27 Decision Trees: Learned Predictive Rules hair eyesB B A A dark red blond blue brown
  • 28. 28 Decision Trees: Another Example Total list 50% member 0-1 child 2-3 child 20% member 4+ children $50-75k income 15% member $75k+ income 70% member $50-75k income $20-50k income 85% member Age: 40-60 80% member Age: 20-40 45% member
  • 29. 29 Rule Induction  Try to find rules of the form IF <left-hand-side> THEN <right-hand-side>  This is the reverse of a rule-based agent, where the rules are given and the agent must act. Here the actions are given and we have to discover the rules!  Prevalence = probability that LHS and RHS occur together (sometimes called “support factor,” “leverage” or “lift”)  Predictability = probability of RHS given LHS (sometimes called “confidence” or “strength”)
  • 30. 30 Association Rules from Market Basket Analysis  <Dairy-Milk-Refrigerated> → <Soft Drinks Carbonated>  prevalence = 4.99%, predictability = 22.89%  <Dry Dinners - Pasta> → <Soup-Canned>  prevalence = 0.94%, predictability = 28.14%  <Dry Dinners - Pasta> → <Cereal - Ready to Eat>  prevalence = 1.36%, predictability = 41.02%  <Cheese Slices > → <Cereal - Ready to Eat>  prevalence = 1.16%, predictability = 38.01%
  • 31. 31 Use of Rule Associations  Coupons, discounts  Don’t give discounts on 2 items that are frequently bought together. Use the discount on 1 to “pull” the other  Product placement  Offer correlated products to the customer at the same time. Increases sales  Timing of cross-marketing  Send camcorder offer to VCR purchasers 2-3 months after VCR purchase  Discovery of patterns  People who bought X, Y and Z (but not any pair) bought W over half the time
  • 32. 32 Finding Rule Associations Algorithm  Example: grocery shopping  For each item, count # of occurrences (say out of 100,000) apples 1891, caviar 3, ice cream 1088, …  Drop the ones that are below a minimum support level apples 1891, ice cream 1088, pet food 2451, …  Make a table of each item against each other item:  Discard cells below support threshold. Now make a cube for triples, etc. Add 1 dimension for each product on LHS. apples ice cream pet food apples 1891 685 24 ice cream ----- 1088 322 pet food ----- ----- 2451
  • 33. 33 Clustering  The art of finding groups in data  Objective: gather items from a database into sets according to (unknown) common characteristics  Much more difficult than classification since the classes are not known in advance (no training)  Technique: unsupervised learning
  • 34. 34 The K-Means Clustering Method 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 K=2 Arbitrarily choose K objects as initial cluster center Assign each of the objects to most similar center Update the cluster means Update the cluster means reassignreassign
  • 35. Thanks Seyyed Jamaleddin Pishvayi Customer Relationship Management Instructor : Dr. Taghiyare Tehran University Spring 1383