Directed data mining: is a top down approach, used when we know what we are looking for. This often takes the form of predictive modelling, where we know exactly what we want to predict. Undirected data mining: is a bottom up approach that lets the data speak for itself. It finds patterns in the data and leaves it up to the user to determine whether or not these patterns are important.
Black box approach used when we know what we are looking for, when we can direct the data mining effort toward a particular goal. Typically, we are using already known examples, such as prospects who already received an offer ( and either did or did not respond), and we are applying information gleaned from them to unknown examples, such as prospects who have not yet been contacted.
The predictive models use experience to assign scores (and confidence levels) to some relevant outcome in future. One of the keys to success is having enough data with the outcome already known to train the model. predictive models are never 100% accurate. They are helpful because making informed decisions in business should lead to better results. Sometimes, though, predictive accuracy is not the only or even the primary goal. Undirected data mining is about discovering new patterns inside the data. These patterns provide insight, and this insight might even prove very informative.
The latest developments in document management systems include utilising web browsers to access vital information by employees and by customers.
Is the data mining effort necessary? For example, if the purpose of a marketing campaign is to get every single possible responder, then it would be a waste of money to build a response model , i.e. a model that predicts who will respond to a marketing campaign. Should we focus on a particular segment or subgroup? For example, if a marketing campaign wants to focus on people aged 20-30, then modeling this set separately from the set of all people would produce better results. What are the relevant business rules? For example, if we are marketing an R-rated movie then we would exclude people under 18 from the model. What do they (the business people) know about the data? Domain experts know where data resides and how it is stored. They may know whether some sources are invalid and where certain data should come from. What do their intuition and experience say is important? Can be a source of insight.
The right data is often whatever is available, reasonably clean, and accessible Data must meet the requirements for solving the business problem for example if the business problem is to identify particular customers, then the data must contain information about each individual customer Data must be as complete as possible When doing predictive modeling the data needs to be complete enough that we can determine the outcome of what we are modeling
When someone walks into a corner store and pays cash, or pays for the telephone call with a pre-paid calling card, or anonymously visits a website,the only thing known about the customer is what happened during one of the transactions. Using this information, we can profile the transactions. Account level: the account- the credit card number, the telephone number, and so on- is the most common way that business units look at customers. This view has a history of transactions over time, but it is limited to a narrow product range. Complete customer: this is a view of a customer across multiple product lines.
Customer profiling and segmentation: break down your customer base into recognisable and manageable groups. By profiling your customers and dividing them into segments, you can target those groups more specifically to their needs. Additionally, you can measure actual behaviour of those groups over time against expected results. Segmentation can be done using cluster analysis and should be done at least once a year or whenever significant changes are made in your company’s business model. You can gain knowledge of your customers with every promotion by measuring response by segment. Ask more specific questions about your customers. Who are my best customers? How can I get more business from my existing customers? Retention and attrition: it is inevitable that businesses lose customers. You can determine those customers with the highest propensity to slow or discontinue business by profiling those customers who have left in the past. A retention model examines the potential for a customer to be retained after some event. An attrition model considers the likelihood that an active customer will become inactive. These are proactive goals for companies to set. Risk avoidance: How can you avoid acquiring unprofitable customers? Data mining can predict, with some accuracy, which prospects will become customers. However, it is also particularly helpful to determine which will be profitable customers.it is also important to recognise that some new customers will default and force your company to write off losses.. Therefore, risk management and marketing group must work together when new business programs are put in place. Cross-sell: The strategy of using an existing customer base for one product as prospective customers for other products how can you measure and affect the lifetime value of your customers? Shopping patterns: with electronic storefronts, it is now possible to study your customer actual behaviour. This information can be used to personalise content for customers as well as to understand your company’s web site efficiency (where are people leaving the site? At what point are shopping carts abandoned?) shopping patterns include what customers looked at or considered purchasing.
Types of data: you need data to build any segmentation or predictive model Which data? The data you study should match your goals. For segmentation models, you’ll need to sample the customer base that is representative of your current business. However, predictive models require some prior history of behaviour. If we want to predict the likelihood of responding to particular offer in an effort to acquire new customers, then we need prior campaign to reference.
Most resources say the data preparation is 80% of building a model. Data problems lead to the decrease in value of any customer data warehouse, on top of impeding the value of any model. Data preparation: Different models require data in certain formats Some require all data to be continues and ordinal. Others require all categorical data fields or need binary constructs.
Many methods are used to construct good model such as statistical methods. Test! Test! Test! When you build a predictive model (e.g., who is most likely to respond to my campaign?), it should be based on a prior campaign in which you know who actually responded and who didn’t. Score after the model is built, the prospects are then scored. This means flagging the prospects with a value to determine what their predicted behaviour might be.
Data Mining and Customer Relationship Management “ It is now conventional wisdom among marketers, and IT personnel who support them, that investments in customer data have a demonstrable return” said Phillip Russom, director of data warehousing and Business Intelligence, Hurwits Group
Directed data mining: is a top down approach, used when we know what we are looking for.
Undirected data mining: is a bottom up approach that lets the data speak for itself.
Sometimes we do not care how the model works. It is a black box and we just want the best prediction possible . Sometimes we want to use the model to gain insight into the data. We need to understand how the model works; it is more like a semitransparent box. Data mining uses both black box models and semitransparent box.