SlideShare a Scribd company logo
An Oracle White Paper
September 2010
Predictive Analytics:
Bringing The Tools To The Data
Predictive Analytics: Bringing the Tools to the Data
Introduction ......................................................................................... 2	
  
What Are Predictive Analytics -- The Traditional View........................ 3	
  
A Better Classification......................................................................... 4	
  
Predicting the Present......................................................................... 5	
  
Shaping The Future ............................................................................ 7	
  
A Better Approach............................................................................... 9	
  
Words of Warning ............................................................................. 11	
  
Predictive Analytics: Bringing the Tools to the Data
2
Introduction
"It's hard to make predictions, especially when they are about the future" is a quote usually
attributed to American baseball-legend Yogi Berra1.
Not a good start when discussing predictive analytics.
It gets even more problematic when we realize what analytics exactly are. Analytics are the
methods of decomposing concepts or substances into smaller pieces, to understand their
workings. How can you analyze the future? There is no concept or substance to break down yet.
To go further, predictive analytics even sounds like an oxymoron. Predictive is rehearsing an
important conversation beforehand: "If they say this, I will respond with that...", while analytics is
evaluating the conversation afterwards; "Gosh, I should have said that..."
It is easy to come to conclusions like this, when you don't have a good understanding of what
predictive analytics actually mean. In this paper we discuss what predictive analytics can and
cannot do, the various categories of predictive analytics, and how to best apply them.
1 The quote has been attributed to the American author Mark Twain and the Danish physicist Niels Bohr as well.
Predictive Analytics: Bringing the Tools to the Data
3
What Are Predictive Analytics -- The Traditional View
"Predictive analytics encompasses a variety of techniques from statistics, data mining and game
theory that analyze current and historical facts to make predictions about future events"2, is how
Wikipedia describes it. The variety of techniques is usually divided in three categories: predictive
models, descriptive models and decision models. Predictive models look for certain relationships and
patterns that usually lead to a certain behavior, point to fraud, predict system failures, assess
credit worthiness, and so forth. By determining the explanatory variables, you can predict
outcomes in the dependent variables. Descriptive models aim at creating segmentations, most
often used to classify customers based on for instance sociodemographic characteristics, life
cycle, profitability, product preferences and so forth. Where predictive models focus on a specific
event or behavior, descriptive models identify as many different relationships as possible. Lastly,
there are decision models that use optimization techniques to predict results of decisions. This
branch of predictive analytics leans particularly heavily on operations research, including areas
such as resource optimization, route planning and so forth. Table 1 provides an overview.
PREDICTIVE MODELS DESCRIPTIVE MODELS DECISION MODELS
• Find causality, relationships and
patterns between explanatory variables
and dependent variables
• Focus on specific variables
• Examples: next customer preference,
fraud, credit worthiness, system failure
• Find clusters of data elements with
similar characteristics
• Focus on as many variables as
possible
• Examples: customer segmentation
based on sociodemographic
characteristics, life cycle, profitability,
product preferences
• Find optimal and most certain
outcome for a specific decision
• Focus on a specific decision
• Examples: critical path, network
planning, scheduling, resource
optimization, simulation, stochastic
modeling
Table 1: Practical classification
This classification is very practical; it provides an immediate understanding of the areas where
predictive analytics add value. However, there are two problems with it:
• The classification is not exhaustive. If a new area of predictive analytics were to arise
tomorrow the current classification would become invalid
• The classification doesn't tell what the categories don't do. The limitations of each
category are not clear.
2 www.wikipedia.com, "Predictive Analytics", August 2010
Predictive Analytics: Bringing the Tools to the Data
4
A Better Classification
The reason why there is so much misunderstanding about predictive analytics is in the common
conception that predictions have to be about the future. A more conceptual classification solves
this misconception and addresses the issues with the current segmentation. This conceptual
classification distinguishes between two types of predictive analytics:
• Predicting the Present
• Shaping the Future
Trends develop like S-curves. They start slowly, take off, and become the new paradigm and
"best practice". Then, at one moment, something unexpected happens3. New regulations, a
better technology, a scandal within the market, whatever. The list of possible disruptions is
endless. Within the paradigm we can predict what is happening, as long as the assumptions don't
change. Once there is a disruption, a structural break, a new S-curve builds, and the process
repeats. See figure 1.
Figure 1: S-curves and predictive analytics.
Predictive analytics that predict the present focus on analyzing what happens within a stable and
current situation, based on a set of assumptions that describe reality. Predictive analytics that help
shape the future focus on preparing for the next S-curve, and help formulate a new set of
assumptions.
3 This moment is often referred to as a "Black Swan". Something we thought couldn't happen within the rules of
the game.
Predictive Analytics: Bringing the Tools to the Data
5
Predicting the Present
Predictive analytics that predict the present are based on existing data, preferably as much as
possible. Through data mining and all other applicable techniques, patterns are detected and rules
are derived. Within the bounds of its present S-curve, there is predictive value. Not towards the
future, but towards similar occurrences. For instance, if Customer A buys a pair of trousers, there is
a large chance he or she is also interested in socks. Or if there are expense items with the name
of a fashion brand on a corporate credit card, it is an irregular transaction. The analytics take
place as deep in the process and as real-time as possible. Table 2 provides an overview of typical
uses, based on the functionality of Oracle Data Mining, which is embedded in the Oracle
database.
FUNCTIONALITY ALGORITHM APPLICABILITY
Classification Logistic Regression (GLM)
Decision Trees
Naïve Bayes
Support Vector Machine
Response modeling
Recommending "next likely product"
Employee retention
Credit default modelling
Regression Multiple Regression (GLM)
Support Vector Machine
Credit scoring
Customer profitability modeling
Anomaly detection One Class SVM (Support Vector
Machine)
Claims fraud
Network intrusion
Attribute importance Minimum Description Length (MDL) Surgery preparation, triage
Net promoter score
Association rules Apriori Market basket analysis
Link analysis
Clustering Hierarchical K-Means
Hierarchical O-Cluser
Customer segmentation
Gene and protein analysis
Feature extraction Non-Negative Matrix Factorization
(NMF)
Text analysis, search
Table 2: Functionality overview Oracle Data Mining
Oracle Data Mining
Oracle Data Mining uses a multitude of algorithms that can automatically sift deep into your data
at the individual record level to discover patterns, relationships, factors, clusters, associations,
profiles, and predictions—that were previously “hidden”. Since Oracle Data Mining functions
reside natively in the Oracle Database kernel, they deliver high performance, high scalability and
security. The data and data mining functions never leave the database to deliver a comprehensive
in-database processing solution. With Oracle Data Mining, special departments of advanced data
analysts working in silos far away from the database are no longer needed.
Predictive Analytics: Bringing the Tools to the Data
6
The true value of data mining is best realized when the new insights and predictions are directly
supplied to existing business applications. With Oracle Data Mining users can supply predictive
analytics to business applications, call centers, web sites, campaign management systems,
automatic teller machines (ATMs), enterprise resource management (ERM), and other
operational and business planning applications. The database becomes more than just a data
repository—it becomes an analytical database that can undergird many new advanced use cases.
Overview of benefits:
• Mines data inside Oracle Database, an industry leader in performance and reliability
• Eliminates data extraction and movement
• Provides a platform for analytics-driven database applications
• Provides increased security by leveraging database security options
• Delivers lowest total cost of ownership (TCO) compared to traditional data mining vendors
• Leverages 30+ years of experience of ever advancing Oracle Database technology
Oracle Real-Time Decisions
Where Oracle Data Mining integrates in the database, as a decision framework Oracle Real-Time
Decisions (Oracle RTD) nests itself in process engines. Oracle RTD, at its core, is a closed-loop
recommendation engine. It helps optimize customer interactions using analytical techniques such
as business rules, data mining, or statistics. Instead of deep analysis in an offline environment --
the database -- Oracle RTD immediately influences the real-time data stream by providing
recommendations on how to complete the transaction.
Oracle RTD particularly focuses on personalized customer interactions flows. By learning from
every single interaction and adjusting the business processes in real-time, Oracle RTD optimizes
the value of each opportunity in itself, as well as every subsequent opportunity. It starts with a list
of possible next actions, and then adds information about the customer, session history,
customer history, your business goals, and what has worked or not worked in the past. RTD uses
that information to come up with a recommendation for the best action to take next. Each
decision is made using a statistical model that predicts the outcome of the interaction based on
what is known about the customer and how similar customers have responded in the past.
Because RTD operates as a business process engine, its decision framework can leverage the real-
time context of the interaction in rules or predictive models. This allows to account for data such
as the time of the day, the agent you are interacting with or the context of your web session to
improve the recommendation logic. Context data is a key predictor.
Oracle RTD applications can be fully automated, requiring no manual effort for building and
maintaining its predictive models. Oracle RTD automatically learns from each customer
interaction by autonomously updating its predictive models in real time.
Predictive Analytics: Bringing the Tools to the Data
7
Shaping The Future
Predicting the present helps an organization improve its organizational excellence. In other
words, it increases efficiency and improves the effectiveness of operations. However,
organizations also have another set of processes: management processes. These are aimed at
creating and implementing new strategies, and monitoring progress. Shaping the future improves
an organization's management excellence4.
One of the most dominant predictive tools on the strategic levels is a strategy map, part of the
balanced scorecard. Strategy maps aim to be predictive, as they aspire to show how decisions
made in the present could impact future results. This is done through linking leading and lagging
indicators. A leading indicator predicts future performance; a lagging indicator reports past
performance. For instance, for a postal service, the percentage of mail delivered within 24 hours
is a leading indicator for customer satisfaction. Statistics are used to discover and test these
relationships. You can use statistical techniques only if you have enough data. Data, by definition,
describes results from the past. Given that all we can truly predict about the future is that most
likely it will be different from today, you can question the predictive value of correlations found
in data describing the past. Putting it in stronger words, you could even argue that validating a
strategy map based on correlating past data, by definition, invalidates it5.
A different approach than predicting the present is needed; an approach in which we create the
data for the future, in terms of new assumptions, what-if questions and scenarios. It is within
these assumptions that the risk and uncertainty that are part of any strategic planning can be
captured. To create good assumptions, organizations must complete three critical steps6:
• Make the assumptions realistic.
• Identify the assumptions that matter most.
• Leverage the drivers that can be controlled and monitor those that cannot.
Several technologies can be helpful in this process.
4 See all of Oracle's management excellence white papers on www.oracle.com/thoughtleadership, in the resource
center
5 Based on Buytendijk, F.A., "Dealing with Dilemmas", Wiley, 2010
6 Driving Strategic Planning with Predictive Modeling, July 2008, www.oracle.com
Predictive Analytics: Bringing the Tools to the Data
8
OLAP
Hyperion Essbase is an OLAP database, in which data is stored in a multidimensional format.
This means the data is categorized in the form of dimensions, representing business structures.
Typical business dimensions are 'product', 'customer', 'region' and 'time'. To perform what-if
analysis and model possible future scenarios, a 'scenario' dimension is added. Scenario-building
consists of using the following critical functionalities:
• Create multiple hierarchies. By modeling alternative ways of segmenting customers,
grouping products, consolidating organizational entities or any other type of business
hierarchy, results can be tested before they are implemented as the new standard way of
looking at the business
• Use forecast calculations. Based on multiple assumptions, forecasts can be created.
Based on likelihood and realism the best forecast can be determined and implemented.
• Data entry. There is no data about the future available in business systems. Based on
new assumptions business users may have to input the data themselves, preferably in a
collaborative manner. Multi-user read write capabilities and version control are needed
to manage such a process.
Hyperion Strategic Planning
Financial modeling is a special activity. It requires specific functionality. At the same time the
structure in which the modeling takes place is highly standardized: a balance sheet, a profit and
loss statement and a cash flow overview. Oracle Hyperion Strategic Finance integrates strategic
planning into an enterprise planning process. It allows users to quickly develop financial models,
perform on the fly what-if impact analysis based on dynamic decision variables and arrive at
targets that can then be implemented. The what-if analysis toolkit is a unique and powerful set of
out-of-the-box tools that allow you to easily create an unlimited number of scenarios by business
unit. Other capabilities let you evaluate any metric’s sensitivity to key performance drivers and
run periodic “goal-seek” checks to determine the performance level needed to achieve specific
financial objectives.
Hyperion Strategic Planning allows organizations to spend more time simulating long-term
alternative strategies, developing contingent scenarios, and stress testing financial models.
Crystal Ball
Shaping the future is done outside operational business systems. It requires a modeling
framework that can handle both structure and creativity. Crystal Ball uses a spreadsheet interface
– the most widely used business analyst tool – to create the data for the future, in terms of new
assumptions, what-if questions and scenarios. Either standalone in a spreadsheet, or integrated
with Essbase and Strategic Planning, Crystal Ball facilitates the creation of good assumptions.
Going through the three critical steps that are needed to shape the future, Crystal Ball can:
Predictive Analytics: Bringing the Tools to the Data
9
• Make the assumptions realistic. Regardless of how much data you have (or none at
all) apply any or all of the available techniques to improve the model assumptions: time-
series forecasting, regression analysis, distribution fitting and simulation methods.
• Identify the assumptions that matter most. Determine the sensitivity of the forecast
to each assumption, so you know where to focus your resources. Sensitivity charts and
Tornado charts rank and help visualize which assumptions are the most important to
the least important in the model.
• Leverage the drivers that can be controlled and monitor those that cannot.
Because of the uncertainty inherent in models that shape the future, changing a driver (a
decision variable) can have a significant effect on the forecast results. For one or two
drivers, use the Decision Table tool, which quickly runs multiple simulations to test the
effects of changing the drivers. For models that contain more than a handful of decision
variables, or where you are trying to optimize the forecast results, optimization is used
to automatically find optimal solutions to simulation models.
A Better Approach
Traditionally, predictive analytics require a laborious process. Figure X shows a typical flow of
activities.
Figure 2: Traditional predictive analytics process
Based on a hypothesis -- an assumption to test -- the data that are needed are identified and
extracted from their sources, such as an ERP or CRM system, or coming from the data
warehouse. Different predictive analytics tools have different requirements on how to best
process the data, usually the data require some transformation to fit the specifics of the tool.
Only then the analysis can take place successfully. After the analysis and removal of all the
"noise" in the data, conclusions are drawn that lead to changes in for instance customer
segmentation or product clustering, and the success of the analysis is monitored. After that, the
cycle starts all over.
Predictive Analytics: Bringing the Tools to the Data
10
The key assumption of this approach is that it is best to take the data to the tools, which are
handled by the experts. This assumption is outdated. It is a costly approach, takes too much time,
robs the process of its creativity and doesn't allow the number of experiments to scale.
Oracle's strategy is different. Oracle feels it is better to bring the tools to the data. Oracle Data
Mining resides in the Oracle database itself, and is used by many business applications. Real-Time
Decisions nests itself into the process engine of business applications, generating real-time alerts,
particularly in customer contact interactions. Hyperion Strategic Finance is an add-on to
Hyperion Financial Management and Hyperion Planning, for long-term financial modeling, and
Crystal Ball integrates with Essbase and Hyperion planning. Figure 3 shows how the tools are
brought to the data.
There are multiple advantages to bringing the tools to the data:
• It is less costly and increases speed. There is no need to extract and transform data and
move it to another environment, which is often the most laborious part of the process.
• Moreover, the increased speed of the process makes the process more iterative, and
therewith increases the quality of the output. It becomes possible to run many more
analyses and scenarios in the same amount of limited time, before deciding what the
best outcome is.
• Predictive analytics become part of a standard business process, instead of a specialist
activity, and has a more direct business impact.
• It adds creativity to the process. If attributes need to be added while experimenting with
multiple analyses, the addition can be done without going back to the source data. The
hypothesis can be narrowed and widened without significant consequences.
Predictive Analytics: Bringing the Tools to the Data
11
In addition, Oracle Real-Time Decisions, Crystal Ball, Hyperion Essbase and Strategic Finance
integrate with non-Oracle databases and non-Oracle business applications as well.
Figure 3: Bringing the tools to the data
Words of Warning
To truly understand the possibilities and limitations of such advanced technologies on which
predictive analytics are based, ironically we need to go back to the old philosophers. Plato's Cave
describes an interesting thought experiment. Imagine a few people in a cave, sitting against a
small wall, facing the other end. They are locked in chains and have been for all their lives. On
top of the wall other people walk around, but they cannot be seen by the chained people. A large
fire in the back of the cave casts the shadows of those people walking around on the wall at the
other end, and the echos of their voices makes the sound come from the other end too. For the
people in chains these shadows are the reality of the world, they do not have any knowledge
about the people walking around on top of the little wall. The shadows are all they know.
Analysts can easily fall into the same trap, unconsciously mistaking their model and analysis for
the real world. Reality is interpreted "to fit the model", signals that change is coming are called
Predictive Analytics: Bringing the Tools to the Data
12
"outliers". Force-fitting reality into our frame of reference is deeply human, and we are all prone
to it. What helps counter this pitfall is to experiment7. Continuously try the outcomes of
predictive analytics in your operational environment. Even run multiple experiments at the same
time. See which ones lead to better results, and implement. Using simulations and other
techniques, predictive analytics should be applied continuously and incrementally.
Another word of warning comes from "Occam's Razor", based on the work of the 14th century
Franciscan friar William of Ockham. Occam's razor is a research principle. In forming a theory,
you should use the fewest elements needed to accurately predict an outcome. The more
assumptions you use, and the more you stack analysis on top of analysis, the more likely you are
to be inaccurate or wrong. To create a robust theory, you need to shave away all unnecessary
elements from the analysis (hence "razor"). Seen this way, two of the least reliable indicators of
business success are profit and customer satisfaction. They are based on many assumptions.
Better indicators are for instance cash-flow and on-time delivery, that are much closer to direct
observation and measurement.
Although predictive analytics are based on sophisticated technologies and mathematical
techniques, their success lies in applying them with common sense.
7 "Analytical competitors", as they are called in Davenport's book "Competing on Analytics", take this approach
to analytics.
Predictive Analytics: Bringing The Tools To The
Data
September, 2010
Authors: Frank Buytendijk, Lucie Trepanier
Oracle Corporation
World Headquarters
500 Oracle Parkway
Redwood Shores, CA 94065
U.S.A.
Worldwide Inquiries:
Phone: +1.650.506.7000
Fax: +1.650.506.7200
oracle.com
Copyright © 2010, Oracle and/or its affiliates. All rights reserved. Published in the U.S.A. This document is provided for information
purposes only, and the contents hereof are subject to change without notice. This document is not warranted to be error-free, nor
subject to any other warranties or conditions, whether expressed orally or implied in law, including implied warranties and conditions
of merchantability or fitness for a particular purpose. We specifically disclaim any liability with respect to this document and no
contractual obligations are formed either directly or indirectly by this document. This document may not be reproduced or transmitted
in any form or by any means, electronic or mechanical, for any purpose, without our prior written permission.
Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners.
10036922

More Related Content

What's hot

Forrester big data_predictive_analytics
Forrester big data_predictive_analyticsForrester big data_predictive_analytics
Forrester big data_predictive_analyticsShyam Sarkar
 
Uses of analytics in the field of Banking
Uses of analytics in the field of BankingUses of analytics in the field of Banking
Uses of analytics in the field of Banking
Niveditasri N
 
Regression and correlation
Regression and correlationRegression and correlation
Regression and correlation
VrushaliSolanke
 
Report Big Data
Report Big DataReport Big Data
Report Big Data
Roomian.org
 
AI powered decision making in banks
AI powered decision making in banksAI powered decision making in banks
AI powered decision making in banks
Pankaj Baid
 
Four ways data is improving healthcare operations
Four ways data is improving healthcare operationsFour ways data is improving healthcare operations
Four ways data is improving healthcare operations
Tableau Software
 
Analytics in-action-survey
Analytics in-action-surveyAnalytics in-action-survey
Analytics in-action-survey
Anjan Das
 
Predictive Model
Predictive ModelPredictive Model
Predictive Model
ModakAnalytics
 
Fraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
Fraud Detection in Insurance with Machine Learning for WARTA - Artur SuchwalkoFraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
Fraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
Institute of Contemporary Sciences
 
Predictive Analytics: Business Perspective & Use Cases
Predictive Analytics: Business Perspective & Use CasesPredictive Analytics: Business Perspective & Use Cases
Predictive Analytics: Business Perspective & Use Cases
Cagri Sarigoz
 
Big data alchemy - how can banks maximize the value of their customer data
Big data alchemy - how can banks maximize the value of their customer dataBig data alchemy - how can banks maximize the value of their customer data
Big data alchemy - how can banks maximize the value of their customer data
Rick Bouter
 
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
VIRGOkonsult
 
driving_business_value_from_real_time_streaming_analytics
driving_business_value_from_real_time_streaming_analyticsdriving_business_value_from_real_time_streaming_analytics
driving_business_value_from_real_time_streaming_analyticsJane Roberts
 
Fraud analytics
Fraud analyticsFraud analytics
Fraud analytics
Simran Mondal
 
Avelo_BigData_Whitepaper
Avelo_BigData_WhitepaperAvelo_BigData_Whitepaper
Avelo_BigData_WhitepaperMark Pearce
 
P&C insurance middleware presentation v1
P&C insurance middleware presentation v1P&C insurance middleware presentation v1
P&C insurance middleware presentation v1Gregg Barrett
 
Data Analytics in Azure Cloud
Data Analytics in Azure CloudData Analytics in Azure Cloud
Data Analytics in Azure Cloud
Microsoft Canada
 
Analystics in banking and financial services
Analystics in banking and financial servicesAnalystics in banking and financial services
Analystics in banking and financial services
RoshithaSunil
 

What's hot (19)

Forrester big data_predictive_analytics
Forrester big data_predictive_analyticsForrester big data_predictive_analytics
Forrester big data_predictive_analytics
 
Uses of analytics in the field of Banking
Uses of analytics in the field of BankingUses of analytics in the field of Banking
Uses of analytics in the field of Banking
 
Regression and correlation
Regression and correlationRegression and correlation
Regression and correlation
 
Report Big Data
Report Big DataReport Big Data
Report Big Data
 
AI powered decision making in banks
AI powered decision making in banksAI powered decision making in banks
AI powered decision making in banks
 
Four ways data is improving healthcare operations
Four ways data is improving healthcare operationsFour ways data is improving healthcare operations
Four ways data is improving healthcare operations
 
Analytics in-action-survey
Analytics in-action-surveyAnalytics in-action-survey
Analytics in-action-survey
 
Predictive Model
Predictive ModelPredictive Model
Predictive Model
 
Predictive Modelling
Predictive ModellingPredictive Modelling
Predictive Modelling
 
Fraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
Fraud Detection in Insurance with Machine Learning for WARTA - Artur SuchwalkoFraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
Fraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
 
Predictive Analytics: Business Perspective & Use Cases
Predictive Analytics: Business Perspective & Use CasesPredictive Analytics: Business Perspective & Use Cases
Predictive Analytics: Business Perspective & Use Cases
 
Big data alchemy - how can banks maximize the value of their customer data
Big data alchemy - how can banks maximize the value of their customer dataBig data alchemy - how can banks maximize the value of their customer data
Big data alchemy - how can banks maximize the value of their customer data
 
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
 
driving_business_value_from_real_time_streaming_analytics
driving_business_value_from_real_time_streaming_analyticsdriving_business_value_from_real_time_streaming_analytics
driving_business_value_from_real_time_streaming_analytics
 
Fraud analytics
Fraud analyticsFraud analytics
Fraud analytics
 
Avelo_BigData_Whitepaper
Avelo_BigData_WhitepaperAvelo_BigData_Whitepaper
Avelo_BigData_Whitepaper
 
P&C insurance middleware presentation v1
P&C insurance middleware presentation v1P&C insurance middleware presentation v1
P&C insurance middleware presentation v1
 
Data Analytics in Azure Cloud
Data Analytics in Azure CloudData Analytics in Azure Cloud
Data Analytics in Azure Cloud
 
Analystics in banking and financial services
Analystics in banking and financial servicesAnalystics in banking and financial services
Analystics in banking and financial services
 

Similar to Risk mgmt-analysis-wp-326822

Barga, roger. predictive analytics with microsoft azure machine learning
Barga, roger. predictive analytics with microsoft azure machine learningBarga, roger. predictive analytics with microsoft azure machine learning
Barga, roger. predictive analytics with microsoft azure machine learning
maldonadojorge
 
Big data overview
Big data overviewBig data overview
Big data overview
Shyam Sunder Budhwar
 
Keys to extract value from the data analytics life cycle
Keys to extract value from the data analytics life cycleKeys to extract value from the data analytics life cycle
Keys to extract value from the data analytics life cycle
Grant Thornton LLP
 
Al-Mqbali, Leila, Big Data - Research Project
Al-Mqbali, Leila, Big Data - Research ProjectAl-Mqbali, Leila, Big Data - Research Project
Al-Mqbali, Leila, Big Data - Research ProjectLeila Al-Mqbali
 
Practical Machine Learning
Practical Machine LearningPractical Machine Learning
Practical Machine Learning
Lynn Langit
 
Semantic Web Based Sentiment Engine
Semantic Web Based Sentiment EngineSemantic Web Based Sentiment Engine
Semantic Web Based Sentiment Engine
James Dellinger
 
BUILDING A GENERAL CONCEPT OF ANALYTICAL SERVICES FOR ANALYSIS OF STRUCTURED ...
BUILDING A GENERAL CONCEPT OF ANALYTICAL SERVICES FOR ANALYSIS OF STRUCTURED ...BUILDING A GENERAL CONCEPT OF ANALYTICAL SERVICES FOR ANALYSIS OF STRUCTURED ...
BUILDING A GENERAL CONCEPT OF ANALYTICAL SERVICES FOR ANALYSIS OF STRUCTURED ...
Kiogyf
 
Data Mining
Data MiningData Mining
Data Mining
SOMASUNDARAM T
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection method
IJSRD
 
Data Mining Presentation.pptx
Data Mining Presentation.pptxData Mining Presentation.pptx
Data Mining Presentation.pptx
ChingChingErm
 
Lesson1.2.pptx.pdf
Lesson1.2.pptx.pdfLesson1.2.pptx.pdf
Lesson1.2.pptx.pdf
JhimarPeredoJurado
 
Prognosis - An Approach to Predictive Analytics- Impetus White Paper
Prognosis - An Approach to Predictive Analytics- Impetus White PaperPrognosis - An Approach to Predictive Analytics- Impetus White Paper
Prognosis - An Approach to Predictive Analytics- Impetus White Paper
Impetus Technologies
 
what is ..how to process types and methods involved in data analysis
what is ..how to process types and methods involved in data analysiswhat is ..how to process types and methods involved in data analysis
what is ..how to process types and methods involved in data analysis
Data analysis ireland
 
Applied_Data_Science_Presented_by_Yhat
Applied_Data_Science_Presented_by_YhatApplied_Data_Science_Presented_by_Yhat
Applied_Data_Science_Presented_by_YhatCharlie Hecht
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015
Roger Barga
 
Data mining
Data miningData mining
Data mining
pradeepa n
 
Introduction to Machine Learning and Data Science using the Autonomous databa...
Introduction to Machine Learning and Data Science using the Autonomous databa...Introduction to Machine Learning and Data Science using the Autonomous databa...
Introduction to Machine Learning and Data Science using the Autonomous databa...
Sandesh Rao
 
Chapter 1.pdf
Chapter 1.pdfChapter 1.pdf
Chapter 1.pdf
DrGnaneswariG
 

Similar to Risk mgmt-analysis-wp-326822 (20)

Barga, roger. predictive analytics with microsoft azure machine learning
Barga, roger. predictive analytics with microsoft azure machine learningBarga, roger. predictive analytics with microsoft azure machine learning
Barga, roger. predictive analytics with microsoft azure machine learning
 
Big data overview
Big data overviewBig data overview
Big data overview
 
Keys to extract value from the data analytics life cycle
Keys to extract value from the data analytics life cycleKeys to extract value from the data analytics life cycle
Keys to extract value from the data analytics life cycle
 
Al-Mqbali, Leila, Big Data - Research Project
Al-Mqbali, Leila, Big Data - Research ProjectAl-Mqbali, Leila, Big Data - Research Project
Al-Mqbali, Leila, Big Data - Research Project
 
Practical Machine Learning
Practical Machine LearningPractical Machine Learning
Practical Machine Learning
 
Semantic Web Based Sentiment Engine
Semantic Web Based Sentiment EngineSemantic Web Based Sentiment Engine
Semantic Web Based Sentiment Engine
 
BUILDING A GENERAL CONCEPT OF ANALYTICAL SERVICES FOR ANALYSIS OF STRUCTURED ...
BUILDING A GENERAL CONCEPT OF ANALYTICAL SERVICES FOR ANALYSIS OF STRUCTURED ...BUILDING A GENERAL CONCEPT OF ANALYTICAL SERVICES FOR ANALYSIS OF STRUCTURED ...
BUILDING A GENERAL CONCEPT OF ANALYTICAL SERVICES FOR ANALYSIS OF STRUCTURED ...
 
Data Mining
Data MiningData Mining
Data Mining
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection method
 
Data Mining Presentation.pptx
Data Mining Presentation.pptxData Mining Presentation.pptx
Data Mining Presentation.pptx
 
Lesson1.2.pptx.pdf
Lesson1.2.pptx.pdfLesson1.2.pptx.pdf
Lesson1.2.pptx.pdf
 
Prognosis - An Approach to Predictive Analytics- Impetus White Paper
Prognosis - An Approach to Predictive Analytics- Impetus White PaperPrognosis - An Approach to Predictive Analytics- Impetus White Paper
Prognosis - An Approach to Predictive Analytics- Impetus White Paper
 
what is ..how to process types and methods involved in data analysis
what is ..how to process types and methods involved in data analysiswhat is ..how to process types and methods involved in data analysis
what is ..how to process types and methods involved in data analysis
 
Applied_Data_Science_Presented_by_Yhat
Applied_Data_Science_Presented_by_YhatApplied_Data_Science_Presented_by_Yhat
Applied_Data_Science_Presented_by_Yhat
 
Data mining
Data miningData mining
Data mining
 
Data mining
Data miningData mining
Data mining
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015
 
Data mining
Data miningData mining
Data mining
 
Introduction to Machine Learning and Data Science using the Autonomous databa...
Introduction to Machine Learning and Data Science using the Autonomous databa...Introduction to Machine Learning and Data Science using the Autonomous databa...
Introduction to Machine Learning and Data Science using the Autonomous databa...
 
Chapter 1.pdf
Chapter 1.pdfChapter 1.pdf
Chapter 1.pdf
 

More from Shubhashish Biswas

Wphab2bintgxs101707 090314225022-phpapp01
Wphab2bintgxs101707 090314225022-phpapp01Wphab2bintgxs101707 090314225022-phpapp01
Wphab2bintgxs101707 090314225022-phpapp01Shubhashish Biswas
 
Predictive analytics-white-paper
Predictive analytics-white-paperPredictive analytics-white-paper
Predictive analytics-white-paperShubhashish Biswas
 
Ebookblogv2 120116015321-phpapp01
Ebookblogv2 120116015321-phpapp01Ebookblogv2 120116015321-phpapp01
Ebookblogv2 120116015321-phpapp01Shubhashish Biswas
 
Customer analytics for dummies
Customer analytics for dummiesCustomer analytics for dummies
Customer analytics for dummiesShubhashish Biswas
 

More from Shubhashish Biswas (11)

Wphab2bintgxs101707 090314225022-phpapp01
Wphab2bintgxs101707 090314225022-phpapp01Wphab2bintgxs101707 090314225022-phpapp01
Wphab2bintgxs101707 090314225022-phpapp01
 
W acto07
W acto07W acto07
W acto07
 
Predictivemodelingwhitepaper
PredictivemodelingwhitepaperPredictivemodelingwhitepaper
Predictivemodelingwhitepaper
 
Predictive analytics-white-paper
Predictive analytics-white-paperPredictive analytics-white-paper
Predictive analytics-white-paper
 
Hurwitz sas victory_report
Hurwitz sas victory_reportHurwitz sas victory_report
Hurwitz sas victory_report
 
Ebookblogv2 120116015321-phpapp01
Ebookblogv2 120116015321-phpapp01Ebookblogv2 120116015321-phpapp01
Ebookblogv2 120116015321-phpapp01
 
Dm
DmDm
Dm
 
Customer analytics for dummies
Customer analytics for dummiesCustomer analytics for dummies
Customer analytics for dummies
 
Big data-analytics-ebook
Big data-analytics-ebookBig data-analytics-ebook
Big data-analytics-ebook
 
1 kwyfvb
1 kwyfvb1 kwyfvb
1 kwyfvb
 
V imp analytics appli ed
V imp analytics appli edV imp analytics appli ed
V imp analytics appli ed
 

Recently uploaded

Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 

Recently uploaded (20)

Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 

Risk mgmt-analysis-wp-326822

  • 1. An Oracle White Paper September 2010 Predictive Analytics: Bringing The Tools To The Data
  • 2. Predictive Analytics: Bringing the Tools to the Data Introduction ......................................................................................... 2   What Are Predictive Analytics -- The Traditional View........................ 3   A Better Classification......................................................................... 4   Predicting the Present......................................................................... 5   Shaping The Future ............................................................................ 7   A Better Approach............................................................................... 9   Words of Warning ............................................................................. 11  
  • 3. Predictive Analytics: Bringing the Tools to the Data 2 Introduction "It's hard to make predictions, especially when they are about the future" is a quote usually attributed to American baseball-legend Yogi Berra1. Not a good start when discussing predictive analytics. It gets even more problematic when we realize what analytics exactly are. Analytics are the methods of decomposing concepts or substances into smaller pieces, to understand their workings. How can you analyze the future? There is no concept or substance to break down yet. To go further, predictive analytics even sounds like an oxymoron. Predictive is rehearsing an important conversation beforehand: "If they say this, I will respond with that...", while analytics is evaluating the conversation afterwards; "Gosh, I should have said that..." It is easy to come to conclusions like this, when you don't have a good understanding of what predictive analytics actually mean. In this paper we discuss what predictive analytics can and cannot do, the various categories of predictive analytics, and how to best apply them. 1 The quote has been attributed to the American author Mark Twain and the Danish physicist Niels Bohr as well.
  • 4. Predictive Analytics: Bringing the Tools to the Data 3 What Are Predictive Analytics -- The Traditional View "Predictive analytics encompasses a variety of techniques from statistics, data mining and game theory that analyze current and historical facts to make predictions about future events"2, is how Wikipedia describes it. The variety of techniques is usually divided in three categories: predictive models, descriptive models and decision models. Predictive models look for certain relationships and patterns that usually lead to a certain behavior, point to fraud, predict system failures, assess credit worthiness, and so forth. By determining the explanatory variables, you can predict outcomes in the dependent variables. Descriptive models aim at creating segmentations, most often used to classify customers based on for instance sociodemographic characteristics, life cycle, profitability, product preferences and so forth. Where predictive models focus on a specific event or behavior, descriptive models identify as many different relationships as possible. Lastly, there are decision models that use optimization techniques to predict results of decisions. This branch of predictive analytics leans particularly heavily on operations research, including areas such as resource optimization, route planning and so forth. Table 1 provides an overview. PREDICTIVE MODELS DESCRIPTIVE MODELS DECISION MODELS • Find causality, relationships and patterns between explanatory variables and dependent variables • Focus on specific variables • Examples: next customer preference, fraud, credit worthiness, system failure • Find clusters of data elements with similar characteristics • Focus on as many variables as possible • Examples: customer segmentation based on sociodemographic characteristics, life cycle, profitability, product preferences • Find optimal and most certain outcome for a specific decision • Focus on a specific decision • Examples: critical path, network planning, scheduling, resource optimization, simulation, stochastic modeling Table 1: Practical classification This classification is very practical; it provides an immediate understanding of the areas where predictive analytics add value. However, there are two problems with it: • The classification is not exhaustive. If a new area of predictive analytics were to arise tomorrow the current classification would become invalid • The classification doesn't tell what the categories don't do. The limitations of each category are not clear. 2 www.wikipedia.com, "Predictive Analytics", August 2010
  • 5. Predictive Analytics: Bringing the Tools to the Data 4 A Better Classification The reason why there is so much misunderstanding about predictive analytics is in the common conception that predictions have to be about the future. A more conceptual classification solves this misconception and addresses the issues with the current segmentation. This conceptual classification distinguishes between two types of predictive analytics: • Predicting the Present • Shaping the Future Trends develop like S-curves. They start slowly, take off, and become the new paradigm and "best practice". Then, at one moment, something unexpected happens3. New regulations, a better technology, a scandal within the market, whatever. The list of possible disruptions is endless. Within the paradigm we can predict what is happening, as long as the assumptions don't change. Once there is a disruption, a structural break, a new S-curve builds, and the process repeats. See figure 1. Figure 1: S-curves and predictive analytics. Predictive analytics that predict the present focus on analyzing what happens within a stable and current situation, based on a set of assumptions that describe reality. Predictive analytics that help shape the future focus on preparing for the next S-curve, and help formulate a new set of assumptions. 3 This moment is often referred to as a "Black Swan". Something we thought couldn't happen within the rules of the game.
  • 6. Predictive Analytics: Bringing the Tools to the Data 5 Predicting the Present Predictive analytics that predict the present are based on existing data, preferably as much as possible. Through data mining and all other applicable techniques, patterns are detected and rules are derived. Within the bounds of its present S-curve, there is predictive value. Not towards the future, but towards similar occurrences. For instance, if Customer A buys a pair of trousers, there is a large chance he or she is also interested in socks. Or if there are expense items with the name of a fashion brand on a corporate credit card, it is an irregular transaction. The analytics take place as deep in the process and as real-time as possible. Table 2 provides an overview of typical uses, based on the functionality of Oracle Data Mining, which is embedded in the Oracle database. FUNCTIONALITY ALGORITHM APPLICABILITY Classification Logistic Regression (GLM) Decision Trees Naïve Bayes Support Vector Machine Response modeling Recommending "next likely product" Employee retention Credit default modelling Regression Multiple Regression (GLM) Support Vector Machine Credit scoring Customer profitability modeling Anomaly detection One Class SVM (Support Vector Machine) Claims fraud Network intrusion Attribute importance Minimum Description Length (MDL) Surgery preparation, triage Net promoter score Association rules Apriori Market basket analysis Link analysis Clustering Hierarchical K-Means Hierarchical O-Cluser Customer segmentation Gene and protein analysis Feature extraction Non-Negative Matrix Factorization (NMF) Text analysis, search Table 2: Functionality overview Oracle Data Mining Oracle Data Mining Oracle Data Mining uses a multitude of algorithms that can automatically sift deep into your data at the individual record level to discover patterns, relationships, factors, clusters, associations, profiles, and predictions—that were previously “hidden”. Since Oracle Data Mining functions reside natively in the Oracle Database kernel, they deliver high performance, high scalability and security. The data and data mining functions never leave the database to deliver a comprehensive in-database processing solution. With Oracle Data Mining, special departments of advanced data analysts working in silos far away from the database are no longer needed.
  • 7. Predictive Analytics: Bringing the Tools to the Data 6 The true value of data mining is best realized when the new insights and predictions are directly supplied to existing business applications. With Oracle Data Mining users can supply predictive analytics to business applications, call centers, web sites, campaign management systems, automatic teller machines (ATMs), enterprise resource management (ERM), and other operational and business planning applications. The database becomes more than just a data repository—it becomes an analytical database that can undergird many new advanced use cases. Overview of benefits: • Mines data inside Oracle Database, an industry leader in performance and reliability • Eliminates data extraction and movement • Provides a platform for analytics-driven database applications • Provides increased security by leveraging database security options • Delivers lowest total cost of ownership (TCO) compared to traditional data mining vendors • Leverages 30+ years of experience of ever advancing Oracle Database technology Oracle Real-Time Decisions Where Oracle Data Mining integrates in the database, as a decision framework Oracle Real-Time Decisions (Oracle RTD) nests itself in process engines. Oracle RTD, at its core, is a closed-loop recommendation engine. It helps optimize customer interactions using analytical techniques such as business rules, data mining, or statistics. Instead of deep analysis in an offline environment -- the database -- Oracle RTD immediately influences the real-time data stream by providing recommendations on how to complete the transaction. Oracle RTD particularly focuses on personalized customer interactions flows. By learning from every single interaction and adjusting the business processes in real-time, Oracle RTD optimizes the value of each opportunity in itself, as well as every subsequent opportunity. It starts with a list of possible next actions, and then adds information about the customer, session history, customer history, your business goals, and what has worked or not worked in the past. RTD uses that information to come up with a recommendation for the best action to take next. Each decision is made using a statistical model that predicts the outcome of the interaction based on what is known about the customer and how similar customers have responded in the past. Because RTD operates as a business process engine, its decision framework can leverage the real- time context of the interaction in rules or predictive models. This allows to account for data such as the time of the day, the agent you are interacting with or the context of your web session to improve the recommendation logic. Context data is a key predictor. Oracle RTD applications can be fully automated, requiring no manual effort for building and maintaining its predictive models. Oracle RTD automatically learns from each customer interaction by autonomously updating its predictive models in real time.
  • 8. Predictive Analytics: Bringing the Tools to the Data 7 Shaping The Future Predicting the present helps an organization improve its organizational excellence. In other words, it increases efficiency and improves the effectiveness of operations. However, organizations also have another set of processes: management processes. These are aimed at creating and implementing new strategies, and monitoring progress. Shaping the future improves an organization's management excellence4. One of the most dominant predictive tools on the strategic levels is a strategy map, part of the balanced scorecard. Strategy maps aim to be predictive, as they aspire to show how decisions made in the present could impact future results. This is done through linking leading and lagging indicators. A leading indicator predicts future performance; a lagging indicator reports past performance. For instance, for a postal service, the percentage of mail delivered within 24 hours is a leading indicator for customer satisfaction. Statistics are used to discover and test these relationships. You can use statistical techniques only if you have enough data. Data, by definition, describes results from the past. Given that all we can truly predict about the future is that most likely it will be different from today, you can question the predictive value of correlations found in data describing the past. Putting it in stronger words, you could even argue that validating a strategy map based on correlating past data, by definition, invalidates it5. A different approach than predicting the present is needed; an approach in which we create the data for the future, in terms of new assumptions, what-if questions and scenarios. It is within these assumptions that the risk and uncertainty that are part of any strategic planning can be captured. To create good assumptions, organizations must complete three critical steps6: • Make the assumptions realistic. • Identify the assumptions that matter most. • Leverage the drivers that can be controlled and monitor those that cannot. Several technologies can be helpful in this process. 4 See all of Oracle's management excellence white papers on www.oracle.com/thoughtleadership, in the resource center 5 Based on Buytendijk, F.A., "Dealing with Dilemmas", Wiley, 2010 6 Driving Strategic Planning with Predictive Modeling, July 2008, www.oracle.com
  • 9. Predictive Analytics: Bringing the Tools to the Data 8 OLAP Hyperion Essbase is an OLAP database, in which data is stored in a multidimensional format. This means the data is categorized in the form of dimensions, representing business structures. Typical business dimensions are 'product', 'customer', 'region' and 'time'. To perform what-if analysis and model possible future scenarios, a 'scenario' dimension is added. Scenario-building consists of using the following critical functionalities: • Create multiple hierarchies. By modeling alternative ways of segmenting customers, grouping products, consolidating organizational entities or any other type of business hierarchy, results can be tested before they are implemented as the new standard way of looking at the business • Use forecast calculations. Based on multiple assumptions, forecasts can be created. Based on likelihood and realism the best forecast can be determined and implemented. • Data entry. There is no data about the future available in business systems. Based on new assumptions business users may have to input the data themselves, preferably in a collaborative manner. Multi-user read write capabilities and version control are needed to manage such a process. Hyperion Strategic Planning Financial modeling is a special activity. It requires specific functionality. At the same time the structure in which the modeling takes place is highly standardized: a balance sheet, a profit and loss statement and a cash flow overview. Oracle Hyperion Strategic Finance integrates strategic planning into an enterprise planning process. It allows users to quickly develop financial models, perform on the fly what-if impact analysis based on dynamic decision variables and arrive at targets that can then be implemented. The what-if analysis toolkit is a unique and powerful set of out-of-the-box tools that allow you to easily create an unlimited number of scenarios by business unit. Other capabilities let you evaluate any metric’s sensitivity to key performance drivers and run periodic “goal-seek” checks to determine the performance level needed to achieve specific financial objectives. Hyperion Strategic Planning allows organizations to spend more time simulating long-term alternative strategies, developing contingent scenarios, and stress testing financial models. Crystal Ball Shaping the future is done outside operational business systems. It requires a modeling framework that can handle both structure and creativity. Crystal Ball uses a spreadsheet interface – the most widely used business analyst tool – to create the data for the future, in terms of new assumptions, what-if questions and scenarios. Either standalone in a spreadsheet, or integrated with Essbase and Strategic Planning, Crystal Ball facilitates the creation of good assumptions. Going through the three critical steps that are needed to shape the future, Crystal Ball can:
  • 10. Predictive Analytics: Bringing the Tools to the Data 9 • Make the assumptions realistic. Regardless of how much data you have (or none at all) apply any or all of the available techniques to improve the model assumptions: time- series forecasting, regression analysis, distribution fitting and simulation methods. • Identify the assumptions that matter most. Determine the sensitivity of the forecast to each assumption, so you know where to focus your resources. Sensitivity charts and Tornado charts rank and help visualize which assumptions are the most important to the least important in the model. • Leverage the drivers that can be controlled and monitor those that cannot. Because of the uncertainty inherent in models that shape the future, changing a driver (a decision variable) can have a significant effect on the forecast results. For one or two drivers, use the Decision Table tool, which quickly runs multiple simulations to test the effects of changing the drivers. For models that contain more than a handful of decision variables, or where you are trying to optimize the forecast results, optimization is used to automatically find optimal solutions to simulation models. A Better Approach Traditionally, predictive analytics require a laborious process. Figure X shows a typical flow of activities. Figure 2: Traditional predictive analytics process Based on a hypothesis -- an assumption to test -- the data that are needed are identified and extracted from their sources, such as an ERP or CRM system, or coming from the data warehouse. Different predictive analytics tools have different requirements on how to best process the data, usually the data require some transformation to fit the specifics of the tool. Only then the analysis can take place successfully. After the analysis and removal of all the "noise" in the data, conclusions are drawn that lead to changes in for instance customer segmentation or product clustering, and the success of the analysis is monitored. After that, the cycle starts all over.
  • 11. Predictive Analytics: Bringing the Tools to the Data 10 The key assumption of this approach is that it is best to take the data to the tools, which are handled by the experts. This assumption is outdated. It is a costly approach, takes too much time, robs the process of its creativity and doesn't allow the number of experiments to scale. Oracle's strategy is different. Oracle feels it is better to bring the tools to the data. Oracle Data Mining resides in the Oracle database itself, and is used by many business applications. Real-Time Decisions nests itself into the process engine of business applications, generating real-time alerts, particularly in customer contact interactions. Hyperion Strategic Finance is an add-on to Hyperion Financial Management and Hyperion Planning, for long-term financial modeling, and Crystal Ball integrates with Essbase and Hyperion planning. Figure 3 shows how the tools are brought to the data. There are multiple advantages to bringing the tools to the data: • It is less costly and increases speed. There is no need to extract and transform data and move it to another environment, which is often the most laborious part of the process. • Moreover, the increased speed of the process makes the process more iterative, and therewith increases the quality of the output. It becomes possible to run many more analyses and scenarios in the same amount of limited time, before deciding what the best outcome is. • Predictive analytics become part of a standard business process, instead of a specialist activity, and has a more direct business impact. • It adds creativity to the process. If attributes need to be added while experimenting with multiple analyses, the addition can be done without going back to the source data. The hypothesis can be narrowed and widened without significant consequences.
  • 12. Predictive Analytics: Bringing the Tools to the Data 11 In addition, Oracle Real-Time Decisions, Crystal Ball, Hyperion Essbase and Strategic Finance integrate with non-Oracle databases and non-Oracle business applications as well. Figure 3: Bringing the tools to the data Words of Warning To truly understand the possibilities and limitations of such advanced technologies on which predictive analytics are based, ironically we need to go back to the old philosophers. Plato's Cave describes an interesting thought experiment. Imagine a few people in a cave, sitting against a small wall, facing the other end. They are locked in chains and have been for all their lives. On top of the wall other people walk around, but they cannot be seen by the chained people. A large fire in the back of the cave casts the shadows of those people walking around on the wall at the other end, and the echos of their voices makes the sound come from the other end too. For the people in chains these shadows are the reality of the world, they do not have any knowledge about the people walking around on top of the little wall. The shadows are all they know. Analysts can easily fall into the same trap, unconsciously mistaking their model and analysis for the real world. Reality is interpreted "to fit the model", signals that change is coming are called
  • 13. Predictive Analytics: Bringing the Tools to the Data 12 "outliers". Force-fitting reality into our frame of reference is deeply human, and we are all prone to it. What helps counter this pitfall is to experiment7. Continuously try the outcomes of predictive analytics in your operational environment. Even run multiple experiments at the same time. See which ones lead to better results, and implement. Using simulations and other techniques, predictive analytics should be applied continuously and incrementally. Another word of warning comes from "Occam's Razor", based on the work of the 14th century Franciscan friar William of Ockham. Occam's razor is a research principle. In forming a theory, you should use the fewest elements needed to accurately predict an outcome. The more assumptions you use, and the more you stack analysis on top of analysis, the more likely you are to be inaccurate or wrong. To create a robust theory, you need to shave away all unnecessary elements from the analysis (hence "razor"). Seen this way, two of the least reliable indicators of business success are profit and customer satisfaction. They are based on many assumptions. Better indicators are for instance cash-flow and on-time delivery, that are much closer to direct observation and measurement. Although predictive analytics are based on sophisticated technologies and mathematical techniques, their success lies in applying them with common sense. 7 "Analytical competitors", as they are called in Davenport's book "Competing on Analytics", take this approach to analytics.
  • 14. Predictive Analytics: Bringing The Tools To The Data September, 2010 Authors: Frank Buytendijk, Lucie Trepanier Oracle Corporation World Headquarters 500 Oracle Parkway Redwood Shores, CA 94065 U.S.A. Worldwide Inquiries: Phone: +1.650.506.7000 Fax: +1.650.506.7200 oracle.com Copyright © 2010, Oracle and/or its affiliates. All rights reserved. Published in the U.S.A. This document is provided for information purposes only, and the contents hereof are subject to change without notice. This document is not warranted to be error-free, nor subject to any other warranties or conditions, whether expressed orally or implied in law, including implied warranties and conditions of merchantability or fitness for a particular purpose. We specifically disclaim any liability with respect to this document and no contractual obligations are formed either directly or indirectly by this document. This document may not be reproduced or transmitted in any form or by any means, electronic or mechanical, for any purpose, without our prior written permission. Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners. 10036922