Data Mining
In SSAS
Ram Kedem
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
What is Data Mining ?
• the computational process of discovering patterns in large
data sets involving methods at the intersection of artificial
intelligence, machine learning, statistics, and database
systems.
• The overall goal of the data mining process is to extract
information from a data set and transform it into an
understandable structure for further use.
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Data Mining Process
• Cross-Industry Standard Process for Data Mining (CRISP-DM)
consists of six cyclic phases :
• Business Understanding
• Data Understanding
• Data Preparation
• Modeling
• Evaluation
• Deployment
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Data Mining Algorithms
Covered In This Lesson
• Decision Trees
• Clustering
• Time Series
• Association rules
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Decision Trees
• Decision Trees
• a tree-like graph or model of decisions and their possible
consequences, including chance event outcomes, resource costs
• Very common, used for classification.
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Clustering
• Clustering
• grouping a set of objects in such a way that objects in the same
group (called a cluster) are more similar (in some sense or
another) to each other than to those in other groups (clusters)
• Widely used in areas of fraud detection
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Data Validation Sample
• http://www.sqlserverdatamining.com/ssdm/
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Data Validation Sample
• http://www.sqlserverdatamining.com/DataValidation/default.
aspx
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Time Series
• Time Series
• A time series is a collection of observations made chronologically,
Used to predict a number of some sort.
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Association rules
• Association rules
• Association rules are if/then statements that help uncover
relationships between seemingly unrelated data in a relational
database or other information repository.
Create New Mining
Structure
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Define your Data Source
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Define your DSV
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Create new mining structure
• Right click on Mining Structures and choose New Mining
Structure
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Select the Definition Method
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Create the Mining Structure
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Select Data Source View
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Specify Table Types
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Specify Training Data
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Suggest Related Columns
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Columns Content & Data Types
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Create Testing Set
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Completing the Wizard
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
View Mining Model
• Deploy and Process the Mining Model.
• View the Mining Model in Mining Models Tab
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Adding Mining Models
• It is possible to add more mining models by right-clicking at
the Mining models area and choosing New Mining Model.
• Deploy and Process once finished.
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Mining Model Viewer
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Dependency Network
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Mining Accuracy Chart - Input
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Lift Chart
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Profit Chart
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Profit Chart
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Mining Model Prediction
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Mining Model Prediction
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Query Results View
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Order the Data Differently
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Singleton Query
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Singleton Query
Time Series Mining
Structure
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Time Series Mining Structure
• Right click on Mining Structures and choose New Mining
Structure
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Select Definition Method
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Create the Data Mining
Structure
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Select DSV
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Specify Table Types
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Specify Training Data
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Specify Columns Content and
Data Types
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Complete the Wizard
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Mining Model Viewer

Data Mining in SSAS

  • 1.
  • 2.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com What is Data Mining ? • the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems. • The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
  • 3.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Data Mining Process • Cross-Industry Standard Process for Data Mining (CRISP-DM) consists of six cyclic phases : • Business Understanding • Data Understanding • Data Preparation • Modeling • Evaluation • Deployment
  • 4.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Data Mining Algorithms Covered In This Lesson • Decision Trees • Clustering • Time Series • Association rules
  • 5.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Decision Trees • Decision Trees • a tree-like graph or model of decisions and their possible consequences, including chance event outcomes, resource costs • Very common, used for classification.
  • 6.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Clustering • Clustering • grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense or another) to each other than to those in other groups (clusters) • Widely used in areas of fraud detection
  • 7.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Data Validation Sample • http://www.sqlserverdatamining.com/ssdm/
  • 8.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Data Validation Sample • http://www.sqlserverdatamining.com/DataValidation/default. aspx
  • 9.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Time Series • Time Series • A time series is a collection of observations made chronologically, Used to predict a number of some sort.
  • 10.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Association rules • Association rules • Association rules are if/then statements that help uncover relationships between seemingly unrelated data in a relational database or other information repository.
  • 11.
  • 12.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Define your Data Source
  • 13.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Define your DSV
  • 14.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Create new mining structure • Right click on Mining Structures and choose New Mining Structure
  • 15.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Select the Definition Method
  • 16.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Create the Mining Structure
  • 17.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Select Data Source View
  • 18.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Specify Table Types
  • 19.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Specify Training Data
  • 20.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Suggest Related Columns
  • 21.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Columns Content & Data Types
  • 22.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Create Testing Set
  • 23.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Completing the Wizard
  • 24.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com View Mining Model • Deploy and Process the Mining Model. • View the Mining Model in Mining Models Tab
  • 25.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Adding Mining Models • It is possible to add more mining models by right-clicking at the Mining models area and choosing New Mining Model. • Deploy and Process once finished.
  • 26.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Mining Model Viewer
  • 27.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Dependency Network
  • 28.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Mining Accuracy Chart - Input
  • 29.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Lift Chart
  • 30.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Profit Chart
  • 31.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Profit Chart
  • 32.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Mining Model Prediction
  • 33.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Mining Model Prediction
  • 34.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Query Results View
  • 35.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Order the Data Differently
  • 36.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Singleton Query
  • 37.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Singleton Query
  • 38.
  • 39.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Time Series Mining Structure • Right click on Mining Structures and choose New Mining Structure
  • 40.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Select Definition Method
  • 41.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Create the Data Mining Structure
  • 42.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Select DSV
  • 43.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Specify Table Types
  • 44.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Specify Training Data
  • 45.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Specify Columns Content and Data Types
  • 46.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Complete the Wizard
  • 47.
    Copyright 2014 ©Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Mining Model Viewer