Introduction to SAS Enterprise Miner

An Introduction to
SAS Enterprise Miner
CS5608: Big Data Analytics
By: Yasoda Jayaweera
Brunel University, London

OUTLINE
▪ An Introduction to SAS
▪ Importance of SAS
▪ Demo
▪ Building a decision tree with SAS Enterprise Miner
CS5608 - Introduction to SAS EMiner - Yasoda Jayaweera 2

SAS: A HISTORY
▪ Statistical Analysis System
▪ Began at North Carolina State University, US as a
project to analyze agricultural research
▪ A US based company founded in 1976
▪ Proprietary software
▪ SAS Base is the main software

SAS IDEs
▪ SAS Studio
▪ SAS Enterprise Guide
▪ Use to write and run SAS code
▪ General purpose reporting and analysis (manipulate data,
describe data, graph data, and perform advanced statistical
analysis)
▪ SAS Enterprise Miner
▪ Specifically for predictive and descriptive modeling
▪ Interface for data mining/neural networks
▪ Used for specific data mining techniques to create statistical
models, scoring models, segmenting data and etc.

IS SAS AN IMPORTANT PLAYER?
▪ Tradition/legacy
▪ Existing infrastructure (since 1976)
▪ Cost of transition
▪ Distrust of free software
▪ Lower processing times with Big Data
▪ Ability for sequential processing

SAS Annual Report

▪ Integrates with other proprietary software well
▪ Procedures are very well documented and
standardize coding
▪ Single-source support
▪ Many data scientists are not programmers and don't
care about using a cool language

GARTNER’S MAGIC QUADRANT 2017
Gartner has recognised SAS as a “Leader” in the magic
quadrant for data science platforms

SAS, R or PYTHON
Burtch Works survey 2017

SAS CERTIFICATION
▪ Global certifications
▪ Certification path

LEARNING MATERIAL

SAS ENTERPRISE MINER
▪ Introduction to the SAS EMiner interface
▪ SAS SEMMA process
▪ Building a decision tree

SAS SEMMA PROCESS
▪ Methodical approach that describes how an analysis
is performed
Sample
Explore
ModifyModel
Assess

DATA
▪ The DONORS_RAW_DATA data set contains details of
donations from a previous mail solicitation campaign
at a charitable organization
▪ Mailing a solicitation is associated with a cost
▪ Mailed and responded - $15.00 (received donation on average)
▪ Mailed but no response - $0.50 (postage)
▪ Did not mail – no cost
▪ Target variable
▪ 1 - decision to mail a solicitation to an individual
▪ 0 - decision to not mail a solicitation

SPEAKER
Yasoda Jayaweera
PhD Student
Brunel University, London
(yasoda.jayaweera@brunel.ac.uk)

Introduction to SAS Enterprise Miner

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Introduction to SAS Enterprise Miner

Similar to Introduction to SAS Enterprise Miner (20)

Recently uploaded

Recently uploaded (20)

Introduction to SAS Enterprise Miner