SlideShare a Scribd company logo
1 of 89
DATA MINING Introductory and Advanced Topics Part I Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University Companion slides for the text by Dr. M.H.Dunham,  Data Mining, Introductory and Advanced Topics , Prentice Hall, 2002.
Data Mining Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Introduction Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],Goal:  Provide an overview of data mining.
Introduction ,[object Object],[object Object],[object Object],UNCOVER HIDDEN INFORMATION DATA MINING
Data Mining Definition ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Mining Algorithm ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Database Processing vs. Data Mining Processing ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Query Examples ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Mining Models and Tasks
Basic Data Mining Tasks ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Basic Data Mining Tasks (cont’d) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Ex:  Time Series Analysis ,[object Object],[object Object],[object Object],[object Object]
Data Mining vs. KDD ,[object Object],[object Object]
KDD Process ,[object Object],[object Object],[object Object],[object Object],[object Object],Modified from [FPSS96C]
KDD Process Ex:  Web Log ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Mining Development ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
KDD Issues ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
KDD Issues (cont’d) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Social Implications of DM ,[object Object],[object Object],[object Object]
Data Mining Metrics ,[object Object],[object Object],[object Object],[object Object]
Database Perspective on Data Mining ,[object Object],[object Object],[object Object],[object Object]
Visualization Techniques ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Related Concepts Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Goal:  Examine some areas which are related to data mining.
DB & OLTP Systems ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Fuzzy Sets and Logic ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Fuzzy Sets
Classification/Prediction is Fuzzy Loan Amnt Simple Fuzzy Accept Accept Reject Reject
Information Retrieval  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Information Retrieval (cont’d) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
IR Query Result Measures and Classification IR Classification
Dimensional Modeling ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Relational View of Data
Dimensional Modeling Queries ,[object Object],[object Object],[object Object],[object Object],[object Object]
Cube view of Data
Aggregation Hierarchies
Star Schema
Data Warehousing ,[object Object],[object Object],[object Object],[object Object],[object Object]
Operational vs. Informational   Operational Data Data Warehouse Application OLTP OLAP Use Precise Queries Ad Hoc Temporal Snapshot Historical Modification Dynamic Static Orientation Application Business Data Operational Values Integrated Size Gigabits Terabits Level Detailed Summarized Access Often Less Often Response Few Seconds Minutes Data Schema Relational Star/Snowflake
OLAP ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
OLAP Operations Single Cell Multiple Cells Slice Dice Roll Up Drill Down
Statistics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Machine Learning ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Pattern Matching (Recognition) ,[object Object],[object Object],[object Object]
DM vs. Related Topics
Data Mining Techniques Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Goal:   Provide an overview of basic data mining techniques
Point Estimation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Estimation Error ,[object Object],[object Object],[object Object],[object Object]
Jackknife Estimate ,[object Object],[object Object]
Maximum Likelihood Estimate (MLE) ,[object Object],[object Object],[object Object]
MLE Example ,[object Object],[object Object],[object Object]
MLE Example (cont’d) ,[object Object],[object Object]
Expectation-Maximization (EM) ,[object Object],[object Object],[object Object]
EM Example
EM Algorithm
Models Based on Summarization ,[object Object],[object Object]
Scatter Diagram
Bayes Theorem ,[object Object],[object Object],[object Object],[object Object]
Bayes Theorem Example ,[object Object],[object Object],[object Object]
Bayes Example(cont’d) ,[object Object]
Bayes Example(cont’d) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Hypothesis Testing ,[object Object],[object Object],[object Object],[object Object]
Chi Squared Statistic ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Regression ,[object Object],[object Object],[object Object],[object Object]
Linear Regression
Correlation ,[object Object],[object Object],[object Object],[object Object],[object Object]
Similarity Measures ,[object Object],[object Object],[object Object]
Similarity Measures
Distance Measures ,[object Object]
Twenty Questions Game
Decision Trees ,[object Object],[object Object],[object Object],[object Object],[object Object]
Decision Tree Example
Decision Trees ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Decision Tree Algorithm
DT Advantages/Disadvantages ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Neural Networks  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Neural Networks ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Neural Network Example
NN Node
NN Activation Functions ,[object Object],[object Object]
NN Activation Functions
NN Learning ,[object Object],[object Object],[object Object]
Neural Networks ,[object Object],[object Object],[object Object],[object Object],[object Object]
NN Advantages ,[object Object],[object Object],[object Object],[object Object]
NN Disadvantages ,[object Object],[object Object],[object Object],[object Object],[object Object]
Genetic Algorithms ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Genetic Algorithms ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Crossover Examples
Genetic Algorithm
GA Advantages/Disadvantages ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

More Related Content

What's hot

Data mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationData mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationDr. Abdul Ahad Abro
 
Slide 2 data models
Slide 2 data modelsSlide 2 data models
Slide 2 data modelsVisakh V
 
Object Oriented Database Management System
Object Oriented Database Management SystemObject Oriented Database Management System
Object Oriented Database Management SystemAjay Jha
 
Data mining an introduction
Data mining an introductionData mining an introduction
Data mining an introductionDr-Dipali Meher
 
Data Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalizationData Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalizationDataminingTools Inc
 
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...Salah Amean
 
multi dimensional data model
multi dimensional data modelmulti dimensional data model
multi dimensional data modelmoni sindhu
 
Data Integration and Transformation in Data mining
Data Integration and Transformation in Data miningData Integration and Transformation in Data mining
Data Integration and Transformation in Data miningkavitha muneeshwaran
 
4.2 spatial data mining
4.2 spatial data mining4.2 spatial data mining
4.2 spatial data miningKrish_ver2
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data AnalyticsUtkarsh Sharma
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining Sushil Kulkarni
 
4.3 multimedia datamining
4.3 multimedia datamining4.3 multimedia datamining
4.3 multimedia dataminingKrish_ver2
 

What's hot (20)

Data mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationData mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, Classification
 
Kdd process
Kdd processKdd process
Kdd process
 
Data Preprocessing
Data PreprocessingData Preprocessing
Data Preprocessing
 
Slide 2 data models
Slide 2 data modelsSlide 2 data models
Slide 2 data models
 
Object Oriented Database Management System
Object Oriented Database Management SystemObject Oriented Database Management System
Object Oriented Database Management System
 
Clustering in Data Mining
Clustering in Data MiningClustering in Data Mining
Clustering in Data Mining
 
Data mining an introduction
Data mining an introductionData mining an introduction
Data mining an introduction
 
Data Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalizationData Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalization
 
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
 
multi dimensional data model
multi dimensional data modelmulti dimensional data model
multi dimensional data model
 
Data mining primitives
Data mining primitivesData mining primitives
Data mining primitives
 
Data Integration and Transformation in Data mining
Data Integration and Transformation in Data miningData Integration and Transformation in Data mining
Data Integration and Transformation in Data mining
 
5desc
5desc5desc
5desc
 
Data mining
Data mining Data mining
Data mining
 
4.2 spatial data mining
4.2 spatial data mining4.2 spatial data mining
4.2 spatial data mining
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data Analytics
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining
 
Introduction to Relational Databases
Introduction to Relational DatabasesIntroduction to Relational Databases
Introduction to Relational Databases
 
Text mining
Text miningText mining
Text mining
 
4.3 multimedia datamining
4.3 multimedia datamining4.3 multimedia datamining
4.3 multimedia datamining
 

Viewers also liked

"STAGES IN THE LIFE OF MINE"
"STAGES IN THE LIFE OF MINE""STAGES IN THE LIFE OF MINE"
"STAGES IN THE LIFE OF MINE"Anus KK
 

Viewers also liked (20)

BPI@BPM2012
BPI@BPM2012BPI@BPM2012
BPI@BPM2012
 
"STAGES IN THE LIFE OF MINE"
"STAGES IN THE LIFE OF MINE""STAGES IN THE LIFE OF MINE"
"STAGES IN THE LIFE OF MINE"
 
Unit operations of mining
Unit operations of miningUnit operations of mining
Unit operations of mining
 
TITANIUM ORE DEPOSITS IN EGYPT
TITANIUM ORE DEPOSITS IN EGYPTTITANIUM ORE DEPOSITS IN EGYPT
TITANIUM ORE DEPOSITS IN EGYPT
 
Introductory mining
Introductory miningIntroductory mining
Introductory mining
 
ASBESTOS, VERMICULITE, COURUNDUM, MAGNESITE, AND TALC DEPOSITS IN EGYPT
ASBESTOS, VERMICULITE, COURUNDUM, MAGNESITE, AND TALC DEPOSITS IN EGYPTASBESTOS, VERMICULITE, COURUNDUM, MAGNESITE, AND TALC DEPOSITS IN EGYPT
ASBESTOS, VERMICULITE, COURUNDUM, MAGNESITE, AND TALC DEPOSITS IN EGYPT
 
CHROMITE ORE DEPOSITS IN EGYPT
CHROMITE ORE DEPOSITS IN EGYPTCHROMITE ORE DEPOSITS IN EGYPT
CHROMITE ORE DEPOSITS IN EGYPT
 
SULFIDE MINERALIZATION IN EGYPT
SULFIDE MINERALIZATION IN EGYPT SULFIDE MINERALIZATION IN EGYPT
SULFIDE MINERALIZATION IN EGYPT
 
Beneficiation and mineral processing of magnesium minerals
Beneficiation and mineral processing of magnesium mineralsBeneficiation and mineral processing of magnesium minerals
Beneficiation and mineral processing of magnesium minerals
 
MANGANESE ORE DEPOSITS IN EGYPT
MANGANESE ORE DEPOSITS IN EGYPT MANGANESE ORE DEPOSITS IN EGYPT
MANGANESE ORE DEPOSITS IN EGYPT
 
PHOSPHATE ORE DEPOSITS IN EGYPT
PHOSPHATE  ORE DEPOSITS IN EGYPTPHOSPHATE  ORE DEPOSITS IN EGYPT
PHOSPHATE ORE DEPOSITS IN EGYPT
 
Topic 2 the mining cycle
Topic 2  the mining cycleTopic 2  the mining cycle
Topic 2 the mining cycle
 
IRON ORE DEPOSITS IN EGYPT
IRON ORE DEPOSITS IN EGYPT IRON ORE DEPOSITS IN EGYPT
IRON ORE DEPOSITS IN EGYPT
 
Classification of Mineral Deposit in Egypt
Classification of Mineral Deposit in EgyptClassification of Mineral Deposit in Egypt
Classification of Mineral Deposit in Egypt
 
Topic 7-mining methods-part iii -surface mining- placer mining
Topic 7-mining methods-part iii -surface mining- placer miningTopic 7-mining methods-part iii -surface mining- placer mining
Topic 7-mining methods-part iii -surface mining- placer mining
 
Beneficiation and Mineral Processing of Calcium Carbonate and Calcium Sulphate
Beneficiation and Mineral Processing of Calcium Carbonate and Calcium Sulphate Beneficiation and Mineral Processing of Calcium Carbonate and Calcium Sulphate
Beneficiation and Mineral Processing of Calcium Carbonate and Calcium Sulphate
 
Beneficiation and mineral processing of sand and silica sand
Beneficiation and mineral processing of  sand and silica sandBeneficiation and mineral processing of  sand and silica sand
Beneficiation and mineral processing of sand and silica sand
 
Ventilation of underground mine
Ventilation of underground mineVentilation of underground mine
Ventilation of underground mine
 
Egyptian Islands الجزر المصرية
Egyptian Islands  الجزر المصريةEgyptian Islands  الجزر المصرية
Egyptian Islands الجزر المصرية
 
URANIUM ORE DEPOSITS IN EGYPT
URANIUM ORE DEPOSITS IN EGYPTURANIUM ORE DEPOSITS IN EGYPT
URANIUM ORE DEPOSITS IN EGYPT
 

Similar to Part1

Cssu dw dm
Cssu dw dmCssu dw dm
Cssu dw dmsumit621
 
Dwdmunit1 a
Dwdmunit1 aDwdmunit1 a
Dwdmunit1 abhagathk
 
Data mining concepts and work
Data mining concepts and workData mining concepts and work
Data mining concepts and workAmr Abd El Latief
 
Introduction To Data Mining
Introduction To Data MiningIntroduction To Data Mining
Introduction To Data Miningdataminers.ir
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining Phi Jack
 
Knowledge discovery claudiad amato
Knowledge discovery claudiad amatoKnowledge discovery claudiad amato
Knowledge discovery claudiad amatoSSSW
 
Tutorial Knowledge Discovery
Tutorial Knowledge DiscoveryTutorial Knowledge Discovery
Tutorial Knowledge DiscoverySSSW
 
Data-Mining-ppt (1).pdf
Data-Mining-ppt (1).pdfData-Mining-ppt (1).pdf
Data-Mining-ppt (1).pdfParvathyparu25
 
Introduction to data mining
Introduction to data miningIntroduction to data mining
Introduction to data miningUjjawal
 

Similar to Part1 (20)

Cssu dw dm
Cssu dw dmCssu dw dm
Cssu dw dm
 
Data mining
Data miningData mining
Data mining
 
Dwdmunit1 a
Dwdmunit1 aDwdmunit1 a
Dwdmunit1 a
 
Talk
TalkTalk
Talk
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Data mining concepts and work
Data mining concepts and workData mining concepts and work
Data mining concepts and work
 
Data mining
Data miningData mining
Data mining
 
Data mining
Data miningData mining
Data mining
 
Chapter 1: Introduction to Data Mining
Chapter 1: Introduction to Data MiningChapter 1: Introduction to Data Mining
Chapter 1: Introduction to Data Mining
 
Data mining
Data miningData mining
Data mining
 
Introduction To Data Mining
Introduction To Data MiningIntroduction To Data Mining
Introduction To Data Mining
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining
 
Knowledge discovery claudiad amato
Knowledge discovery claudiad amatoKnowledge discovery claudiad amato
Knowledge discovery claudiad amato
 
data.2.pptx
data.2.pptxdata.2.pptx
data.2.pptx
 
Tutorial Knowledge Discovery
Tutorial Knowledge DiscoveryTutorial Knowledge Discovery
Tutorial Knowledge Discovery
 
Seminar Presentation
Seminar PresentationSeminar Presentation
Seminar Presentation
 
Data Mining
Data MiningData Mining
Data Mining
 
Data-Mining-ppt (1).pdf
Data-Mining-ppt (1).pdfData-Mining-ppt (1).pdf
Data-Mining-ppt (1).pdf
 
data mining
data miningdata mining
data mining
 
Introduction to data mining
Introduction to data miningIntroduction to data mining
Introduction to data mining
 

More from sumit621

142230 633685297550892500
142230 633685297550892500142230 633685297550892500
142230 633685297550892500sumit621
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingsumit621
 
Datamining
DataminingDatamining
Dataminingsumit621
 
Chapter 13 data warehousing
Chapter 13   data warehousingChapter 13   data warehousing
Chapter 13 data warehousingsumit621
 
90300 633579030311875000
90300 63357903031187500090300 633579030311875000
90300 633579030311875000sumit621
 

More from sumit621 (11)

142230 633685297550892500
142230 633685297550892500142230 633685297550892500
142230 633685297550892500
 
Lecture1
Lecture1Lecture1
Lecture1
 
Lect4
Lect4Lect4
Lect4
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Datamining
DataminingDatamining
Datamining
 
Database
DatabaseDatabase
Database
 
Chapter 13 data warehousing
Chapter 13   data warehousingChapter 13   data warehousing
Chapter 13 data warehousing
 
Chapter16
Chapter16Chapter16
Chapter16
 
Chap05
Chap05Chap05
Chap05
 
90300 633579030311875000
90300 63357903031187500090300 633579030311875000
90300 633579030311875000
 
01 intro
01 intro01 intro
01 intro
 

Part1