SlideShare a Scribd company logo
MCA535 Data Mining and Data Warehousing
Total teaching Hours/Semester: 60 No of Lecture Hours/Week: 04
Unit I. (12)
Introduction
What Is Data Mining? Data Mining—On What Kind of Data? Data Mining
Functionalities, Classification of Data Mining Systems, Data Mining Task Primitives,
Integration of a Data Mining System with a Database or DataWarehouse System, Major
Issues in Data Mining.
Data Preprocessing
Why Preprocess the Data? Descriptive Data Summarization – Measuring the central
tendency- Measuring the dispersion of data.
Unit II. (12)
Data Preprocessing (cont.)
Data Cleaning-Missing Values-Noisy Data-Data Cleaning as a Process, Data Integration
and Transformation, Data Reduction-Data Cube Aggregation-Attribute Subset Selection-
Dimensionality Reduction-Numerosity Reduction.
DataWarehouse and OLAP Technology
What Is a DataWarehouse? A Multidimensional Data Model, DataWarehouse
Architecture, DataWarehouse Implementation, From DataWarehousing to Data Mining.
Unit III. (12)
Data Cube Computation and Data Generalization
Efficient Methods for Data Cube Computation – Road map - Multiway array aggregation
– Star cubing, Further Development of Data Cube and OLAP Technology.
Mining Frequent Patterns and Associations
Basic Concepts, Efficient and Scalable Frequent Itemset Mining Methods – Apriori
algorithm, Generating Rules – Improving efficiency – Mining frequent itemset without
candidate generation.
Unit IV. (14)
Classification and Prediction
What Is Classification? What Is Prediction? Issues Regarding Classification and
Prediction, Classification by Decision Tree – Decision tree induction – Attribute
selection, Bayesian Classification – Bayesian Theorem - naïve Bayesian, Rule-Based
Classification, Prediction, Accuracy and Error Measures.
Cluster Analysis
What Is Cluster Analysis? Types of Data in Cluster Analysis, A Categorization of Major
Clustering Methods, Partitioning Methods – K-Means and K-Medoids, Hierarchical
Methods – Agglomerative and Divisive, Density Based Methods - DBSCAN, Outlier
Analysis – Statistical based.
Syllabus 2009 MCA 84
Christ University, Bangalore, India
Unit V. (10)
Mining Time-Series and Spatial Data
Mining Time-Series Data – Trend analysis – Similarity search, Spatial Data Mining-
Spatial Data Cube Construction and Spatial OLAP-Mining Spatial Association and Colocation
Patterns-Spatial Clustering, Classification Methods-Mining Raster Databases
Applications and Trends in Data Mining
Data Mining Applications, Data Mining System Products and Research Prototypes,
Social Impacts of Data Mining.
Text Book:
1. Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques,
Morgan Kaufmann Publishers, San Francisco, USA, 2nd edition, 2006.
Reference Books:
1. Claudia Imhoff, Nicholas & et al, Mastering Data warehouse Design, J Wiley,
2003
2. Berson A & Smith S J, Data warehousing, Data Mining & OLAP, Mc Graw
Hall, 1997.
3. Margaret H. Dunham, Data mining-Introductory and Advanced topics Pearson
Education, 2003
4. Inmon W H, Building the Data Warehouse, John Wiley & Sons, 3rd edition,
2005

More Related Content

What's hot

Additional themes of data mining for Msc CS
Additional themes of data mining for Msc CSAdditional themes of data mining for Msc CS
Additional themes of data mining for Msc CS
Thanveen
 
introduction to data mining tutorial
introduction to data mining tutorial introduction to data mining tutorial
introduction to data mining tutorial
Salah Amean
 
Project 0th Review
Project 0th ReviewProject 0th Review
Project 0th Review
Divakar Raj M
 
Data mining in agriculture
Data mining in agricultureData mining in agriculture
Data mining in agriculture
Sibananda Khatai
 
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & KamberChapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
error007
 
R project
R projectR project
R project
Tayyaba Jabeen
 
Chapter - 8.1 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 8.1 Data Mining Concepts and Techniques 2nd Ed slides Han & KamberChapter - 8.1 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 8.1 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
error007
 
Data Mining Overview
Data Mining OverviewData Mining Overview
Data Mining Overview
Golda Margret Sheeba J
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
DataminingTools Inc
 
A CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSE
A CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSEA CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSE
A CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSE
IJDKP
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining
Amritanshu Mehra
 
Chaper 13 trend
Chaper 13 trendChaper 13 trend
Chaper 13 trend
Houw Liong The
 
Data mining primitives
Data mining primitivesData mining primitives
Data mining primitives
lavanya marichamy
 
Data Mining: Application and trends in data mining
Data Mining: Application and trends in data miningData Mining: Application and trends in data mining
Data Mining: Application and trends in data mining
DataminingTools Inc
 
Data mining concepts
Data mining conceptsData mining concepts
Data mining concepts
Basit Rafiq
 
Data mining and knowledge Discovery
Data mining and knowledge DiscoveryData mining and knowledge Discovery
Data mining and knowledge Discovery
Kartik Kalpande Patil
 
Data mining and knowledge discovery
Data mining and knowledge discoveryData mining and knowledge discovery
Data mining and knowledge discovery
Hoang Nguyen
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
Izwan Nizal Mohd Shaharanee
 
DM
DMDM
DM
sowfi
 
Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...
Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...
Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...
IJCSIS Research Publications
 

What's hot (20)

Additional themes of data mining for Msc CS
Additional themes of data mining for Msc CSAdditional themes of data mining for Msc CS
Additional themes of data mining for Msc CS
 
introduction to data mining tutorial
introduction to data mining tutorial introduction to data mining tutorial
introduction to data mining tutorial
 
Project 0th Review
Project 0th ReviewProject 0th Review
Project 0th Review
 
Data mining in agriculture
Data mining in agricultureData mining in agriculture
Data mining in agriculture
 
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & KamberChapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
 
R project
R projectR project
R project
 
Chapter - 8.1 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 8.1 Data Mining Concepts and Techniques 2nd Ed slides Han & KamberChapter - 8.1 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 8.1 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
 
Data Mining Overview
Data Mining OverviewData Mining Overview
Data Mining Overview
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
A CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSE
A CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSEA CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSE
A CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSE
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining
 
Chaper 13 trend
Chaper 13 trendChaper 13 trend
Chaper 13 trend
 
Data mining primitives
Data mining primitivesData mining primitives
Data mining primitives
 
Data Mining: Application and trends in data mining
Data Mining: Application and trends in data miningData Mining: Application and trends in data mining
Data Mining: Application and trends in data mining
 
Data mining concepts
Data mining conceptsData mining concepts
Data mining concepts
 
Data mining and knowledge Discovery
Data mining and knowledge DiscoveryData mining and knowledge Discovery
Data mining and knowledge Discovery
 
Data mining and knowledge discovery
Data mining and knowledge discoveryData mining and knowledge discovery
Data mining and knowledge discovery
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
DM
DMDM
DM
 
Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...
Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...
Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...
 

Similar to Mca535 data mining and data warehousing

17 cs002
17 cs00217 cs002
17 cs002
TPLatchoumi
 
Dbm630_lecture01
Dbm630_lecture01Dbm630_lecture01
Dbm630 Lecture01
Dbm630 Lecture01Dbm630 Lecture01
Dbm630 Lecture01
Aj Kritsada Sriphaew
 
Mca iv
Mca ivMca iv
Chapter 1. Introduction.ppt
Chapter 1. Introduction.pptChapter 1. Introduction.ppt
Chapter 1. Introduction.ppt
Subrata Kumer Paul
 
2 introductory slides
2 introductory slides2 introductory slides
2 introductory slides
tafosepsdfasg
 
Unit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.pptUnit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.ppt
PadmajaLaksh
 
Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1
DanWooster1
 
Data Mining Intro
Data Mining IntroData Mining Intro
Data Mining Intro
ShubhamSamrat5
 
01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
AidaMustapha6
 
01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt
admsoyadm4
 
01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
VaibhavGupta447155
 
data mining
data miningdata mining
data mining
AMITKUMAR202236
 
Data Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trendData Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trend
Salah Amean
 
DM course outlines.pdf
DM course outlines.pdfDM course outlines.pdf
DM course outlines.pdf
AgricultureExtension3
 
Introduction.ppt
Introduction.pptIntroduction.ppt
Introduction.ppt
bommaiah
 
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEDATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
IJDKP
 
Introduction to data warehouse
Introduction to data warehouseIntroduction to data warehouse
Introduction to data warehouse
Cognizant Technology Solutions
 
Introduction
IntroductionIntroduction
Introduction
neelamoberoi1030
 
Dwdmunit1 a
Dwdmunit1 aDwdmunit1 a
Dwdmunit1 a
bhagathk
 

Similar to Mca535 data mining and data warehousing (20)

17 cs002
17 cs00217 cs002
17 cs002
 
Dbm630_lecture01
Dbm630_lecture01Dbm630_lecture01
Dbm630_lecture01
 
Dbm630 Lecture01
Dbm630 Lecture01Dbm630 Lecture01
Dbm630 Lecture01
 
Mca iv
Mca ivMca iv
Mca iv
 
Chapter 1. Introduction.ppt
Chapter 1. Introduction.pptChapter 1. Introduction.ppt
Chapter 1. Introduction.ppt
 
2 introductory slides
2 introductory slides2 introductory slides
2 introductory slides
 
Unit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.pptUnit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.ppt
 
Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1
 
Data Mining Intro
Data Mining IntroData Mining Intro
Data Mining Intro
 
01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
 
01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt
 
01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
 
data mining
data miningdata mining
data mining
 
Data Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trendData Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trend
 
DM course outlines.pdf
DM course outlines.pdfDM course outlines.pdf
DM course outlines.pdf
 
Introduction.ppt
Introduction.pptIntroduction.ppt
Introduction.ppt
 
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEDATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
 
Introduction to data warehouse
Introduction to data warehouseIntroduction to data warehouse
Introduction to data warehouse
 
Introduction
IntroductionIntroduction
Introduction
 
Dwdmunit1 a
Dwdmunit1 aDwdmunit1 a
Dwdmunit1 a
 

Mca535 data mining and data warehousing

  • 1. MCA535 Data Mining and Data Warehousing Total teaching Hours/Semester: 60 No of Lecture Hours/Week: 04 Unit I. (12) Introduction What Is Data Mining? Data Mining—On What Kind of Data? Data Mining Functionalities, Classification of Data Mining Systems, Data Mining Task Primitives, Integration of a Data Mining System with a Database or DataWarehouse System, Major Issues in Data Mining. Data Preprocessing Why Preprocess the Data? Descriptive Data Summarization – Measuring the central tendency- Measuring the dispersion of data. Unit II. (12) Data Preprocessing (cont.) Data Cleaning-Missing Values-Noisy Data-Data Cleaning as a Process, Data Integration and Transformation, Data Reduction-Data Cube Aggregation-Attribute Subset Selection- Dimensionality Reduction-Numerosity Reduction. DataWarehouse and OLAP Technology What Is a DataWarehouse? A Multidimensional Data Model, DataWarehouse Architecture, DataWarehouse Implementation, From DataWarehousing to Data Mining. Unit III. (12) Data Cube Computation and Data Generalization Efficient Methods for Data Cube Computation – Road map - Multiway array aggregation – Star cubing, Further Development of Data Cube and OLAP Technology. Mining Frequent Patterns and Associations Basic Concepts, Efficient and Scalable Frequent Itemset Mining Methods – Apriori algorithm, Generating Rules – Improving efficiency – Mining frequent itemset without candidate generation. Unit IV. (14) Classification and Prediction What Is Classification? What Is Prediction? Issues Regarding Classification and Prediction, Classification by Decision Tree – Decision tree induction – Attribute selection, Bayesian Classification – Bayesian Theorem - naïve Bayesian, Rule-Based Classification, Prediction, Accuracy and Error Measures. Cluster Analysis What Is Cluster Analysis? Types of Data in Cluster Analysis, A Categorization of Major Clustering Methods, Partitioning Methods – K-Means and K-Medoids, Hierarchical Methods – Agglomerative and Divisive, Density Based Methods - DBSCAN, Outlier Analysis – Statistical based. Syllabus 2009 MCA 84 Christ University, Bangalore, India Unit V. (10) Mining Time-Series and Spatial Data Mining Time-Series Data – Trend analysis – Similarity search, Spatial Data Mining- Spatial Data Cube Construction and Spatial OLAP-Mining Spatial Association and Colocation Patterns-Spatial Clustering, Classification Methods-Mining Raster Databases Applications and Trends in Data Mining
  • 2. Data Mining Applications, Data Mining System Products and Research Prototypes, Social Impacts of Data Mining. Text Book: 1. Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques, Morgan Kaufmann Publishers, San Francisco, USA, 2nd edition, 2006. Reference Books: 1. Claudia Imhoff, Nicholas & et al, Mastering Data warehouse Design, J Wiley, 2003 2. Berson A & Smith S J, Data warehousing, Data Mining & OLAP, Mc Graw Hall, 1997. 3. Margaret H. Dunham, Data mining-Introductory and Advanced topics Pearson Education, 2003 4. Inmon W H, Building the Data Warehouse, John Wiley & Sons, 3rd edition, 2005