SlideShare a Scribd company logo
1 of 12
INTRODUCTION TO
DATA MINING
Samrat Devidas Tayade
TE IT
ARMIET
prerequisites
• Knowledge of databases
• Data warehousing
• Olap
Knowledge of databases
• Database : A database is an organized collection of data, generally
stored and accessed electronically from a computer system.
Where databases are more complex they are often developed
using formal design and modeling techniques.
• Database Management System (DBMS) – add, remove, update
records – retrieve data that match certain criteria – cross-reference
data in different tables – perform complex aggregate calculation •
Database consists of columns (attributes) and rows (records).
Data warehousing
Data warehousing is the process of
constructing and using a data
warehouse. A data warehouse is
constructed by integrating data from
multiple heterogeneous sources that
support analytical reporting,
structured and/or ad hoc queries, and
decision making. Data warehousing
involves data cleaning, data
integration, and data consolidations.
OLAP
Online Analytical Processing Server (OLAP) is based on the
multidimensional data model. It allows managers, and analysts to get
an insight of the information through fast, consistent, and interactive
access to information.
TYPES OF OLAP :
1. Relational OLAP (ROLAP)
2. Multidimensional OLAP (MOLAP)
3. Hybrid OLAP (HOLAP)
4. Specialized SQL Servers
Content
• What is data mining
• Kind to be mined
• Technologies used
• Major issues in data
mining
What is data mining
• The practice of examining large pre-existing databases in order to generate new
information.
• Data Mining is defined as extracting information from huge sets of data. In other
words, we can say that data mining is the procedure of mining knowledge from
data.
• The information or knowledge extracted so can be used for any of the following
applications −
1. Market Analysis
2. Fraud Detection
3. Customer Retention
4. Production Control
5. Science Exploration
Kind to be mined
• Kind of knowledge to be mined
• It refers to the kind of functions to be performed.
• These functions are −
I. Characterization
II. Discrimination
III. Association and Correlation Analysis
IV. Classification
V. Prediction
VI. Clustering
VII. Outlier Analysis
VIII.Evolution Analysis
Kind of data mined
1.Flat Files
2.Relational Databases
3.DataWarehouse
4.Transactional Databases
5.Multimedia Databases
6.Spatial Databases
7.Time Series Databases
8.World Wide Web(WWW)
Technologies used
Major issues in data mining
References
• Google
• Wikipedia
• Tutorials point
• www.lsamratl.tk

More Related Content

What's hot

Data Warehouse Architectures
Data Warehouse ArchitecturesData Warehouse Architectures
Data Warehouse Architectures
Theju Paul
 
Mis chapter5
Mis chapter5Mis chapter5
Mis chapter5
Poleak
 
Tatyana Matvienko,Senior Java Developer, Big data storages
 Tatyana Matvienko,Senior Java Developer, Big data storages Tatyana Matvienko,Senior Java Developer, Big data storages
Tatyana Matvienko,Senior Java Developer, Big data storages
Alina Vilk
 

What's hot (16)

Data Mining: Key definitions
Data Mining: Key definitionsData Mining: Key definitions
Data Mining: Key definitions
 
Data Warehouse Architectures
Data Warehouse ArchitecturesData Warehouse Architectures
Data Warehouse Architectures
 
Data flow ii extract
Data flow   ii extractData flow   ii extract
Data flow ii extract
 
Over view of data structures
Over view of data structuresOver view of data structures
Over view of data structures
 
Data warehouse 13 data transformation
Data warehouse 13 data transformationData warehouse 13 data transformation
Data warehouse 13 data transformation
 
Session 10 data
Session 10 dataSession 10 data
Session 10 data
 
Mis chapter5
Mis chapter5Mis chapter5
Mis chapter5
 
Mis chapter5
Mis chapter5Mis chapter5
Mis chapter5
 
Tatyana Matvienko,Senior Java Developer, Big data storages
 Tatyana Matvienko,Senior Java Developer, Big data storages Tatyana Matvienko,Senior Java Developer, Big data storages
Tatyana Matvienko,Senior Java Developer, Big data storages
 
Big data storages
Big data storagesBig data storages
Big data storages
 
Final presentation
Final presentationFinal presentation
Final presentation
 
Data warehouse and olap technology
Data warehouse and olap technologyData warehouse and olap technology
Data warehouse and olap technology
 
Dw Concepts
Dw ConceptsDw Concepts
Dw Concepts
 
Data Management_TL III Annual Meet
Data Management_TL III Annual MeetData Management_TL III Annual Meet
Data Management_TL III Annual Meet
 
data warehousing and data mining
data warehousing and data mining data warehousing and data mining
data warehousing and data mining
 
Datawarehouse
DatawarehouseDatawarehouse
Datawarehouse
 

Similar to Introduction to data mining

ETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptxETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptx
ParnalSatle
 

Similar to Introduction to data mining (20)

Data warehouse introduction
Data warehouse introductionData warehouse introduction
Data warehouse introduction
 
Data ware housing - Introduction to data ware housing process.
Data ware housing - Introduction to data ware housing process.Data ware housing - Introduction to data ware housing process.
Data ware housing - Introduction to data ware housing process.
 
data warehousing
data warehousingdata warehousing
data warehousing
 
DATA WAREHOUSING.2.pptx
DATA WAREHOUSING.2.pptxDATA WAREHOUSING.2.pptx
DATA WAREHOUSING.2.pptx
 
Unit 3 part i Data mining
Unit 3 part i Data miningUnit 3 part i Data mining
Unit 3 part i Data mining
 
Data mining techniques unit 1
Data mining techniques  unit 1Data mining techniques  unit 1
Data mining techniques unit 1
 
Introduction to data mining and data warehousing
Introduction to data mining and data warehousingIntroduction to data mining and data warehousing
Introduction to data mining and data warehousing
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Various Applications of Data Warehouse.ppt
Various Applications of Data Warehouse.pptVarious Applications of Data Warehouse.ppt
Various Applications of Data Warehouse.ppt
 
Online analytical processing
Online analytical processingOnline analytical processing
Online analytical processing
 
DW (1).ppt
DW (1).pptDW (1).ppt
DW (1).ppt
 
ETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptxETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptx
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
3 OLAP.pptx
3 OLAP.pptx3 OLAP.pptx
3 OLAP.pptx
 
Lecture2 (1).ppt
Lecture2 (1).pptLecture2 (1).ppt
Lecture2 (1).ppt
 
Dw 07032018-dr pl pradhan
Dw 07032018-dr pl pradhanDw 07032018-dr pl pradhan
Dw 07032018-dr pl pradhan
 
OLAP OnLine Analytical Processing
OLAP OnLine Analytical ProcessingOLAP OnLine Analytical Processing
OLAP OnLine Analytical Processing
 
data warehousing
data warehousingdata warehousing
data warehousing
 

Recently uploaded

Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
pritamlangde
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
AldoGarca30
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
 
Introduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxIntroduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptx
hublikarsn
 

Recently uploaded (20)

S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Introduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdfIntroduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdf
 
8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessor8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessor
 
Post office management system project ..pdf
Post office management system project ..pdfPost office management system project ..pdf
Post office management system project ..pdf
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Electromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptxElectromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptx
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptx
 
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
 
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...
 
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
 
Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Introduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxIntroduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptx
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxWorksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptx
 
Memory Interfacing of 8086 with DMA 8257
Memory Interfacing of 8086 with DMA 8257Memory Interfacing of 8086 with DMA 8257
Memory Interfacing of 8086 with DMA 8257
 

Introduction to data mining

  • 1. INTRODUCTION TO DATA MINING Samrat Devidas Tayade TE IT ARMIET
  • 2. prerequisites • Knowledge of databases • Data warehousing • Olap
  • 3. Knowledge of databases • Database : A database is an organized collection of data, generally stored and accessed electronically from a computer system. Where databases are more complex they are often developed using formal design and modeling techniques. • Database Management System (DBMS) – add, remove, update records – retrieve data that match certain criteria – cross-reference data in different tables – perform complex aggregate calculation • Database consists of columns (attributes) and rows (records).
  • 4. Data warehousing Data warehousing is the process of constructing and using a data warehouse. A data warehouse is constructed by integrating data from multiple heterogeneous sources that support analytical reporting, structured and/or ad hoc queries, and decision making. Data warehousing involves data cleaning, data integration, and data consolidations.
  • 5. OLAP Online Analytical Processing Server (OLAP) is based on the multidimensional data model. It allows managers, and analysts to get an insight of the information through fast, consistent, and interactive access to information. TYPES OF OLAP : 1. Relational OLAP (ROLAP) 2. Multidimensional OLAP (MOLAP) 3. Hybrid OLAP (HOLAP) 4. Specialized SQL Servers
  • 6. Content • What is data mining • Kind to be mined • Technologies used • Major issues in data mining
  • 7. What is data mining • The practice of examining large pre-existing databases in order to generate new information. • Data Mining is defined as extracting information from huge sets of data. In other words, we can say that data mining is the procedure of mining knowledge from data. • The information or knowledge extracted so can be used for any of the following applications − 1. Market Analysis 2. Fraud Detection 3. Customer Retention 4. Production Control 5. Science Exploration
  • 8. Kind to be mined • Kind of knowledge to be mined • It refers to the kind of functions to be performed. • These functions are − I. Characterization II. Discrimination III. Association and Correlation Analysis IV. Classification V. Prediction VI. Clustering VII. Outlier Analysis VIII.Evolution Analysis
  • 9. Kind of data mined 1.Flat Files 2.Relational Databases 3.DataWarehouse 4.Transactional Databases 5.Multimedia Databases 6.Spatial Databases 7.Time Series Databases 8.World Wide Web(WWW)
  • 11. Major issues in data mining
  • 12. References • Google • Wikipedia • Tutorials point • www.lsamratl.tk