SlideShare a Scribd company logo
1 I NAME OF PRESENTER
Data Mining
Ashis Kumar Chanda
Department of Computer Science and Engineering
University of Dhaka
2 I NAME OF PRESENTERCSE, DU2
Key concepts
 What is Data mining
 Why learn Data mining
 Data type
 Warehouse & OLAP
 Data Cleaning, Integration
 Associations, Item sets, Support, Confidence
3 I NAME OF PRESENTERCSE, DU3
Data Mining
 Data mining refers to Knowledge mining
from large amount of data
 Also known as “Knowledge Discovery from
Data” or KDD
 Target is to find a Hidden Pattern
4 I NAME OF PRESENTER
 We can’t get all type of information through Query
 Query not support Statistical analysis
 Again, we can apply artificial intelligence & find new
patterns or structures
CSE, DU4
Why learn data mining
Query provide values but data mining provides idea that help
to take (business ) decision
Ex: Women live at “Dhanmondi” & older than 40 years
most frequently buy “Jamdani Shari” at “Arong”
5 I NAME OF PRESENTERCSE, DU5
Data type
 Tabular (Transaction data) Most commonly
used
 Spatial Data (Remote sensing data/
encoded data)
 Tree Data ( xml )
 Graphs (www, bio-molecular)
 Sequence (DNA, activity log)
 Text, multimedia data
6 I NAME OF PRESENTERCSE, DU6
Warehouse & OLAP
Ware House
Data Source
Warehouse is an archive of information gathered from
multiple sources
Suppose a Banking database where each has a data source
that stores all transactions of that area. And all data source
will provide a clean/safe copy at Warehouse
7 I NAME OF PRESENTERCSE, DU7
Warehouse & OLAP
There is several issues about Warehouse:
 When and how to gather data
 What schema/pattern to use
 Data transformation & cleaning
 How to update
“Warehouse is a collection of data marts”
Where data mart is store of data in specialized pattern
8 I NAME OF PRESENTERCSE, DU8
Warehouse & OLAP
OLAP: Online Analytical Processing
OLAP tools support interactive analysis of summary Information
OLAP permits an analyst to view different summaries of
multidimensional data
Item name
Dress
Fig: Data Cube
9 I NAME OF PRESENTERCSE, DU9
Data cleaning
There may be some missing data, duplicate data, dirty data
So we need to data cleaning
Some methods:
 Ignore the tuple (not effective unless tuple contain many
missing attribute)
 Fill missing values (time consuming)
 Fill with a global value (like: unknown)
 Use mean attribute
 Use most probable value
10 I NAME OF PRESENTERCSE, DU10
11 I NAME OF PRESENTERCSE, DU11
Associations & Item sets
Associations:
An associations is a rule of the form if X then Y
It is denoted as X-> Y
Example: if there is an exam then I read
Item Sets:
For any rule if X->Y & Y->X Then X, Y are called item-set
Example:
People buying school books in January also by notebook
People buying school note books in January also by book
12 I NAME OF PRESENTERCSE, DU12
Support & confidence
Support:
The proportion of transactions in the data set which contains
the itemset
Confidence:
The conditional probability that an item appears in a
transaction when another item appears.
13 I NAME OF PRESENTERCSE, DU13
Support & confidence
Support for {I₁,I₂}
= support_count(I1 U I2)/ |D|
= 4/9
Confidence for I1 → I2
=support_count(I1 U I2) /
support_count(I1)
= 4/6
14 I NAME OF PRESENTERCSE, DU14
Association rules
Where, support count(AUB) is the number of transactions
containing the itemsets AUB, and support count(A) is the
number of transactions containing the itemset A.
•Association rules can be generated as follows:
1. For each frequent itemset l, generate all nonempty subsets
of l.
2. For every nonempty subset s of l, output the rule “s → (l-
s)” if support count(l)/support count(s) >= min_conf,
where min_conf is the minimum confidence threshold.
15 I NAME OF PRESENTERCSE, DU15
Summary
Basic topics: Data mining, Data cleaning, Warehouse, OLAP
Term: Association, Item-set, Support, Confidence
16 I NAME OF PRESENTERCSE, DU16
References
- Data Mining Concepts & Techniques
by J. Han & M. Kamber
- Database system Concept
by Abraham Sillberschatz, Korth, Sudarshan
- Lecture of Dr. S. Srinath
Institute of Technology at Madras, India

More Related Content

What's hot

Introduction to data pre-processing and cleaning
Introduction to data pre-processing and cleaning Introduction to data pre-processing and cleaning
Introduction to data pre-processing and cleaning
Matteo Manca
 
A basic course on Research data management, part 4: caring for your data, or ...
A basic course on Research data management, part 4: caring for your data, or ...A basic course on Research data management, part 4: caring for your data, or ...
A basic course on Research data management, part 4: caring for your data, or ...
Leon Osinski
 
23.database
23.database23.database
23.database
Bayarmaa GBayarmaa
 
A classification of methods for frequent pattern mining
A classification of methods for frequent pattern miningA classification of methods for frequent pattern mining
A classification of methods for frequent pattern mining
IOSR Journals
 
DataVsStatistics
DataVsStatisticsDataVsStatistics
DataVsStatistics
jpheintz
 
A basic course on Research data management, part 1: what and why
A basic course on Research data management, part 1: what and whyA basic course on Research data management, part 1: what and why
A basic course on Research data management, part 1: what and why
Leon Osinski
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsData Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
DataminingTools Inc
 
Data mining nouman javed
Data mining   nouman javedData mining   nouman javed
Data mining nouman javed
nouman javed
 
Research trends in data warehousing and data mining
Research trends in data warehousing and data miningResearch trends in data warehousing and data mining
Research trends in data warehousing and data mining
Er. Nawaraj Bhandari
 
Simulation and Performance Analysis of Long Term Evolution (LTE) Cellular Net...
Simulation and Performance Analysis of Long Term Evolution (LTE) Cellular Net...Simulation and Performance Analysis of Long Term Evolution (LTE) Cellular Net...
Simulation and Performance Analysis of Long Term Evolution (LTE) Cellular Net...
ijsrd.com
 
1 db terms
1 db terms1 db terms
Data pre processing
Data pre processingData pre processing
Data pre processing
pommurajopt
 
Database
DatabaseDatabase
Database
Chinmay Raul
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
suganmca14
 
Data structures
Data structuresData structures
Data structures
Amrutha Rajan
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
Daniel JACOB
 
EDI Training Module 11: Publishing Data in the EDI Repository
EDI Training Module 11:  Publishing Data in the EDI RepositoryEDI Training Module 11:  Publishing Data in the EDI Repository
EDI Training Module 11: Publishing Data in the EDI Repository
Environmental Data Initiative
 
Data Mining: Classification and analysis
Data Mining: Classification and analysisData Mining: Classification and analysis
Data Mining: Classification and analysis
DataminingTools Inc
 
A Study of Various Projected Data Based Pattern Mining Algorithms
A Study of Various Projected Data Based Pattern Mining AlgorithmsA Study of Various Projected Data Based Pattern Mining Algorithms
A Study of Various Projected Data Based Pattern Mining Algorithms
ijsrd.com
 
Data Warehouse By Piyush
Data Warehouse By PiyushData Warehouse By Piyush
Data Warehouse By Piyush
astronish
 

What's hot (20)

Introduction to data pre-processing and cleaning
Introduction to data pre-processing and cleaning Introduction to data pre-processing and cleaning
Introduction to data pre-processing and cleaning
 
A basic course on Research data management, part 4: caring for your data, or ...
A basic course on Research data management, part 4: caring for your data, or ...A basic course on Research data management, part 4: caring for your data, or ...
A basic course on Research data management, part 4: caring for your data, or ...
 
23.database
23.database23.database
23.database
 
A classification of methods for frequent pattern mining
A classification of methods for frequent pattern miningA classification of methods for frequent pattern mining
A classification of methods for frequent pattern mining
 
DataVsStatistics
DataVsStatisticsDataVsStatistics
DataVsStatistics
 
A basic course on Research data management, part 1: what and why
A basic course on Research data management, part 1: what and whyA basic course on Research data management, part 1: what and why
A basic course on Research data management, part 1: what and why
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsData Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
 
Data mining nouman javed
Data mining   nouman javedData mining   nouman javed
Data mining nouman javed
 
Research trends in data warehousing and data mining
Research trends in data warehousing and data miningResearch trends in data warehousing and data mining
Research trends in data warehousing and data mining
 
Simulation and Performance Analysis of Long Term Evolution (LTE) Cellular Net...
Simulation and Performance Analysis of Long Term Evolution (LTE) Cellular Net...Simulation and Performance Analysis of Long Term Evolution (LTE) Cellular Net...
Simulation and Performance Analysis of Long Term Evolution (LTE) Cellular Net...
 
1 db terms
1 db terms1 db terms
1 db terms
 
Data pre processing
Data pre processingData pre processing
Data pre processing
 
Database
DatabaseDatabase
Database
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Data structures
Data structuresData structures
Data structures
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
EDI Training Module 11: Publishing Data in the EDI Repository
EDI Training Module 11:  Publishing Data in the EDI RepositoryEDI Training Module 11:  Publishing Data in the EDI Repository
EDI Training Module 11: Publishing Data in the EDI Repository
 
Data Mining: Classification and analysis
Data Mining: Classification and analysisData Mining: Classification and analysis
Data Mining: Classification and analysis
 
A Study of Various Projected Data Based Pattern Mining Algorithms
A Study of Various Projected Data Based Pattern Mining AlgorithmsA Study of Various Projected Data Based Pattern Mining Algorithms
A Study of Various Projected Data Based Pattern Mining Algorithms
 
Data Warehouse By Piyush
Data Warehouse By PiyushData Warehouse By Piyush
Data Warehouse By Piyush
 

Viewers also liked

Mining Data from Reservoir Simulation Result
Mining Data from Reservoir Simulation ResultMining Data from Reservoir Simulation Result
Mining Data from Reservoir Simulation Result
akmaltk96
 
10 flatteners
10 flatteners10 flatteners
10 flatteners
ernestnwanu
 
L’eutanasia
L’eutanasiaL’eutanasia
L’eutanasia
Laura Franchini
 
Activity in comp.
Activity in comp.Activity in comp.
Activity in comp.
Renz Ronald Prieto
 
Keep austin weird 2014.ppt
Keep austin weird 2014.pptKeep austin weird 2014.ppt
Keep austin weird 2014.ppt
Melinda Brasher
 
Keep Austin Weird 2013
Keep Austin Weird 2013Keep Austin Weird 2013
Keep Austin Weird 2013
Melinda Brasher
 
Tornado re brand presentation (draft)(not for reproduction)
Tornado re brand presentation (draft)(not for reproduction)Tornado re brand presentation (draft)(not for reproduction)
Tornado re brand presentation (draft)(not for reproduction)
Melinda Brasher
 
Final photos
Final photosFinal photos
Final photos
sholawelch
 
Test Powerpoint Upload
Test Powerpoint UploadTest Powerpoint Upload
Test Powerpoint Upload
Matthew Walton
 
Test audio
Test audioTest audio
Test audio
GalinaMi
 
Venticinque Aprile Un bellissimo giorno da ricordare e onorare
Venticinque   Aprile Un bellissimo giorno da ricordare e onorareVenticinque   Aprile Un bellissimo giorno da ricordare e onorare
Venticinque Aprile Un bellissimo giorno da ricordare e onorareLaura Franchini
 
Universal Design
Universal DesignUniversal Design
Universal Design
summerbloom
 
Nooges Brochure
Nooges BrochureNooges Brochure
Nooges Brochure
noogeking
 
Disney
DisneyDisney
Disney
sakuraluka
 
The colors of the flag
The colors of the flagThe colors of the flag
The colors of the flag
forever97
 
Mexican manufacturers inc 10 18
Mexican manufacturers inc 10 18Mexican manufacturers inc 10 18
Mexican manufacturers inc 10 18
John Martino
 
Nooges-T Project
Nooges-T ProjectNooges-T Project
Nooges-T Project
noogeking
 
Big Data vs. Smart Data: The Cook County Land Bank’s Data-Driven plan for lan...
Big Data vs. Smart Data: The Cook County Land Bank’s Data-Driven plan for lan...Big Data vs. Smart Data: The Cook County Land Bank’s Data-Driven plan for lan...
Big Data vs. Smart Data: The Cook County Land Bank’s Data-Driven plan for lan...
Cook County Commissioner Bridget Gainer
 
Fotosintesis
FotosintesisFotosintesis
Fotosintesis
Rudy Avengeance
 

Viewers also liked (20)

Mining Data from Reservoir Simulation Result
Mining Data from Reservoir Simulation ResultMining Data from Reservoir Simulation Result
Mining Data from Reservoir Simulation Result
 
10 flatteners
10 flatteners10 flatteners
10 flatteners
 
L’eutanasia
L’eutanasiaL’eutanasia
L’eutanasia
 
Activity in comp.
Activity in comp.Activity in comp.
Activity in comp.
 
Keep austin weird 2014.ppt
Keep austin weird 2014.pptKeep austin weird 2014.ppt
Keep austin weird 2014.ppt
 
Keep Austin Weird 2013
Keep Austin Weird 2013Keep Austin Weird 2013
Keep Austin Weird 2013
 
Tornado re brand presentation (draft)(not for reproduction)
Tornado re brand presentation (draft)(not for reproduction)Tornado re brand presentation (draft)(not for reproduction)
Tornado re brand presentation (draft)(not for reproduction)
 
Final photos
Final photosFinal photos
Final photos
 
Test Powerpoint Upload
Test Powerpoint UploadTest Powerpoint Upload
Test Powerpoint Upload
 
Test audio
Test audioTest audio
Test audio
 
Venticinque Aprile Un bellissimo giorno da ricordare e onorare
Venticinque   Aprile Un bellissimo giorno da ricordare e onorareVenticinque   Aprile Un bellissimo giorno da ricordare e onorare
Venticinque Aprile Un bellissimo giorno da ricordare e onorare
 
Universal Design
Universal DesignUniversal Design
Universal Design
 
Nooges Brochure
Nooges BrochureNooges Brochure
Nooges Brochure
 
Disney
DisneyDisney
Disney
 
The colors of the flag
The colors of the flagThe colors of the flag
The colors of the flag
 
Mexican manufacturers inc 10 18
Mexican manufacturers inc 10 18Mexican manufacturers inc 10 18
Mexican manufacturers inc 10 18
 
Nooges-T Project
Nooges-T ProjectNooges-T Project
Nooges-T Project
 
Big Data vs. Smart Data: The Cook County Land Bank’s Data-Driven plan for lan...
Big Data vs. Smart Data: The Cook County Land Bank’s Data-Driven plan for lan...Big Data vs. Smart Data: The Cook County Land Bank’s Data-Driven plan for lan...
Big Data vs. Smart Data: The Cook County Land Bank’s Data-Driven plan for lan...
 
Fotosintesis
FotosintesisFotosintesis
Fotosintesis
 
Jaguar
JaguarJaguar
Jaguar
 

Similar to Data Mining (Introduction)

Data Mining @ BSU Malolos 2019
Data Mining @ BSU Malolos 2019Data Mining @ BSU Malolos 2019
Data Mining @ BSU Malolos 2019
Edwin S. Garcia
 
MS SQL SERVER: Introduction To Database Concepts
MS SQL SERVER: Introduction To Database ConceptsMS SQL SERVER: Introduction To Database Concepts
MS SQL SERVER: Introduction To Database Concepts
sqlserver content
 
MS Sql Server: Introduction To Database Concepts
MS Sql Server: Introduction To Database ConceptsMS Sql Server: Introduction To Database Concepts
MS Sql Server: Introduction To Database Concepts
DataminingTools Inc
 
Data mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationData mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, Classification
Dr. Abdul Ahad Abro
 
Lec 1 introduction
Lec 1 introductionLec 1 introduction
Lec 1 introduction
Shimul Ahmmed
 
SIM PASCA CHAPTER 4.pdf
SIM PASCA CHAPTER 4.pdfSIM PASCA CHAPTER 4.pdf
SIM PASCA CHAPTER 4.pdf
AdiSuputrq
 
Data mining
Data miningData mining
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
Sukirti Garg
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data Warehousing
Aswathy S Nair
 
DMDW Lesson 04 - Data Mining Theory
DMDW Lesson 04 - Data Mining TheoryDMDW Lesson 04 - Data Mining Theory
DMDW Lesson 04 - Data Mining Theory
Johannes Hoppe
 
Data warehousing interview questions
Data warehousing interview questionsData warehousing interview questions
Data warehousing interview questions
Satyam Jaiswal
 
Master Minds on Data Science - Arno Siebes
Master Minds on Data Science - Arno SiebesMaster Minds on Data Science - Arno Siebes
Master Minds on Data Science - Arno Siebes
Media Perspectives
 
Introduction to Data Science With R Notes
Introduction to Data Science With R NotesIntroduction to Data Science With R Notes
Introduction to Data Science With R Notes
LakshmiSarvani6
 
DMDW Lesson 03 - Data Warehouse Theory
DMDW Lesson 03 - Data Warehouse TheoryDMDW Lesson 03 - Data Warehouse Theory
DMDW Lesson 03 - Data Warehouse Theory
Johannes Hoppe
 
Database system Handbook 4th muhammad sharif.pdf
Database system Handbook 4th muhammad sharif.pdfDatabase system Handbook 4th muhammad sharif.pdf
Database system Handbook 4th muhammad sharif.pdf
Bahria University Islamabad, Pakistan
 
Database system Handbook 4th muhammad sharif.pdf
Database system Handbook 4th muhammad sharif.pdfDatabase system Handbook 4th muhammad sharif.pdf
Database system Handbook 4th muhammad sharif.pdf
Bahria University Islamabad, Pakistan
 
DMML1_overview.ppt
DMML1_overview.pptDMML1_overview.ppt
DMML1_overview.ppt
butest
 
Data Mining Concepts and Techniques
Data Mining Concepts and TechniquesData Mining Concepts and Techniques
Data Mining Concepts and Techniques
Pratik Tambekar
 
A Survey on Approaches for Frequent Item Set Mining on Apache Hadoop
A Survey on Approaches for Frequent Item Set Mining on Apache HadoopA Survey on Approaches for Frequent Item Set Mining on Apache Hadoop
A Survey on Approaches for Frequent Item Set Mining on Apache Hadoop
IJTET Journal
 
Data warehousing and Data mining
Data warehousing and Data mining Data warehousing and Data mining
Data warehousing and Data mining
Bahria University ,
 

Similar to Data Mining (Introduction) (20)

Data Mining @ BSU Malolos 2019
Data Mining @ BSU Malolos 2019Data Mining @ BSU Malolos 2019
Data Mining @ BSU Malolos 2019
 
MS SQL SERVER: Introduction To Database Concepts
MS SQL SERVER: Introduction To Database ConceptsMS SQL SERVER: Introduction To Database Concepts
MS SQL SERVER: Introduction To Database Concepts
 
MS Sql Server: Introduction To Database Concepts
MS Sql Server: Introduction To Database ConceptsMS Sql Server: Introduction To Database Concepts
MS Sql Server: Introduction To Database Concepts
 
Data mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationData mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, Classification
 
Lec 1 introduction
Lec 1 introductionLec 1 introduction
Lec 1 introduction
 
SIM PASCA CHAPTER 4.pdf
SIM PASCA CHAPTER 4.pdfSIM PASCA CHAPTER 4.pdf
SIM PASCA CHAPTER 4.pdf
 
Data mining
Data miningData mining
Data mining
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data Warehousing
 
DMDW Lesson 04 - Data Mining Theory
DMDW Lesson 04 - Data Mining TheoryDMDW Lesson 04 - Data Mining Theory
DMDW Lesson 04 - Data Mining Theory
 
Data warehousing interview questions
Data warehousing interview questionsData warehousing interview questions
Data warehousing interview questions
 
Master Minds on Data Science - Arno Siebes
Master Minds on Data Science - Arno SiebesMaster Minds on Data Science - Arno Siebes
Master Minds on Data Science - Arno Siebes
 
Introduction to Data Science With R Notes
Introduction to Data Science With R NotesIntroduction to Data Science With R Notes
Introduction to Data Science With R Notes
 
DMDW Lesson 03 - Data Warehouse Theory
DMDW Lesson 03 - Data Warehouse TheoryDMDW Lesson 03 - Data Warehouse Theory
DMDW Lesson 03 - Data Warehouse Theory
 
Database system Handbook 4th muhammad sharif.pdf
Database system Handbook 4th muhammad sharif.pdfDatabase system Handbook 4th muhammad sharif.pdf
Database system Handbook 4th muhammad sharif.pdf
 
Database system Handbook 4th muhammad sharif.pdf
Database system Handbook 4th muhammad sharif.pdfDatabase system Handbook 4th muhammad sharif.pdf
Database system Handbook 4th muhammad sharif.pdf
 
DMML1_overview.ppt
DMML1_overview.pptDMML1_overview.ppt
DMML1_overview.ppt
 
Data Mining Concepts and Techniques
Data Mining Concepts and TechniquesData Mining Concepts and Techniques
Data Mining Concepts and Techniques
 
A Survey on Approaches for Frequent Item Set Mining on Apache Hadoop
A Survey on Approaches for Frequent Item Set Mining on Apache HadoopA Survey on Approaches for Frequent Item Set Mining on Apache Hadoop
A Survey on Approaches for Frequent Item Set Mining on Apache Hadoop
 
Data warehousing and Data mining
Data warehousing and Data mining Data warehousing and Data mining
Data warehousing and Data mining
 

More from Ashis Kumar Chanda

Word 2 vector
Word 2 vectorWord 2 vector
Word 2 vector
Ashis Kumar Chanda
 
Multi-class Image Classification using deep convolutional networks on extreme...
Multi-class Image Classification using deep convolutional networks on extreme...Multi-class Image Classification using deep convolutional networks on extreme...
Multi-class Image Classification using deep convolutional networks on extreme...
Ashis Kumar Chanda
 
Full resolution image compression with recurrent neural networks
Full resolution image compression with  recurrent neural networksFull resolution image compression with  recurrent neural networks
Full resolution image compression with recurrent neural networks
Ashis Kumar Chanda
 
Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational Databases
Ashis Kumar Chanda
 
03. Agile Development
03. Agile Development03. Agile Development
03. Agile Development
Ashis Kumar Chanda
 
Software Cost Estimation
Software Cost EstimationSoftware Cost Estimation
Software Cost Estimation
Ashis Kumar Chanda
 
Risk Management
Risk ManagementRisk Management
Risk Management
Ashis Kumar Chanda
 
Project Management
Project ManagementProject Management
Project Management
Ashis Kumar Chanda
 
MVC
MVCMVC
Requirements engineering
Requirements engineeringRequirements engineering
Requirements engineering
Ashis Kumar Chanda
 
4. UML
4. UML4. UML
2. Software process
2. Software process2. Software process
2. Software process
Ashis Kumar Chanda
 
1. Introduction
1. Introduction1. Introduction
1. Introduction
Ashis Kumar Chanda
 
Periodic pattern mining
Periodic pattern miningPeriodic pattern mining
Periodic pattern mining
Ashis Kumar Chanda
 
FPPM algorithm
FPPM algorithmFPPM algorithm
FPPM algorithm
Ashis Kumar Chanda
 
Secure software design
Secure software designSecure software design
Secure software design
Ashis Kumar Chanda
 
Sequential logic circuit optimization
Sequential logic circuit optimizationSequential logic circuit optimization
Sequential logic circuit optimization
Ashis Kumar Chanda
 
Introduction to CS
Introduction to CSIntroduction to CS
Introduction to CS
Ashis Kumar Chanda
 
Iterative deepening search
Iterative deepening searchIterative deepening search
Iterative deepening search
Ashis Kumar Chanda
 
CloudBus
CloudBusCloudBus

More from Ashis Kumar Chanda (20)

Word 2 vector
Word 2 vectorWord 2 vector
Word 2 vector
 
Multi-class Image Classification using deep convolutional networks on extreme...
Multi-class Image Classification using deep convolutional networks on extreme...Multi-class Image Classification using deep convolutional networks on extreme...
Multi-class Image Classification using deep convolutional networks on extreme...
 
Full resolution image compression with recurrent neural networks
Full resolution image compression with  recurrent neural networksFull resolution image compression with  recurrent neural networks
Full resolution image compression with recurrent neural networks
 
Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational Databases
 
03. Agile Development
03. Agile Development03. Agile Development
03. Agile Development
 
Software Cost Estimation
Software Cost EstimationSoftware Cost Estimation
Software Cost Estimation
 
Risk Management
Risk ManagementRisk Management
Risk Management
 
Project Management
Project ManagementProject Management
Project Management
 
MVC
MVCMVC
MVC
 
Requirements engineering
Requirements engineeringRequirements engineering
Requirements engineering
 
4. UML
4. UML4. UML
4. UML
 
2. Software process
2. Software process2. Software process
2. Software process
 
1. Introduction
1. Introduction1. Introduction
1. Introduction
 
Periodic pattern mining
Periodic pattern miningPeriodic pattern mining
Periodic pattern mining
 
FPPM algorithm
FPPM algorithmFPPM algorithm
FPPM algorithm
 
Secure software design
Secure software designSecure software design
Secure software design
 
Sequential logic circuit optimization
Sequential logic circuit optimizationSequential logic circuit optimization
Sequential logic circuit optimization
 
Introduction to CS
Introduction to CSIntroduction to CS
Introduction to CS
 
Iterative deepening search
Iterative deepening searchIterative deepening search
Iterative deepening search
 
CloudBus
CloudBusCloudBus
CloudBus
 

Recently uploaded

Bituminous road construction project based learning report
Bituminous road construction project based learning reportBituminous road construction project based learning report
Bituminous road construction project based learning report
CE19KaushlendraKumar
 
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
sydezfe
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
ecqow
 
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
Paris Salesforce Developer Group
 
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
upoux
 
Power Electronics- AC -AC Converters.pptx
Power Electronics- AC -AC Converters.pptxPower Electronics- AC -AC Converters.pptx
Power Electronics- AC -AC Converters.pptx
Poornima D
 
Digital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptxDigital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptx
aryanpankaj78
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
ijaia
 
5G Radio Network Througput Problem Analysis HCIA.pdf
5G Radio Network Througput Problem Analysis HCIA.pdf5G Radio Network Througput Problem Analysis HCIA.pdf
5G Radio Network Througput Problem Analysis HCIA.pdf
AlvianRamadhani5
 
Object Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOADObject Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOAD
PreethaV16
 
P5 Working Drawings.pdf floor plan, civil
P5 Working Drawings.pdf floor plan, civilP5 Working Drawings.pdf floor plan, civil
P5 Working Drawings.pdf floor plan, civil
AnasAhmadNoor
 
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
upoux
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
VANDANAMOHANGOUDA
 
openshift technical overview - Flow of openshift containerisatoin
openshift technical overview - Flow of openshift containerisatoinopenshift technical overview - Flow of openshift containerisatoin
openshift technical overview - Flow of openshift containerisatoin
snaprevwdev
 
Assistant Engineer (Chemical) Interview Questions.pdf
Assistant Engineer (Chemical) Interview Questions.pdfAssistant Engineer (Chemical) Interview Questions.pdf
Assistant Engineer (Chemical) Interview Questions.pdf
Seetal Daas
 
Ericsson LTE Throughput Troubleshooting Techniques.ppt
Ericsson LTE Throughput Troubleshooting Techniques.pptEricsson LTE Throughput Troubleshooting Techniques.ppt
Ericsson LTE Throughput Troubleshooting Techniques.ppt
wafawafa52
 
Introduction to Computer Networks & OSI MODEL.ppt
Introduction to Computer Networks & OSI MODEL.pptIntroduction to Computer Networks & OSI MODEL.ppt
Introduction to Computer Networks & OSI MODEL.ppt
Dwarkadas J Sanghvi College of Engineering
 
OOPS_Lab_Manual - programs using C++ programming language
OOPS_Lab_Manual - programs using C++ programming languageOOPS_Lab_Manual - programs using C++ programming language
OOPS_Lab_Manual - programs using C++ programming language
PreethaV16
 
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
MadhavJungKarki
 
Accident detection system project report.pdf
Accident detection system project report.pdfAccident detection system project report.pdf
Accident detection system project report.pdf
Kamal Acharya
 

Recently uploaded (20)

Bituminous road construction project based learning report
Bituminous road construction project based learning reportBituminous road construction project based learning report
Bituminous road construction project based learning report
 
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
 
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
 
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
 
Power Electronics- AC -AC Converters.pptx
Power Electronics- AC -AC Converters.pptxPower Electronics- AC -AC Converters.pptx
Power Electronics- AC -AC Converters.pptx
 
Digital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptxDigital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptx
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
 
5G Radio Network Througput Problem Analysis HCIA.pdf
5G Radio Network Througput Problem Analysis HCIA.pdf5G Radio Network Througput Problem Analysis HCIA.pdf
5G Radio Network Througput Problem Analysis HCIA.pdf
 
Object Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOADObject Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOAD
 
P5 Working Drawings.pdf floor plan, civil
P5 Working Drawings.pdf floor plan, civilP5 Working Drawings.pdf floor plan, civil
P5 Working Drawings.pdf floor plan, civil
 
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
 
openshift technical overview - Flow of openshift containerisatoin
openshift technical overview - Flow of openshift containerisatoinopenshift technical overview - Flow of openshift containerisatoin
openshift technical overview - Flow of openshift containerisatoin
 
Assistant Engineer (Chemical) Interview Questions.pdf
Assistant Engineer (Chemical) Interview Questions.pdfAssistant Engineer (Chemical) Interview Questions.pdf
Assistant Engineer (Chemical) Interview Questions.pdf
 
Ericsson LTE Throughput Troubleshooting Techniques.ppt
Ericsson LTE Throughput Troubleshooting Techniques.pptEricsson LTE Throughput Troubleshooting Techniques.ppt
Ericsson LTE Throughput Troubleshooting Techniques.ppt
 
Introduction to Computer Networks & OSI MODEL.ppt
Introduction to Computer Networks & OSI MODEL.pptIntroduction to Computer Networks & OSI MODEL.ppt
Introduction to Computer Networks & OSI MODEL.ppt
 
OOPS_Lab_Manual - programs using C++ programming language
OOPS_Lab_Manual - programs using C++ programming languageOOPS_Lab_Manual - programs using C++ programming language
OOPS_Lab_Manual - programs using C++ programming language
 
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
 
Accident detection system project report.pdf
Accident detection system project report.pdfAccident detection system project report.pdf
Accident detection system project report.pdf
 

Data Mining (Introduction)

  • 1. 1 I NAME OF PRESENTER Data Mining Ashis Kumar Chanda Department of Computer Science and Engineering University of Dhaka
  • 2. 2 I NAME OF PRESENTERCSE, DU2 Key concepts  What is Data mining  Why learn Data mining  Data type  Warehouse & OLAP  Data Cleaning, Integration  Associations, Item sets, Support, Confidence
  • 3. 3 I NAME OF PRESENTERCSE, DU3 Data Mining  Data mining refers to Knowledge mining from large amount of data  Also known as “Knowledge Discovery from Data” or KDD  Target is to find a Hidden Pattern
  • 4. 4 I NAME OF PRESENTER  We can’t get all type of information through Query  Query not support Statistical analysis  Again, we can apply artificial intelligence & find new patterns or structures CSE, DU4 Why learn data mining Query provide values but data mining provides idea that help to take (business ) decision Ex: Women live at “Dhanmondi” & older than 40 years most frequently buy “Jamdani Shari” at “Arong”
  • 5. 5 I NAME OF PRESENTERCSE, DU5 Data type  Tabular (Transaction data) Most commonly used  Spatial Data (Remote sensing data/ encoded data)  Tree Data ( xml )  Graphs (www, bio-molecular)  Sequence (DNA, activity log)  Text, multimedia data
  • 6. 6 I NAME OF PRESENTERCSE, DU6 Warehouse & OLAP Ware House Data Source Warehouse is an archive of information gathered from multiple sources Suppose a Banking database where each has a data source that stores all transactions of that area. And all data source will provide a clean/safe copy at Warehouse
  • 7. 7 I NAME OF PRESENTERCSE, DU7 Warehouse & OLAP There is several issues about Warehouse:  When and how to gather data  What schema/pattern to use  Data transformation & cleaning  How to update “Warehouse is a collection of data marts” Where data mart is store of data in specialized pattern
  • 8. 8 I NAME OF PRESENTERCSE, DU8 Warehouse & OLAP OLAP: Online Analytical Processing OLAP tools support interactive analysis of summary Information OLAP permits an analyst to view different summaries of multidimensional data Item name Dress Fig: Data Cube
  • 9. 9 I NAME OF PRESENTERCSE, DU9 Data cleaning There may be some missing data, duplicate data, dirty data So we need to data cleaning Some methods:  Ignore the tuple (not effective unless tuple contain many missing attribute)  Fill missing values (time consuming)  Fill with a global value (like: unknown)  Use mean attribute  Use most probable value
  • 10. 10 I NAME OF PRESENTERCSE, DU10
  • 11. 11 I NAME OF PRESENTERCSE, DU11 Associations & Item sets Associations: An associations is a rule of the form if X then Y It is denoted as X-> Y Example: if there is an exam then I read Item Sets: For any rule if X->Y & Y->X Then X, Y are called item-set Example: People buying school books in January also by notebook People buying school note books in January also by book
  • 12. 12 I NAME OF PRESENTERCSE, DU12 Support & confidence Support: The proportion of transactions in the data set which contains the itemset Confidence: The conditional probability that an item appears in a transaction when another item appears.
  • 13. 13 I NAME OF PRESENTERCSE, DU13 Support & confidence Support for {I₁,I₂} = support_count(I1 U I2)/ |D| = 4/9 Confidence for I1 → I2 =support_count(I1 U I2) / support_count(I1) = 4/6
  • 14. 14 I NAME OF PRESENTERCSE, DU14 Association rules Where, support count(AUB) is the number of transactions containing the itemsets AUB, and support count(A) is the number of transactions containing the itemset A. •Association rules can be generated as follows: 1. For each frequent itemset l, generate all nonempty subsets of l. 2. For every nonempty subset s of l, output the rule “s → (l- s)” if support count(l)/support count(s) >= min_conf, where min_conf is the minimum confidence threshold.
  • 15. 15 I NAME OF PRESENTERCSE, DU15 Summary Basic topics: Data mining, Data cleaning, Warehouse, OLAP Term: Association, Item-set, Support, Confidence
  • 16. 16 I NAME OF PRESENTERCSE, DU16 References - Data Mining Concepts & Techniques by J. Han & M. Kamber - Database system Concept by Abraham Sillberschatz, Korth, Sudarshan - Lecture of Dr. S. Srinath Institute of Technology at Madras, India