SlideShare a Scribd company logo
1 of 31
Presented to :
Dr. Rabie
By :
Amr Abd EL Latief Abd El Al
Data Mining Def.
 Def. :
 Data mining is the extraction of interesting patterns or
knowledge from huge amount of data.
Known different names :
 knowledge discovery (mining) in databases (KDD)
 knowledge extraction,
 data/pattern analysis,
 data archeology,
 data dredging,
 information harvesting,
 business intelligence and others. [1]
What is Data Mining
 Data Mining enables data exploration, data analysis,
and data visualization of huge databases at a high level
of abstraction, without a specific hypothesis in mind.
 working of data mining is understood by using a
method called modeling with it to make predictions.
Data Mining Technologies
 include :
 artificial neural networks
 decision trees
 genetic algorithms.
 Machine Learning .
 Evolutionary Computing
 MOEA Multi objective Evolutionary
Computing
Data Mining System Arch.
Data Mining Procedure
The Process of Data Mining
Classifications
Data Types
Application
Data Types
Data Structure
Functionality
Data Types Application S.V.
 Business transactions
 Scientific data
 Medical and personal data
 Surveillance video and pictures
 Satellite sensing
 Text reports and memos (e-mail messages)
 Most of the communications
 The World Wide Web repositories
types of data (Data Structure S.V.)
 Flat files
 Relational Databases
 Data Warehouses
 Transaction Databases
 Multimedia Databases
 Spatial Databases
 World Wide Web
FUNCTIONALITIES AND
CLASSIFICATIONS OF
DATA MINING
 Characterization
 Discrimination
 Association analysis
 Classification
 uses given class labels to order the objects in
 the data collection Classification approaches normally use a
 training set where all objects are already associated with
 known class labels. The classification algorithm learns from
 the training set and builds a model. The model is used to
 classify new objects.
 Prediction
 Prediction
Data Mining Systems
specialized
data source mined
dataClassification
according to the data
drawn on modmodel
el drawn on
kind of knowledge
discovered
mining techniques
used
comprehensive
Classification according to the type
of data source mined
 This classification categorizes data mining systems
according to the type of data handled:
 spatial data
 multimedia data
 time-series data
 text data
 World Wide Web.
Classification according to the data
model drawn on
 This classification categorizes data mining systems
based on the data model involved:
 Relational database
 object-oriented database
 data warehouse
 Transactional
 others
Classification according to the king
of knowledge discovered
 This classification categorizes data mining systems
based on the kind of knowledge discovered or data
mining functionalities:
 Characterization
 discrimination
 Association
 classification
 clustering
 others
Classification according to mining
techniques used
 The classification categorizes data mining systems
according to the data analysis approach used:
 machine learning
 neural networks
 Genetic algorithms
 Statistics
 visualization
 database oriented
 data warehouse-oriented
 others
take into account the degree of
user interaction involved in the
data mining process
 query-driven systems,
 interactive exploratory systems
 autonomous systems
Note:
 A comprehensive system would provide a wide variety
of data mining techniques to fit different situations
and options, and offer different degrees of user
interaction.
[2]
Papers
Data Mining Goals
 the two main goals of DM are:
 description
 prediction.
 Standard tasks in the field of DM are: description,
clustering, association discovery, sequential pattern
analysis, classification and regression.
 Description : can be obtained by characterization or by
discrimination.
 Characterization: is a summarization of the general features
 Discrimination :does not differ too much from
characterization. It consists of characterizing a class by
comparison with another one.
Data Mining Goals
 Clustering differs from classification since it analyses data
objects without knowing their class.
 Association : discovery results in a set of association rules
which represents attribute-value conditions frequently
occurring in a given set of data.
 Sequential pattern analysis : consists in searching for
frequently occurring patterns related to time.
 Regression : uses existing values of some variables in order
to forecast what values of another continuous variable will
be
Machine Learning
 A ML system uses an entire finite set of objects,
examples which represent observations of the
environment ; the learning algorithm learns a model
from this set which is called the training set.
 ML In DM include:
 databases
 data warehouses
 flat files
Classification in DM
 Classification:
is a form of data analysis that can be used to extract
models describing important classes or to predict future
trends.
 It represents :
learning paradigm which consists in segmenting data by
assigning it to groups, or classes,, that are already defined.
 the assumption is a small database size but In Data Mining
it must be scalable technique.
Classification in DM
 classes are represented by:
the values of a particular attribute called goal attribute
and remaining attributes are called predicting
attribute.
 resulting model is usually represented as:
a set of IF-THEN prediction rules where each one
predicts a class from the predicting attributes.
ML in Classification
 Procedure:
 Algorithms are first applied to the so-called training set
which contains training examples with a known class to
discover rules.
 the model is used for classification on a set of examples,
called the test set.
 The predictive accuracy of the model is evaluated on the
test set
Classification Methods
 Main classification methods are:
 decision tree induction
 Scalability problem
 Bayesian classification
 neural network learning.
 Draw Backs:
 Time-consuming
 difficulty for humans to interpret their results.
ASSOCIATION ANALYSIS
 They show relationships between attributes. Their
typical application domain is market basket and
transaction data analysis.
 Association Rules:
 An association rule is generally defined as an expression
 X=>Y,
 where X and Y are sets of attribute-value terms
ASSOCIATION ANALYSIS
 Rules are not supposed to be strictly correct in order
for them to be useful. It is generally required to find
rules which are true to some degree only.
 X implies Y
 X tends to imply Y
 Support and confidence
Apriori Algorithm
 Depends on Frqeuent occurence
 Draw Backs :
 Large number of database scans
 Large size of generated intermediate sets.
 Apriori mining only Boolean and single-dimensional
association rules.
 These rules are adapted to market basket analysis and can
GA Advantages in Data Mining
 DM problem needs: robustness of solutions and
scalability
 GA Advantages:
 there is high ability to find patterns in vey large spaces.
 parallel implementation
 It performs a kind Of global search rather than local
hill-climbing.
 the patterns produced are directly understandable
Search Challenges
 scalability problems is an important research
challenge too.
 MULTI-OBJECTIVE RULE EXTRACTION
 MOEA Issues
Aperior Ex.

More Related Content

What's hot

Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Seerat Malik
 
Data Mining: Classification and analysis
Data Mining: Classification and analysisData Mining: Classification and analysis
Data Mining: Classification and analysisDataminingTools Inc
 
Knowledge discovery thru data mining
Knowledge discovery thru data miningKnowledge discovery thru data mining
Knowledge discovery thru data miningDevakumar Jain
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining Phi Jack
 
Data mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationData mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationDr. Abdul Ahad Abro
 
Application of data mining
Application of data miningApplication of data mining
Application of data miningSHIVANI SONI
 
Data mining
Data mining Data mining
Data mining AthiraR23
 
Data Mining: clustering and analysis
Data Mining: clustering and analysisData Mining: clustering and analysis
Data Mining: clustering and analysisDataminingTools Inc
 
Data mining: Classification and prediction
Data mining: Classification and predictionData mining: Classification and prediction
Data mining: Classification and predictionDataminingTools Inc
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & ApplicationsFazle Rabbi Ador
 
Architecture of data mining system
Architecture of data mining systemArchitecture of data mining system
Architecture of data mining systemramya marichamy
 

What's hot (20)

Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Data Mining: Classification and analysis
Data Mining: Classification and analysisData Mining: Classification and analysis
Data Mining: Classification and analysis
 
Knowledge discovery thru data mining
Knowledge discovery thru data miningKnowledge discovery thru data mining
Knowledge discovery thru data mining
 
web mining
web miningweb mining
web mining
 
Data mining
Data miningData mining
Data mining
 
Kdd process
Kdd processKdd process
Kdd process
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining
 
Data mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationData mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, Classification
 
Application of data mining
Application of data miningApplication of data mining
Application of data mining
 
Dbms schemas for decision support
Dbms schemas for decision supportDbms schemas for decision support
Dbms schemas for decision support
 
Data mining
Data mining Data mining
Data mining
 
Clustering in Data Mining
Clustering in Data MiningClustering in Data Mining
Clustering in Data Mining
 
Text mining
Text miningText mining
Text mining
 
Data Mining: clustering and analysis
Data Mining: clustering and analysisData Mining: clustering and analysis
Data Mining: clustering and analysis
 
Data mining: Classification and prediction
Data mining: Classification and predictionData mining: Classification and prediction
Data mining: Classification and prediction
 
Data mining
Data miningData mining
Data mining
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & Applications
 
Architecture of data mining system
Architecture of data mining systemArchitecture of data mining system
Architecture of data mining system
 
Data mining
Data mining Data mining
Data mining
 

Viewers also liked

Libro l4
Libro l4Libro l4
Libro l4Tono582
 
Certificate_35
Certificate_35Certificate_35
Certificate_35Adeel Khan
 
Manger-et-penser-bio
Manger-et-penser-bioManger-et-penser-bio
Manger-et-penser-bioAude Debenest
 
Higado y vias biliares
Higado y vias biliaresHigado y vias biliares
Higado y vias biliaresdanyalara
 
Traitement et Exploitation des nuages de points (LiDAR)
Traitement et Exploitation des nuages de points (LiDAR)Traitement et Exploitation des nuages de points (LiDAR)
Traitement et Exploitation des nuages de points (LiDAR)Mourad Labguira
 
Determinantes
DeterminantesDeterminantes
Determinantestejinha
 
LiDAR et traces agraires fossiles autour de Besançon : potentiel et limites d...
LiDAR et traces agraires fossiles autour de Besançon : potentiel et limites d...LiDAR et traces agraires fossiles autour de Besançon : potentiel et limites d...
LiDAR et traces agraires fossiles autour de Besançon : potentiel et limites d...Nicolas Bernigaud
 
What, Why & How of Crowdfunding
What, Why & How of CrowdfundingWhat, Why & How of Crowdfunding
What, Why & How of CrowdfundingMakeitnow
 
Golden hollywood
Golden hollywoodGolden hollywood
Golden hollywoodMs Olive
 
3. synergy and convergence
3. synergy and convergence3. synergy and convergence
3. synergy and convergenceMs Olive
 
Jamie A Cowan, Timendo - Solocal Group UK Event "How To Drive Online Traffic ...
Jamie A Cowan, Timendo - Solocal Group UK Event "How To Drive Online Traffic ...Jamie A Cowan, Timendo - Solocal Group UK Event "How To Drive Online Traffic ...
Jamie A Cowan, Timendo - Solocal Group UK Event "How To Drive Online Traffic ...Solocal Group UK
 
Northern Illinois Rockford Heart Walk Slated for May of 2015
Northern Illinois Rockford Heart Walk Slated for May of 2015 Northern Illinois Rockford Heart Walk Slated for May of 2015
Northern Illinois Rockford Heart Walk Slated for May of 2015 Dr . Randy David Hassen
 

Viewers also liked (19)

Libro l4
Libro l4Libro l4
Libro l4
 
Certificate_35
Certificate_35Certificate_35
Certificate_35
 
La amistad
La amistadLa amistad
La amistad
 
Manger-et-penser-bio
Manger-et-penser-bioManger-et-penser-bio
Manger-et-penser-bio
 
Jun 06 jorge tuto quiroga - oea - reeleccion evo
Jun 06   jorge tuto quiroga - oea - reeleccion evoJun 06   jorge tuto quiroga - oea - reeleccion evo
Jun 06 jorge tuto quiroga - oea - reeleccion evo
 
Qualification of the NDI process
Qualification of the NDI processQualification of the NDI process
Qualification of the NDI process
 
Sorteo alianzas
Sorteo alianzasSorteo alianzas
Sorteo alianzas
 
Higado y vias biliares
Higado y vias biliaresHigado y vias biliares
Higado y vias biliares
 
Traitement et Exploitation des nuages de points (LiDAR)
Traitement et Exploitation des nuages de points (LiDAR)Traitement et Exploitation des nuages de points (LiDAR)
Traitement et Exploitation des nuages de points (LiDAR)
 
Determinantes
DeterminantesDeterminantes
Determinantes
 
Letra t t
Letra t tLetra t t
Letra t t
 
Raising Tomatoes Workshop
Raising Tomatoes WorkshopRaising Tomatoes Workshop
Raising Tomatoes Workshop
 
LiDAR et traces agraires fossiles autour de Besançon : potentiel et limites d...
LiDAR et traces agraires fossiles autour de Besançon : potentiel et limites d...LiDAR et traces agraires fossiles autour de Besançon : potentiel et limites d...
LiDAR et traces agraires fossiles autour de Besançon : potentiel et limites d...
 
ESTUDIOS DE VELOCIDADES EN CARRETERAS
ESTUDIOS DE VELOCIDADES EN CARRETERASESTUDIOS DE VELOCIDADES EN CARRETERAS
ESTUDIOS DE VELOCIDADES EN CARRETERAS
 
What, Why & How of Crowdfunding
What, Why & How of CrowdfundingWhat, Why & How of Crowdfunding
What, Why & How of Crowdfunding
 
Golden hollywood
Golden hollywoodGolden hollywood
Golden hollywood
 
3. synergy and convergence
3. synergy and convergence3. synergy and convergence
3. synergy and convergence
 
Jamie A Cowan, Timendo - Solocal Group UK Event "How To Drive Online Traffic ...
Jamie A Cowan, Timendo - Solocal Group UK Event "How To Drive Online Traffic ...Jamie A Cowan, Timendo - Solocal Group UK Event "How To Drive Online Traffic ...
Jamie A Cowan, Timendo - Solocal Group UK Event "How To Drive Online Traffic ...
 
Northern Illinois Rockford Heart Walk Slated for May of 2015
Northern Illinois Rockford Heart Walk Slated for May of 2015 Northern Illinois Rockford Heart Walk Slated for May of 2015
Northern Illinois Rockford Heart Walk Slated for May of 2015
 

Similar to Data mining concepts and work

Data Mining: Data mining classification and analysis
Data Mining: Data mining classification and analysisData Mining: Data mining classification and analysis
Data Mining: Data mining classification and analysisDatamining Tools
 
Data Mining System and Applications: A Review
Data Mining System and Applications: A ReviewData Mining System and Applications: A Review
Data Mining System and Applications: A Reviewijdpsjournal
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.pptSamPrem3
 
Knowledge Discovery & Representation
Knowledge Discovery & RepresentationKnowledge Discovery & Representation
Knowledge Discovery & RepresentationDarshan Patil
 
Privacy preservation techniques in data mining
Privacy preservation techniques in data miningPrivacy preservation techniques in data mining
Privacy preservation techniques in data miningeSAT Journals
 
Privacy preservation techniques in data mining
Privacy preservation techniques in data miningPrivacy preservation techniques in data mining
Privacy preservation techniques in data miningeSAT Publishing House
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.pptPalaniKumarR2
 
Data mining an introduction
Data mining an introductionData mining an introduction
Data mining an introductionDr-Dipali Meher
 
Data Mining Classification Comparison (Naïve Bayes and C4.5 Algorithms)
Data Mining Classification Comparison (Naïve Bayes and C4.5 Algorithms)Data Mining Classification Comparison (Naïve Bayes and C4.5 Algorithms)
Data Mining Classification Comparison (Naïve Bayes and C4.5 Algorithms)Universitas Pembangunan Panca Budi
 
Data Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesData Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesFellowBuddy.com
 
Introduction to data mining
Introduction to data miningIntroduction to data mining
Introduction to data miningUjjawal
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection methodIJSRD
 

Similar to Data mining concepts and work (20)

Seminar Presentation
Seminar PresentationSeminar Presentation
Seminar Presentation
 
Talk
TalkTalk
Talk
 
Data Mining: Data mining classification and analysis
Data Mining: Data mining classification and analysisData Mining: Data mining classification and analysis
Data Mining: Data mining classification and analysis
 
data mining
data miningdata mining
data mining
 
Part1
Part1Part1
Part1
 
Unit i
Unit iUnit i
Unit i
 
G045033841
G045033841G045033841
G045033841
 
Data mining
Data miningData mining
Data mining
 
Data Mining System and Applications: A Review
Data Mining System and Applications: A ReviewData Mining System and Applications: A Review
Data Mining System and Applications: A Review
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt
 
Knowledge Discovery & Representation
Knowledge Discovery & RepresentationKnowledge Discovery & Representation
Knowledge Discovery & Representation
 
Privacy preservation techniques in data mining
Privacy preservation techniques in data miningPrivacy preservation techniques in data mining
Privacy preservation techniques in data mining
 
Privacy preservation techniques in data mining
Privacy preservation techniques in data miningPrivacy preservation techniques in data mining
Privacy preservation techniques in data mining
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt
 
Data mining an introduction
Data mining an introductionData mining an introduction
Data mining an introduction
 
Data Mining Classification Comparison (Naïve Bayes and C4.5 Algorithms)
Data Mining Classification Comparison (Naïve Bayes and C4.5 Algorithms)Data Mining Classification Comparison (Naïve Bayes and C4.5 Algorithms)
Data Mining Classification Comparison (Naïve Bayes and C4.5 Algorithms)
 
Data Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesData Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture Notes
 
Introduction to data mining
Introduction to data miningIntroduction to data mining
Introduction to data mining
 
Data Mining and Knowledge
Data Mining and KnowledgeData Mining and Knowledge
Data Mining and Knowledge
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection method
 

More from Amr Abd El Latief

More from Amr Abd El Latief (12)

master-journey.pptx
master-journey.pptxmaster-journey.pptx
master-journey.pptx
 
Micro frontend
Micro frontendMicro frontend
Micro frontend
 
I feel presentation [autosaved]
I feel presentation [autosaved]I feel presentation [autosaved]
I feel presentation [autosaved]
 
Design p atterns
Design p atternsDesign p atterns
Design p atterns
 
AngularJs advanced Topics
AngularJs advanced TopicsAngularJs advanced Topics
AngularJs advanced Topics
 
Angular js slides
Angular js slidesAngular js slides
Angular js slides
 
Test vector compression
Test vector compressionTest vector compression
Test vector compression
 
Designing energy efficient lte
Designing energy efficient lteDesigning energy efficient lte
Designing energy efficient lte
 
Stock market analysis using ga and neural network
Stock market analysis using ga and neural networkStock market analysis using ga and neural network
Stock market analysis using ga and neural network
 
Chromium os architecture report
Chromium os  architecture reportChromium os  architecture report
Chromium os architecture report
 
Marketing plane of cadbry bupply kids
Marketing plane of cadbry bupply kidsMarketing plane of cadbry bupply kids
Marketing plane of cadbry bupply kids
 
Test vector compression in Digital Testing
Test vector compression in Digital Testing Test vector compression in Digital Testing
Test vector compression in Digital Testing
 

Recently uploaded

Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsMehedi Hasan Shohan
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 

Recently uploaded (20)

Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software Solutions
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The Ugly
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 

Data mining concepts and work

  • 1. Presented to : Dr. Rabie By : Amr Abd EL Latief Abd El Al
  • 2. Data Mining Def.  Def. :  Data mining is the extraction of interesting patterns or knowledge from huge amount of data. Known different names :  knowledge discovery (mining) in databases (KDD)  knowledge extraction,  data/pattern analysis,  data archeology,  data dredging,  information harvesting,  business intelligence and others. [1]
  • 3. What is Data Mining  Data Mining enables data exploration, data analysis, and data visualization of huge databases at a high level of abstraction, without a specific hypothesis in mind.  working of data mining is understood by using a method called modeling with it to make predictions.
  • 4. Data Mining Technologies  include :  artificial neural networks  decision trees  genetic algorithms.  Machine Learning .  Evolutionary Computing  MOEA Multi objective Evolutionary Computing
  • 7. The Process of Data Mining
  • 9. Data Types Application S.V.  Business transactions  Scientific data  Medical and personal data  Surveillance video and pictures  Satellite sensing  Text reports and memos (e-mail messages)  Most of the communications  The World Wide Web repositories
  • 10. types of data (Data Structure S.V.)  Flat files  Relational Databases  Data Warehouses  Transaction Databases  Multimedia Databases  Spatial Databases  World Wide Web
  • 11. FUNCTIONALITIES AND CLASSIFICATIONS OF DATA MINING  Characterization  Discrimination  Association analysis  Classification  uses given class labels to order the objects in  the data collection Classification approaches normally use a  training set where all objects are already associated with  known class labels. The classification algorithm learns from  the training set and builds a model. The model is used to  classify new objects.  Prediction  Prediction
  • 12. Data Mining Systems specialized data source mined dataClassification according to the data drawn on modmodel el drawn on kind of knowledge discovered mining techniques used comprehensive
  • 13. Classification according to the type of data source mined  This classification categorizes data mining systems according to the type of data handled:  spatial data  multimedia data  time-series data  text data  World Wide Web.
  • 14. Classification according to the data model drawn on  This classification categorizes data mining systems based on the data model involved:  Relational database  object-oriented database  data warehouse  Transactional  others
  • 15. Classification according to the king of knowledge discovered  This classification categorizes data mining systems based on the kind of knowledge discovered or data mining functionalities:  Characterization  discrimination  Association  classification  clustering  others
  • 16. Classification according to mining techniques used  The classification categorizes data mining systems according to the data analysis approach used:  machine learning  neural networks  Genetic algorithms  Statistics  visualization  database oriented  data warehouse-oriented  others
  • 17. take into account the degree of user interaction involved in the data mining process  query-driven systems,  interactive exploratory systems  autonomous systems Note:  A comprehensive system would provide a wide variety of data mining techniques to fit different situations and options, and offer different degrees of user interaction.
  • 19. Data Mining Goals  the two main goals of DM are:  description  prediction.  Standard tasks in the field of DM are: description, clustering, association discovery, sequential pattern analysis, classification and regression.  Description : can be obtained by characterization or by discrimination.  Characterization: is a summarization of the general features  Discrimination :does not differ too much from characterization. It consists of characterizing a class by comparison with another one.
  • 20. Data Mining Goals  Clustering differs from classification since it analyses data objects without knowing their class.  Association : discovery results in a set of association rules which represents attribute-value conditions frequently occurring in a given set of data.  Sequential pattern analysis : consists in searching for frequently occurring patterns related to time.  Regression : uses existing values of some variables in order to forecast what values of another continuous variable will be
  • 21. Machine Learning  A ML system uses an entire finite set of objects, examples which represent observations of the environment ; the learning algorithm learns a model from this set which is called the training set.  ML In DM include:  databases  data warehouses  flat files
  • 22. Classification in DM  Classification: is a form of data analysis that can be used to extract models describing important classes or to predict future trends.  It represents : learning paradigm which consists in segmenting data by assigning it to groups, or classes,, that are already defined.  the assumption is a small database size but In Data Mining it must be scalable technique.
  • 23. Classification in DM  classes are represented by: the values of a particular attribute called goal attribute and remaining attributes are called predicting attribute.  resulting model is usually represented as: a set of IF-THEN prediction rules where each one predicts a class from the predicting attributes.
  • 24. ML in Classification  Procedure:  Algorithms are first applied to the so-called training set which contains training examples with a known class to discover rules.  the model is used for classification on a set of examples, called the test set.  The predictive accuracy of the model is evaluated on the test set
  • 25. Classification Methods  Main classification methods are:  decision tree induction  Scalability problem  Bayesian classification  neural network learning.  Draw Backs:  Time-consuming  difficulty for humans to interpret their results.
  • 26. ASSOCIATION ANALYSIS  They show relationships between attributes. Their typical application domain is market basket and transaction data analysis.  Association Rules:  An association rule is generally defined as an expression  X=>Y,  where X and Y are sets of attribute-value terms
  • 27. ASSOCIATION ANALYSIS  Rules are not supposed to be strictly correct in order for them to be useful. It is generally required to find rules which are true to some degree only.  X implies Y  X tends to imply Y  Support and confidence
  • 28. Apriori Algorithm  Depends on Frqeuent occurence  Draw Backs :  Large number of database scans  Large size of generated intermediate sets.  Apriori mining only Boolean and single-dimensional association rules.  These rules are adapted to market basket analysis and can
  • 29. GA Advantages in Data Mining  DM problem needs: robustness of solutions and scalability  GA Advantages:  there is high ability to find patterns in vey large spaces.  parallel implementation  It performs a kind Of global search rather than local hill-climbing.  the patterns produced are directly understandable
  • 30. Search Challenges  scalability problems is an important research challenge too.  MULTI-OBJECTIVE RULE EXTRACTION  MOEA Issues