2.
The data mining is the process in Data Knowledge
Discovery in Database that produces useful patterns or
the modules from the data, Database.
Data mining can be used to mine
understandable meaningful patterns from large databases
and these patterns may then be converted into knowledge.
The KDD stands for Knowledge Discovery in
Database.
It refers to the overall process of discovering useful
knowledge from the data.
2
Data Mining and KDD
3.
Classification:-
Mining patterns that can classify future
data.
Association Rule Mining:-
Mining any rule of the form X Y, where X
and Y are sets of data items.
Clustering:-
Identifying a set of similarity groups in the
data.
3
Main Data Mining Tasks
4. Sequential Pattern Mining:
A sequential rule: A B, says that event A will be
immediately followed by event B with a certain
confidence.
Deviation detection:
Discovering the most significant changes in Data.
Data visualization: - Using graphical or
Diagrammatically methods to show patterns in data.
4
Main data mining tasks
(cont. …)
5.
Valid: generalize to the future.
Novel: what we don't know.
Useful: be able to take some action.
Understandable: leading to insight.
Iterative: takes multiple passes.
Interactive: human in the loop .
5
What is data mining?
6. 6
Data Mining process :-
Original
Data
Target
Data
Preprocessed
Data
Transformed
Data
Patterns
Knowledge
Selection
Preprocessing
Transformation
Data Mining
Interpretation
7.
Regression:-
Assign a new data record to one of several
predefined categories or classes. Regression deals with
predicting real-valued fields. Also called supervised
learning.
Clustering:
Partition the dataset into subsets or groups
such that elements of a group share a common set of
properties.
7
Data Mining Techniques
8.
1. Selection: Selecting data relevant to the analysis task from
the database
2. Preprocessing: Removing noise and inconsistent data;
combining multiple data sources
3. Transformation: Transforming data into appropriate forms
to perform data mining
4. Data mining: Choosing a data mining algorithm which is
appropriate to pattern in the
data; Extracting data patterns
5. Interpretation/Evaluation : Interpreting the patterns into
knowledge. 8
KDD Methods..
9. Related Areas of
Data Mining And KDD
Database technology
and data warehouses
efficient storage,
access and
manipulation
of data
DM
statistics
machine
learning
visualization
text and Web
mining
soft
computing pattern
recognition
databases
11. Conti…
Text And Web
Mining:-
Web page analysis,
Text categorization,
filtering and
structuring of
textual information
Natural language
processing
DM
Statistics
Machine
Learning
Visualization
Text and web
mining
Soft
Computing Pattern
Recognition
Databases
Text and web
mining
12.
Knowledge discovery can be broadly defined as the
automated discovery of novel and useful
information from commercial databases. Data
mining is one step at the core of the knowledge
discovery process, dealing with the extraction of
patterns and relationships from large amounts of
data. Data Mining Techniques are used to analyze
data and extract useful information from large
amount of data.
Conclusion..