2. DATA MINING - BASICS
Dr. V. Subha, B.E., M.E., Ph.D.,
Assistant Professor,
Department of Computer Science & Engineering,
Manonmaniam Sundaranar University,
Tirunelveli.
2
5. Definition of data mining
• Process of identifying valid, novel,
potentially useful, and ultimately
understandable patterns in data.
- Predicts outcomes of future
observations.
5
8. Data Mining - Motivation
• Growth in data.
• High dimensionality of data.
• Heterogeneous and complex data.
• Development of commercial data mining software.
• Growth of computing power and storage capacity.
• Limitation of humans.
8
9. Why Data Mining ?
• Credit ratings :
• Given a database of 100,000 names, which persons are the least likely
to default on their credit cards?
• Fraud detection :
• Which types of transactions are likely to be fraudulent, given the
demographics and transactional history of a particular customer?
• Customer relationship management:
• Which of my customers are likely to be the most loyal, and which are
most likely to leave for a competitor?
Data Mining helps extract such information
9
16. Data Mining Techniques…
• Unsupervised learning
- No knowledge of output
- Self-guided learning algorithm.
• Supervised learning
- Knowledge of output
- Learning with an expert/teacher
16
20. Data mining Tools - Proprietary License
Oracle Data Mining
IBM Cognos IBM SPSS Modeler
SAS Data Mining Sisense
SSDT
Teradata Board toolkit Dundas BI
20