Data Mining
Presented by: Shubham singh
To: Mr.Sanjay kumar
Outline
Definition, motivation & application

Branches of data mining

Classification, clustering, Association rule

mining
Some classification techniques

What Is Data Mining?
Data mining (knowledge discovery in databases):

Extraction of interesting (non-trivial, implicit, previously
◦
unknown and potentially useful) information or patterns
from data in large databases
Alternative names and their “inside stories”:

Data mining: a misnomer?
◦
Knowledge discovery(mining) in databases (KDD),
◦
knowledge extraction, data/pattern analysis, data
archeology, business intelligence, etc.
Data Mining Definition
Finding hidden information in a

database
Fit data to a model

Similar terms

Exploratory data analysis
◦
Data driven discovery
◦
Deductive learning
◦
Steps of data mining
Data integration

Data selection

Data cleaning

Data transformation

Data mining

Pattern evaluation

Knowledge presentation

Why Mine Data? Commercial
Viewpoint
Lots of data is being collected

and warehoused
Web data, e-commerce
◦
purchases at department/
◦
grocery stores
Bank/Credit Card
◦
transactions
Computers have become cheaper and more powerful

Competitive Pressure is Strong

Provide better, customized services for an edge (e.g. in Customer
◦
Relationship Management)
Is data mining a threat to privacy
and information security?
Purpose specifications and use

limitations.
Openness

Security measures like encryption

Data warehousing
Data warehousing provides the

Enterprise with a memory.
Data mining provides the Enterprise

and Intelligence.
Data warehousing?
A data warehouse is a subject-

oriented,integrated,time-variant and non-volatile
collection of data in support of management’s
decision making process.
The most common form of data integration.

Copy sources into a single database and try to
keep it
up-to-date.
Usual method:periodic reconstruction of the
warehouse,perhaps overnight.
Frequently essential for analytic queries.
Data mining tools
Microsoft SQL Server 2005

Microsoft SQL Server 2008

Oracle Data Mining

DBMiner

end!
THANK YOU EVERYONE!

Data mining excel.pdf

  • 1.
    Data Mining Presented by:Shubham singh To: Mr.Sanjay kumar
  • 2.
    Outline Definition, motivation &application  Branches of data mining  Classification, clustering, Association rule  mining Some classification techniques 
  • 3.
    What Is DataMining? Data mining (knowledge discovery in databases):  Extraction of interesting (non-trivial, implicit, previously ◦ unknown and potentially useful) information or patterns from data in large databases Alternative names and their “inside stories”:  Data mining: a misnomer? ◦ Knowledge discovery(mining) in databases (KDD), ◦ knowledge extraction, data/pattern analysis, data archeology, business intelligence, etc.
  • 4.
    Data Mining Definition Findinghidden information in a  database Fit data to a model  Similar terms  Exploratory data analysis ◦ Data driven discovery ◦ Deductive learning ◦
  • 5.
    Steps of datamining Data integration  Data selection  Data cleaning  Data transformation  Data mining  Pattern evaluation  Knowledge presentation 
  • 6.
    Why Mine Data?Commercial Viewpoint Lots of data is being collected  and warehoused Web data, e-commerce ◦ purchases at department/ ◦ grocery stores Bank/Credit Card ◦ transactions Computers have become cheaper and more powerful  Competitive Pressure is Strong  Provide better, customized services for an edge (e.g. in Customer ◦ Relationship Management)
  • 7.
    Is data mininga threat to privacy and information security? Purpose specifications and use  limitations. Openness  Security measures like encryption 
  • 8.
    Data warehousing Data warehousingprovides the  Enterprise with a memory. Data mining provides the Enterprise  and Intelligence.
  • 9.
    Data warehousing? A datawarehouse is a subject-  oriented,integrated,time-variant and non-volatile collection of data in support of management’s decision making process. The most common form of data integration.  Copy sources into a single database and try to keep it up-to-date. Usual method:periodic reconstruction of the warehouse,perhaps overnight. Frequently essential for analytic queries.
  • 10.
    Data mining tools MicrosoftSQL Server 2005  Microsoft SQL Server 2008  Oracle Data Mining  DBMiner 
  • 11.