Your SlideShare is downloading. ×
0
Data Mining: Classification and analysis
Data Mining: Classification and analysis
Data Mining: Classification and analysis
Data Mining: Classification and analysis
Data Mining: Classification and analysis
Data Mining: Classification and analysis
Data Mining: Classification and analysis
Data Mining: Classification and analysis
Data Mining: Classification and analysis
Data Mining: Classification and analysis
Data Mining: Classification and analysis
Data Mining: Classification and analysis
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Data Mining: Classification and analysis

12,484

Published on

Data Mining: Classification and analysis

Data Mining: Classification and analysis

Published in: Technology
0 Comments
12 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
12,484
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
0
Comments
0
Likes
12
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Data Mining :Classification and Analysis<br />
  • 2. Classification in the process of Data Mining<br />It is the process of finding a model (or function) that describes and distinguishes data classes or concepts, for the purpose of being able to use the model to predict the class of objects whose class label is unknown.<br />There are 3 models in which classification can be represented<br />IF-THEN rules, A decision tree, Neural network.<br />
  • 3. Classificationof Data Mining Systems<br />The kinds of databases mined<br />The kinds of knowledge mined<br />The kinds of techniques utilized<br />The applications adapted<br />
  • 4. What is Cluster Analysis?<br />Clustering analyzes data objects without consulting a known class label. Clustering can also facilitate taxonomy formation.<br />
  • 5. What is Outlier Mining ?<br />A database may contain data objects that do not comply with the general behavior or model of the data. These data objects are outliers. The analysis of outlier data is referred to as outlier mining.<br />
  • 6. What is Evolution Analysis?<br />Data evolution analysis describes and models regularities or trends for objects whose behavior changes over time.<br />
  • 7. What are Data Mining Task Primitives?<br />A data mining task can be specified in the form of a data mining query, which is input to the data mining system. A data mining query is defined in terms of data mining task primitives.<br />
  • 8. Integration schemes of Data Base and Data Warehouse systems<br />No coupling: No coupling means that a DM system will not utilize any function of a DB or DW system <br />Loose coupling: Loose coupling means that a DM system will use some facilities of a DB or DW system, fetching data from a data repository managed by these systems, performing data mining, and then storing the mining results either in a file or in a designated place in a database or data warehouse.<br />
  • 9. Cont..<br />Semi tight coupling: Semi tight coupling means that besides linking a DM system to a DB/DW system, efficient implementations of a few essential data mining primitives (identified by the analysis of frequently encountered data mining functions) can be provided in the DB/DW system.<br />Tight coupling: Tight coupling means that a DM system is smoothly integrated into the DB/DW system.<br />
  • 10. Some issues encountered in Data Mining<br />Mining methodology and user interaction issues<br />Mining different kinds of knowledge in databases<br />Interactive mining of knowledge at multiple levels of abstraction<br />Incorporation of background knowledge<br />Data mining query languages and ad hoc data mining<br />Presentation and visualization of data mining results<br />Handling noisy or incomplete data<br />Pattern evaluation—the interestingness problem<br />
  • 11. The performance of data mining system<br />Efficiency and scalability of data mining algorithms<br />Parallel, distributed, and incremental mining algorithms<br />Issues relating to the diversity of database types<br />Handling of relational and complex types of data<br />Mining information from heterogeneous databases and global information systems<br />
  • 12. Visit more self help tutorials<br />Pick a tutorial of your choice and browse through it at your own pace.<br />The tutorials section is free, self-guiding and will not involve any additional support.<br />Visit us at www.dataminingtools.net<br />

Ă—