• Save
Data Mining: Outlier analysis
Upcoming SlideShare
Loading in...5
×
 

Like this? Share it with your network

Share

Data Mining: Outlier analysis

on

  • 16,025 views

Data Mining: Outlier analysis

Data Mining: Outlier analysis

Statistics

Views

Total Views
16,025
Views on SlideShare
15,923
Embed Views
102

Actions

Likes
3
Downloads
0
Comments
0

2 Embeds 102

http://dataminingtools.net 53
http://www.dataminingtools.net 49

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Data Mining: Outlier analysis Presentation Transcript

  • 1. Outlier Analysis
  • 2. What are outliers?
    Very often, there exist data objects that do not comply with the general behavior or model of the data. Such data objects, which are grossly different from or inconsistent with the remaining set of data, are called outliers.
  • 3. What is Outlier Analysis?
    The outliers may be of particular interest, such as in the case of fraud detection, where outliers may indicate fraudulent activity. Thus, outlier detection and analysis is an interesting data mining task, referred to as outlier mining or outlier analysis.
  • 4. Statistical Distribution-Based Outlier Detection
    Two basic types of procedures for detecting outliers:
    Block procedures: In this case, either all of the suspect objects are treated as outliersor all of them are accepted as consistent.
    Consecutive (or sequential) procedures: An example of such a procedure is the insideoutprocedure.
  • 5. Distance-Based Outlier Detection
    Some efficient algorithms for mining distance-based outliers are as follows:
    Index-based algorithm
    Nested-loop algorithm:
    Cell-based algorithm
    Density-Based Local Outlier Detection
    Deviation-Based Outlier Detection with Sequential Exception Technique
  • 6. OLAP Data Cube Technique
    An OLAP approach to deviation detection uses data cubes to identify regions of anomaliesin large multidimensional data
  • 7. Visit more self help tutorials
    Pick a tutorial of your choice and browse through it at your own pace.
    The tutorials section is free, self-guiding and will not involve any additional support.
    Visit us at www.dataminingtools.net