Outlier Analysis
What are outliers?Very often, there exist data objects that do not comply with the general behavior or model of the data. Such data objects, which are grossly different from or inconsistent with the remaining set of data, are called outliers.
What is Outlier Analysis?The outliers may be of particular interest, such as in the case of fraud detection, where outliers may indicate fraudulent activity. Thus, outlier detection and analysis is an interesting data mining task, referred to as outlier mining or outlier analysis.
Statistical Distribution-Based Outlier Detection   Two basic types of procedures for detecting outliers:Block procedures: In this case, either all of the suspect objects are treated as outliersor all of them are accepted as consistent.Consecutive (or sequential) procedures: An example of such a procedure is the insideoutprocedure.
Distance-Based Outlier DetectionSome efficient algorithms for mining distance-based outliers are as follows:Index-based algorithmNested-loop algorithm:Cell-based algorithmDensity-Based Local Outlier DetectionDeviation-Based Outlier Detection with Sequential Exception Technique
OLAP Data Cube TechniqueAn OLAP approach to deviation detection uses data cubes to identify regions of anomaliesin large multidimensional data
Visit more self help tutorialsPick a tutorial of your choice and browse through it at your own pace.The tutorials section is free, self-guiding and will not involve any additional support.Visit us at www.dataminingtools.net

Data Mining: Outlier analysis

  • 1.
  • 2.
    What are outliers?Veryoften, there exist data objects that do not comply with the general behavior or model of the data. Such data objects, which are grossly different from or inconsistent with the remaining set of data, are called outliers.
  • 3.
    What is OutlierAnalysis?The outliers may be of particular interest, such as in the case of fraud detection, where outliers may indicate fraudulent activity. Thus, outlier detection and analysis is an interesting data mining task, referred to as outlier mining or outlier analysis.
  • 4.
    Statistical Distribution-Based OutlierDetection Two basic types of procedures for detecting outliers:Block procedures: In this case, either all of the suspect objects are treated as outliersor all of them are accepted as consistent.Consecutive (or sequential) procedures: An example of such a procedure is the insideoutprocedure.
  • 5.
    Distance-Based Outlier DetectionSomeefficient algorithms for mining distance-based outliers are as follows:Index-based algorithmNested-loop algorithm:Cell-based algorithmDensity-Based Local Outlier DetectionDeviation-Based Outlier Detection with Sequential Exception Technique
  • 6.
    OLAP Data CubeTechniqueAn OLAP approach to deviation detection uses data cubes to identify regions of anomaliesin large multidimensional data
  • 7.
    Visit more selfhelp tutorialsPick a tutorial of your choice and browse through it at your own pace.The tutorials section is free, self-guiding and will not involve any additional support.Visit us at www.dataminingtools.net