Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Upcoming SlideShare
Anomaly Detection Using the CLA
Next

7

Share

Science of Anomaly Detection

Presentation by Scott Purdy at Numenta Workshop on October 17, 2014.

Science of Anomaly Detection

  1. 1. Science of Anomaly Detection Numenta Workshop October 17, 2014 Scott Purdy Engineering Manager
  2. 2. What is an anomaly? Something that deviates from what is standard, normal, or expected.
  3. 3. Types of Anomalies Spatial (static) anomalies Temporal anomalies
  4. 4. Challenges with Temporal Anomalies Tolerance to noise Continuous learning
  5. 5. • How anomaly detection fits into HTM theory • How we do anomaly detection – HTM Learning Algorithms – Anomaly score processing • Evaluating anomaly detection algorithms Outline
  6. 6. What does anomaly detection have to do with Hierarchical Temporal Memory? Applications of HTM: • Prediction • Classification • Anomaly Detection No changes to the algorithm
  7. 7. Anomaly Detection with HTM HTM Algorithms Encoder SDR Prediction Raw anomaly score Time average Historical comparison Anomaly likelihood Data How do we turn a data stream into anomaly scores?
  8. 8. Raw Anomaly Score Raw anomaly score is the fraction of active columns that were not predicted. rawAnomalyScore = At -(Pt-1 Ç At ) At Pt = Predicted columns at time t At = Active columns at time t
  9. 9. Load Balancer Example 0 0.5 1 1.5 2 2.5 3 3.5 Latency(s) Latency 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 RawAnomalyScore Raw Anomaly Score
  10. 10. Historical Comparison Compute normal distribution over history Compute probability for each point relative to the distribution m = xP(x)å s = E[(X -m)2 ]
  11. 11. Anomaly Likelihood 0 0.2 0.4 0.6 0.8 1 1.2 Raw Anomaly Score Mean and Standard Deviation mean std dev 0 0.05 0.1 0.15 0.2 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Probability Probability Distribution
  12. 12. Anomalies in Load Balancer Latency
  13. 13. Continuous Learning
  14. 14. Highly Predictable Data
  15. 15. Subtle Anomalies Patterns that humans can’t see but are important
  16. 16. HTM Learning Algorithms Compared to Other Techniques Anomaly Type Sudden In predictable data In noisy data Human can’t see HTM Learning Algorithms Yes Yes Yes Yes Thresholds Yes No No No Various Statistical Yes Maybe Yes No Time Series Analysis Yes Yes No No Distance-based Yes Maybe No No Supervised Methods N/A N/A N/A N/A See “The Science of Anomaly Detection” White Paper at numenta.com
  17. 17. Benchmarking Streaming Anomaly Detection • No training/test set • No parameter tuning per data sample • Need real data samples in addition to artificial • We haven’t found any streaming anomaly detection benchmarks so far
  18. 18. Numenta Anomaly Benchmark (NAB) • Work in progress • High velocity, streaming data • Currently 21 real data samples and 11 artificial samples • Hand-labeled, requiring multiple labelers to agree • Open source, configurable – Currently runs HTM Learning Algorithms and Etsy Skyline algorithms • Follow progress at http://github.com/numenta/nab • Please participate!
  19. 19. Next Steps Read the white paper http://numenta.com/#technology Scott Purdy spurdy@numenta.com NAB http://github.com/numenta/nab Algorithm code http://numenta.org @numenta @scottmpurdy
  • KevinYu59

    Jun. 1, 2017
  • kartiktv

    Jul. 25, 2016
  • LionelKusch

    May. 14, 2016
  • artob

    Dec. 19, 2015
  • fvizer

    Nov. 18, 2015
  • MohanBavirisetty

    Nov. 11, 2015
  • keefie98

    May. 27, 2015

Presentation by Scott Purdy at Numenta Workshop on October 17, 2014.

Views

Total views

6,702

On Slideshare

0

From embeds

0

Number of embeds

3,820

Actions

Downloads

0

Shares

0

Comments

0

Likes

7

×