SlideShare a Scribd company logo
1 of 27
Download to read offline
2020/07/19
Ho Seong Lee (hoya012)
Cognex Deep Learning Lab
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 1
Contents
• Introduction
• Related Work
• Datasets for Anomaly Detection in natural images
• Methods
• MVTec-AD Dataset
• Benchmark
• Discussion
• Conclusion
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 2
Introduction
What is Anomaly Detection?
• Detecting anomalous regions in images or videos or time-series data
• Today, we will focus anomaly detection in industrial images
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 3
Reference: https://hoya012.github.io/blog/anomaly-detection-overview-1/
Introduction
Definition of Anomaly Detection
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 4
• I wrote technical blog post about anomaly detection (Korean Only TT)
• I recommend reading this post if you want to know more about anomaly detection
Reference: https://hoya012.github.io/blog/anomaly-detection-overview-1/
Introduction
Anomaly Detection in PR-12
• Unsupervised Anomaly Detection 1, One-Class Anomaly Detection 1, Out-of-distribution Detection 2
• Minor topic.. 5/263 = 1.9%
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 5
PR-115
By 강민국님
PR-148
By 강민국님
PR-190
By 강민국님
PR-235
By 이도엽님
Related Work
Datasets for Anomaly Detection in natural images
• One-Class Anomaly Detection: MNIST, CIFAR-10
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 6
Normal Class Anomaly Class
Provide a large amount of
train/test data
Anomaly samples differ
significantly from normal
samples
Related Work
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 7
Datasets for Anomaly Detection in natural images
• Unsupervised Anomaly Detection (Segmentation): DAGM 2007, NanoTWICE
Only focus on the
inspection of textured
surfaces
Only 5 defect-free images
can be used for training
Defects were generated
by similar texture models
Related Work
Methods - Generative Adversarial Networks (AnoGAN) → See PR-115!
• Train GAN using normal samples, and fix Generator and Discriminator
• Search for a latent sample that reproduces a given input image and manages to fool discriminator
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 8
Reference: “Unsupervised Anomaly Detection with Generative Adversarial Networks to Guide Marker Discovery”
Related Work
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 9
Methods – Deep Convolutional Autoencoders and Variational Autoencoders (VAE)
• Reconstruct normal training samples through a bottleneck (latent space)
• During testing, they fail to reproduce images that differ from the data that was observed during training
Many papers provide further
evidence that probabilities obtained
from VAEs and other deep generative
models might fail to model the true
likelihood of the training data
Reference: https://hoya012.github.io/blog/anomaly-detection-overview-1/
Related Work
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 10
Methods – Features of pre-trained CNN (Feature Dictionary)
• Use feature descriptors obtained from ImageNet pre-trained CNN
• Sliding window-based approach → for large images, very slow
Reference: “Anomaly Detection in Nanofibrous Materials by CNN-Based Self Similarity”
Related Work
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 11
Methods – Traditional Methods
• Use hand-crafted feature descriptors from defect-free texture images
• Gaussian Mixture Model(GMM)-based Texture Inspection Model for texture objects
• Shape-based Matching-based Variation Model for non-texture objects
MVTec-AD Dataset
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
MVTec-AD Dataset Description
• 15 categories with 3629 images for training and validation and 1725 images for testing
• The defects were generated with the aim to produce realistic anomalies as they would occur in real-
world industrial inspection scenarios
12
OK OK / NG
MVTec-AD Dataset
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 13
Grid, Screw, Zipper = Gray(1ch)
MVTec-AD Dataset
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 14
MVTec-AD Dataset Description
• All images were acquired using a 2048 x 2048 pixel high-resolution industrial RGB sensor
• Afterwards, the images were cropped to a suitable output size (700 x 700 ~ 1024 x 1024)
• The images were acquired under highly controlled illumination conditions. For some object classes,
however, the illumination was altered intentionally to increase variability
• Provide pixel-precise ground truth labels for each defective image region
Benchmark
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 15
Evaluated Methods
• AnoGAN: use the publicly available implementation from DoYup Lee!! (PR-12 Presenter)
• L2 and SSIM Autoencoder: same CAE architecture as MVTec’s previous paper
• CNN Feature Dictionary: 512-dimensional avgpool layer of ResNet-18 pretrained on ImageNet
• GMM-Based Texture Inspection Model / Variation Model: use HALCON machine vision library from
MVTec
MVTec CAE
→ www.github.com/LeeDoYup/AnoGAN-tf
Benchmark
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 16
Data Augmentation
• Texture images: random crop rotated rectangular patches of fixed size
• Object images: random translation and rotation and additional random flip if possible
Texture Object
Benchmark
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 17
Evaluation
• Output score is one-channel spatial map. To obtain a final segmentation result, a threshold must be
determined! Use defect-free validation images to estimate the threshold
Ideal threshold.. But.. we don’t know
Original Reconstructed Anomaly Map
Reference: SUALAB Research Team Park’s Figure
Benchmark
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 18
Evaluation
• Output score is one-channel spatial map. To obtain a final segmentation result, a threshold must be
determined! Use defect-free validation images to estimate the threshold
• For every category, define a minimum defect area that a connected component in the thresholded
anomaly map must have to be classified as a defective region
• Successively segment the anomaly maps of the defect-free validation images with increasing thresholds
• The threshold that yielded this segmentation is used for further evaluation
Human’s prior
Maybe.. Blob?
Benchmark
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 19
Evaluation Metric
• For classification, use the accuracy of correctly classified test images
• For segmentation, use relative per-region overlap (IoU) and AUROC (which is independent of threshold)
Red line: GT / Green map: Prediction
Benchmark
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 20
Results
Classification Segmentation
Top row: OK
Bottom row: NG
Top row: IoU
Bottom row: AUROC
Benchmark
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 21
Results
• For texture images, none of methods emerges as a clear winner
• Autoencoder and CNN Feature Dictionary perform well
Benchmark
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 22
Results
• For object images, Autoencoder achieve the best results
• L2 AE achieves better per-region overlap values
Discussion
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
AnoGAN
• Observe a tendency of GAN training to result in mode collapse
• AnoGAN has great difficulties with object categories for which the objects appear in various
shapes or orientations in the dataset
23
Discussion
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
Autoencoder
• Observe stable training across all dataset categories both SSIM AE and L2 AE
• For some categories, however, fail to model small details, which results in rather blurry reconstruction
• This is especially the case for high-frequency textures, which appear in tile and zipper
24
False Positive!
Discussion
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
CNN Feature Dictionary
• As a method proposed for detection of anomalous regions in textured surfaces, CNN Feature
Dictionary achieves satisfactory results for all textures except grid
• Its performance degenerates when evaluated on objects categories
25
False Negative!
Discussion
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
Others (Texture Inspection Model and Variation Model)
• Good performance and Bad performance..
26
Texture
Inspection
Variation
Model
Conclusion
PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
• Introduce MVTec Anomaly Detection dataset (MVTec-AD), a novel dataset for unsupervised
anomaly detection mimicking real-world industrial inspection scenarios
• Several methods were thoroughly evaluated on this dataset
• The evaluations provide a first benchmark on this dataset and show that there is still
considerable room for improvement
→ The first real-world industrial unsupervised anomaly detection dataset!
→ Some ambiguous evaluation metric(threshold).. And didn’t provide source code
27

More Related Content

What's hot

Multiclass classification of imbalanced data
Multiclass classification of imbalanced dataMulticlass classification of imbalanced data
Multiclass classification of imbalanced dataSaurabhWani6
 
A100 GPU 搭載! P4d インスタンス 使いこなしのコツ
A100 GPU 搭載! P4d インスタンス使いこなしのコツA100 GPU 搭載! P4d インスタンス使いこなしのコツ
A100 GPU 搭載! P4d インスタンス 使いこなしのコツKuninobu SaSaki
 
Deep convolutional neural fields for depth estimation from a single image
Deep convolutional neural fields for depth estimation from a single imageDeep convolutional neural fields for depth estimation from a single image
Deep convolutional neural fields for depth estimation from a single imageWei Yang
 
Introduction to Visual transformers
Introduction to Visual transformers Introduction to Visual transformers
Introduction to Visual transformers leopauly
 
Transformer in Vision
Transformer in VisionTransformer in Vision
Transformer in VisionSangmin Woo
 
Text Analysis with Machine Learning
Text Analysis with Machine LearningText Analysis with Machine Learning
Text Analysis with Machine LearningTuri, Inc.
 
[DL輪読会]A Simple Unified Framework for Detecting Out-of-Distribution Samples a...
[DL輪読会]A Simple Unified Framework for Detecting Out-of-Distribution Samples a...[DL輪読会]A Simple Unified Framework for Detecting Out-of-Distribution Samples a...
[DL輪読会]A Simple Unified Framework for Detecting Out-of-Distribution Samples a...Deep Learning JP
 
The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...Evention
 
AIFrienz_Webinar_Tomomi_Research_Inc).pdf
AIFrienz_Webinar_Tomomi_Research_Inc).pdfAIFrienz_Webinar_Tomomi_Research_Inc).pdf
AIFrienz_Webinar_Tomomi_Research_Inc).pdfSeong-Hun Choe
 
Threat Detection in Surveillance Videos
Threat Detection in Surveillance VideosThreat Detection in Surveillance Videos
Threat Detection in Surveillance VideosDatabricks
 
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...changedaeoh
 
Thai Text processing by Transfer Learning using Transformer (Bert)
Thai Text processing by Transfer Learning using Transformer (Bert)Thai Text processing by Transfer Learning using Transformer (Bert)
Thai Text processing by Transfer Learning using Transformer (Bert)Kobkrit Viriyayudhakorn
 
[DL輪読会]Encoder-Decoder with Atrous Separable Convolution for Semantic Image S...
[DL輪読会]Encoder-Decoder with Atrous Separable Convolution for Semantic Image S...[DL輪読会]Encoder-Decoder with Atrous Separable Convolution for Semantic Image S...
[DL輪読会]Encoder-Decoder with Atrous Separable Convolution for Semantic Image S...Deep Learning JP
 
Book of the Dead: Environmental Design, Tools, and Techniques for Photo-Real ...
Book of the Dead: Environmental Design, Tools, and Techniques for Photo-Real ...Book of the Dead: Environmental Design, Tools, and Techniques for Photo-Real ...
Book of the Dead: Environmental Design, Tools, and Techniques for Photo-Real ...Unity Technologies
 
Voyage dans le monde du Deep Learning
Voyage dans le monde du Deep LearningVoyage dans le monde du Deep Learning
Voyage dans le monde du Deep LearningAlexia Audevart
 
対話テキストの自動要約
対話テキストの自動要約対話テキストの自動要約
対話テキストの自動要約Masahiro Yamamoto
 
深層学習の不確実性 - Uncertainty in Deep Neural Networks -
深層学習の不確実性 - Uncertainty in Deep Neural Networks -深層学習の不確実性 - Uncertainty in Deep Neural Networks -
深層学習の不確実性 - Uncertainty in Deep Neural Networks -tmtm otm
 
[논문리뷰] 딥러닝 적용한 EEG 연구 소개
[논문리뷰] 딥러닝 적용한 EEG 연구 소개[논문리뷰] 딥러닝 적용한 EEG 연구 소개
[논문리뷰] 딥러닝 적용한 EEG 연구 소개Donghyeon Kim
 
Introduction to MAML (Model Agnostic Meta Learning) with Discussions
Introduction to MAML (Model Agnostic Meta Learning) with DiscussionsIntroduction to MAML (Model Agnostic Meta Learning) with Discussions
Introduction to MAML (Model Agnostic Meta Learning) with DiscussionsJoonyoung Yi
 
[DL輪読会]A closer look at few shot classification
[DL輪読会]A closer look at few shot classification[DL輪読会]A closer look at few shot classification
[DL輪読会]A closer look at few shot classificationDeep Learning JP
 

What's hot (20)

Multiclass classification of imbalanced data
Multiclass classification of imbalanced dataMulticlass classification of imbalanced data
Multiclass classification of imbalanced data
 
A100 GPU 搭載! P4d インスタンス 使いこなしのコツ
A100 GPU 搭載! P4d インスタンス使いこなしのコツA100 GPU 搭載! P4d インスタンス使いこなしのコツ
A100 GPU 搭載! P4d インスタンス 使いこなしのコツ
 
Deep convolutional neural fields for depth estimation from a single image
Deep convolutional neural fields for depth estimation from a single imageDeep convolutional neural fields for depth estimation from a single image
Deep convolutional neural fields for depth estimation from a single image
 
Introduction to Visual transformers
Introduction to Visual transformers Introduction to Visual transformers
Introduction to Visual transformers
 
Transformer in Vision
Transformer in VisionTransformer in Vision
Transformer in Vision
 
Text Analysis with Machine Learning
Text Analysis with Machine LearningText Analysis with Machine Learning
Text Analysis with Machine Learning
 
[DL輪読会]A Simple Unified Framework for Detecting Out-of-Distribution Samples a...
[DL輪読会]A Simple Unified Framework for Detecting Out-of-Distribution Samples a...[DL輪読会]A Simple Unified Framework for Detecting Out-of-Distribution Samples a...
[DL輪読会]A Simple Unified Framework for Detecting Out-of-Distribution Samples a...
 
The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...
 
AIFrienz_Webinar_Tomomi_Research_Inc).pdf
AIFrienz_Webinar_Tomomi_Research_Inc).pdfAIFrienz_Webinar_Tomomi_Research_Inc).pdf
AIFrienz_Webinar_Tomomi_Research_Inc).pdf
 
Threat Detection in Surveillance Videos
Threat Detection in Surveillance VideosThreat Detection in Surveillance Videos
Threat Detection in Surveillance Videos
 
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
 
Thai Text processing by Transfer Learning using Transformer (Bert)
Thai Text processing by Transfer Learning using Transformer (Bert)Thai Text processing by Transfer Learning using Transformer (Bert)
Thai Text processing by Transfer Learning using Transformer (Bert)
 
[DL輪読会]Encoder-Decoder with Atrous Separable Convolution for Semantic Image S...
[DL輪読会]Encoder-Decoder with Atrous Separable Convolution for Semantic Image S...[DL輪読会]Encoder-Decoder with Atrous Separable Convolution for Semantic Image S...
[DL輪読会]Encoder-Decoder with Atrous Separable Convolution for Semantic Image S...
 
Book of the Dead: Environmental Design, Tools, and Techniques for Photo-Real ...
Book of the Dead: Environmental Design, Tools, and Techniques for Photo-Real ...Book of the Dead: Environmental Design, Tools, and Techniques for Photo-Real ...
Book of the Dead: Environmental Design, Tools, and Techniques for Photo-Real ...
 
Voyage dans le monde du Deep Learning
Voyage dans le monde du Deep LearningVoyage dans le monde du Deep Learning
Voyage dans le monde du Deep Learning
 
対話テキストの自動要約
対話テキストの自動要約対話テキストの自動要約
対話テキストの自動要約
 
深層学習の不確実性 - Uncertainty in Deep Neural Networks -
深層学習の不確実性 - Uncertainty in Deep Neural Networks -深層学習の不確実性 - Uncertainty in Deep Neural Networks -
深層学習の不確実性 - Uncertainty in Deep Neural Networks -
 
[논문리뷰] 딥러닝 적용한 EEG 연구 소개
[논문리뷰] 딥러닝 적용한 EEG 연구 소개[논문리뷰] 딥러닝 적용한 EEG 연구 소개
[논문리뷰] 딥러닝 적용한 EEG 연구 소개
 
Introduction to MAML (Model Agnostic Meta Learning) with Discussions
Introduction to MAML (Model Agnostic Meta Learning) with DiscussionsIntroduction to MAML (Model Agnostic Meta Learning) with Discussions
Introduction to MAML (Model Agnostic Meta Learning) with Discussions
 
[DL輪読会]A closer look at few shot classification
[DL輪読会]A closer look at few shot classification[DL輪読会]A closer look at few shot classification
[DL輪読会]A closer look at few shot classification
 

Similar to MVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection

深度學習在AOI的應用
深度學習在AOI的應用深度學習在AOI的應用
深度學習在AOI的應用CHENHuiMei
 
Deep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUSDeep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUSGanesan Narayanasamy
 
Revisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural NetworksRevisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural NetworksSungchul Kim
 
Computer vision-nit-silchar-hackathon
Computer vision-nit-silchar-hackathonComputer vision-nit-silchar-hackathon
Computer vision-nit-silchar-hackathonAditya Bhattacharya
 
Video Annotation for Visual Tracking via Selection and Refinement_tran.pptx
Video Annotation for Visual Tracking via Selection and Refinement_tran.pptxVideo Annotation for Visual Tracking via Selection and Refinement_tran.pptx
Video Annotation for Visual Tracking via Selection and Refinement_tran.pptxAlyaaMachi
 
Multispectral imaging in Plant Sciences with VideometerLab 3
Multispectral imaging in Plant Sciences with VideometerLab 3Multispectral imaging in Plant Sciences with VideometerLab 3
Multispectral imaging in Plant Sciences with VideometerLab 3Adrian Waltho
 
Online video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident networkOnline video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident networkNAVER Engineering
 
PERFORMANCE EVALUATION OF SPATIAL AND FRACTAL WATERMARKING ALGORITHM
PERFORMANCE EVALUATION OF SPATIAL AND FRACTAL WATERMARKING ALGORITHMPERFORMANCE EVALUATION OF SPATIAL AND FRACTAL WATERMARKING ALGORITHM
PERFORMANCE EVALUATION OF SPATIAL AND FRACTAL WATERMARKING ALGORITHMAM Publications
 
IRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box DetectorIRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box DetectorIRJET Journal
 
Overview of DuraMat software tool development
Overview of DuraMat software tool developmentOverview of DuraMat software tool development
Overview of DuraMat software tool developmentAnubhav Jain
 
How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?Anubhav Jain
 
End-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersEnd-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersSeunghyun Hwang
 
Blood Cell Image Classification for Detecting Malaria using CNN
Blood Cell Image Classification for Detecting Malaria using CNNBlood Cell Image Classification for Detecting Malaria using CNN
Blood Cell Image Classification for Detecting Malaria using CNNIRJET Journal
 
PR-433: Test-time Training with Masked Autoencoders
PR-433: Test-time Training with Masked AutoencodersPR-433: Test-time Training with Masked Autoencoders
PR-433: Test-time Training with Masked AutoencodersSunghoon Joo
 
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘CHENHuiMei
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learningpratik pratyay
 
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digits
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digitsNVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digits
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digitsNVIDIA Taiwan
 

Similar to MVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection (20)

深度學習在AOI的應用
深度學習在AOI的應用深度學習在AOI的應用
深度學習在AOI的應用
 
Image anaysis
Image anaysisImage anaysis
Image anaysis
 
Deep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUSDeep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUS
 
Revisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural NetworksRevisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural Networks
 
Computer vision-nit-silchar-hackathon
Computer vision-nit-silchar-hackathonComputer vision-nit-silchar-hackathon
Computer vision-nit-silchar-hackathon
 
Video Annotation for Visual Tracking via Selection and Refinement_tran.pptx
Video Annotation for Visual Tracking via Selection and Refinement_tran.pptxVideo Annotation for Visual Tracking via Selection and Refinement_tran.pptx
Video Annotation for Visual Tracking via Selection and Refinement_tran.pptx
 
lec6a.ppt
lec6a.pptlec6a.ppt
lec6a.ppt
 
Multispectral imaging in Plant Sciences with VideometerLab 3
Multispectral imaging in Plant Sciences with VideometerLab 3Multispectral imaging in Plant Sciences with VideometerLab 3
Multispectral imaging in Plant Sciences with VideometerLab 3
 
Online video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident networkOnline video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident network
 
PERFORMANCE EVALUATION OF SPATIAL AND FRACTAL WATERMARKING ALGORITHM
PERFORMANCE EVALUATION OF SPATIAL AND FRACTAL WATERMARKING ALGORITHMPERFORMANCE EVALUATION OF SPATIAL AND FRACTAL WATERMARKING ALGORITHM
PERFORMANCE EVALUATION OF SPATIAL AND FRACTAL WATERMARKING ALGORITHM
 
IRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box DetectorIRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
 
Overview of DuraMat software tool development
Overview of DuraMat software tool developmentOverview of DuraMat software tool development
Overview of DuraMat software tool development
 
How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?
 
End-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersEnd-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
 
slide-171212080528.pptx
slide-171212080528.pptxslide-171212080528.pptx
slide-171212080528.pptx
 
Blood Cell Image Classification for Detecting Malaria using CNN
Blood Cell Image Classification for Detecting Malaria using CNNBlood Cell Image Classification for Detecting Malaria using CNN
Blood Cell Image Classification for Detecting Malaria using CNN
 
PR-433: Test-time Training with Masked Autoencoders
PR-433: Test-time Training with Masked AutoencodersPR-433: Test-time Training with Masked Autoencoders
PR-433: Test-time Training with Masked Autoencoders
 
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learning
 
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digits
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digitsNVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digits
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digits
 

More from LEE HOSEONG

Unsupervised anomaly detection using style distillation
Unsupervised anomaly detection using style distillationUnsupervised anomaly detection using style distillation
Unsupervised anomaly detection using style distillationLEE HOSEONG
 
do adversarially robust image net models transfer better
do adversarially robust image net models transfer betterdo adversarially robust image net models transfer better
do adversarially robust image net models transfer betterLEE HOSEONG
 
CNN Architecture A to Z
CNN Architecture A to ZCNN Architecture A to Z
CNN Architecture A to ZLEE HOSEONG
 
carrier of_tricks_for_image_classification
carrier of_tricks_for_image_classificationcarrier of_tricks_for_image_classification
carrier of_tricks_for_image_classificationLEE HOSEONG
 
"The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Gen...
"The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Gen..."The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Gen...
"The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Gen...LEE HOSEONG
 
Mixed Precision Training Review
Mixed Precision Training ReviewMixed Precision Training Review
Mixed Precision Training ReviewLEE HOSEONG
 
YOLOv4: optimal speed and accuracy of object detection review
YOLOv4: optimal speed and accuracy of object detection reviewYOLOv4: optimal speed and accuracy of object detection review
YOLOv4: optimal speed and accuracy of object detection reviewLEE HOSEONG
 
FixMatch:simplifying semi supervised learning with consistency and confidence
FixMatch:simplifying semi supervised learning with consistency and confidenceFixMatch:simplifying semi supervised learning with consistency and confidence
FixMatch:simplifying semi supervised learning with consistency and confidenceLEE HOSEONG
 
"Revisiting self supervised visual representation learning" Paper Review
"Revisiting self supervised visual representation learning" Paper Review"Revisiting self supervised visual representation learning" Paper Review
"Revisiting self supervised visual representation learning" Paper ReviewLEE HOSEONG
 
Unsupervised visual representation learning overview: Toward Self-Supervision
Unsupervised visual representation learning overview: Toward Self-SupervisionUnsupervised visual representation learning overview: Toward Self-Supervision
Unsupervised visual representation learning overview: Toward Self-SupervisionLEE HOSEONG
 
Human uncertainty makes classification more robust, ICCV 2019 Review
Human uncertainty makes classification more robust, ICCV 2019 ReviewHuman uncertainty makes classification more robust, ICCV 2019 Review
Human uncertainty makes classification more robust, ICCV 2019 ReviewLEE HOSEONG
 
Single Image Super Resolution Overview
Single Image Super Resolution OverviewSingle Image Super Resolution Overview
Single Image Super Resolution OverviewLEE HOSEONG
 
2019 ICLR Best Paper Review
2019 ICLR Best Paper Review2019 ICLR Best Paper Review
2019 ICLR Best Paper ReviewLEE HOSEONG
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overviewLEE HOSEONG
 
"Google Vizier: A Service for Black-Box Optimization" Paper Review
"Google Vizier: A Service for Black-Box Optimization" Paper Review"Google Vizier: A Service for Black-Box Optimization" Paper Review
"Google Vizier: A Service for Black-Box Optimization" Paper ReviewLEE HOSEONG
 
"Searching for Activation Functions" Paper Review
"Searching for Activation Functions" Paper Review"Searching for Activation Functions" Paper Review
"Searching for Activation Functions" Paper ReviewLEE HOSEONG
 
"Learning transferable architectures for scalable image recognition" Paper Re...
"Learning transferable architectures for scalable image recognition" Paper Re..."Learning transferable architectures for scalable image recognition" Paper Re...
"Learning transferable architectures for scalable image recognition" Paper Re...LEE HOSEONG
 
"Learning From Noisy Large-Scale Datasets With Minimal Supervision" Paper Review
"Learning From Noisy Large-Scale Datasets With Minimal Supervision" Paper Review"Learning From Noisy Large-Scale Datasets With Minimal Supervision" Paper Review
"Learning From Noisy Large-Scale Datasets With Minimal Supervision" Paper ReviewLEE HOSEONG
 
"Dataset and metrics for predicting local visible differences" Paper Review
"Dataset and metrics for predicting local visible differences" Paper Review"Dataset and metrics for predicting local visible differences" Paper Review
"Dataset and metrics for predicting local visible differences" Paper ReviewLEE HOSEONG
 
"From image level to pixel-level labeling with convolutional networks" Paper ...
"From image level to pixel-level labeling with convolutional networks" Paper ..."From image level to pixel-level labeling with convolutional networks" Paper ...
"From image level to pixel-level labeling with convolutional networks" Paper ...LEE HOSEONG
 

More from LEE HOSEONG (20)

Unsupervised anomaly detection using style distillation
Unsupervised anomaly detection using style distillationUnsupervised anomaly detection using style distillation
Unsupervised anomaly detection using style distillation
 
do adversarially robust image net models transfer better
do adversarially robust image net models transfer betterdo adversarially robust image net models transfer better
do adversarially robust image net models transfer better
 
CNN Architecture A to Z
CNN Architecture A to ZCNN Architecture A to Z
CNN Architecture A to Z
 
carrier of_tricks_for_image_classification
carrier of_tricks_for_image_classificationcarrier of_tricks_for_image_classification
carrier of_tricks_for_image_classification
 
"The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Gen...
"The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Gen..."The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Gen...
"The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Gen...
 
Mixed Precision Training Review
Mixed Precision Training ReviewMixed Precision Training Review
Mixed Precision Training Review
 
YOLOv4: optimal speed and accuracy of object detection review
YOLOv4: optimal speed and accuracy of object detection reviewYOLOv4: optimal speed and accuracy of object detection review
YOLOv4: optimal speed and accuracy of object detection review
 
FixMatch:simplifying semi supervised learning with consistency and confidence
FixMatch:simplifying semi supervised learning with consistency and confidenceFixMatch:simplifying semi supervised learning with consistency and confidence
FixMatch:simplifying semi supervised learning with consistency and confidence
 
"Revisiting self supervised visual representation learning" Paper Review
"Revisiting self supervised visual representation learning" Paper Review"Revisiting self supervised visual representation learning" Paper Review
"Revisiting self supervised visual representation learning" Paper Review
 
Unsupervised visual representation learning overview: Toward Self-Supervision
Unsupervised visual representation learning overview: Toward Self-SupervisionUnsupervised visual representation learning overview: Toward Self-Supervision
Unsupervised visual representation learning overview: Toward Self-Supervision
 
Human uncertainty makes classification more robust, ICCV 2019 Review
Human uncertainty makes classification more robust, ICCV 2019 ReviewHuman uncertainty makes classification more robust, ICCV 2019 Review
Human uncertainty makes classification more robust, ICCV 2019 Review
 
Single Image Super Resolution Overview
Single Image Super Resolution OverviewSingle Image Super Resolution Overview
Single Image Super Resolution Overview
 
2019 ICLR Best Paper Review
2019 ICLR Best Paper Review2019 ICLR Best Paper Review
2019 ICLR Best Paper Review
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overview
 
"Google Vizier: A Service for Black-Box Optimization" Paper Review
"Google Vizier: A Service for Black-Box Optimization" Paper Review"Google Vizier: A Service for Black-Box Optimization" Paper Review
"Google Vizier: A Service for Black-Box Optimization" Paper Review
 
"Searching for Activation Functions" Paper Review
"Searching for Activation Functions" Paper Review"Searching for Activation Functions" Paper Review
"Searching for Activation Functions" Paper Review
 
"Learning transferable architectures for scalable image recognition" Paper Re...
"Learning transferable architectures for scalable image recognition" Paper Re..."Learning transferable architectures for scalable image recognition" Paper Re...
"Learning transferable architectures for scalable image recognition" Paper Re...
 
"Learning From Noisy Large-Scale Datasets With Minimal Supervision" Paper Review
"Learning From Noisy Large-Scale Datasets With Minimal Supervision" Paper Review"Learning From Noisy Large-Scale Datasets With Minimal Supervision" Paper Review
"Learning From Noisy Large-Scale Datasets With Minimal Supervision" Paper Review
 
"Dataset and metrics for predicting local visible differences" Paper Review
"Dataset and metrics for predicting local visible differences" Paper Review"Dataset and metrics for predicting local visible differences" Paper Review
"Dataset and metrics for predicting local visible differences" Paper Review
 
"From image level to pixel-level labeling with convolutional networks" Paper ...
"From image level to pixel-level labeling with convolutional networks" Paper ..."From image level to pixel-level labeling with convolutional networks" Paper ...
"From image level to pixel-level labeling with convolutional networks" Paper ...
 

Recently uploaded

Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 

Recently uploaded (20)

Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 

MVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection

  • 1. 2020/07/19 Ho Seong Lee (hoya012) Cognex Deep Learning Lab PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 1
  • 2. Contents • Introduction • Related Work • Datasets for Anomaly Detection in natural images • Methods • MVTec-AD Dataset • Benchmark • Discussion • Conclusion PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 2
  • 3. Introduction What is Anomaly Detection? • Detecting anomalous regions in images or videos or time-series data • Today, we will focus anomaly detection in industrial images PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 3 Reference: https://hoya012.github.io/blog/anomaly-detection-overview-1/
  • 4. Introduction Definition of Anomaly Detection PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 4 • I wrote technical blog post about anomaly detection (Korean Only TT) • I recommend reading this post if you want to know more about anomaly detection Reference: https://hoya012.github.io/blog/anomaly-detection-overview-1/
  • 5. Introduction Anomaly Detection in PR-12 • Unsupervised Anomaly Detection 1, One-Class Anomaly Detection 1, Out-of-distribution Detection 2 • Minor topic.. 5/263 = 1.9% PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 5 PR-115 By 강민국님 PR-148 By 강민국님 PR-190 By 강민국님 PR-235 By 이도엽님
  • 6. Related Work Datasets for Anomaly Detection in natural images • One-Class Anomaly Detection: MNIST, CIFAR-10 PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 6 Normal Class Anomaly Class Provide a large amount of train/test data Anomaly samples differ significantly from normal samples
  • 7. Related Work PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 7 Datasets for Anomaly Detection in natural images • Unsupervised Anomaly Detection (Segmentation): DAGM 2007, NanoTWICE Only focus on the inspection of textured surfaces Only 5 defect-free images can be used for training Defects were generated by similar texture models
  • 8. Related Work Methods - Generative Adversarial Networks (AnoGAN) → See PR-115! • Train GAN using normal samples, and fix Generator and Discriminator • Search for a latent sample that reproduces a given input image and manages to fool discriminator PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 8 Reference: “Unsupervised Anomaly Detection with Generative Adversarial Networks to Guide Marker Discovery”
  • 9. Related Work PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 9 Methods – Deep Convolutional Autoencoders and Variational Autoencoders (VAE) • Reconstruct normal training samples through a bottleneck (latent space) • During testing, they fail to reproduce images that differ from the data that was observed during training Many papers provide further evidence that probabilities obtained from VAEs and other deep generative models might fail to model the true likelihood of the training data Reference: https://hoya012.github.io/blog/anomaly-detection-overview-1/
  • 10. Related Work PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 10 Methods – Features of pre-trained CNN (Feature Dictionary) • Use feature descriptors obtained from ImageNet pre-trained CNN • Sliding window-based approach → for large images, very slow Reference: “Anomaly Detection in Nanofibrous Materials by CNN-Based Self Similarity”
  • 11. Related Work PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 11 Methods – Traditional Methods • Use hand-crafted feature descriptors from defect-free texture images • Gaussian Mixture Model(GMM)-based Texture Inspection Model for texture objects • Shape-based Matching-based Variation Model for non-texture objects
  • 12. MVTec-AD Dataset PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection MVTec-AD Dataset Description • 15 categories with 3629 images for training and validation and 1725 images for testing • The defects were generated with the aim to produce realistic anomalies as they would occur in real- world industrial inspection scenarios 12 OK OK / NG
  • 13. MVTec-AD Dataset PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 13 Grid, Screw, Zipper = Gray(1ch)
  • 14. MVTec-AD Dataset PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 14 MVTec-AD Dataset Description • All images were acquired using a 2048 x 2048 pixel high-resolution industrial RGB sensor • Afterwards, the images were cropped to a suitable output size (700 x 700 ~ 1024 x 1024) • The images were acquired under highly controlled illumination conditions. For some object classes, however, the illumination was altered intentionally to increase variability • Provide pixel-precise ground truth labels for each defective image region
  • 15. Benchmark PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 15 Evaluated Methods • AnoGAN: use the publicly available implementation from DoYup Lee!! (PR-12 Presenter) • L2 and SSIM Autoencoder: same CAE architecture as MVTec’s previous paper • CNN Feature Dictionary: 512-dimensional avgpool layer of ResNet-18 pretrained on ImageNet • GMM-Based Texture Inspection Model / Variation Model: use HALCON machine vision library from MVTec MVTec CAE → www.github.com/LeeDoYup/AnoGAN-tf
  • 16. Benchmark PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 16 Data Augmentation • Texture images: random crop rotated rectangular patches of fixed size • Object images: random translation and rotation and additional random flip if possible Texture Object
  • 17. Benchmark PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 17 Evaluation • Output score is one-channel spatial map. To obtain a final segmentation result, a threshold must be determined! Use defect-free validation images to estimate the threshold Ideal threshold.. But.. we don’t know Original Reconstructed Anomaly Map Reference: SUALAB Research Team Park’s Figure
  • 18. Benchmark PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 18 Evaluation • Output score is one-channel spatial map. To obtain a final segmentation result, a threshold must be determined! Use defect-free validation images to estimate the threshold • For every category, define a minimum defect area that a connected component in the thresholded anomaly map must have to be classified as a defective region • Successively segment the anomaly maps of the defect-free validation images with increasing thresholds • The threshold that yielded this segmentation is used for further evaluation Human’s prior Maybe.. Blob?
  • 19. Benchmark PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 19 Evaluation Metric • For classification, use the accuracy of correctly classified test images • For segmentation, use relative per-region overlap (IoU) and AUROC (which is independent of threshold) Red line: GT / Green map: Prediction
  • 20. Benchmark PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 20 Results Classification Segmentation Top row: OK Bottom row: NG Top row: IoU Bottom row: AUROC
  • 21. Benchmark PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 21 Results • For texture images, none of methods emerges as a clear winner • Autoencoder and CNN Feature Dictionary perform well
  • 22. Benchmark PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection 22 Results • For object images, Autoencoder achieve the best results • L2 AE achieves better per-region overlap values
  • 23. Discussion PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection AnoGAN • Observe a tendency of GAN training to result in mode collapse • AnoGAN has great difficulties with object categories for which the objects appear in various shapes or orientations in the dataset 23
  • 24. Discussion PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection Autoencoder • Observe stable training across all dataset categories both SSIM AE and L2 AE • For some categories, however, fail to model small details, which results in rather blurry reconstruction • This is especially the case for high-frequency textures, which appear in tile and zipper 24 False Positive!
  • 25. Discussion PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection CNN Feature Dictionary • As a method proposed for detection of anomalous regions in textured surfaces, CNN Feature Dictionary achieves satisfactory results for all textures except grid • Its performance degenerates when evaluated on objects categories 25 False Negative!
  • 26. Discussion PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection Others (Texture Inspection Model and Variation Model) • Good performance and Bad performance.. 26 Texture Inspection Variation Model
  • 27. Conclusion PR-263 | MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection • Introduce MVTec Anomaly Detection dataset (MVTec-AD), a novel dataset for unsupervised anomaly detection mimicking real-world industrial inspection scenarios • Several methods were thoroughly evaluated on this dataset • The evaluations provide a first benchmark on this dataset and show that there is still considerable room for improvement → The first real-world industrial unsupervised anomaly detection dataset! → Some ambiguous evaluation metric(threshold).. And didn’t provide source code 27