주제 : AI를 이용한 외관검사프로세스의 트렌드
내용 :
● 일본의 제조업에서 공업제품 외관검사의 현실
● 공업제품의 AI 외관검사의 과제 및 해결방안
● 최근의 공업제품의 AI외관검사의 트렌드 및 주요 알고리즘의 소개
데모 사이트 : https://tomomiresearch.herokuapp.com/
Towards Total Recall in Industrial Anomaly Detectionharmonylab
公開URL:https://openaccess.thecvf.com/content/CVPR2022/papers/Roth_Towards_Total_Recall_in_Industrial_Anomaly_Detection_CVPR_2022_paper.pdf
出典:Karsten Roth, Latha Pemula, Joaquin Zepeda, Bernhard Schölkopf, Thomas Brox, Peter Gehler: Towards Total Recall in Industrial Anomaly Detection, Conference on Computer Vision and Pattern Recognition (CVPR), pp. 14318-14328 (2022)
概要:本論文では位置情報を考慮した特徴量の集合和であるメモリバンクとCoresetによる画像パッチ特徴量の削減を行うPatchCoreアルゴリズムを提案する.結果として、異常検出のベンチマークであるMVTecにおいてAUROC99%以上の精度を出力し,2022年時点でのSoTAを記録した.また,PatchCoreによる特徴量削減により,学習のサンプル数を20%に減らした場合でも以前のSoTAに匹敵する精度となった.
Towards Total Recall in Industrial Anomaly Detectionharmonylab
公開URL:https://openaccess.thecvf.com/content/CVPR2022/papers/Roth_Towards_Total_Recall_in_Industrial_Anomaly_Detection_CVPR_2022_paper.pdf
出典:Karsten Roth, Latha Pemula, Joaquin Zepeda, Bernhard Schölkopf, Thomas Brox, Peter Gehler: Towards Total Recall in Industrial Anomaly Detection, Conference on Computer Vision and Pattern Recognition (CVPR), pp. 14318-14328 (2022)
概要:本論文では位置情報を考慮した特徴量の集合和であるメモリバンクとCoresetによる画像パッチ特徴量の削減を行うPatchCoreアルゴリズムを提案する.結果として、異常検出のベンチマークであるMVTecにおいてAUROC99%以上の精度を出力し,2022年時点でのSoTAを記録した.また,PatchCoreによる特徴量削減により,学習のサンプル数を20%に減らした場合でも以前のSoTAに匹敵する精度となった.
cvpaper.challenge の Meta Study Group 発表スライド
cvpaper.challenge はコンピュータビジョン分野の今を映し、トレンドを創り出す挑戦です。論文サマリ・アイディア考案・議論・実装・論文投稿に取り組み、凡ゆる知識を共有します。2019の目標「トップ会議30+本投稿」「2回以上のトップ会議網羅的サーベイ」
http://xpaperchallenge.org/cv/
Electronic Circuit Assessment using Machine Learning (ML)vivatechijri
Traditionally after installing all the electronics on the circuit board part, a worker make sure the circuits are working properly. Motive is to build machines that can replace the repetitive function of Human and Test Electronic Circuit Performance using Computer Vision which is one of the advancements using machine learning. Printed circuit board (PCB) testing has been a critical process in electrical production industry to ensure product quality and reliability, reduce production costs and increase production. PCB testing involves the detection of errors on a PCB and the segmentation of those errors to identify the roots of errors. The proposed algorithm is broadly divided into five categories, feature detection and feature classification. The algorithm is able to perform tests even if the image is captured rotating, measuring and translating according to a template that performs algorithm rotation, scale and translation they are different. The newness of the algorithm is still at the beginning of analyzing the feature with its unique appearance as well firmness. In addition to this, the algorithm only takes 2,528 s to scan a PCB image. Performance of the proposed algorithm is verified by performing experiments on various PCB images and shows that the proposed algorithms suitable for automatic PCB view testing
cvpaper.challenge の Meta Study Group 発表スライド
cvpaper.challenge はコンピュータビジョン分野の今を映し、トレンドを創り出す挑戦です。論文サマリ・アイディア考案・議論・実装・論文投稿に取り組み、凡ゆる知識を共有します。2019の目標「トップ会議30+本投稿」「2回以上のトップ会議網羅的サーベイ」
http://xpaperchallenge.org/cv/
Electronic Circuit Assessment using Machine Learning (ML)vivatechijri
Traditionally after installing all the electronics on the circuit board part, a worker make sure the circuits are working properly. Motive is to build machines that can replace the repetitive function of Human and Test Electronic Circuit Performance using Computer Vision which is one of the advancements using machine learning. Printed circuit board (PCB) testing has been a critical process in electrical production industry to ensure product quality and reliability, reduce production costs and increase production. PCB testing involves the detection of errors on a PCB and the segmentation of those errors to identify the roots of errors. The proposed algorithm is broadly divided into five categories, feature detection and feature classification. The algorithm is able to perform tests even if the image is captured rotating, measuring and translating according to a template that performs algorithm rotation, scale and translation they are different. The newness of the algorithm is still at the beginning of analyzing the feature with its unique appearance as well firmness. In addition to this, the algorithm only takes 2,528 s to scan a PCB image. Performance of the proposed algorithm is verified by performing experiments on various PCB images and shows that the proposed algorithms suitable for automatic PCB view testing
Transfer Learning Model for Image Segmentation by Integrating U-NetPlusPlus a...YutaSuzuki27
In the image classification task, we only need to learn local features, but in the image segmentation task, we also need to learn positional information. Therefore, there is a difference between the image segmentation task and the image classification task in the features to be learned. In this study, we propose SE-U-Net++, which efficiently learns both local features and positional information by incorporating SE blocks, and a transfer learning algorithm that bridges the difference between the tasks by comparing parameters in the convolutional layer.
- How to tackle an object detection competition
- Schwert's 6th-place solution on Open Images Challenge 2019
- presented at the lunch workshop of the 26th Symposium on Sensing via Image Information (2020).
For the full video of this presentation, please visit:
https://www.edge-ai-vision.com/2021/01/video-activity-recognition-with-limited-data-for-smart-home-applications-a-presentation-from-comcast/
For more information about edge AI and computer vision, please visit:
https://www.edge-ai-vision.com
Hongcheng Wang, Director of Technical Research at Comcast, presents the “Video Activity Recognition with Limited Data for Smart Home Applications” tutorial at the September 2020 Embedded Vision Summit.
Comcast’s Xfinity Home connects millions of home smart cameras and IoT devices to improve its customers’ safety and security. The company’s teams use computer vision and deep learning to understand video and sensor data from these devices to identify relevant events so that it can improve the user experience.
Specifically, Comcast has explored the spatial-temporal relationships among objects, places and actions. The company has also developed a semi-supervised learning approach for video classification (VideoSSL) to detect certain activities using limited training data. Using these techniques, and as described in this presentation, it has achieved very promising results on activity recognition with multiple datasets.
Evaluation reports of various DC-DC converter products that are available in Japan. DC-DC converter is used to provide a stable output DC voltage to its load such as rechargeable battery. We have tested 5 types DC-DC converter samples which can be available in amazon.co.jp. As of 2016/12.
For more information, please visit the following site.
www.wireless-square.com
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfEnterprise Wired
In this guide, we'll explore the key considerations and features to look for when choosing a Trusted analytics platform that meets your organization's needs and delivers actionable intelligence you can trust.
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Round table discussion of vector databases, unstructured data, ai, big data, real-time, robots and Milvus.
A lively discussion with NJ Gen AI Meetup Lead, Prasad and Procure.FYI's Co-Found
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...sameer shah
"Join us for STATATHON, a dynamic 2-day event dedicated to exploring statistical knowledge and its real-world applications. From theory to practice, participants engage in intensive learning sessions, workshops, and challenges, fostering a deeper understanding of statistical methodologies and their significance in various fields."
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
1. Tomomi Research Inc.
Trends in AI Visual Inspection
2022-06-15
Tomomi Research, Inc.
Seong-Hun Choe (Dr.Eng.)
https://www.tomomi-research.com/
2. Tomomi Research Inc.
About Me
2022/6/15 3
At Makers Faire Tokyo 2018
• Name: Seong-Hun Choe (崔 成熏)
• Education:
• Seoul National University
BE at Mechanical Engineering
• Tohoku University
Dr. Eng. at Nano Mechanics
• Work at:
• HITACHI, Ltd. (Researcher)
• Hitachi GST -> Western Digital (Principal
Engineer)
• Tomomi Research, Inc. (CTO)
• SNS:
• Twitter : @wireless_power
• Linkedin:
www.linkedin.com/in/seonghunchoe/
• Email: seonghun.choe@tomomi-
research.com
4bit CPU with TTL
TD4
https://makezine.jp/event/make
rs2018/m0268/
3. Tomomi Research Inc. 2022/6/15 4
Visual Inspection Status in Industrial Area
Labor Shortage
Number of Workers in Visual Inspection:
1.4M in Japan
Ratio in total workers in manufacturing :10~ 20%
Variation in
inspection result
• Depends on individuals
• Human error
• Cost in education
4. Tomomi Research Inc. 2022/6/15 5
Visual Inspection Status in Industrial Area
単調
熟練、重労働
Variation in
inspection
results
Heavy loads
• Driver Bits
• 10M Pieces/Month
• Failure Rate: 0.01%
• Only 1 or 2 of fails occurs a
day
5. Tomomi Research Inc. 2022/6/15 6
Market Size
1) Reference: Machine Vision industry Report 2015 Yole Development, Frost&Sullivan analysis
2) Reference: World Bank, U.S. Department of Labor
Automation Rate
Business
Field
100%
Semiconductor Display Electronics Car
Plastic Steel Paper Medicine
Textile Leather Food Airplane
Main Business Filed
Total Market Size
約6.7兆円(2)
Current Market size
7,000億円(1)
6. Tomomi Research Inc. 2022/6/15 7
Market Mapping
Metal
Food
Electronics
Car
Medicine
Leather
Textile
Plastic
Steel
Semiconductor
Automation
Easy to
see
manual
Hard to
see
7. Tomomi Research Inc. 2022/6/15 8
1.Classification 2.Object Detection
Four Main Algorithm for Visual Inspection
https://www.youtube.com/watch?v=UY6xbrcViVw&
ab_channel=MobiDev
8. Tomomi Research Inc. 2022/6/15 9
3.Segementation 4. Anomaly Detection (OK DATA only)
Four Main Algorithm for Visual Inspection
https://www.youtube.com/watch?v=9taNBy7XpFU&
ab_channel=ArayaInc.
Input Output
Anomaly
map
Features can
not be
generated =
Defect
9. Tomomi Research Inc. 2022/6/15 10
The Higher Yield Rate is, Fewer NG Data will be.
Anomaly Detection with only OK data
• Metal Driver
• 10M Pieces/Month
• Failure Rate: 0.01%
• Only 1 or 2 of fails occurs a
day
Classification Object Detection
Segmentation
Anomaly
Detection with
only OK data
10. Tomomi Research Inc.
Anomaly Detection on MVTec AD
2022/6/15 11
https://paperswithcode.com/sota/anomaly-detection-on-mvtec-ad
11. Tomomi Research Inc. 2022/6/15 12
https://www.mvtec.com/company/research/datasets/mvtec-ad
Datasets : MVTec AD
12. Tomomi Research Inc. 2022/6/15 13
AUROC Two AUROCs on MVTec AD
Metrics: AUROC
Image AUROC
13. Tomomi Research Inc. 2022/6/15 14
AUROC Two AUROCs on MVTec AD
Metrics: AUROC
Pixel AUROC
14. Tomomi Research Inc. 2022/6/15 15
AUROC Two AUROCs on MVTec AD
Metrics: two AUROCs
Image AUROC Pixel AUROC
15. Tomomi Research Inc. 2022/6/15 16
1. Image Level AD 2. Pixel Level AD
Two Metrics
https://qiita.com/makotoito/items/39bc64d30ce49a
9edad8
16. Tomomi Research Inc. 2022/6/15 17
Image vs. Pixel AUROC
Image Level AUROC
(Detection AUROC)
Pixel Level AUROC
(Segmentation AUROC)
Definition
• AUROC related to the Image level
• Global Anomaly
• AUROC related the pixel level
• Local Anomaly
Features • Real world Inspection result
• Easy to calculate
• Need to mask image (label image)
• Depends on defect size
21. Tomomi Research Inc. 2022/6/15 22
Pretrained network as a Feature Extractor
2. SPADE: Sub-Image Anomaly Detection with Deep
Pyramid Correspondences [2020/05]
https://arxiv.org/pdf/1112.6209.pdf
22. Tomomi Research Inc. 2022/6/15 23
Pretrained network as Feature Extractor
2. SPADE:idea
https://static.googleusercontent.com/media/researc
h.google.com/ja//archive/unsupervised_icml2012_s
lides.pdf
23. Tomomi Research Inc. 2022/6/15 24
• Prequel to SPADE (Same authors)
• Use the pretrained ResNet with ImageNet
data without re-training
• Store the extracted features fy with normal
image data y
• At interference phase, measure the k-
nearest neighbor distance between input
data y and stored feature data fy.
https://arxiv.org/abs/2002.10445
2. SPADE: DN2 Deep Nearest Neighbor Anomaly Detection(2020)
https://arxiv.org/abs/2002.10445
24. Tomomi Research Inc. 2022/6/15 25
• Use the pretrained ResNet with ImageNet
data without re-training
• Store the extracted features f(y,p) with
normal image data y and its each pixel p.
• At interference phase, measure the k-
nearest neighbor distance between input
data y and stored feature data f(y,p).
Image level -> Pixel Level
2: SPADE
25. Tomomi Research Inc. 2022/6/15 26
Result AUROC
2: SPADE
Image AUROC Pixel AUROC
https://github.com/byungjae89/SPADE-pytorch
26. Tomomi Research Inc. 2022/6/15 27
Pretrained Feature Extractor + Mahalanobis Distance
3. Gaussian-AD
https://qiita.com/makotoito/items/39bc64d30ce49a
9edad8
https://arxiv.org/pdf/2005.14140.pdf
28. Tomomi Research Inc. 2022/6/15 29
Result AUROC (Image Level only)
3. Gaussian AD
https://github.com/byungjae89/MahalanobisAD-
pytorch
29. Tomomi Research Inc. 2022/6/15 30
Pretrained Feature Extractor : Mahalanobis Distance (PaDiM)
• SPADE : slow inference due to kNN
• Mahalanobis Distance instead of kNN
• Use the pretrained ResNet with ImageNet
data without re-training
• Store the extracted features with normal
image data y and its each pixel p to the
mean μ and covariant matrix Σ .
• At interference phase, measure the
Mahalanobis distance between input data y
and stored feature data (μ ,Σ ).
4.PaDiM: a Patch Distribution Modeling Framework for Anomaly
Detection and Localization [2020/11]
https://arxiv.org/pdf/2011.08785.pdf
31. Tomomi Research Inc. 2022/6/15 32
Pretrained Feature Extractor, Sampling : Coreset Memory Bank.
5.PatchCore: Towards Total Recall in Industrial Anomaly
Detection [2021/06]
https://arxiv.org/abs/2106.08265
• Greedy Coreset to more efficient
sampling from normal data than
random sampling
•Inference time is faster than PaDiM
•SOTA@2022
33. Tomomi Research Inc. 34
Anomaly Detection with MVTecAD
Dataset : MVTecAD
Bottleneck Nuts
Capsule Metal nut
34. Tomomi Research Inc. 2022/6/15 35
Anomaly Detection with MVTecAD
Dataset : MVTecAD
Carpet Metal grid
Leather Wood
35. Tomomi Research Inc. 2022/6/15 36
Anomaly Detection History
Generative Model
(~2019)
Pretrained Model as
Feature Extractor
(2019~2022)
Autoencoder
Autoencoder-
SSIM
AnoGAN
DN2 SPADE PaDiM PatchCore
Gaussian AD
kNN
Image
level
kNN
Pixel
level
Mahalanobis distance
Image level
Faster than SPADE
Mahalanobis distance
Pixel level
Faster than PaDiM
Coreset sampling
SSIM
Better result
MSE
36. Tomomi Research Inc. 2022/6/15 37
Feature Extractor developed by Prof. Otsu Nobuyuki (2011)
6. HLAC (Higher-order Local Auto-Correlation)
Image
HLAC
Feature
Extraction
PCA
Anomaly
Score
Prof. Otsu Nobuyuki
cv2.threshold(gray, 0, 255, cv2.THRESH_OTSU)
https://www.aist.go.jp/pdf/aist_j/syn
thesiology/vol04_02/vol04_02_p70_p
79.pdf
37. Tomomi Research Inc. 2022/6/15 38
https://adacotech.co.jp/ : Japanse Startup Non-Deep Learning
6. HLAC (Higher-order Local Auto-Correlation)
• Small Train Images : 100~
• No GPU
• Higher Accuracy
https://www.nikkei.com/article/DGXZQOUC09ADH0
Z00C22A2000000/
38. Tomomi Research Inc. 2022/6/15 39
History of Pattern Recognition
Feature Extractor Design Deep Learning
39. Tomomi Research Inc. 2022/6/15 40
History of Anomaly Detection
Feature Extractor Design
~2011
Deep Learning
~2019
Pretrained model as
Feature Extractor
2019~
HLAC AutoEncoder
Ano-GAN
SPADE PaDiM
PatchCore
Gaussian AD
51. Tomomi Research Inc. 2022/6/15 52
Strategy on Market
Metal
Food
Electronics
Car
Medicine
Leather
Textile
Plastic
Steel
Semiconductor
Automation
Easy to
see
manual
Hard to
see
52. Tomomi Research Inc. 2022/6/15 53
Anomaly Detection Real Industrial Products
Summary
Defects
invisible!
53. Tomomi Research Inc. 54
2022/6/15
https://www.tomomi-research.com/ shorturl.at/hzAGL
Please, Subscribe Our YouTube
Channel.