2/2 Predictive Maintenance in Semiconductor Industry

•

0 likes•322 views

Machine Learning of SECOM dataset (https://archive.ics.uci.edu/ml/datasets/SECOM) Following CRISP-DM Part 4: Building statistical model

Data & Analytics

Build the classification model
1. Select features 2. Split dataset 3. Build models 4. Assess models
- Remove highly correlated
features (>0.75)
- Features reduced from 436
to 208
- 3 feature subsets
● LVQ #20
● RFE #15
● Boruta #11
- Model Set: first 80%
● Train 70%
● Test 30 %
- Validation: last 20% data
- Linear methods: Linear
Discriminant Analysis and Logistic
Regression.
- Non-Linear methods: Neural
Network, SVM, kNN
- Trees and Rules: CART
- Ensembles of Trees:
Bagging CART, Random Forest
and Stochastic Gradient Boosting
- Features selected using
RFE gave the best results
with the minimum error rate
and the highest precision
- Bagging CART selected
based on Cohen’s Kappa
(Kursa and Rudnicki 2010),
(Guyon and Elisseeff 203)
(Holte 1993) (Lee, Lessler, and Stuart 2010),
(Cutler and Zhao 2001), (Mohanbir
1996), (Kohavi 1995)
(Wilson 1927), (Cohen 1960)

●
●
●
LVQ RFE Boruta
Accuracy 92% 94% 92%
Sensitivity 0% 15% 12%
Precision 0% 67% 33%
*Wilson score interval

LVQ RFE Boruta All
Testing Validation Testing Validation Testing Validation Testing Validation
Accuracy 92% 95% 94% 88% 92% 91% 91% 92%
Sensitivity 0% 6% 15% 0% 12% 0% 0% 6%
Precision 0% 50% 67% 0% 33% 0% 0% 10%
Dataset:
Testing 375
Validation 314
Prevalence 7%

Gradient
Boosting
CART
Big Data
Random
Forest
model
Deep
neural
network
Accuracy 77% 85% 84%
Sensitivity 60% 48% 29%
Precision 17% 21% 15%
BACKUP

2/2 Predictive Maintenance in Semiconductor Industry

Recently uploaded

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss

From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La

DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh

Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha

Call Girls in Saket 99530🔝 56974 Escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda

Industrialised data - the key to AI success.pdfLars Albertsson

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster

vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Bookmanojkuma9823

20240419 - Measurecamp Amsterdam - SAM.pdfHuman37

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster

How we prevented account sharing with MFAAndrei Kaleshka

Recently uploaded (20)

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一

DBA Basics: Getting Started with Performance Tuning.pdf

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service

Brighton SEO | April 2024 | Data Storytelling

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝

Call Girls In Dwarka 9654467111 Escorts Service

Call Girls in Saket 99530🔝 56974 Escort Service

Customer Service Analytics - Make Sense of All Your Data.pptx

Industrialised data - the key to AI success.pdf

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx

vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book

20240419 - Measurecamp Amsterdam - SAM.pdf

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024

How we prevented account sharing with MFA

Featured

2024 State of Marketing Report – by HubspotMarius Sescu

Everything You Need To Know About ChatGPTExpeed Software

Product Design Trends in 2024 | Teenage EngineeringsPixeldarts

How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow

AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork

Skeleton Culture CodeSkeleton Technologies

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

Content Methodology: A Best Practices Report (Webinar)contently

How to Prepare For a Successful Job Search for 2024Albert Qian

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

5 Public speaking tips from TED - Visualized summarySpeakerHub

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

Getting into the tech field. what next Tessa Mero

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

Introduction to Data ScienceChristy Abraham Joy

Time Management & Productivity - Best PracticesVit Horky

The six step guide to practical project managementMindGenius

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36

Featured (20)

2024 State of Marketing Report – by Hubspot

Everything You Need To Know About ChatGPT

Product Design Trends in 2024 | Teenage Engineerings

How Race, Age and Gender Shape Attitudes Towards Mental Health

AI Trends in Creative Operations 2024 by Artwork Flow.pdf

Skeleton Culture Code

PEPSICO Presentation to CAGNY Conference Feb 2024

Content Methodology: A Best Practices Report (Webinar)

How to Prepare For a Successful Job Search for 2024

Social Media Marketing Trends 2024 // The Global Indie Insights

Trends In Paid Search: Navigating The Digital Landscape In 2024

5 Public speaking tips from TED - Visualized summary

ChatGPT and the Future of Work - Clark Boyd

Getting into the tech field. what next

Google's Just Not That Into You: Understanding Core Updates & Search Intent

How to have difficult conversations

Introduction to Data Science

Time Management & Productivity - Best Practices

The six step guide to practical project management

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...

2/2 Predictive Maintenance in Semiconductor Industry

2. ● ● ●

3. Build the classification model 1. Select features 2. Split dataset 3. Build models 4. Assess models - Remove highly correlated features (>0.75) - Features reduced from 436 to 208 - 3 feature subsets ● LVQ #20 ● RFE #15 ● Boruta #11 - Model Set: first 80% ● Train 70% ● Test 30 % - Validation: last 20% data - Linear methods: Linear Discriminant Analysis and Logistic Regression. - Non-Linear methods: Neural Network, SVM, kNN - Trees and Rules: CART - Ensembles of Trees: Bagging CART, Random Forest and Stochastic Gradient Boosting - Features selected using RFE gave the best results with the minimum error rate and the highest precision - Bagging CART selected based on Cohen’s Kappa (Kursa and Rudnicki 2010), (Guyon and Elisseeff 203) (Holte 1993) (Lee, Lessler, and Stuart 2010), (Cutler and Zhao 2001), (Mohanbir 1996), (Kohavi 1995) (Wilson 1927), (Cohen 1960)

4. Features BACKUP

5. BACKUP

6. BACKUP

9. ● ● ● BACKUP

10.

11.

12. ● ● ● LVQ RFE Boruta Accuracy 92% 94% 92% Sensitivity 0% 15% 12% Precision 0% 67% 33% *Wilson score interval

13. BACKUP

14. LVQ RFE Boruta All Testing Validation Testing Validation Testing Validation Testing Validation Accuracy 92% 95% 94% 88% 92% 91% 91% 92% Sensitivity 0% 6% 15% 0% 12% 0% 0% 6% Precision 0% 50% 67% 0% 33% 0% 0% 10% Dataset: Testing 375 Validation 314 Prevalence 7%

15. Gradient Boosting CART Big Data Random Forest model Deep neural network Accuracy 77% 85% 84% Sensitivity 60% 48% 29% Precision 17% 21% 15% BACKUP

16.

17. 1. 2.

2/2 Predictive Maintenance in Semiconductor Industry

Recommended

Recommended

More Related Content

Recently uploaded

Recently uploaded (20)

Featured

Featured (20)

2/2 Predictive Maintenance in Semiconductor Industry