BigML Education Program Evaluations

•

0 likes•453 views

This document discusses best practices for evaluating supervised machine learning models, including: 1) The importance of splitting data into training and testing sets to avoid "memorizing" the data and get an accurate performance measure. 2) Common dataset splitting methods like linear and random splits. 3) The importance of metrics like accuracy, and how they can be misleading, especially for imbalanced datasets. 4) How different domains may value reducing different types of mistakes, like preferring fewer false negatives for medical diagnosis.

Data & Analytics

BigML Education Program 2Evaluations
In This Video
• Introduction to and justiﬁcation of model evaluation
• Discussion of common missteps when evaluating a
supervised model
• Creation of a dataset split for training and evaluation of
a supervised model using BigML
• Creation and interpretation of an evaluation in the
BigML user interface

BigML Education Program 3Evaluations
Evaluation Workﬂow
TRAINING
DATASET
EVALUATION
TESTING
DATASET
+
ANALYZE PREDICTIONS
SUPERVISED
MODEL

BigML Education Program 4Evaluations
Memorizing Data
TRAINED MODEL
TRAINING
DATASET
TESTING
DATASET
EVALUATE
TRAINING
DATASET
EVALUATE
Excellent performance
Model already
“knows” the data
Poor performance
if model can’t
generalize

BigML Education Program 5Evaluations
Linear Dataset Split
Patient No.
Plasma
Glucose
BMI Pregnancies Diabetes?
1 80 28,5 4 No
2 111 20,7 2 No
3 156 41,0 0 Yes
4 131 35,1 2 Yes
5 70 32,2 1 No
6 86 21,8 4 No
7 42 18,6 0 No
8 150 33,1 6 Yes
9 92 25,9 3 No
10 145 30,7 5 Yes
Test Instances
Training Instances

BigML Education Program 6Evaluations
Random Dataset Split
Plasma Glucose BMI Pregnancies Diabetes?
80 28,5 4 No
111 20,7 2 No
156 41,0 0 Yes
131 35,1 2 Yes
70 32,2 1 No
86 21,8 4 No
42 18,6 0 No
150 33,1 6 Yes
92 25,9 3 No
145 30,7 5 Yes
Test Instances
Remainder used for
training

BigML Education Program 7Evaluations
Accuracy
Positive Example
Negative Example
Classiﬁed Positive Classiﬁed Negative
A trivial classiﬁer (always predict negative)
achieves 95% accuracy on unbalanced data!

BigML Education Program 8Evaluations
Accuracy
Positive Example
Negative Example
Classiﬁed Positive Classiﬁed Negative
On a balanced dataset,
95% accuracy indicates a competent classiﬁer

BigML Education Program 9Evaluations
Mistake Costs
• Which is worse in your domain, a false positive or a
false negative?
• Medical diagnosis
• Cost of a false postive: The patient has to
undergo more testing to discover they do not
have the disease (low cost?)
• Cost of a false negative: The patient is declared
healthy and the undetected disease progresses
(high cost?)
• Solution: Select a threshold for positive classiﬁcation
that makes the appropriate trade-oﬀ between mistakes

BigML Education Program 10Evaluations
Review
• Evaluations, when done correctly allow you to assess
the performance of your learned supervised model
• You should never evaluate a model on any of the data
used to train that model
• BigML allows 1-click splitting of your dataset for proper
training and testing
• Depending on your domain, metrics other than
accuracy may be required to properly understand your
model’s performance
• The evaluation resource view can be used to select an
operating point for the model that makes the correct
trade-oﬀ between false positives and false negatives for
your domain

What's hot

accountinganamorgannpa

Webinar: How machine learning can impact manufacturing industry? Mindbowser Inc

Resume-Predicting Profitability and Customer Preference Presentation-Brian Bu...Brian Burger

Bringing big data to lifeSKIM

Managing uncertainty in ai performance target settingNoelle Ibrahim

Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...Manju Devadas

Fikrimuhal Big Data Analysis Ed Con Europe 2015 PresentationSukru Hasdemir

Machine LearningM Abhishek Dora

Prediction of potential customers for term depositPranov Mishra

How to graphical calaculators (1)Angela Phillips

Rethinking product lifecycle curves to fight commoditizationThomas Emrich

Automating Discounting StrategiesKatharineStevenson

HackathonMadhumitha Chandrasekar

Master the essentials of conversion optimization Steve Clough

What's hot (14)

accounting

Webinar: How machine learning can impact manufacturing industry?

Resume-Predicting Profitability and Customer Preference Presentation-Brian Bu...

Bringing big data to life

Managing uncertainty in ai performance target setting

Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...

Fikrimuhal Big Data Analysis Ed Con Europe 2015 Presentation

Machine Learning

Prediction of potential customers for term deposit

How to graphical calaculators (1)

Rethinking product lifecycle curves to fight commoditization

Automating Discounting Strategies

Hackathon

Master the essentials of conversion optimization

Similar to BigML Education Program Evaluations

The 8 Step Data Mining ProcessMarc Berman

Using minitab for Superior Quality in Food ManufacturingMinitab, LLC

Barga Data Science lecture 10Roger Barga

Hair_EOMA_1e_Chap001_PPT.pptxAsadAli104515

Module 4: Model Selection and EvaluationSara Hooker

Data mining - Machine LearningRupaDutta3

Statistical Learning and Model Selection (1).pptxrajalakshmi5921

Model Validation Magnify Analytic Solutions

The Life Cycle Of Data Science PPT.pdfPhurba Sherpa

Total Quality Management & Six SigmaShashank Varun

segmentdaSteven Cosgrove, Ph.D.

E bay amplify_finalMaria Stone

How Will Your ML Project FailElena Samuylova

Rapid Optimization Application Development Using Excel and SolverMichael Mina

Types of Machine Learning- Tanvir Siddike MoinTanvir Moin

IRJET- Improving Prediction of Potential Clients for Bank Term Deposits using...IRJET Journal

Model validation techniques in machine learning.pdfAnastasiaSteele10

ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...BigML, Inc

Predictive Analysis PowerPoint Presentation SlidesSlideTeam

Google Analytics 4 Setup CourseHackalogy

Similar to BigML Education Program Evaluations (20)

The 8 Step Data Mining Process

Using minitab for Superior Quality in Food Manufacturing

Barga Data Science lecture 10

Hair_EOMA_1e_Chap001_PPT.pptx

Module 4: Model Selection and Evaluation

Data mining - Machine Learning

Statistical Learning and Model Selection (1).pptx

Model Validation

The Life Cycle Of Data Science PPT.pdf

Total Quality Management & Six Sigma

segmentda

E bay amplify_final

How Will Your ML Project Fail

Rapid Optimization Application Development Using Excel and Solver

Types of Machine Learning- Tanvir Siddike Moin

IRJET- Improving Prediction of Potential Clients for Bank Term Deposits using...

Model validation techniques in machine learning.pdf

ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...

Predictive Analysis PowerPoint Presentation Slides

Google Analytics 4 Setup Course

Recently uploaded

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Spark3's new memory model/managementakshesh doshi

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

E-Commerce Order PredictionShraddha Kamble.pptxBoston Institute of Analytics

Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha

Industrialised data - the key to AI success.pdfLars Albertsson

From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor

Data Science Jobs and Salaries Analysis.pptxFurkanTasci3

Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda

100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate

RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408

04242024_CCC TUG_Joins and Relationshipsccctableauusergroup

Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083

Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Recently uploaded (20)

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Spark3's new memory model/management

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

E-Commerce Order PredictionShraddha Kamble.pptx

Call Girls In Mahipalpur O9654467111 Escorts Service

Industrialised data - the key to AI success.pdf

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati

Data Science Jobs and Salaries Analysis.pptx

Customer Service Analytics - Make Sense of All Your Data.pptx

100-Concepts-of-AI by Anupama Kate .pptx

RA-11058_IRR-COMPRESS Do 198 series of 1998

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...

Brighton SEO | April 2024 | Data Storytelling

PKS-TGC-1084-630 - Stage 1 Proposal.pptx

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps

04242024_CCC TUG_Joins and Relationships

Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...

Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...

BigML Education Program Evaluations

1. BigML Education Evaluations August 2017

2. BigML Education Program 2Evaluations In This Video • Introduction to and justiﬁcation of model evaluation • Discussion of common missteps when evaluating a supervised model • Creation of a dataset split for training and evaluation of a supervised model using BigML • Creation and interpretation of an evaluation in the BigML user interface

3. BigML Education Program 3Evaluations Evaluation Workﬂow TRAINING DATASET EVALUATION TESTING DATASET + ANALYZE PREDICTIONS SUPERVISED MODEL

4. BigML Education Program 4Evaluations Memorizing Data TRAINED MODEL TRAINING DATASET TESTING DATASET EVALUATE TRAINING DATASET EVALUATE Excellent performance Model already “knows” the data Poor performance if model can’t generalize

5. BigML Education Program 5Evaluations Linear Dataset Split Patient No. Plasma Glucose BMI Pregnancies Diabetes? 1 80 28,5 4 No 2 111 20,7 2 No 3 156 41,0 0 Yes 4 131 35,1 2 Yes 5 70 32,2 1 No 6 86 21,8 4 No 7 42 18,6 0 No 8 150 33,1 6 Yes 9 92 25,9 3 No 10 145 30,7 5 Yes Test Instances Training Instances

6. BigML Education Program 6Evaluations Random Dataset Split Plasma Glucose BMI Pregnancies Diabetes? 80 28,5 4 No 111 20,7 2 No 156 41,0 0 Yes 131 35,1 2 Yes 70 32,2 1 No 86 21,8 4 No 42 18,6 0 No 150 33,1 6 Yes 92 25,9 3 No 145 30,7 5 Yes Test Instances Remainder used for training

7. BigML Education Program 7Evaluations Accuracy Positive Example Negative Example Classified Positive Classified Negative A trivial classifier (always predict negative) achieves 95% accuracy on unbalanced data!

8. BigML Education Program 8Evaluations Accuracy Positive Example Negative Example Classified Positive Classified Negative On a balanced dataset, 95% accuracy indicates a competent classifier

9. BigML Education Program 9Evaluations Mistake Costs • Which is worse in your domain, a false positive or a false negative? • Medical diagnosis • Cost of a false postive: The patient has to undergo more testing to discover they do not have the disease (low cost?) • Cost of a false negative: The patient is declared healthy and the undetected disease progresses (high cost?) • Solution: Select a threshold for positive classiﬁcation that makes the appropriate trade-oﬀ between mistakes

10. BigML Education Program 10Evaluations Review • Evaluations, when done correctly allow you to assess the performance of your learned supervised model • You should never evaluate a model on any of the data used to train that model • BigML allows 1-click splitting of your dataset for proper training and testing • Depending on your domain, metrics other than accuracy may be required to properly understand your model’s performance • The evaluation resource view can be used to select an operating point for the model that makes the correct trade-oﬀ between false positives and false negatives for your domain

BigML Education Program Evaluations

Recommended

Recommended

More Related Content

What's hot

What's hot (14)

Similar to BigML Education Program Evaluations

Similar to BigML Education Program Evaluations (20)

More from BigML, Inc

More from BigML, Inc (20)

Recently uploaded

Recently uploaded (20)

BigML Education Program Evaluations