EVALUATION IN AI.pptx

•

3 likes•2,240 views

The last phase of AI project Cycle. Evaluation phase is based on the Prediction and Actual results. The predicted values are matched with the actual results and the machine/model is evaluated to be the efficient one. Depending upon the type or purpose of AI model, the kind of evaluation technique is used. There are various evaluation techniques- Accuracy, Precision, Recall and F1 Score.

Education

EVALUATION TECHNIQUES IN AI
-Jitendra Kumar Yadav
DAV Public School, Gumla

WHAT IS EVALUATION?
 It is the process of understanding the reliability of
any AI model, based on outputs by feeding test
dataset into the model and comparing with actual
answers.
 There are different evaluation techniques,
depending on the type and purpose of the model.
 Note: It’s not recommended to use the data
used to build the model to evaluate it.

MODEL EVALUATION
TERMINOLOGIES
Lets Imagine, an AI based prediction model has been
deployed in a forest which is prone to forest fires.
 The efficiency of the model is checked using two
conditions: Prediction & Reality
 The prediction is the output which is given by the
machine and,
 The reality is the real scenario in the forest when
the prediction has been made.

THE SCENARIO…
The Prediction matches with the Reality.
Hence, this condition is termed as True
Positive.

THE SCENARIO…
The machine has predicted it correctly as a
No. Therefore, this condition is termed as
True Negative.

THE SCENARIO…
The machine has incorrectly predicted that
there is a forest fire. This case is termed as
False Positive.

THE SCENARIO…
Here, a forest fire has broken out in the forest
but the machine has incorrectly predicted it
as a No. Therefore, this case becomes False
Negative.

THE CONFUSION MATRIX
 The result of comparison between the
prediction and reality is recorded, which is
called the confusion matrix.
 It is not an evaluation metric but a record
which helps in evaluation.

EVALUATION METHODS
 Accuracy
 Precision
 Recall
 F1 Score

ACCURACY
 It is the percentage of correct predictions out
of all the observations.
 A prediction can be said to be correct if it
matches the reality.
 Here, we have two conditions in which the
Prediction matches with the Reality: True
Positive and True Negative.

ACCURACY…
 Example:
Assume that the model always predicts that
there is no fire. But in reality, there is a 2%
chance of forest fire breaking out. In this
case, for 98 cases, the model will be right but
for those 2 cases in which there was a forest
fire, then too the model predicted no fire.
 Here, TP = 0 TN = 98 Total cases = 100
Therefore, accuracy: (98 + 0) / 100 = 98%

ACCURACY…
This is a fairly high accuracy for an AI model. But this
parameter is useless for as the actual cases where the
fire broke out are not taken into account. Hence, there is a
need to look at another parameter which takes account of
such cases as well.

PRECISION
 The percentage of True Positive cases versus
all the cases where the prediction is true.
 It takes into account the True Positives and
False Positives.

PRECISION…
 Ex- Assume that the model always predicts
that there is a forest fire irrespective of the
reality. In this case, all the Positive conditions
would be taken into account that is, True
Positive (Prediction = Yes and Reality = Yes)
and False Positive (Prediction = Yes and
Reality = No).
 In this case, the firefighters will check for the
fire all the time to see if the alarm was True or
False.

PRECISION…
 This makes Precision an important evaluation
criteria. If Precision is high, this means the
True Positive cases are more, giving lesser
False alarms.

$RECALL  It is the fraction of positive cases that are correctly identified.$

F1 SCORE
 It is the measure of balance between
precision and recall.
 An ideal situation would be when we have a
value of 1 (that is 100%) for both Precision
and Recall. In that case, the F1 score would
also be an ideal 1 (100%). It is known as the
perfect value for F1 Score.

QUESTIONS
 Find out Accuracy, Precision, Recall and F1 Score for the
given problems.

What's hot

AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete DeckSlideTeam

machine learningsoundaryasarya

Introduction to ML (Machine Learning)SwatiTripathi44

Ppt on data science Ansh Budania

Machine LearningShrey Malik

Machine LearningVivek Garg

Data SciencePrakhyath Rai

Machine learningRajib Kumar De

Data science applications and usecasesSreenatha Reddy K R

Machine LearningKumar P

What Is Data Science? | Introduction to Data Science | Data Science For Begin...Simplilearn

Exploratory data analysis with PythonDavis David

Machine learning Saurabh Agrawal

Machine LearningGirish Khanzode

Presentation on supervised learningTonmoy Bhagawati

Introduction to Data Science.pptxVrishit Saraswat

Performance Metrics for Machine Learning AlgorithmsKush Kulshrestha

Truth management systemMohammad Kamrul Hasan

Introduction to data science.pptxSadhanaParameswaran

Machine LearningDarshan Ambhaikar

What's hot (20)

AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete Deck

machine learning

Introduction to ML (Machine Learning)

Ppt on data science

Machine Learning

Data Science

Machine learning

Data science applications and usecases

Machine Learning

What Is Data Science? | Introduction to Data Science | Data Science For Begin...

Exploratory data analysis with Python

Machine learning

Machine Learning

Presentation on supervised learning

Introduction to Data Science.pptx

Performance Metrics for Machine Learning Algorithms

Truth management system

Introduction to data science.pptx

Machine Learning

Similar to EVALUATION IN AI.pptx

Model Evaluation Matrix: Accuracy, precision and recallMegha Sharma

Machine learning session5(logistic regression)Abhimanyu Dwivedi

Important Classification and Regression Metrics.pptxChode Amarnath

MACHINE LEARNING PPT K MEANS CLUSTERING.AmnaArooj13

04 performance metrics v2Anne Starr

Performance Metrics, Baseline Model, and Hyper ParameterIndraFransiskusAlam1

Assessing Model Performance - Beginner's GuideMegan Verbakel

Lecture note 2sreenu t

Binary classification metrics_cheatsheetJakub Czakon

Numerical approximationLizeth Paola Barrero

Chapter 8MaryWall14

Machine learning algorithms and business use casesSridhar Ratakonda

Sensitivity and Specificity in Predictive ModelingSarajit Poddar, GPHR, HRMP

Confusion matrix and classification evaluation metricsMinesh A. Jethva

Evaluating classification algorithmsUtkarsh Sharma

chap4_Parametric_Methods.pptShayanChowdary

MLlectureMethod.pptbutest

Similar to EVALUATION IN AI.pptx (18)

Model Evaluation Matrix: Accuracy, precision and recall

Machine learning session5(logistic regression)

Important Classification and Regression Metrics.pptx

MACHINE LEARNING PPT K MEANS CLUSTERING.

04 performance metrics v2

Performance Metrics, Baseline Model, and Hyper Parameter

Assessing Model Performance - Beginner's Guide

Lecture note 2

Binary classification metrics_cheatsheet

Numerical approximation

Chapter 8

Machine learning algorithms and business use cases

Sensitivity and Specificity in Predictive Modeling

Confusion matrix and classification evaluation metrics

Evaluating classification algorithms

chap4_Parametric_Methods.ppt

MLlectureMethod.ppt

Recently uploaded

TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...Nguyen Thanh Tu Collection

24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...Nguyen Thanh Tu Collection

Spellings Wk 4 and Wk 5 for Grade 4 at CAPSAnaAcapella

OSCM Unit 2_Operations Processes & SystemsSandeep D Chaudhary

FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfPondicherry University

Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptxLimon Prince

When Quality Assurance Meets Innovation in Higher Education - Report launch w...Gary Wood

How to Send Pro Forma Invoice to Your Customers in Odoo 17Celine George

Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjMohammed Sikander

PSYPACT- Practicing Over State Lines May 2024.pptxMarlene Maheu

SPLICE Working Group:Reusable Code ExamplesPeter Brusilovsky

Mattingly "AI and Prompt Design: LLMs with NER"National Information Standards Organization (NISO)

MOOD STABLIZERS DRUGS.pptxPoojaSen20

Including Mental Health Support in Project Delivery, 14 May.pdfAssociation for Project Management

ANTI PARKISON DRUGS.pptxPoojaSen20

An overview of the various scriptures in HinduismDabee Kamal

Mattingly "AI & Prompt Design: Named Entity Recognition"National Information Standards Organization (NISO)

SURVEY I created for uni project researchCaitlinCummins3

DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUMELOISARIVERA8

The Story of Village Palampur Class 9 Free Study Material PDFVivekanand Anglo Vedic Academy

Recently uploaded (20)

TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...

24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...

Spellings Wk 4 and Wk 5 for Grade 4 at CAPS

OSCM Unit 2_Operations Processes & Systems

FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf

Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptx

When Quality Assurance Meets Innovation in Higher Education - Report launch w...

How to Send Pro Forma Invoice to Your Customers in Odoo 17

Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj

PSYPACT- Practicing Over State Lines May 2024.pptx

SPLICE Working Group:Reusable Code Examples

Mattingly "AI and Prompt Design: LLMs with NER"

MOOD STABLIZERS DRUGS.pptx

Including Mental Health Support in Project Delivery, 14 May.pdf

ANTI PARKISON DRUGS.pptx

An overview of the various scriptures in Hinduism

Mattingly "AI & Prompt Design: Named Entity Recognition"

SURVEY I created for uni project research

DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM

The Story of Village Palampur Class 9 Free Study Material PDF

EVALUATION IN AI.pptx

1. EVALUATION TECHNIQUES IN AI -Jitendra Kumar Yadav DAV Public School, Gumla

2. WHAT IS EVALUATION?  It is the process of understanding the reliability of any AI model, based on outputs by feeding test dataset into the model and comparing with actual answers.  There are different evaluation techniques, depending on the type and purpose of the model.  Note: It’s not recommended to use the data used to build the model to evaluate it.

3. MODEL EVALUATION TERMINOLOGIES Lets Imagine, an AI based prediction model has been deployed in a forest which is prone to forest fires.  The efficiency of the model is checked using two conditions: Prediction & Reality  The prediction is the output which is given by the machine and,  The reality is the real scenario in the forest when the prediction has been made.

4. THE SCENARIO

5. THE SCENARIO… The Prediction matches with the Reality. Hence, this condition is termed as True Positive.

6. THE SCENARIO…

7. THE SCENARIO… The machine has predicted it correctly as a No. Therefore, this condition is termed as True Negative.

8. THE SCENARIO…

9. THE SCENARIO… The machine has incorrectly predicted that there is a forest fire. This case is termed as False Positive.

10. THE SCENARIO…

11. THE SCENARIO… Here, a forest fire has broken out in the forest but the machine has incorrectly predicted it as a No. Therefore, this case becomes False Negative.

12. THE CONFUSION MATRIX  The result of comparison between the prediction and reality is recorded, which is called the confusion matrix.  It is not an evaluation metric but a record which helps in evaluation.

13. THE CONFUSION MATRIX…

14. THE CONFUSION MATRIX…

15. EVALUATION METHODS  Accuracy  Precision  Recall  F1 Score

16. ACCURACY  It is the percentage of correct predictions out of all the observations.  A prediction can be said to be correct if it matches the reality.  Here, we have two conditions in which the Prediction matches with the Reality: True Positive and True Negative.

17. ACCURACY…  Example: Assume that the model always predicts that there is no fire. But in reality, there is a 2% chance of forest fire breaking out. In this case, for 98 cases, the model will be right but for those 2 cases in which there was a forest fire, then too the model predicted no fire.  Here, TP = 0 TN = 98 Total cases = 100 Therefore, accuracy: (98 + 0) / 100 = 98%

18. ACCURACY… This is a fairly high accuracy for an AI model. But this parameter is useless for as the actual cases where the fire broke out are not taken into account. Hence, there is a need to look at another parameter which takes account of such cases as well.

19. PRECISION  The percentage of True Positive cases versus all the cases where the prediction is true.  It takes into account the True Positives and False Positives.

20. PRECISION…  Ex- Assume that the model always predicts that there is a forest fire irrespective of the reality. In this case, all the Positive conditions would be taken into account that is, True Positive (Prediction = Yes and Reality = Yes) and False Positive (Prediction = Yes and Reality = No).  In this case, the firefighters will check for the fire all the time to see if the alarm was True or False.

21. PRECISION…  This makes Precision an important evaluation criteria. If Precision is high, this means the True Positive cases are more, giving lesser False alarms.

22. RECALL  It is the fraction of positive cases that are correctly identified.

23. F1 SCORE  It is the measure of balance between precision and recall.  An ideal situation would be when we have a value of 1 (that is 100%) for both Precision and Recall. In that case, the F1 score would also be an ideal 1 (100%). It is known as the perfect value for F1 Score.

24. QUESTIONS  Find out Accuracy, Precision, Recall and F1 Score for the given problems.

25. QUESTIONS  Find out Accuracy, Precision, Recall and F1 Score for the given problems.

26. QUESTIONS  Find out Accuracy, Precision, Recall and F1 Score for the given problems.

27. QUESTIONS  Find out Accuracy, Precision, Recall and F1 Score for the given problems.

EVALUATION IN AI.pptx

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to EVALUATION IN AI.pptx

Similar to EVALUATION IN AI.pptx (18)

Recently uploaded

Recently uploaded (20)

EVALUATION IN AI.pptx