Classification metrics

•

1 like•723 views

The document discusses various evaluation metrics that can be used for binary classification and click prediction, including AUC, RIG, LogLoss, precision, recall, and F1. It notes that AUC ignores predicted probabilities and considers type 1 and type 2 errors equally. RIG is bad for comparing models with different data distributions but can be used to compare multiple models trained on the same data. The document also provides a reference for more information on offline and online predictive model performance evaluations.

EVALUATION METRICS
FOR CLICK PREDICTION
Evgeniy Zhurin, RuTarget

Binary Classification Error Measurement
1) AUC
2) RIG
3) LogLoss
4) Precision/Recall
5) F1
6) PE, MSE, MAE

AUC
1) ignores the predicted probability values
2) usually we are interested in parts of
roc curve
3) considers Type 1 error and Type 2
error weights equivalently
4) dependent on the underlying distribution
of data

RIG
1) bad to compare two model
performances with different
distributions
2) can be used to compare the relative
performance of multiple models trained
and tested on the same data
3) is not informative, because score also
depends on the data distribution

Thanks!
J. Yi, Y. Chen, J. Li, S. Sett, and T. W. Yan.
Predictive model performance: Offline and
online evaluations. In KDD, pages 1294–1302,
2013.
http://chbrown.github.io/kdd-2013-
usb/kdd/p1294.pdf

What's hot

Machine Learning 3 - Decision Tree Learningbutest

Overfitting & UnderfittingSOUMIT KAR

Deep Learning for Computer Vision: Data Augmentation (UPC 2016)Universitat Politècnica de Catalunya

Data preprocessing in Machine learning pyingkodi maran

Decision trees in Machine Learning Mohammad Junaid Khan

Classification using back propagation algorithmKIRAN R

Feature selectionDong Guo

Probabilistic ReasoningJunya Tanaka

Dimensionality ReductionSaad Elbeleidy

“An Introduction to Data Augmentation Techniques in ML Frameworks,” a Present...Edge AI and Vision Alliance

Introduction to Machine Learning ClassifiersFunctional Imperative

Data mining :Concepts and Techniques Chapter 2, dataSalah Amean

Hyperparameter TuningJon Lederman

CnnNirthika Rajendran

Feed forward ,back propagation,gradient descentMuhammad Rasel

Data PreprocessingObject-Frontier Software Pvt. Ltd

Decision Tree LearningMilind Gokhale

Data Mining: Concepts and Techniques — Chapter 2 —Salah Amean

Machine Learning With Logistic RegressionKnoldus Inc.

Introduction to Linear Discriminant AnalysisJaclyn Kokx

What's hot (20)

Machine Learning 3 - Decision Tree Learning

Overfitting & Underfitting

Deep Learning for Computer Vision: Data Augmentation (UPC 2016)

Data preprocessing in Machine learning

Decision trees in Machine Learning

Classification using back propagation algorithm

Feature selection

Probabilistic Reasoning

Dimensionality Reduction

“An Introduction to Data Augmentation Techniques in ML Frameworks,” a Present...

Introduction to Machine Learning Classifiers

Data mining :Concepts and Techniques Chapter 2, data

Hyperparameter Tuning

Cnn

Feed forward ,back propagation,gradient descent

Data Preprocessing

Decision Tree Learning

Data Mining: Concepts and Techniques — Chapter 2 —

Machine Learning With Logistic Regression

Introduction to Linear Discriminant Analysis

Similar to Classification metrics

A New Concurrent Calibration Method For Nonequivalent Group Design Under Nonr...Kathryn Patel

C054Weili Xu

COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...ijiert bestjournal

MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMSijcsit

PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...IJMIT JOURNAL

A New SDM Classifier Using Jaccard Mining Procedure (CASE STUDY: RHEUMATIC FE...Soaad Abd El-Badie

A new sdm classifier using jaccard mining procedure case study rheumatic feve...ijbbjournal

On Feature Selection Algorithms and Feature Selection Stability Measures : A ...AIRCC Publishing Corporation

ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...ijcsit

On Feature Selection Algorithms and Feature Selection Stability Measures : A...AIRCC Publishing Corporation

Chapter 8 2Mansooreh Alavi

Engineering Method.pptxBebangMapagmahal

Classification accuracy analyses using Shannon’s EntropyIJERA Editor

An Influence of Measurement Scale of Predictor Variable on Logistic Regressio...IJECEIAES

Detecting Dif Between Conventional And Computerized Adaptive Testing.Pptbarthriley

Feature selection using modified particle swarm optimisation for face recogni...eSAT Journals

A framework for outlier detection inijfcstjournal

A Three-Layer Visual Hash Function Using Adler-32Universitas Pembangunan Panca Budi

Evaluation of image segmentation and filtering with ann in the papaya leafijcsit

Similar to Classification metrics (20)

A New Concurrent Calibration Method For Nonequivalent Group Design Under Nonr...

C054

COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...

MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMS

PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...

A New SDM Classifier Using Jaccard Mining Procedure (CASE STUDY: RHEUMATIC FE...

A new sdm classifier using jaccard mining procedure case study rheumatic feve...

On Feature Selection Algorithms and Feature Selection Stability Measures : A ...

ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...

On Feature Selection Algorithms and Feature Selection Stability Measures : A...

Chapter 8 2

Engineering Method.pptx

Classification accuracy analyses using Shannon’s Entropy

An Influence of Measurement Scale of Predictor Variable on Logistic Regressio...

Detecting Dif Between Conventional And Computerized Adaptive Testing.Ppt

Feature selection using modified particle swarm optimisation for face recogni...

A framework for outlier detection in

A Three-Layer Visual Hash Function Using Adler-32

Evaluation of image segmentation and filtering with ann in the papaya leaf

More from SPb_Data_Science

Diabetic Retinopathy DetectionSPb_Data_Science

Эффективные Алгоритмы Поиска Подобных Объектов Для Терабайтов ДанныхSPb_Data_Science

Meetup#4, Apache Spark as SQL Engine SPb_Data_Science

Trending Topics in Recommender SystemsSPb_Data_Science

Meetup#4, Smart.Data@OK.ruSPb_Data_Science

Intro to Deep Reinforcement LearningSPb_Data_Science

Benford’s law & fraud detection slidesSPb_Data_Science

Meetup#2. Intro to Factorization MachinesSPb_Data_Science

Meetup#2. Introduction to Algorithmic TradingSPb_Data_Science

Meetup #1. Trends, talks, cool stuff.SPb_Data_Science

Meetup #1. Building a CNN in Kaggle Data Science BowlSPb_Data_Science

More from SPb_Data_Science (11)

Diabetic Retinopathy Detection

Эффективные Алгоритмы Поиска Подобных Объектов Для Терабайтов Данных

Meetup#4, Apache Spark as SQL Engine

Classification metrics

1. EVALUATION METRICS FOR CLICK PREDICTION Evgeniy Zhurin, RuTarget

2. Binary Classification Error Measurement 1) AUC 2) RIG 3) LogLoss 4) Precision/Recall 5) F1 6) PE, MSE, MAE

8. AUC 1) ignores the predicted probability values 2) usually we are interested in parts of roc curve 3) considers Type 1 error and Type 2 error weights equivalently 4) dependent on the underlying distribution of data

10.

11.

12. RIG 1) bad to compare two model performances with different distributions 2) can be used to compare the relative performance of multiple models trained and tested on the same data 3) is not informative, because score also depends on the data distribution

13.

14. OR WRITE A SIMULATOR

15. Thanks! J. Yi, Y. Chen, J. Li, S. Sett, and T. W. Yan. Predictive model performance: Offline and online evaluations. In KDD, pages 1294–1302, 2013. http://chbrown.github.io/kdd-2013- usb/kdd/p1294.pdf

Classification metrics

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Classification metrics

Similar to Classification metrics (20)

More from SPb_Data_Science

More from SPb_Data_Science (11)

Classification metrics