SlideShare a Scribd company logo
FAIR AI
Image by James Sutherland from Pixabay
Identifying and mitigating
bias in machine learning
2
Comete Team
Privacy Fairness Causality
Catuscia Palamidessi
Director of Comete team at
Inria Saclay-Ile-de-France,
Ecole Polytechnique
Sami Zhioua
Senior Researcher at Inria
Saclay-Ile-de-France, Ecole
Polytechnique
Ruta Binkyte
Ph.D. Student at Inria
Saclay-Ile-de-France, Ecole
Polytechnique, IP Paris
3
Table of Contents
01
02
03
Sources and types of bias
Bias in AI case studies
Bias mitigation
Photo by Maximalfocus on Unsplash
4
The Case Studies
Photo by ThisisEngineering RAEng on Unsplash
5
What happened?
The algorithms for face recognition have
much lower accuracy for darker female
faces.
Racial bias
in computer
vision Why?
Black women underrepresented in the
training data.
6
What happened?
The algorithm built to predict the need
for medical interventions (sickness)
would give lower score for black patients
who where the same or more sick than
the white ones.
Racial bias in
healthcare AI
Why?
The proxy used for “sickness” was
healthcare spending, which correlated
race.
Obermeyer, Z., Powers, B., Vogeli, C., & Mullainathan, S. (2019). Dissecting racial bias in an algorithm used to manage the health of populations. Science, 366(6464), 447-453.
7
Sources and
Types of Bias
Photo by Possessed Photography on Unsplash
8
The Learning Process
Raw Data
Features/
Attributes
The Prediction
The Decision
Feed Back Loop
Machine Learning Process
9
A group of is underrepresented
in a sample.
Types of Bias
Re p r e s e n t a t i o n B i a s
A historically disadvantaged group has
lower occurrence of positive labels
H i s t o r i c a l B i a s
Sample
population
https://www.britannica.com/topic/racial-segregation
10
When under-representation
can affect discrimination? <3%
Common for
Intersectional
Sensitive
attribute
Zhioua, S., & Binkytė, R. (2023). Shedding light on underrepresentation and Sampling Bias in machine learning. arXiv preprint arXiv:2306.05068.
11
Bias Mitigation
Photo by Aideal Hwa on Unsplash
12
Representation Bias:
Does data augmentation
always solve the problem?
“When representation bias is combined
with historical bias, data augmentation
can increase the disparity ”
Zhioua, S., & Binkytė, R. (2023). Shedding light on underrepresentation and Sampling Bias in machine learning. arXiv preprint arXiv:2306.05068.
13
When representation
bias is combined with
historical bias, the
oversampling can
increase the bias
Zhioua, S., & Binkytė, R. (2023). Shedding light on underrepresentation and Sampling Bias in machine learning. arXiv preprint arXiv:2306.05068.
Discrimination while augmenting
the training set with female group
samples randomly. The male group
size is
fi
xed at 100. Dataset is
Dutch Census and training
algorithm is logistic regression.
Discrimination while augmenting
the training set with only positive
outcome female group samples.
The male group size is
fi
xed at
100. Dataset is Dutch Census and
training algorithm is logistic
regression.
Random
augmentation vs.
Positive label
augmentation
14
Most bias mitigation algorithms aim to satisfy
Statistical Parity
Photo by Possessed Photography on Unsplash
P( ̂
Y = 1|S = 0) = P( ̂
Y = 1|S = 1)
Statistical Parity
Where is the prediction, S is the sensitive
attribute
̂
Y
15
Historical Bias:
Does “equal” always
mean “fair”?
“Black patients on average have 26.3%
more chronic diseases than the white”
Obermeyer, Z., Powers, B., Vogeli, C., & Mullainathan, S. (2019). Dissecting racial bias in an algorithm used to manage the health of populations. Science, 366(6464), 447-453.
16
More nuanced fairness notions:
P( ̂
Y = 1|S = 0,E) = P( ̂
Y = 1|S = 1,E)
Conditional Statistical Parity
Where is the prediction, S is the sensitive attribute, E is
explanatory attribute and Y is the label
̂
Y
Equal Opportunity
P( ̂
Y = 1|S = 0,Y = 1) = P( ̂
Y = 1|S = 1,Y = 1)
Photo by Possessed Photography on Unsplash
17
When representation
bias is combined with
historical bias, the
oversampling can
increase the bias
BaBE:
Bayesian Bias
Elimination
Binkytė, R., Gorla, D., Palamidessi, C. (2023). BaBE: Enhancing Fairness via Estimation of Latent Explaining Variables
a) Causal relation between the
variables , and the decision
based on .
S, E, Z
YZ Z
b) Derivation of . The
decision is then based directly
on .
E|S, Z
YE
E
18
Results for Disparate Impact Remover and BaBE
The original distributions P [E|S = 0] (green), P
[E|S = 1] (orange) and the distributions of the E
computed by Disparate Impact Remover (blue
and magenta) for S=0. The probability for higher
values for S=0 is increased minimally to match
the distribution of S=1
The original distributions P [E|S = 0] (green),
P [E|S = 1] (orange) and the distributions of
the E estimated by BaBE (blue and magenta)
for S=0. BaBE accurately matches the true
distributions both for S=0 and S=1
The distributions of E (green) and biased
Z (blue) for S = 0, and for S=1 E (orange),
biased Z (magenta).
S=1 S=0
19
Questions?
Suggestions?
Collaboration?
http://www.lix.polytechnique.fr/Labo/
Ruta.BINKYTE-SADAUSKIENE/
ruta.binkyte@gmail.com 0659274196
Photo by Possessed Photography on Unsplash
Please reach out!
20
THANK YOU
Thank You !
Photo by Aideal Hwa on Unsplash

More Related Content

More from Paris Women in Machine Learning and Data Science

Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
Paris Women in Machine Learning and Data Science
 
How and why AI should fight cybersexism, by Chloe Daudier
How and why AI should fight cybersexism, by Chloe DaudierHow and why AI should fight cybersexism, by Chloe Daudier
How and why AI should fight cybersexism, by Chloe Daudier
Paris Women in Machine Learning and Data Science
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
Paris Women in Machine Learning and Data Science
 
Managing international tech teams, by Natasha Dimban
Managing international tech teams, by Natasha DimbanManaging international tech teams, by Natasha Dimban
Managing international tech teams, by Natasha Dimban
Paris Women in Machine Learning and Data Science
 
Optimizing GenAI apps, by N. El Mawass and Maria Knorps
Optimizing GenAI apps, by N. El Mawass and Maria KnorpsOptimizing GenAI apps, by N. El Mawass and Maria Knorps
Optimizing GenAI apps, by N. El Mawass and Maria Knorps
Paris Women in Machine Learning and Data Science
 
Perspectives, by M. Pannegeon
Perspectives, by M. PannegeonPerspectives, by M. Pannegeon
Evaluation strategies for dealing with partially labelled or unlabelled data
Evaluation strategies for dealing with partially labelled or unlabelled dataEvaluation strategies for dealing with partially labelled or unlabelled data
Evaluation strategies for dealing with partially labelled or unlabelled data
Paris Women in Machine Learning and Data Science
 
Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...
Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...
Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...
Paris Women in Machine Learning and Data Science
 
An age-old question, by Caroline Jean-Pierre
An age-old question, by Caroline Jean-PierreAn age-old question, by Caroline Jean-Pierre
An age-old question, by Caroline Jean-Pierre
Paris Women in Machine Learning and Data Science
 
Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré
Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle LautréApplying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré
Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré
Paris Women in Machine Learning and Data Science
 
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure SoulierHow to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
Paris Women in Machine Learning and Data Science
 
Global Ambitions Local Realities, by Anna Abreu
Global Ambitions Local Realities, by Anna AbreuGlobal Ambitions Local Realities, by Anna Abreu
Global Ambitions Local Realities, by Anna Abreu
Paris Women in Machine Learning and Data Science
 
Plug-and-Play methods for inverse problems in imagine, by Julie Delon
Plug-and-Play methods for inverse problems in imagine, by Julie DelonPlug-and-Play methods for inverse problems in imagine, by Julie Delon
Plug-and-Play methods for inverse problems in imagine, by Julie Delon
Paris Women in Machine Learning and Data Science
 
Sales Forecasting as a Data Product by Francesca Iannuzzi
Sales Forecasting as a Data Product by Francesca IannuzziSales Forecasting as a Data Product by Francesca Iannuzzi
Sales Forecasting as a Data Product by Francesca Iannuzzi
Paris Women in Machine Learning and Data Science
 
“Turning your ML algorithms into full web apps in no time with Python" by Mar...
“Turning your ML algorithms into full web apps in no time with Python" by Mar...“Turning your ML algorithms into full web apps in no time with Python" by Mar...
“Turning your ML algorithms into full web apps in no time with Python" by Mar...
Paris Women in Machine Learning and Data Science
 
Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...
Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...
Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...
Paris Women in Machine Learning and Data Science
 
Sandrine Henry presents the BechdelAI project
Sandrine Henry presents the BechdelAI projectSandrine Henry presents the BechdelAI project
Sandrine Henry presents the BechdelAI project
Paris Women in Machine Learning and Data Science
 
Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...
Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...
Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...
Paris Women in Machine Learning and Data Science
 
Khrystyna Grynko WiMLDS - From marketing to Tech.pdf
Khrystyna Grynko WiMLDS - From marketing to Tech.pdfKhrystyna Grynko WiMLDS - From marketing to Tech.pdf
Khrystyna Grynko WiMLDS - From marketing to Tech.pdf
Paris Women in Machine Learning and Data Science
 
Iana Iatsun_ML in production_20Dec2022.pdf
Iana Iatsun_ML in production_20Dec2022.pdfIana Iatsun_ML in production_20Dec2022.pdf
Iana Iatsun_ML in production_20Dec2022.pdf
Paris Women in Machine Learning and Data Science
 

More from Paris Women in Machine Learning and Data Science (20)

Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 
How and why AI should fight cybersexism, by Chloe Daudier
How and why AI should fight cybersexism, by Chloe DaudierHow and why AI should fight cybersexism, by Chloe Daudier
How and why AI should fight cybersexism, by Chloe Daudier
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Managing international tech teams, by Natasha Dimban
Managing international tech teams, by Natasha DimbanManaging international tech teams, by Natasha Dimban
Managing international tech teams, by Natasha Dimban
 
Optimizing GenAI apps, by N. El Mawass and Maria Knorps
Optimizing GenAI apps, by N. El Mawass and Maria KnorpsOptimizing GenAI apps, by N. El Mawass and Maria Knorps
Optimizing GenAI apps, by N. El Mawass and Maria Knorps
 
Perspectives, by M. Pannegeon
Perspectives, by M. PannegeonPerspectives, by M. Pannegeon
Perspectives, by M. Pannegeon
 
Evaluation strategies for dealing with partially labelled or unlabelled data
Evaluation strategies for dealing with partially labelled or unlabelled dataEvaluation strategies for dealing with partially labelled or unlabelled data
Evaluation strategies for dealing with partially labelled or unlabelled data
 
Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...
Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...
Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...
 
An age-old question, by Caroline Jean-Pierre
An age-old question, by Caroline Jean-PierreAn age-old question, by Caroline Jean-Pierre
An age-old question, by Caroline Jean-Pierre
 
Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré
Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle LautréApplying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré
Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré
 
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure SoulierHow to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
 
Global Ambitions Local Realities, by Anna Abreu
Global Ambitions Local Realities, by Anna AbreuGlobal Ambitions Local Realities, by Anna Abreu
Global Ambitions Local Realities, by Anna Abreu
 
Plug-and-Play methods for inverse problems in imagine, by Julie Delon
Plug-and-Play methods for inverse problems in imagine, by Julie DelonPlug-and-Play methods for inverse problems in imagine, by Julie Delon
Plug-and-Play methods for inverse problems in imagine, by Julie Delon
 
Sales Forecasting as a Data Product by Francesca Iannuzzi
Sales Forecasting as a Data Product by Francesca IannuzziSales Forecasting as a Data Product by Francesca Iannuzzi
Sales Forecasting as a Data Product by Francesca Iannuzzi
 
“Turning your ML algorithms into full web apps in no time with Python" by Mar...
“Turning your ML algorithms into full web apps in no time with Python" by Mar...“Turning your ML algorithms into full web apps in no time with Python" by Mar...
“Turning your ML algorithms into full web apps in no time with Python" by Mar...
 
Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...
Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...
Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...
 
Sandrine Henry presents the BechdelAI project
Sandrine Henry presents the BechdelAI projectSandrine Henry presents the BechdelAI project
Sandrine Henry presents the BechdelAI project
 
Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...
Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...
Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...
 
Khrystyna Grynko WiMLDS - From marketing to Tech.pdf
Khrystyna Grynko WiMLDS - From marketing to Tech.pdfKhrystyna Grynko WiMLDS - From marketing to Tech.pdf
Khrystyna Grynko WiMLDS - From marketing to Tech.pdf
 
Iana Iatsun_ML in production_20Dec2022.pdf
Iana Iatsun_ML in production_20Dec2022.pdfIana Iatsun_ML in production_20Dec2022.pdf
Iana Iatsun_ML in production_20Dec2022.pdf
 

Recently uploaded

DESIGN AND ANALYSIS OF A CAR SHOWROOM USING E TABS
DESIGN AND ANALYSIS OF A CAR SHOWROOM USING E TABSDESIGN AND ANALYSIS OF A CAR SHOWROOM USING E TABS
DESIGN AND ANALYSIS OF A CAR SHOWROOM USING E TABS
itech2017
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
Victor Morales
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
AJAYKUMARPUND1
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
zwunae
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
SamSarthak3
 
PPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testingPPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testing
anoopmanoharan2
 
Building Electrical System Design & Installation
Building Electrical System Design & InstallationBuilding Electrical System Design & Installation
Building Electrical System Design & Installation
symbo111
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
Osamah Alsalih
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
manasideore6
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 
Fundamentals of Induction Motor Drives.pptx
Fundamentals of Induction Motor Drives.pptxFundamentals of Induction Motor Drives.pptx
Fundamentals of Induction Motor Drives.pptx
manasideore6
 
Water billing management system project report.pdf
Water billing management system project report.pdfWater billing management system project report.pdf
Water billing management system project report.pdf
Kamal Acharya
 
Unbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptxUnbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptx
ChristineTorrepenida1
 
Modelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdfModelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdf
camseq
 
14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application
SyedAbiiAzazi1
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
Aditya Rajan Patra
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
WENKENLI1
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
ydteq
 
basic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdfbasic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdf
NidhalKahouli2
 

Recently uploaded (20)

DESIGN AND ANALYSIS OF A CAR SHOWROOM USING E TABS
DESIGN AND ANALYSIS OF A CAR SHOWROOM USING E TABSDESIGN AND ANALYSIS OF A CAR SHOWROOM USING E TABS
DESIGN AND ANALYSIS OF A CAR SHOWROOM USING E TABS
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
 
PPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testingPPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testing
 
Building Electrical System Design & Installation
Building Electrical System Design & InstallationBuilding Electrical System Design & Installation
Building Electrical System Design & Installation
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
Fundamentals of Induction Motor Drives.pptx
Fundamentals of Induction Motor Drives.pptxFundamentals of Induction Motor Drives.pptx
Fundamentals of Induction Motor Drives.pptx
 
Water billing management system project report.pdf
Water billing management system project report.pdfWater billing management system project report.pdf
Water billing management system project report.pdf
 
Unbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptxUnbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptx
 
Modelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdfModelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdf
 
14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
 
basic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdfbasic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdf
 

Identifying and mitigating bias in machine learning, by Ruta Binkyte

  • 1. FAIR AI Image by James Sutherland from Pixabay Identifying and mitigating bias in machine learning
  • 2. 2 Comete Team Privacy Fairness Causality Catuscia Palamidessi Director of Comete team at Inria Saclay-Ile-de-France, Ecole Polytechnique Sami Zhioua Senior Researcher at Inria Saclay-Ile-de-France, Ecole Polytechnique Ruta Binkyte Ph.D. Student at Inria Saclay-Ile-de-France, Ecole Polytechnique, IP Paris
  • 3. 3 Table of Contents 01 02 03 Sources and types of bias Bias in AI case studies Bias mitigation Photo by Maximalfocus on Unsplash
  • 4. 4 The Case Studies Photo by ThisisEngineering RAEng on Unsplash
  • 5. 5 What happened? The algorithms for face recognition have much lower accuracy for darker female faces. Racial bias in computer vision Why? Black women underrepresented in the training data.
  • 6. 6 What happened? The algorithm built to predict the need for medical interventions (sickness) would give lower score for black patients who where the same or more sick than the white ones. Racial bias in healthcare AI Why? The proxy used for “sickness” was healthcare spending, which correlated race. Obermeyer, Z., Powers, B., Vogeli, C., & Mullainathan, S. (2019). Dissecting racial bias in an algorithm used to manage the health of populations. Science, 366(6464), 447-453.
  • 7. 7 Sources and Types of Bias Photo by Possessed Photography on Unsplash
  • 8. 8 The Learning Process Raw Data Features/ Attributes The Prediction The Decision Feed Back Loop Machine Learning Process
  • 9. 9 A group of is underrepresented in a sample. Types of Bias Re p r e s e n t a t i o n B i a s A historically disadvantaged group has lower occurrence of positive labels H i s t o r i c a l B i a s Sample population https://www.britannica.com/topic/racial-segregation
  • 10. 10 When under-representation can affect discrimination? <3% Common for Intersectional Sensitive attribute Zhioua, S., & Binkytė, R. (2023). Shedding light on underrepresentation and Sampling Bias in machine learning. arXiv preprint arXiv:2306.05068.
  • 11. 11 Bias Mitigation Photo by Aideal Hwa on Unsplash
  • 12. 12 Representation Bias: Does data augmentation always solve the problem? “When representation bias is combined with historical bias, data augmentation can increase the disparity ” Zhioua, S., & Binkytė, R. (2023). Shedding light on underrepresentation and Sampling Bias in machine learning. arXiv preprint arXiv:2306.05068.
  • 13. 13 When representation bias is combined with historical bias, the oversampling can increase the bias Zhioua, S., & Binkytė, R. (2023). Shedding light on underrepresentation and Sampling Bias in machine learning. arXiv preprint arXiv:2306.05068. Discrimination while augmenting the training set with female group samples randomly. The male group size is fi xed at 100. Dataset is Dutch Census and training algorithm is logistic regression. Discrimination while augmenting the training set with only positive outcome female group samples. The male group size is fi xed at 100. Dataset is Dutch Census and training algorithm is logistic regression. Random augmentation vs. Positive label augmentation
  • 14. 14 Most bias mitigation algorithms aim to satisfy Statistical Parity Photo by Possessed Photography on Unsplash P( ̂ Y = 1|S = 0) = P( ̂ Y = 1|S = 1) Statistical Parity Where is the prediction, S is the sensitive attribute ̂ Y
  • 15. 15 Historical Bias: Does “equal” always mean “fair”? “Black patients on average have 26.3% more chronic diseases than the white” Obermeyer, Z., Powers, B., Vogeli, C., & Mullainathan, S. (2019). Dissecting racial bias in an algorithm used to manage the health of populations. Science, 366(6464), 447-453.
  • 16. 16 More nuanced fairness notions: P( ̂ Y = 1|S = 0,E) = P( ̂ Y = 1|S = 1,E) Conditional Statistical Parity Where is the prediction, S is the sensitive attribute, E is explanatory attribute and Y is the label ̂ Y Equal Opportunity P( ̂ Y = 1|S = 0,Y = 1) = P( ̂ Y = 1|S = 1,Y = 1) Photo by Possessed Photography on Unsplash
  • 17. 17 When representation bias is combined with historical bias, the oversampling can increase the bias BaBE: Bayesian Bias Elimination Binkytė, R., Gorla, D., Palamidessi, C. (2023). BaBE: Enhancing Fairness via Estimation of Latent Explaining Variables a) Causal relation between the variables , and the decision based on . S, E, Z YZ Z b) Derivation of . The decision is then based directly on . E|S, Z YE E
  • 18. 18 Results for Disparate Impact Remover and BaBE The original distributions P [E|S = 0] (green), P [E|S = 1] (orange) and the distributions of the E computed by Disparate Impact Remover (blue and magenta) for S=0. The probability for higher values for S=0 is increased minimally to match the distribution of S=1 The original distributions P [E|S = 0] (green), P [E|S = 1] (orange) and the distributions of the E estimated by BaBE (blue and magenta) for S=0. BaBE accurately matches the true distributions both for S=0 and S=1 The distributions of E (green) and biased Z (blue) for S = 0, and for S=1 E (orange), biased Z (magenta). S=1 S=0
  • 20. 20 THANK YOU Thank You ! Photo by Aideal Hwa on Unsplash