Regoli fairness deep_learningitalia_20220127

Data, algorithms and discrimination
Fairness in AI
daniele regoli
Big Data Lab @ Intesa Sanpaolo
daniele.regoli@intesasanpaolo.com
2022.01.27 Deep Learning Italia

milano torino
Intesa Sanpaolo
Big Data Lab
des
dev
dep
ign.
elop.
loy.

2016 2018 2021
The Team
30%
MATHEMATHICS
BIG DATA LAB
DATA SCIENTISTS: 49
TORINO
MILANO
25%
PHYSICS
25%
ENGINEERING
10%
FINANCE
10%
COMPUTER
SCIENCE
our numbers
5
49
we’ve been growing quite a lot

our mission
Build
Microservices Monitor
COLLABORATIONS
50+ Projects Completed

Why we talk about fairness in AI
The zoo of fairness metrics
A roadmap to fairness
Mitigation strategies
Error rate parities & the COMPAS debate

Why we talk about fairness in AI?

risks for
people
this can have a huge impact on people’s lives
e.g. Recruiting / Loans approval
ML could amplify and
perpetuate biases already
present in data, at large
scale
data as a social
mirror
ML could disgregard
minority groups,
effectively producing
bias even if absent in
the data
sample size
imbalances

there are a lot of different definitions of
fairness, in general non compatible with one
another
FAIRNESS
observational
metrics
causality-
based metrics
group
fairness
individual
fairness
Demographic
fairness
Error rate
parity
assessing
fairness

ground truth,
e.g. repayment
of loans
model decision,
e.g. grant/don’t
grant loan
sensitive
feature(s), e.g.
citizenship,
gender
non-sensitive
feature(s), e.g.
income
Machine Learning
model

same percentage of loans granted to men and women
same error rates for men and women
DEMOGRAPHIC
PARITY
CONDITIONAL
DEMOGRAPHIC PARITY
given some characteristics, same percentage of loans
granted to men and women
EQUALITY OF ODDS
(some) group
fairness
metrics
independence
separation

FAIRNESS THROUGH
AWARENESS
similar individuals are given similar
decisions
FAIRNESS THROUGH
UNAWARENESS
model’s outcomes are functions of non-
sensitive features only
(some)
individual
fairness
metrics
"Fairness through
awareness“, Dwork, Cynthia,
et al. Proceedings of the
3rd innovations in
theoretical computer science
conference. 2012

the devil
is in the
details
group fairness vs individual fairness
domestic foreign
financial
status
loan
granted
loan
rejected
fairness through unawareness:
don’t use explicitely the sensitive variable

the devil
is in the
details
domestic foreign
demographic parity:
same acceptance rate for different groups
widely cited 4/5 rule
of the Uniform
Guidelines on Employee
Selection Procedures
group fairness vs individual fairness
financial
status
loan
granted
loan
rejected

Demographic parity makes
sense when you want to
enforce equality between
groups whose
distributions you deem
should be equal if there
were no bias
Fairness Through
Unawareness:
problems with correlations,
what variable are we
allowed to use?

Demographic
Parity
Conditional
Demographic
Parity
Fairness
Through
Unawareness
Suppression
information of
contained in
group individual
metrics
techniques
«The zoo of Fairness metrics
in Machine Learning»,
Castelnovo, Crupi, Greco,
Regoli, preprint 2021
fairness
metrics
landscape

Error rate parities & the COMPAS debate

the devil
is in the
details
The COMPAS debate: error rate parities are not all
the same
White Black
precision = 75%
score
re-offend
does not
re-offend
precision = 75%
Predictive
Parity ~

the devil
is in the
details
The COMPAS debate: error rate parities are not all
the same
White Black
score
recall = 60% recall = 86%
re-offend
does not
re-offend
Equality of
opportunity ~

Impossibility
Theorem
Fairness and Machine
Learning, Solon Barocas
and Moritz Hardt and
Arvind Narayanan (2019)
https://fairmlbook.org/
~recall parity ~precision parity

pre-processing:
in-processing:
post-processing:
dataset
remove bias
from dataset
dataset
train
the model
train
the model
train a
fairness
aware model
validation
validation
validation
adjust
thresholds
* Adversarial Debiasing
* Reduction method
* suppression
* resampling
* massaging

26
Suppression
Remove sensitive variable(s) and features highly correlated with
them
Massaging
Re-label enough "minority" observations so that a new "massaged" training
dataset is reached which satisfies demographic parity to begin with.
The choice of which observations to be re-labeled is done via training an
auxiliary model on the original target
Sampling
Resample observations in order to reach Demographic Parity
pre-
processing
Fair Representation
Learn a representation of data such that sensitive information is
removed while keeping as much information as possible from X
"Learning fair
representations", Zemel,
Rich, et al. International
conference on machine
learning. PMLR, 2013
“Data preprocessing
techniques for
classification without
discrimination”, F.
Kamiran and T. Calders,
Knowledge and Information
Systems, 2012

27
Idea: a model tries to
maximize performance while an
Adversary tries to reconstruct
the protected variable Z from
the model’s outputs
Goal: train using the following gradient steps
Estimates Y pushes against
the Adversary
reconstructs Z
and removes its
effect
Diagram illustrating the gradients
Without the projection term, in the pictured
scenario, the predictor would move in the
direction labelled g+h in the diagram, which
actually helps the adversary. With the projection
term, the predictor will never move in a direction
that helps the adversary.
in-
processing
“Mitigating unwanted biases with
adversarial learning”, AI B. H.
Zhang, B. Lemoine, and M. Mitchell,
Proceedings of the 2018 AAAI/ACM
Conference on Ethics, and Society,
2018
here Z is the protected
variable
U step
W step

28
General idea
Tweak the threshold for different groups with respect to the sensitive variable
Equality of Odds
Need to look at the ROC
Demographic Parity
straightforward
Plots from Barocas, Hardt, Narayanan https://fairmlbook.org
“Equality of opportunity
in supervised learning”,
Hardt, Price,
Srebro,, NeurIPS 2016
post-
processing

Project
description
and objectives
Joint collaboration to research on Trustworthy AI in
Financial domain.
The goal is to overview the available metrics and
techniques and to come up with a roadmap to follow in
order to pursue Fairness.
To reach this goal, we carried out explorations on a
real-world financial use-case of credit lending.
Collect tools of assessment, mitigation, visualization
into a fairness toolbox called BeFair.
«BeFair: addressing
Fairness in the Banking
sector» Castelnovo,
Crupi, Greco, Del Gamba,
Naseer, Regoli, San
Miguel Gonzalez, (2020
IEEE Big Data
Conference)

REGULATORY
ASPECTS
What are the principles
to be followed
to identify the
appropriate concept of
fairness to be pursued?
DATASET
ASSESSMENT
Target assessment:
is there human intervention?
Features assessment:
what information is compatible
with the pursued fairness concept?
Correlations with protected classes
Choose the metric(s) that best
comply with previous findings
1.
2.
3.
BIAS
MITIGATION
Apply a set of mitigation
strategies in order to
target the chosen metric
4.
COMPARISON
AND CHOICE
Compare and choose the
best strategy
5.
CHOICE OF THE
FAIRNESS METRICS
EXPERT KNOWLEDGE
CONTEXT BASED FAIRNESS
DEFINITION
BUSINESS
NEEDS AND
LOGICS
LEGAL AND DOMAIN
KNOWLEDGE

Credit Lending
use case
~200,000 loan applications
~50 predictors, including financial variables and
personal information.
The target is the final decision of a human officer.
Dataset assessment
Throughout the analysis, we focus on
CITIZENSHIP = {0, 1}
as sensitive attribute with respect to which assess
fairness.
Bias, measured in terms of Demographic Parity, is
negligible in the original target, but amplified by a
the application of a ML model.
2
Naseer, Regoli, San
IEEE Big Data
Conference)

3
% loans allowed to citizens: 75,5%
% loans allowed to non-citizens: 75,1%
% loans allowed to citizens: 77,5%
% loans allowed to non-citizens: 51,7%
Mitigated Model – Group Level Mitigated Model – Individual Level

Bias
mitigation
Demographic
Parity
Error Rate
Parity
Individual
Fairness
FTU
Suppression
Massaging
Sampling
CFF
AdvDP
AdvEO
AdvCDP
ReductionsGS
ReductionsEG
ThreshDP
ThreshEO
ThreshEopp
ThreshCDP
pre
in
post
BeFair: developed methods and fairness goal
4

Counterfactual Fairness
Build causal graph with causal
discovery algorithms and
validate with domain experts.
Employ the causal graph to
train a counterfactually fair
model (Kusner et al. 2017): no
causal flow from sensitive
attribute to final decision.
Nodes are variables, while
directed edges express the
causal relationships among
them.
4

BeFair:
bias mitigation
results
5
Naseer, Regoli, San
IEEE Big Data
Conference)

Proposed methods to
identify the best
perfomance-fairness
tradeoff:
Trade-off fairness-
performance
Constrained performance
1 + 𝛽2
1 − 𝜙 ∗ 𝜋
𝛽2 ∗ 1 − 𝜙 + 𝜋
max
𝜙< 𝛷
𝜋
π and φ are the preferred performance and
fairness metrics, respectively and beta is the
weight associated with the performance metric.
Comparison and choice
5

38
Conclusions
Bias discrimination is a concrete risk for AI
applications at scale.
…a crucial point is that Fairness in Machine
Learning cannot be left to Data Scientists only.
More research is needed on the ethical and legal
side to clarify the needs of specific domains.
More research is needed on the technical side,
e.g. to understand the relationship among
different fairness metrics, in particular group
vs individual, or to find mitigation strategies
enforcing different metrics.
Fairness concepts are manifold, and care should
be taken in any specific situation.

Castelnovo, Crupi, Greco, Regoli, The zoo of Fairness metrics in Machine
Learning, preprint (2021)
Barocas, Selbst, Big data's disparate impact, Calif. L. Rev. (2016)
Mehrabi et al. A survey on bias and fairness in machine learning, ACM
Computing Surveys (2021)
Zhang, Hu, Mitchell, Mitigating unwanted biases with adversarial learning,
Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society
(2018)
Kamiran, Calders, Data preprocessing techniques for classification without
discrimination, Knowledge and Information Systems (2012)
Hardt, Price, Srebro, Equality of opportunity in supervised learning,
Advances in neural information processing systems (2016)
SOME
REFERENCES
Barocas, Hardt, Narayanan, Fairness and machine learning, (2019)
surveys
bias
mitigation
Zemel, Rich, et al. Learning fair representations, International conference
on machine learning. PMLR, 2013

thank you daniele regoli
Big Data Lab @ Intesa Sanpaolo
daniele.regoli@intesasanpaolo.com

Regoli fairness deep_learningitalia_20220127

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to Regoli fairness deep_learningitalia_20220127

Similar to Regoli fairness deep_learningitalia_20220127 (20)

Recently uploaded

Recently uploaded (20)

Regoli fairness deep_learningitalia_20220127