Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview

Predictive Biases in Natural Language Processing Models:
A Conceptual Framework and Overview
Deven Shah, H. Andrew Schwartz, & Dirk Hovy
(dsshah, has)@cs.stonybrook.edu, dirk.hovy@unibocconi.edu
Human Language Analysis Beings

features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
fit
outcomes
Ysource
Predictive Models in NLP

features
Xsource
Source Population
(Model Side)
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Predictive Models in NLP
features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
fit
outcomes
Ysource

features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Predictive Models in NLP are Biased
Zhao et al. 2018
Webster et al. 2018
Suresh and Guttag 2019
Rudinger et al. 2018
Romanov et al. 2019
Li et al. 2018Kiritchenko and Mohammad 2018
Kern et al. 2016
Hovy 2018
Hitti et al. 2019Gebru et al. 2018
Garimella et al. 2019
Elazar and Goldberg 2018
Corbett-Davies and Goel 2018 Coavoux et al. 2018
Zmigrod et al. 2019
Zhao et al. 2017
Sweeney and Najafian 2019
Sun et al. 2019
Stanovsky et al. 2019
Sap et al. 2019
Nissim et al. 2019
Mitchell et al. 2019
Lynn et al. 2017
Kurita et al. 2019
Kleinberg et al. 2018
Kern et al. 2016 Jørgensen et al. 2015
Hovy and Spruit 2016Hovy et al. 2020
Gonen and Goldberg 2019
Glymour and Herington 2019
Giorgi et al. 2019
Garg et al. 2018
Friedler et al. 2016
Culotta 2014
Caliskan et al. 2017
Bolukbasi et al. 2016
Bender and Friedman 2018
Almodaresi et al. 2017
Almeida et al. 2015

features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Predictive Models in NLP are Biased
Zhao et al. 2018
Webster et al. 2018
Suresh and Guttag 2019
Rudinger et al. 2018
Romanov et al. 2019
Li et al. 2018Kiritchenko and Mohammad 2018
Kern et al. 2016
Hovy 2018
Hitti et al. 2019Gebru et al. 2018
Garimella et al. 2019
Elazar and Goldberg 2018
Corbett-Davies and Goel 2018 Coavoux et al. 2018
Zmigrod et al. 2019
Zhao et al. 2017
Sweeney and Najafian 2019
Sun et al. 2019
Stanovsky et al. 2019
Sap et al. 2019
Nissim et al. 2019
Mitchell et al. 2019
Lynn et al. 2017
Kurita et al. 2019
Kleinberg et al. 2018
Kern et al. 2016 Jørgensen et al. 2015
Hovy and Spruit 2016Hovy et al. 2020
Gonen and Goldberg 2019
Glymour and Herington 2019
Giorgi et al. 2019
Garg et al. 2018
Friedler et al. 2016
Culotta 2014
Caliskan et al. 2017
Bolukbasi et al. 2016
Bender and Friedman 2018
Almodaresi et al. 2017
Almeida et al. 2015
Goal: To provide a conceptual framework
and mathematical definitions for organizing
work on biased predictive models in NLP.

features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Conceptual Framework:

features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Conceptual Framework:
outcome disparity
The distribution of outcomes, given attribute A,
is dissimilar than the ideal distribution:
Q(Ŷt|A) ≁ P(Yt|A)
error disparity
The distribution of error (ϵ) over at least two
different values of an attribute (A) are unequal:
Q(ϵt|Ai) ≁ Q(ϵt|Aj)

features
Xtarget
predict
Target Population
(Application Side)
biased
outcomes
Ŷtarget
Outcome Disparity
outcome disparity
error disparity
Predicted
Q(Ŷt|A)
Ideal
P(Yt|A)
human
attribute
value1
value2average
predicted
outcome

features
Xtarget
predict
Target Population
(Application Side)
biased
outcomes
Ŷtarget
Outcome Disparity
outcome disparity
error disparity
Predicted
Q(Ŷt|A)
Ideal
P(Yt|A)
human
attribute
value1
value2
Predicted
Q(Ŷt|A)
Ideal
P(Yt|A)
human
attribute
woman
man
cancer

features
Xtarget
predict
Target Population
(Application Side)
biased
outcomes
Ŷtarget
Error Disparity
outcome disparity
error disparity
Biased
Q(ϵt|A)
Unbiased
P(ϵt|A)
human
attribute
value1
value2
error of
predictions

features
Xtarget
predict
Target Population
(Application Side)
biased
outcomes
Ŷtarget
Error Disparity
outcome disparity
error disparity
Predicted
Q(Ŷt|A)
Ideal
P(Yt|A)
human
attribute
value1
value2
Jørgensen et al. (WNUT 2015)
Hovy & Søggard (ACL 2015)
Correlates with demographics
Distance from “Standard”
error
WSJ Effect

features
Xtarget
predict
Target Population
(Application Side)
biased
outcomes
Ŷtarget
Disparities
outcome disparity
error disparity

features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Disparities
outcome disparity
error disparity

features
Xsource
features
Xtarget
Target Population
(Application Side)
predict
Source Population
(Model Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Origins of Bias
outcome disparity
error disparity

features
Xsource
features
Xtarget
Target Population
(Application Side)
predict
Source Population
(Model Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Selection Bias
outcome disparity
error disparity
selection bias
The sample of observations
themselves are not representative
of the application population.

features
Xsource
features
Xtarget
Target Population
(Application Side)
predict
Source Population
(Model Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Selection Bias
outcome disparity
error disparity
selection bias
Source
Q(AS)
Target
P(AT)
human
attribute
value1
value2proportion
of sample

features
Xsource
features
Xtarget
Target Population
(Application Side)
predict
Source Population
(Model Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Selection Bias
outcome disparity
error disparity
selection bias
Jørgensen et al. (WNUT 2015)
Hovy & Søggard (ACL 2015)
Correlates with demographics
Distance from “Standard”
error
WSJ Effect

features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Selection Bias
outcome disparity
error disparity
selection bias

features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Label Bias
outcome disparity
error disparity
selection bias
label bias
Biased annotations,
interaction, or latent bias
from past classifications.

features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Label Bias
outcome disparity
error disparity
selection bias
label bias
Biased annotations,
Source
Q(YS|AS)
Ideal
P(YS|AS)
human
attribute
value1
value2proportion
of sample

Label Bias
it comes out apr 30
PRON VERB ADP NOUN NUM
PRON VERB PART NOUN NUM
It’s a particle
No! It’s an
adposition

features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Overamplification
outcome disparity
error disparity
selection bias
label bias
Biased annotations,
over-amplification
The model discriminates on
a given human attribute
beyond its source base-rate.

features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Overamplification
outcome disparity
error disparity
selection bias
label bias
Biased annotations,
over-amplification
Target
Q(ŶT|AT)
Source
Q(YS|AS)
human
attribute
value1
value2proportion
of sample
Ideal
P(YS|AS)

Overamplifiction - Model Amplifies Bias
Zhao et al. (ACL 2015)

features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
biased
outcomes
Ŷtarget
fit
outcomes
Ysource
features
𝜃embedding
Embedding
Corpus
(Pre-trained Side)
Semantic Bias
outcome disparity
error disparity
selection bias
label bias
Biased annotations,
over-amplification
semantic bias
Non-ideal associations between
attributed lexeme (e.g. gendered
pronouns) and non-attributed lexeme
(e.g. occupation).

Semantic Bias
Woman
Man
Car
accessories
Pets
Biased Vectors

Predictive Bias Framework for NLP
semantic bias
Non-ideal associations between
attributed lexeme (e.g. gendered
pronouns) and non-attributed lexeme
(e.g. occupation).
features
Xsource
features
Xtarget
predict
Source Population
(Model Side)
Target Population
(Application Side)
biased
outcomes
Ŷtarget
fit
over-amplification
features
𝜃embedding
outcome disparity
error disparity
Embedding
Corpus
(Pre-trained Side)
outcomes
Ysource
label bias
Biased annotations,
selection bias
origin
consequence

Source Origin Countermeasures
Label Bias
Post-stratification, Re-train
annotators
data selection Selection Bias
Stratified sampling, Post-
stratification or Re-weighing
techniques
Overamplification
Synthetically match
distributions, add outcome
disparity to cost function
Semantic Bias
Use above techniques and re-
train embeddings
Summary of Countermeasures
annotation
models
embeddings

Takeaways
Bias, as outcome and error disparities, can result from many origins:
● the embedding model
● the feature sample
● the fitting process
● the outcome sample
See the paper!
● Survey of works and countermeasures for each origin
● Details on the predictive bias framework including mathematical definitions
This is ver .01: We hope this inspires further work towards a unified understanding and
ultimately mitigating bias!

References
1. Berk, R. A. (1983). An introduction to sample selection bias in sociological data. American Sociological Review, 386-398.
2. Feldman, M., Friedler, S. A., Moeller, J., Scheidegger, C., & Venkatasubramanian, S. (2015, August). Certifying and removing disparate
impact. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 259-268). ACM.
3. Friedler, S. A., Scheidegger, C., & Venkatasubramanian, S. (2016). On the (im) possibility of fairness. arXiv preprint arXiv:1609.07236.
4. Baker, R., Brick, J. M., Bates, N. A., Battaglia, M., Couper, M. P., Dever, J. A., ... & Tourangeau, R. (2013). Summary report of the AAPOR
task force on non-probability sampling. Journal of Survey Statistics and Methodology, 1(2), 90-143.
5. Hays, R. D., Liu, H., & Kapteyn, A. (2015). Use of internet panels to conduct surveys. Behavior research methods, 47(3), 685-690.
6. Hovy, Dirk. "Demographic factors improve classification performance." Proceedings of the 53rd annual meeting of the Association for
Computational Linguistics and the 7th international joint conference on natural language processing (volume 1: Long papers). 2015.
7. Sun, Tony, et al. “Mitigating gender bias in natural language processing: Literature review.” arXiv preprint arXiv:1906.08976 (2019).

Thank You
(dsshah, has)@cs.stonybrook.edu, dirk.hovy@unibocconi.edu
Deven Shah, H. Andrew Schwartz, and Dirk Hovy. 2020. Predictive Biases in Natural Language Processing Models: A Conceptual
Framework and Overview. In Proceedings of the The 58th Annual Meeting of the Association for Computational Linguistics.
Human Language Analysis Beings

Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview

Recommended

Recommended

More Related Content

What's hot

What's hot (9)

Similar to Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview

Similar to Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview (20)

Recently uploaded

Recently uploaded (20)

Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview

Editor's Notes