SlideShare a Scribd company logo
Balanced Datasets Are Not Enough:
Estimating and Mitigating Gender Bias in
Deep Image Representations
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Tianlu Wang
University of Virginia
Gender Bias in Visual Recognition Systems
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Deep Neural
Network
Gender Bias in Visual Recognition Systems
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Trained Deep
Neural
Network
tie:
Quantifying Bias: Leakage
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Trained Deep
Neural Network
Is this prediction
biased?
Quantifying Bias: Model Leakage
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Gender
Classifier
Model Leakage: gender prediction accuracy of a classifier trained on predictions.
man
woman
Object & Action Recognition Models
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
COCO Object Recognition
• 22k images (16k man & 6k woman)
• 80 objects (kite, ski, handbag, tie…)
• Recognition performance (F1): 53.75%
• model leakage: 70.46%
imSitu Action Recognition
• 24k images (14k man & 10k woman)
• 211 activities (cooking, shooting, lifting…)
• Recognition performance (F1): 40.11%
• model leakage: 76.93%
Quantifying Bias: Dataset Leakage
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Gender
Classifier
Dataset Leakage: gender prediction accuracy of a classifier trained on annotations.
Predictions
Ground Truth
Labels
man
woman
Does the model inherit 100% dataset leakage?
Performance Matters!
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Predictions
Ground Truth
Labels Predictions
F1 score = 100%
Dataset Leakage = 67.72%
F1 score = 53.75%
Model Leakage = 70.46%
Random Guess
F1 score ≈ 0
NO LEAKGE!
Random
Perturbation
Perturbed
Labels
match the
performance
Quantifying Bias: Adjusted Dataset Leakage
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Gender
Classifier
man
woman
Adjusted Dataset Leakage:
gender prediction accuracy of a classifier trained on perturbed annotations.
Ground Truth
Labels
Quantifying Bias: Bias Amplification
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Δ = Model Leakage – Adjusted Dataset Leakage > 0 Bias Amplification!
52
56
60
64
68
72
Model Leakage Adjusted Dataset
Leakage
COCO Object Recognition
50
55
60
65
70
75
80
Model Leakage Adjusted Dataset
Leakage
imSitu Action Recognition
20.47
9.93
Eliminating Bias: Adding Noise
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
35
40
45
50
55
2 4 6 8 10
F1Score(%)
Bias Amplification in COCO
original
randomization
Eliminating Bias: Balanced Datasets
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
man
71%
woman
29%
man
68%
woman
38% man
50%
woman
50%
Original
F1 score (%): 53.75
model leakage (%): 70.46
Balanced 3
F1 score (%): 52.60
model leakage (%): 67.78
Balanced 1
F1 score (%): 42.89
model leakage (%): 63.22
man
57%
woman
43%
Balanced 2
F1 score (%): 51.95
model leakage (%): 64.45
less images, lower performance, lower model leakage
Eliminating Bias: Balanced Datasets
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
35
40
45
50
55
2 4 6 8 10
F1Score(%)
Bias Amplification in COCO
original
randomization
balanced 3
balanced 2
balanced 1Balancing the co-occurance of gender and target labels
does not reduce bias amplification.
Eliminating Bias: Using Extra Annotations
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
blur-segmoriginal blackout-face blackout-segm blackout-box
Eliminating Bias: Using Extra Annotations
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
35
40
45
50
55
2 4 6 8 10
F1Score(%)
Bias Amplification in COCO
original
randomization
balanced 3
balanced 2
balanced 1
blackout-face
blur-segm
blackout-segm
blackout-box
Eliminating Bias: Adversarial Training
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Convolutional
Neural
Network
(Resnet-50)
Fully-
connected
Layer
+
Logistic
Regressors
Handbag
Fork
Vase
Spoon
…
Knife
Car
Oven
Gender
Classifier
man
woman
Gradient Reversal
Eliminating Bias: Adversarial Training
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
35
40
45
50
55
2 4 6 8 10
F1Score(%)
Bias Amplification in COCO
original
randomization
balanced 3
balanced 2
balanced 1
blackout-face
blur-segm
blackout-segm
blackout-box
adv @ image
adv@conv4
adv @ conv5
Visualization of Adversarial Training
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Convolutional
Neural
Network
(Resnet-50)
Fully-
connected
Layer
+
Logistic
Regressors
Handbag
Fork
Vase
Spoon
…
Knife
Car
Oven
Gender
Classifier
man
woman
Gradient Reversal
X
Mask Prediction
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Adversarial Training: Removing Face Area
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Adversarial Training: Removing Face and Skin
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Adversarial Training: Removing Entire Person
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest
Adversarial Training:
Removing Contextual Cues
Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest

More Related Content

More from Melissa Moody

Connecting citizens with public data to drive policy change
Connecting citizens with public data to drive policy changeConnecting citizens with public data to drive policy change
Connecting citizens with public data to drive policy change
Melissa Moody
 
Data Collection Methods for Building a Free Response Training Simulation
Data Collection Methods for Building a Free Response Training SimulationData Collection Methods for Building a Free Response Training Simulation
Data Collection Methods for Building a Free Response Training Simulation
Melissa Moody
 
Improving Credit Card Fraud Detection: Using Machine Learning to Profile and ...
Improving Credit Card Fraud Detection: Using Machine Learning to Profile and ...Improving Credit Card Fraud Detection: Using Machine Learning to Profile and ...
Improving Credit Card Fraud Detection: Using Machine Learning to Profile and ...
Melissa Moody
 
Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...
Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...
Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...
Melissa Moody
 
Automatic detection of online abuse and analysis of problematic users in wiki...
Automatic detection of online abuse and analysis of problematic users in wiki...Automatic detection of online abuse and analysis of problematic users in wiki...
Automatic detection of online abuse and analysis of problematic users in wiki...
Melissa Moody
 
Plans for the University of Virginia School of Data Science
Plans for the University of Virginia School of Data SciencePlans for the University of Virginia School of Data Science
Plans for the University of Virginia School of Data Science
Melissa Moody
 
Wikimedia Foundation, Trust & Safety: Cyber Harassment Classification and Pre...
Wikimedia Foundation, Trust & Safety: Cyber Harassment Classification and Pre...Wikimedia Foundation, Trust & Safety: Cyber Harassment Classification and Pre...
Wikimedia Foundation, Trust & Safety: Cyber Harassment Classification and Pre...
Melissa Moody
 
Collective Biographies of Women: A Deep Learning Approach to Paragraph Annota...
Collective Biographies of Women: A Deep Learning Approach to Paragraph Annota...Collective Biographies of Women: A Deep Learning Approach to Paragraph Annota...
Collective Biographies of Women: A Deep Learning Approach to Paragraph Annota...
Melissa Moody
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
Melissa Moody
 
Ethical Priniciples for the All Data Revolution
Ethical Priniciples for the All Data RevolutionEthical Priniciples for the All Data Revolution
Ethical Priniciples for the All Data Revolution
Melissa Moody
 
Steering Model Selection with Visual Diagnostics
Steering Model Selection with Visual DiagnosticsSteering Model Selection with Visual Diagnostics
Steering Model Selection with Visual Diagnostics
Melissa Moody
 
Assessing the reproducibility of DNA microarray studies
Assessing the reproducibility of DNA microarray studiesAssessing the reproducibility of DNA microarray studies
Assessing the reproducibility of DNA microarray studies
Melissa Moody
 
Modeling the Impact of R & Python Packages: Dependency and Contributor Networks
Modeling the Impact of R & Python Packages: Dependency and Contributor NetworksModeling the Impact of R & Python Packages: Dependency and Contributor Networks
Modeling the Impact of R & Python Packages: Dependency and Contributor Networks
Melissa Moody
 
How to Beat the House: Predicting Football Results with Hyperparameter Optimi...
How to Beat the House: Predicting Football Results with Hyperparameter Optimi...How to Beat the House: Predicting Football Results with Hyperparameter Optimi...
How to Beat the House: Predicting Football Results with Hyperparameter Optimi...
Melissa Moody
 
A Modified K-Means Clustering Approach to Redrawing US Congressional Districts
A Modified K-Means Clustering Approach to Redrawing US Congressional DistrictsA Modified K-Means Clustering Approach to Redrawing US Congressional Districts
A Modified K-Means Clustering Approach to Redrawing US Congressional Districts
Melissa Moody
 
Joining Separate Paradigms: Text Mining & Deep Neural Networks to Character...
Joining Separate Paradigms:  Text Mining & Deep Neural Networks  to Character...Joining Separate Paradigms:  Text Mining & Deep Neural Networks  to Character...
Joining Separate Paradigms: Text Mining & Deep Neural Networks to Character...
Melissa Moody
 

More from Melissa Moody (16)

Connecting citizens with public data to drive policy change
Connecting citizens with public data to drive policy changeConnecting citizens with public data to drive policy change
Connecting citizens with public data to drive policy change
 
Data Collection Methods for Building a Free Response Training Simulation
Data Collection Methods for Building a Free Response Training SimulationData Collection Methods for Building a Free Response Training Simulation
Data Collection Methods for Building a Free Response Training Simulation
 
Improving Credit Card Fraud Detection: Using Machine Learning to Profile and ...
Improving Credit Card Fraud Detection: Using Machine Learning to Profile and ...Improving Credit Card Fraud Detection: Using Machine Learning to Profile and ...
Improving Credit Card Fraud Detection: Using Machine Learning to Profile and ...
 
Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...
Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...
Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...
 
Automatic detection of online abuse and analysis of problematic users in wiki...
Automatic detection of online abuse and analysis of problematic users in wiki...Automatic detection of online abuse and analysis of problematic users in wiki...
Automatic detection of online abuse and analysis of problematic users in wiki...
 
Plans for the University of Virginia School of Data Science
Plans for the University of Virginia School of Data SciencePlans for the University of Virginia School of Data Science
Plans for the University of Virginia School of Data Science
 
Wikimedia Foundation, Trust & Safety: Cyber Harassment Classification and Pre...
Wikimedia Foundation, Trust & Safety: Cyber Harassment Classification and Pre...Wikimedia Foundation, Trust & Safety: Cyber Harassment Classification and Pre...
Wikimedia Foundation, Trust & Safety: Cyber Harassment Classification and Pre...
 
Collective Biographies of Women: A Deep Learning Approach to Paragraph Annota...
Collective Biographies of Women: A Deep Learning Approach to Paragraph Annota...Collective Biographies of Women: A Deep Learning Approach to Paragraph Annota...
Collective Biographies of Women: A Deep Learning Approach to Paragraph Annota...
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
 
Ethical Priniciples for the All Data Revolution
Ethical Priniciples for the All Data RevolutionEthical Priniciples for the All Data Revolution
Ethical Priniciples for the All Data Revolution
 
Steering Model Selection with Visual Diagnostics
Steering Model Selection with Visual DiagnosticsSteering Model Selection with Visual Diagnostics
Steering Model Selection with Visual Diagnostics
 
Assessing the reproducibility of DNA microarray studies
Assessing the reproducibility of DNA microarray studiesAssessing the reproducibility of DNA microarray studies
Assessing the reproducibility of DNA microarray studies
 
Modeling the Impact of R & Python Packages: Dependency and Contributor Networks
Modeling the Impact of R & Python Packages: Dependency and Contributor NetworksModeling the Impact of R & Python Packages: Dependency and Contributor Networks
Modeling the Impact of R & Python Packages: Dependency and Contributor Networks
 
How to Beat the House: Predicting Football Results with Hyperparameter Optimi...
How to Beat the House: Predicting Football Results with Hyperparameter Optimi...How to Beat the House: Predicting Football Results with Hyperparameter Optimi...
How to Beat the House: Predicting Football Results with Hyperparameter Optimi...
 
A Modified K-Means Clustering Approach to Redrawing US Congressional Districts
A Modified K-Means Clustering Approach to Redrawing US Congressional DistrictsA Modified K-Means Clustering Approach to Redrawing US Congressional Districts
A Modified K-Means Clustering Approach to Redrawing US Congressional Districts
 
Joining Separate Paradigms: Text Mining & Deep Neural Networks to Character...
Joining Separate Paradigms:  Text Mining & Deep Neural Networks  to Character...Joining Separate Paradigms:  Text Mining & Deep Neural Networks  to Character...
Joining Separate Paradigms: Text Mining & Deep Neural Networks to Character...
 

Recently uploaded

一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 

Recently uploaded (20)

一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 

Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations

  • 1. Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Tianlu Wang University of Virginia
  • 2. Gender Bias in Visual Recognition Systems Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Deep Neural Network
  • 3. Gender Bias in Visual Recognition Systems Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Trained Deep Neural Network tie:
  • 4. Quantifying Bias: Leakage Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Trained Deep Neural Network Is this prediction biased?
  • 5. Quantifying Bias: Model Leakage Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Gender Classifier Model Leakage: gender prediction accuracy of a classifier trained on predictions. man woman
  • 6. Object & Action Recognition Models Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest COCO Object Recognition • 22k images (16k man & 6k woman) • 80 objects (kite, ski, handbag, tie…) • Recognition performance (F1): 53.75% • model leakage: 70.46% imSitu Action Recognition • 24k images (14k man & 10k woman) • 211 activities (cooking, shooting, lifting…) • Recognition performance (F1): 40.11% • model leakage: 76.93%
  • 7. Quantifying Bias: Dataset Leakage Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Gender Classifier Dataset Leakage: gender prediction accuracy of a classifier trained on annotations. Predictions Ground Truth Labels man woman Does the model inherit 100% dataset leakage?
  • 8. Performance Matters! Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Predictions Ground Truth Labels Predictions F1 score = 100% Dataset Leakage = 67.72% F1 score = 53.75% Model Leakage = 70.46% Random Guess F1 score ≈ 0 NO LEAKGE!
  • 9. Random Perturbation Perturbed Labels match the performance Quantifying Bias: Adjusted Dataset Leakage Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Gender Classifier man woman Adjusted Dataset Leakage: gender prediction accuracy of a classifier trained on perturbed annotations. Ground Truth Labels
  • 10. Quantifying Bias: Bias Amplification Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Δ = Model Leakage – Adjusted Dataset Leakage > 0 Bias Amplification! 52 56 60 64 68 72 Model Leakage Adjusted Dataset Leakage COCO Object Recognition 50 55 60 65 70 75 80 Model Leakage Adjusted Dataset Leakage imSitu Action Recognition 20.47 9.93
  • 11. Eliminating Bias: Adding Noise Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest 35 40 45 50 55 2 4 6 8 10 F1Score(%) Bias Amplification in COCO original randomization
  • 12. Eliminating Bias: Balanced Datasets Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest man 71% woman 29% man 68% woman 38% man 50% woman 50% Original F1 score (%): 53.75 model leakage (%): 70.46 Balanced 3 F1 score (%): 52.60 model leakage (%): 67.78 Balanced 1 F1 score (%): 42.89 model leakage (%): 63.22 man 57% woman 43% Balanced 2 F1 score (%): 51.95 model leakage (%): 64.45 less images, lower performance, lower model leakage
  • 13. Eliminating Bias: Balanced Datasets Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest 35 40 45 50 55 2 4 6 8 10 F1Score(%) Bias Amplification in COCO original randomization balanced 3 balanced 2 balanced 1Balancing the co-occurance of gender and target labels does not reduce bias amplification.
  • 14. Eliminating Bias: Using Extra Annotations Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest blur-segmoriginal blackout-face blackout-segm blackout-box
  • 15. Eliminating Bias: Using Extra Annotations Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest 35 40 45 50 55 2 4 6 8 10 F1Score(%) Bias Amplification in COCO original randomization balanced 3 balanced 2 balanced 1 blackout-face blur-segm blackout-segm blackout-box
  • 16. Eliminating Bias: Adversarial Training Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Convolutional Neural Network (Resnet-50) Fully- connected Layer + Logistic Regressors Handbag Fork Vase Spoon … Knife Car Oven Gender Classifier man woman Gradient Reversal
  • 17. Eliminating Bias: Adversarial Training Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest 35 40 45 50 55 2 4 6 8 10 F1Score(%) Bias Amplification in COCO original randomization balanced 3 balanced 2 balanced 1 blackout-face blur-segm blackout-segm blackout-box adv @ image adv@conv4 adv @ conv5
  • 18. Visualization of Adversarial Training Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Convolutional Neural Network (Resnet-50) Fully- connected Layer + Logistic Regressors Handbag Fork Vase Spoon … Knife Car Oven Gender Classifier man woman Gradient Reversal X Mask Prediction
  • 19. Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Adversarial Training: Removing Face Area
  • 20. Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Adversarial Training: Removing Face and Skin
  • 21. Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Adversarial Training: Removing Entire Person
  • 22. Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest Adversarial Training: Removing Contextual Cues
  • 23. Tianlu Wang-Gender Bias in Deep Image Representation-Applied Machine Learning Conference, Tom Tom Fest

Editor's Notes

  1. During training, we feed images containing tie and man into the model.
  2. At test time, the model is not able to recognize “tie” when there is a woman in the image.
  3. Gender information leaked through the predictions which are generated by the model.
  4. With an accuracy 70.46%, you can tell the gender of the person in the image correctly, just from the prediction. The model leaks gender information may because the dataset is not balanced.
  5. Instead of using predictions, we use ground truth labels to train the gender classifier. Gender information revealed by ground truth annotations (dataset).
  6. Perturbed labels have the same F1 score as the model, introduce some randomness which may reduce the bias Gender revealed by perturbed ground truth labels at different levels of accuracy. Or: Gender leakage of a model with certain accuracy, whose errors are due entirely to chance.
  7. compare model leakage and adjusted dataset leakage, they have same performance. Imaging we have an ideal model which has the same performance as our model but make mistakes entirely due to chance, not systematic bias.
  8. Woman and woman
  9. Woman and woman
  10. Man and man
  11. woman