SlideShare a Scribd company logo
1 of 19
Download to read offline
2019.07.09
Mitigating Unwanted Biases
with Adversarial Learning
- 인구통계학적 집단에 대한 편견이 훈련 데이터에 존재할 때, 이에 따라 훈련된 모델 또한 편견이 포함된다.
- Protected group를 modeling하려는 adversary와 model을 예측하려는 predictor를 동시에 학습시켜,
편견을 완화시키고자 한다.
- Measurements for Fairness:
> Demographic Parity
> Equality of Odds
> Equality of Opportunity
X
data
Y
predict
Z
Protected
attribute
predictor
Adversary
Adversarial Debiasing
1. Demographic parity
2. Equality of Odds
3. Equality of Opportunity
Adversarial Debiasing
Adversarial Debiasing
> 𝐿𝑃 ො
𝑦, 𝑦 를 최소화하기 위해 W를 update (using stochastic gradient descent)
Word Embedding
> 단어 유사성을 제시하지만 여기에 bias가 포함되어 있다는 문제
Adversarial Debiasing
AI Fairness 360 (AIF360)
AI Fairness 360 (AIF 360)
> 인공지능 기술의 활용 과정에서 등장할 수 있는 편향성을 시정하기 위해 IBM에서 이를 open source 형태로 발표.
https://github.com/IBM/AIF360
- Dataset과 model에서의 “편향성(bias)”을 완화하기 위한 알고리즘과 평가지표에 대한 설명
- Tutorial과 demo notebook 공개
AI Fairness 360 (AIF360)
https://arxiv.org/pdf/1810.01943.pdf
AI Fairness 360 (AIF360)
Algorithm Method
pre-processing
:데이터 자체의 편향성 문제
Re-weighing
(Kamiran&Calders, 2012)
Optimized pre-processing
(Calmon et al.,2017)
Learning fair
representation(LFR)
(Zemel et al.,2013)
Disparate import remover
(Feldman et al.,2015)
In-processing
: 특정 feature의 가중치로 인해
생성된 모델의 편향성 문제
Adversarial Debiasing
(Zhang et al.,2018)
Prejudice remover
(Kamishima et al.,2012)
Post-processing
: Test dataset자체의 편향성 문제
Equalized odds post-
processing (Hardt et al.,2016)
Calibrated eq. odds
postprocessing (Pleiss et al.,2017)
Reject Option classification
(Kamiran et al.,2012)
pre-processing
In-processing
Post-processing
Adversarial Debiasing (in-processing)
: 예측 정확도를 최대화하고 동시에 예측으로부터 protected attribute를 결정할 수 있는 Adversary’s ability를 감소시키는 classifier를 학습한다.
즉, adversary가 이용할 수 있는 집단 간 차별 정보(privileged group & unprivileged group)를 예측에 전달할 수 없기 때문에 공정한 classifier가 된다.
Adult / Census Income Dataset
In-processing _ Adversarial Debiasing
- Income이 >$50K인지를 예측하는 데이터셋
- 해당 모델에 대해 “Equality of Odds”를 강화하고자 한다.
- Protected Attribute: Sex
- Privileged Group: Male / Unprivileged Group: Female
In-processing _ Adversarial Debiasing
- Epoch: 50
- Batch_size: 128
- Plain model: without debias / model: with debias
In-processing _ Adversarial Debiasing
Statistical Parity Difference
= Pr(Unprivileged group) – Pr(privileged group)
Fairness는 -0.1~0.1의 값으로 평가된다.
Equal Opportunity Difference
= true positive rate
value < 0 , privileged group의 이익 / value > 0, unprivileged group의 이익
Fairness는 -0.1~0.1의 값으로 평가된다.
Average Odds Difference
= (false positive rate + true positive rate) / 2
value < 0, privileged group의 이익 / value > 0, unprivileged group의 이익
Fairness는 -0.1~0.1의 값으로 평가된다.
Disparate Impact
= Pr(Unprivileged group) / Pr(privileged group)
value < 1 , privileged group의 이익 / value > 1, unprivileged group의 이익
Fairness는 0.8~1.2의 값으로 평가된다.
Theil Index
= generalized entrop
각각의 data에 대한 inequality을 의미한다.
Perfect fairness = 0 (value 값이 낮을수록 fairness / 높을수록 problematic)
In-processing _ Adversarial Debiasing
Privileged Group: Male / Unprivileged Group: Female
Accuracy: 82%
Accuracy: 70%
In-processing _ Adversarial Debiasing
http://aif360.mybluemix.net/
Pre-processing _ Reweighing
The German credit dataset
> Protected attribute: AGE
> algorithm: Reweighing (pre-processing)
(protected attribute에 따라 편향성을 줄이도록 데이터셋을 변형시킨다.)
Privileged group이 training dataset에서 17%의 positive한 결과를 갖는다.
즉, 이러한 bias한 결과를 완화해야한다.
The German credit dataset
> Protected attribute: AGE
> algorithm: Reweighing (pre-processing)
(protected attribute에 따라 편향성을 줄이도록 데이터셋을 변형시킨다.)
Re-weighing model (pre-processing) 모델을 학습한 결과,
이전의 편향성이 0으로 줄어든 것을 확인할 수 있다.
Re-weighing model: classification이전에 fairness를 확인하기 위해 feature들의 조합(group, label)에 가중치를 부여한다.
Pre-processing _ Reweighing
Post-processing _ calibrate eq odds postprocessing
The Adult / Census Income dataset
> Protected attribute: Sex
> algorithm: Calibrated_eq_odds postprocessing (post-processing)
Logistic
regression
The Adult / Census Income dataset
> Protected attribute: Sex
> algorithm: Calibrated_eq_odds postprocessing (post-processing)
Post-processing
Post-processing _ calibrate eq odds postprocessing
The Adult / Census Income dataset
> Protected attribute: Sex
> algorithm: Calibrated_eq_odds postprocessing (post-processing)
Post-processing _ calibrate eq odds postprocessing
THANK YOU.

More Related Content

What's hot

AIF360 - Trusted and Fair AI
AIF360 - Trusted and Fair AIAIF360 - Trusted and Fair AI
AIF360 - Trusted and Fair AIAnimesh Singh
 
GANs Presentation.pptx
GANs Presentation.pptxGANs Presentation.pptx
GANs Presentation.pptxMAHMOUD729246
 
NLP 101 + Chatbots
NLP 101 + ChatbotsNLP 101 + Chatbots
NLP 101 + ChatbotsChris Shei
 
PPT presentation on ARTIFICIAL INTELLIGENCE
PPT presentation on ARTIFICIAL  INTELLIGENCEPPT presentation on ARTIFICIAL  INTELLIGENCE
PPT presentation on ARTIFICIAL INTELLIGENCEAnushka Ghosh
 
Bio Inspired Computing Final Version
Bio Inspired Computing Final VersionBio Inspired Computing Final Version
Bio Inspired Computing Final VersionThomas Petry
 
Deep Learning Tutorial | Deep Learning Tutorial for Beginners | Neural Networ...
Deep Learning Tutorial | Deep Learning Tutorial for Beginners | Neural Networ...Deep Learning Tutorial | Deep Learning Tutorial for Beginners | Neural Networ...
Deep Learning Tutorial | Deep Learning Tutorial for Beginners | Neural Networ...Edureka!
 
Responsible AI in Industry: Practical Challenges and Lessons Learned
Responsible AI in Industry: Practical Challenges and Lessons LearnedResponsible AI in Industry: Practical Challenges and Lessons Learned
Responsible AI in Industry: Practical Challenges and Lessons LearnedKrishnaram Kenthapadi
 
Responsible AI
Responsible AIResponsible AI
Responsible AINeo4j
 
Fairness in Machine Learning and AI
Fairness in Machine Learning and AIFairness in Machine Learning and AI
Fairness in Machine Learning and AISeth Grimes
 
The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021Steve Omohundro
 
Generative Adversarial Networks (GAN)
Generative Adversarial Networks (GAN)Generative Adversarial Networks (GAN)
Generative Adversarial Networks (GAN)Manohar Mukku
 
Bias in Artificial Intelligence
Bias in Artificial IntelligenceBias in Artificial Intelligence
Bias in Artificial IntelligenceNeelima Kumar
 
Module 8: Natural language processing Pt 1
Module 8:  Natural language processing Pt 1Module 8:  Natural language processing Pt 1
Module 8: Natural language processing Pt 1Sara Hooker
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networksYunjey Choi
 
Ai vs machine learning vs deep learning
Ai vs machine learning vs deep learningAi vs machine learning vs deep learning
Ai vs machine learning vs deep learningSanjay Patel
 
Data science and Artificial Intelligence
Data science and Artificial IntelligenceData science and Artificial Intelligence
Data science and Artificial IntelligenceSuman Srinivasan
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architectureLiang Xiang
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Yuriy Guts
 

What's hot (20)

AIF360 - Trusted and Fair AI
AIF360 - Trusted and Fair AIAIF360 - Trusted and Fair AI
AIF360 - Trusted and Fair AI
 
GANs Presentation.pptx
GANs Presentation.pptxGANs Presentation.pptx
GANs Presentation.pptx
 
NLP 101 + Chatbots
NLP 101 + ChatbotsNLP 101 + Chatbots
NLP 101 + Chatbots
 
PPT presentation on ARTIFICIAL INTELLIGENCE
PPT presentation on ARTIFICIAL  INTELLIGENCEPPT presentation on ARTIFICIAL  INTELLIGENCE
PPT presentation on ARTIFICIAL INTELLIGENCE
 
Bio Inspired Computing Final Version
Bio Inspired Computing Final VersionBio Inspired Computing Final Version
Bio Inspired Computing Final Version
 
Deep Learning Tutorial | Deep Learning Tutorial for Beginners | Neural Networ...
Deep Learning Tutorial | Deep Learning Tutorial for Beginners | Neural Networ...Deep Learning Tutorial | Deep Learning Tutorial for Beginners | Neural Networ...
Deep Learning Tutorial | Deep Learning Tutorial for Beginners | Neural Networ...
 
Bleu vs rouge
Bleu vs rougeBleu vs rouge
Bleu vs rouge
 
Responsible AI in Industry: Practical Challenges and Lessons Learned
Responsible AI in Industry: Practical Challenges and Lessons LearnedResponsible AI in Industry: Practical Challenges and Lessons Learned
Responsible AI in Industry: Practical Challenges and Lessons Learned
 
Responsible AI
Responsible AIResponsible AI
Responsible AI
 
Fairness in Machine Learning and AI
Fairness in Machine Learning and AIFairness in Machine Learning and AI
Fairness in Machine Learning and AI
 
Bias in AI
Bias in AIBias in AI
Bias in AI
 
The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021
 
Generative Adversarial Networks (GAN)
Generative Adversarial Networks (GAN)Generative Adversarial Networks (GAN)
Generative Adversarial Networks (GAN)
 
Bias in Artificial Intelligence
Bias in Artificial IntelligenceBias in Artificial Intelligence
Bias in Artificial Intelligence
 
Module 8: Natural language processing Pt 1
Module 8:  Natural language processing Pt 1Module 8:  Natural language processing Pt 1
Module 8: Natural language processing Pt 1
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
 
Ai vs machine learning vs deep learning
Ai vs machine learning vs deep learningAi vs machine learning vs deep learning
Ai vs machine learning vs deep learning
 
Data science and Artificial Intelligence
Data science and Artificial IntelligenceData science and Artificial Intelligence
Data science and Artificial Intelligence
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architecture
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 

More from Danbi Cho

Crf based named entity recognition using a korean lexical semantic network
Crf based named entity recognition using a korean lexical semantic networkCrf based named entity recognition using a korean lexical semantic network
Crf based named entity recognition using a korean lexical semantic networkDanbi Cho
 
Attention boosted deep networks for video classification
Attention boosted deep networks for video classificationAttention boosted deep networks for video classification
Attention boosted deep networks for video classificationDanbi Cho
 
A survey on deep learning based approaches for action and gesture recognition...
A survey on deep learning based approaches for action and gesture recognition...A survey on deep learning based approaches for action and gesture recognition...
A survey on deep learning based approaches for action and gesture recognition...Danbi Cho
 
ELECTRA_Pretraining Text Encoders as Discriminators rather than Generators
ELECTRA_Pretraining Text Encoders as Discriminators rather than GeneratorsELECTRA_Pretraining Text Encoders as Discriminators rather than Generators
ELECTRA_Pretraining Text Encoders as Discriminators rather than GeneratorsDanbi Cho
 
A survey on automatic detection of hate speech in text
A survey on automatic detection of hate speech in textA survey on automatic detection of hate speech in text
A survey on automatic detection of hate speech in textDanbi Cho
 
Zero wall detecting zero-day web attacks through encoder-decoder recurrent ne...
Zero wall detecting zero-day web attacks through encoder-decoder recurrent ne...Zero wall detecting zero-day web attacks through encoder-decoder recurrent ne...
Zero wall detecting zero-day web attacks through encoder-decoder recurrent ne...Danbi Cho
 
Decision tree and ensemble
Decision tree and ensembleDecision tree and ensemble
Decision tree and ensembleDanbi Cho
 
Can recurrent neural networks warp time
Can recurrent neural networks warp timeCan recurrent neural networks warp time
Can recurrent neural networks warp timeDanbi Cho
 
Man is to computer programmer as woman is to homemaker debiasing word embeddings
Man is to computer programmer as woman is to homemaker debiasing word embeddingsMan is to computer programmer as woman is to homemaker debiasing word embeddings
Man is to computer programmer as woman is to homemaker debiasing word embeddingsDanbi Cho
 
Situation recognition visual semantic role labeling for image understanding
Situation recognition visual semantic role labeling for image understandingSituation recognition visual semantic role labeling for image understanding
Situation recognition visual semantic role labeling for image understandingDanbi Cho
 

More from Danbi Cho (11)

Crf based named entity recognition using a korean lexical semantic network
Crf based named entity recognition using a korean lexical semantic networkCrf based named entity recognition using a korean lexical semantic network
Crf based named entity recognition using a korean lexical semantic network
 
Gpt models
Gpt modelsGpt models
Gpt models
 
Attention boosted deep networks for video classification
Attention boosted deep networks for video classificationAttention boosted deep networks for video classification
Attention boosted deep networks for video classification
 
A survey on deep learning based approaches for action and gesture recognition...
A survey on deep learning based approaches for action and gesture recognition...A survey on deep learning based approaches for action and gesture recognition...
A survey on deep learning based approaches for action and gesture recognition...
 
ELECTRA_Pretraining Text Encoders as Discriminators rather than Generators
ELECTRA_Pretraining Text Encoders as Discriminators rather than GeneratorsELECTRA_Pretraining Text Encoders as Discriminators rather than Generators
ELECTRA_Pretraining Text Encoders as Discriminators rather than Generators
 
A survey on automatic detection of hate speech in text
A survey on automatic detection of hate speech in textA survey on automatic detection of hate speech in text
A survey on automatic detection of hate speech in text
 
Zero wall detecting zero-day web attacks through encoder-decoder recurrent ne...
Zero wall detecting zero-day web attacks through encoder-decoder recurrent ne...Zero wall detecting zero-day web attacks through encoder-decoder recurrent ne...
Zero wall detecting zero-day web attacks through encoder-decoder recurrent ne...
 
Decision tree and ensemble
Decision tree and ensembleDecision tree and ensemble
Decision tree and ensemble
 
Can recurrent neural networks warp time
Can recurrent neural networks warp timeCan recurrent neural networks warp time
Can recurrent neural networks warp time
 
Man is to computer programmer as woman is to homemaker debiasing word embeddings
Man is to computer programmer as woman is to homemaker debiasing word embeddingsMan is to computer programmer as woman is to homemaker debiasing word embeddings
Man is to computer programmer as woman is to homemaker debiasing word embeddings
 
Situation recognition visual semantic role labeling for image understanding
Situation recognition visual semantic role labeling for image understandingSituation recognition visual semantic role labeling for image understanding
Situation recognition visual semantic role labeling for image understanding
 

Recently uploaded

OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...Shane Coughlan
 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationJuha-Pekka Tolvanen
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2
 
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2
 
WSO2CON2024 - Why Should You Consider Ballerina for Your Next Integration
WSO2CON2024 - Why Should You Consider Ballerina for Your Next IntegrationWSO2CON2024 - Why Should You Consider Ballerina for Your Next Integration
WSO2CON2024 - Why Should You Consider Ballerina for Your Next IntegrationWSO2
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024VictoriaMetrics
 
WSO2Con2024 - Software Delivery in Hybrid Environments
WSO2Con2024 - Software Delivery in Hybrid EnvironmentsWSO2Con2024 - Software Delivery in Hybrid Environments
WSO2Con2024 - Software Delivery in Hybrid EnvironmentsWSO2
 
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of TransformationWSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of TransformationWSO2
 
WSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2
 
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public AdministrationWSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public AdministrationWSO2
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd
 
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!WSO2
 
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...WSO2
 
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareJim McKeeth
 
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public AdministrationWSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public AdministrationWSO2
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension AidPhilip Schwarz
 
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxArtyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxAnnaArtyushina1
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2
 

Recently uploaded (20)

OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the Situation
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
 
WSO2CON2024 - Why Should You Consider Ballerina for Your Next Integration
WSO2CON2024 - Why Should You Consider Ballerina for Your Next IntegrationWSO2CON2024 - Why Should You Consider Ballerina for Your Next Integration
WSO2CON2024 - Why Should You Consider Ballerina for Your Next Integration
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
WSO2Con2024 - Software Delivery in Hybrid Environments
WSO2Con2024 - Software Delivery in Hybrid EnvironmentsWSO2Con2024 - Software Delivery in Hybrid Environments
WSO2Con2024 - Software Delivery in Hybrid Environments
 
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of TransformationWSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
 
WSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - Keynote
 
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public AdministrationWSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
 
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
 
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public AdministrationWSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxArtyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptx
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 

Mitigating unwanted biases with adversarial learning

  • 2. - 인구통계학적 집단에 대한 편견이 훈련 데이터에 존재할 때, 이에 따라 훈련된 모델 또한 편견이 포함된다. - Protected group를 modeling하려는 adversary와 model을 예측하려는 predictor를 동시에 학습시켜, 편견을 완화시키고자 한다. - Measurements for Fairness: > Demographic Parity > Equality of Odds > Equality of Opportunity X data Y predict Z Protected attribute predictor Adversary Adversarial Debiasing
  • 3. 1. Demographic parity 2. Equality of Odds 3. Equality of Opportunity Adversarial Debiasing
  • 4. Adversarial Debiasing > 𝐿𝑃 ො 𝑦, 𝑦 를 최소화하기 위해 W를 update (using stochastic gradient descent) Word Embedding > 단어 유사성을 제시하지만 여기에 bias가 포함되어 있다는 문제
  • 6. AI Fairness 360 (AIF360) AI Fairness 360 (AIF 360) > 인공지능 기술의 활용 과정에서 등장할 수 있는 편향성을 시정하기 위해 IBM에서 이를 open source 형태로 발표. https://github.com/IBM/AIF360 - Dataset과 model에서의 “편향성(bias)”을 완화하기 위한 알고리즘과 평가지표에 대한 설명 - Tutorial과 demo notebook 공개
  • 7. AI Fairness 360 (AIF360) https://arxiv.org/pdf/1810.01943.pdf
  • 8. AI Fairness 360 (AIF360) Algorithm Method pre-processing :데이터 자체의 편향성 문제 Re-weighing (Kamiran&Calders, 2012) Optimized pre-processing (Calmon et al.,2017) Learning fair representation(LFR) (Zemel et al.,2013) Disparate import remover (Feldman et al.,2015) In-processing : 특정 feature의 가중치로 인해 생성된 모델의 편향성 문제 Adversarial Debiasing (Zhang et al.,2018) Prejudice remover (Kamishima et al.,2012) Post-processing : Test dataset자체의 편향성 문제 Equalized odds post- processing (Hardt et al.,2016) Calibrated eq. odds postprocessing (Pleiss et al.,2017) Reject Option classification (Kamiran et al.,2012) pre-processing In-processing Post-processing
  • 9. Adversarial Debiasing (in-processing) : 예측 정확도를 최대화하고 동시에 예측으로부터 protected attribute를 결정할 수 있는 Adversary’s ability를 감소시키는 classifier를 학습한다. 즉, adversary가 이용할 수 있는 집단 간 차별 정보(privileged group & unprivileged group)를 예측에 전달할 수 없기 때문에 공정한 classifier가 된다. Adult / Census Income Dataset In-processing _ Adversarial Debiasing
  • 10. - Income이 >$50K인지를 예측하는 데이터셋 - 해당 모델에 대해 “Equality of Odds”를 강화하고자 한다. - Protected Attribute: Sex - Privileged Group: Male / Unprivileged Group: Female In-processing _ Adversarial Debiasing
  • 11. - Epoch: 50 - Batch_size: 128 - Plain model: without debias / model: with debias In-processing _ Adversarial Debiasing
  • 12. Statistical Parity Difference = Pr(Unprivileged group) – Pr(privileged group) Fairness는 -0.1~0.1의 값으로 평가된다. Equal Opportunity Difference = true positive rate value < 0 , privileged group의 이익 / value > 0, unprivileged group의 이익 Fairness는 -0.1~0.1의 값으로 평가된다. Average Odds Difference = (false positive rate + true positive rate) / 2 value < 0, privileged group의 이익 / value > 0, unprivileged group의 이익 Fairness는 -0.1~0.1의 값으로 평가된다. Disparate Impact = Pr(Unprivileged group) / Pr(privileged group) value < 1 , privileged group의 이익 / value > 1, unprivileged group의 이익 Fairness는 0.8~1.2의 값으로 평가된다. Theil Index = generalized entrop 각각의 data에 대한 inequality을 의미한다. Perfect fairness = 0 (value 값이 낮을수록 fairness / 높을수록 problematic) In-processing _ Adversarial Debiasing
  • 13. Privileged Group: Male / Unprivileged Group: Female Accuracy: 82% Accuracy: 70% In-processing _ Adversarial Debiasing http://aif360.mybluemix.net/
  • 14. Pre-processing _ Reweighing The German credit dataset > Protected attribute: AGE > algorithm: Reweighing (pre-processing) (protected attribute에 따라 편향성을 줄이도록 데이터셋을 변형시킨다.) Privileged group이 training dataset에서 17%의 positive한 결과를 갖는다. 즉, 이러한 bias한 결과를 완화해야한다.
  • 15. The German credit dataset > Protected attribute: AGE > algorithm: Reweighing (pre-processing) (protected attribute에 따라 편향성을 줄이도록 데이터셋을 변형시킨다.) Re-weighing model (pre-processing) 모델을 학습한 결과, 이전의 편향성이 0으로 줄어든 것을 확인할 수 있다. Re-weighing model: classification이전에 fairness를 확인하기 위해 feature들의 조합(group, label)에 가중치를 부여한다. Pre-processing _ Reweighing
  • 16. Post-processing _ calibrate eq odds postprocessing The Adult / Census Income dataset > Protected attribute: Sex > algorithm: Calibrated_eq_odds postprocessing (post-processing) Logistic regression
  • 17. The Adult / Census Income dataset > Protected attribute: Sex > algorithm: Calibrated_eq_odds postprocessing (post-processing) Post-processing Post-processing _ calibrate eq odds postprocessing
  • 18. The Adult / Census Income dataset > Protected attribute: Sex > algorithm: Calibrated_eq_odds postprocessing (post-processing) Post-processing _ calibrate eq odds postprocessing