SlideShare a Scribd company logo
1 of 30
Download to read offline
Combating Bias in
Production Computer
Vision Systems
Alex Thaman
Chief Architect
Red Cell Partners
About Red Cell Partners
2
© 2023 Red Cell Partners
Red Cell is an incubation firm that builds & invests in rapidly scalable, technology-led
companies bringing revolutionary advancements to market in National Security & Healthcare,
sectors where we have a distinct competitive advantage.
Our Mission
United by a shared sense of duty & deep belief in the power of innovation, we
develop technology to address our Nation’s most pressing problems.
Domain knowledge
Technology expertise (Big Data,
AI/ML, AR/VR, Kinetics, etc.)
Strategic networks
Exceptional capabilities in
regulated end markets
Our Strategy
We incubate & invest in technology-led companies that address key issues in
National Security & Healthcare, backing the most promising emerging firms &
dramatically accelerating their growth by leveraging our team’s:
3
© 2023 Red Cell Partners
What Is The Problem?
Understanding and Evaluating Racial
Biases in Image Captioning
• Males appear 2x more than females
• Light skin appears 7.5x more than dark
• Dark-skinned females appear 23.1x less
frequently than dark-skinned males
• Lighter-skinned appear more with
indoor and furniture objects
• Darker-skinned appear more with
outdoor and vehicle objects
Dataset Statistics in Coco Dataset
4
© 2023 Red Cell Partners
Zhao, D., Wang, A., & Russakovsky, O. (2021). Understanding and
Evaluating Racial Biases in Image Captioning. In International Conference
on Computer Vision (ICCV).
1
2
3
4
5
6
• CLIP
• 400M images from 500K word queries
based based on frequency in Wikipedia
• Multimodal neurons in artificial neural
networks (openai.com)
• We have observed, for example, a
“Middle East” neuron with an
association with terrorism; and an
“immigration” neuron that responds to
Latin America. We have even found a
neuron that fires for both dark-skinned
people and gorillas...
Encoded Bias
5
© 2023 Red Cell Partners
https://microscope.openai.com/models/contrastive_v2/image_
block_4_2_Add_6_0/1895
Generative AI Bias
6
© 2023 Red Cell Partners
Prompt: “A person cleaning a living room”
Source: Bing Image Creator
Generative AI Bias
7
© 2023 Red Cell Partners
Prompt: “A housecleaner cleaning a bathroom”
Source: Bing Image Creator
Generative AI Bias
8
© 2023 Red Cell Partners
Search Query: “A housecleaner cleaning a bathroom”
Source: Google Image Search
Generative AI Bias
9
© 2023 Red Cell Partners
Prompt: “A nurse taking care of a hospital patient”
Source: Bing Image Creator
10
© 2023 Red Cell Partners
Measuring Bias
• Labeling data for analytics
• Distribution mismatches
• Find correlations
• Consider labeling costs
• Sampling your samples
• Instructions
• Pre-labeling
Metadata Labeling
11
© 2023 Red Cell Partners
Zhao, D., Wang, A., & Russakovsky, O. (2021). Understanding and
Evaluating Racial Biases in Image Captioning. In International Conference
on Computer Vision (ICCV).
• Fitzpatrick scale
• 6 classifications
• Skewed towards white skin
variations
• Monk scale
• 10 classifications
• Tested socially across population
groups for fair representation
Skin Tone Labeling
12
© 2023 Red Cell Partners
Metadata Statistics
13
© 2023 Red Cell Partners
Metadata Statistics
14
© 2023 Red Cell Partners
• Live data collection
• Sample data in production
• Compare inference results across
dimensions
• Without access to the data
• Human supervision is difficult
• Use metadata models for insights
• Data drift in embedding space –
KL divergence, WS distance, etc.
Measuring Bias in Production
15
© 2023 Red Cell Partners
UMAP of CLIP embedding of cropped boxes, colored by class
16
© 2023 Red Cell Partners
Controlling Bias
• Data collection
• Targeted data collection for specific population groups
• Collect more data than you think you need
• Dataset balancing
• Sample training data across dimensions
• Overweight underrepresented data
Controlling Bias
17
© 2023 Red Cell Partners
• Synthetic data
• Infinite labels/metadata
• Study model behavior
across dimensions
• Learn domain-invariant
features via pre-training
Controlling Bias
18
© 2023 Red Cell Partners
Microsoft Face Synthetics: https://microsoft.github.io/FaceSynthetics/
Synthetic Data Study
19
© 2023 Red Cell Partners
20
© 2023 Red Cell Partners
Other Considerations
• Skin tone
• Perceived differently by
different cultures
• Multidimensional
• Image characteristics and
environment
• Sex/gender not globally
normalized
Descriptive Metadata Challenges
21
© 2023 Red Cell Partners
What skin tone are these 2 people?
• “To have any inductive process make predictions on unseen
data, an agent requires a bias. What constitutes a good bias is
an empirical question about which biases work best in practice”
Poole, D. & Mackworth, A. (2019). Artificial Intelligence: Foundations of Computational Agents.
• Think about tradeoffs
• Can the application layer help?
The Cost of Bias
22
© 2023 Red Cell Partners
Clothes fitting
Racial associations Obscure Misrecognitions
Helpful
Harmful
• All AI systems have bias that results from the data that was used
• AI systems need deep evaluation prior to widespread deployment
• Analysis through additional metadata, statistics, and controlled
experiments help predict how these systems may perform in the
real world
Conclusions
23
© 2023 Red Cell Partners
Resources
Contact Info
LinkedIn:
https://www.linkedin.com/in/alex-
thaman-93436659/
Email:
alex.thaman@redcellpartners.com
24
© 2023 Red Cell Partners
GitHub repository for this talk
https://github.com/alexthaman/evs2023
Understanding and Evaluating Racial
Biases in Image Captioning
https://arxiv.org/abs/2106.08503
Themis AI – evaluate bias in your model
https://themisai.io/
25
© 2023 Red Cell Partners
Appendix
• Detecting data drift - https://www.evidentlyai.com/blog/data-drift-
detection-large-datasets
• Monk Scale blog
https://blog.google/technology/research/ai-monk-scale-skin-tone-
story/
• REVISE (open source tool for evaluating bias) -
https://github.com/princetonvisualai/revise-tool
• Other products to evaluate bias: Manot (https://www.manot.ai/)
Additional Resources
26
© 2023 Red Cell Partners
• Digital Humans Synthetic data
• Synthesis AI - https://synthesis.ai/
• Datagen - https://datagen.tech/
• Infinity AI - https://infinity.ai/
• Unity Digital Humans - https://github.com/Unity-
Technologies/com.unity.cv.synthetichumans
• Microsoft Face Synthetics - https://microsoft.github.io/FaceSynthetics/
Additional Resources
27
© 2023 Red Cell Partners
Bias AI systems produce biased humans: Experiment Setup
AI-Human Feedback Loop
28
© 2023 Red Cell Partners
Glickman, M., & Sharot, T. (2022, November 15). Biased AI systems produce biased humans.
https://doi.org/10.31219/osf.io/c4e7r
AI-Human Feedback Loop
29
© 2023 Red Cell Partners
Glickman, M., & Sharot, T. (2022, November 15). Biased AI systems produce biased humans.
https://doi.org/10.31219/osf.io/c4e7r
Conclusion 1: Human-AI interactions create bias feedback loops
AI-Human Feedback Loop
30
© 2023 Red Cell Partners
Glickman, M., & Sharot, T. (2022, November 15). Biased AI systems produce biased humans.
https://doi.org/10.31219/osf.io/c4e7r
Conclusion 2: Humans underestimate the impact of the bias from AI

More Related Content

Similar to “Combating Bias in Production Computer Vision Systems,” a Presentation from Red Cell Partners

Keynote: Graphs in Government_Lance Walter, CMO
Keynote:  Graphs in Government_Lance Walter, CMOKeynote:  Graphs in Government_Lance Walter, CMO
Keynote: Graphs in Government_Lance Walter, CMONeo4j
 
Community of practice on socio-economic data
Community of practice on socio-economic dataCommunity of practice on socio-economic data
Community of practice on socio-economic dataCGIAR
 
Community of practice on socio-economic data
Community of practice on socio-economic dataCommunity of practice on socio-economic data
Community of practice on socio-economic dataIFPRI-PIM
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AIDATAVERSITY
 
Distributed Trust Architecture: The New Reality of ML-based Systems
Distributed Trust Architecture: The New Reality of ML-based SystemsDistributed Trust Architecture: The New Reality of ML-based Systems
Distributed Trust Architecture: The New Reality of ML-based SystemsLiming Zhu
 
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Denodo
 
From information to intelligence
From information to intelligence From information to intelligence
From information to intelligence Srini Koushik
 
The Dangers of Machine Learning
The Dangers of Machine LearningThe Dangers of Machine Learning
The Dangers of Machine LearningtothepointIT
 
Scaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsScaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsApplause
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Glantus Presentation Slides - Ethical Data Science - BoI Analytics Connect 2018
Glantus Presentation Slides - Ethical Data Science - BoI Analytics Connect 2018Glantus Presentation Slides - Ethical Data Science - BoI Analytics Connect 2018
Glantus Presentation Slides - Ethical Data Science - BoI Analytics Connect 2018Joe Keating
 
Data minig with Big data analysis
Data minig with Big data analysisData minig with Big data analysis
Data minig with Big data analysisPoonam Kshirsagar
 
How would AI shape Future Integrations?
How would AI shape Future Integrations?How would AI shape Future Integrations?
How would AI shape Future Integrations?Srinath Perera
 
El camino hacia el éxito con las bases de datos de grafos, la ciencia de dato...
El camino hacia el éxito con las bases de datos de grafos, la ciencia de dato...El camino hacia el éxito con las bases de datos de grafos, la ciencia de dato...
El camino hacia el éxito con las bases de datos de grafos, la ciencia de dato...Neo4j
 
The value of big data analytics
The value of big data analyticsThe value of big data analytics
The value of big data analyticsMarc Vael
 
Discovering Big Data in the Fog: Why Catalogs Matter
 Discovering Big Data in the Fog: Why Catalogs Matter Discovering Big Data in the Fog: Why Catalogs Matter
Discovering Big Data in the Fog: Why Catalogs MatterEric Kavanagh
 
Keith prabhu global high on cloud summit
Keith prabhu  global high on cloud summitKeith prabhu  global high on cloud summit
Keith prabhu global high on cloud summitadministrator_confidis
 
Glantus Presentation: Ethical Data Science - BoI Analytics Connect 2018
Glantus Presentation: Ethical Data Science - BoI Analytics Connect 2018Glantus Presentation: Ethical Data Science - BoI Analytics Connect 2018
Glantus Presentation: Ethical Data Science - BoI Analytics Connect 2018Joe Keating
 
Digital Transformation Business Evolution
Digital Transformation Business Evolution Digital Transformation Business Evolution
Digital Transformation Business Evolution Digital Catapult
 
UNICOM Conference on Digital Transformation - The Trust Framework Initiative ...
UNICOM Conference on Digital Transformation - The Trust Framework Initiative ...UNICOM Conference on Digital Transformation - The Trust Framework Initiative ...
UNICOM Conference on Digital Transformation - The Trust Framework Initiative ...MicheleNati
 

Similar to “Combating Bias in Production Computer Vision Systems,” a Presentation from Red Cell Partners (20)

Keynote: Graphs in Government_Lance Walter, CMO
Keynote:  Graphs in Government_Lance Walter, CMOKeynote:  Graphs in Government_Lance Walter, CMO
Keynote: Graphs in Government_Lance Walter, CMO
 
Community of practice on socio-economic data
Community of practice on socio-economic dataCommunity of practice on socio-economic data
Community of practice on socio-economic data
 
Community of practice on socio-economic data
Community of practice on socio-economic dataCommunity of practice on socio-economic data
Community of practice on socio-economic data
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AI
 
Distributed Trust Architecture: The New Reality of ML-based Systems
Distributed Trust Architecture: The New Reality of ML-based SystemsDistributed Trust Architecture: The New Reality of ML-based Systems
Distributed Trust Architecture: The New Reality of ML-based Systems
 
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
 
From information to intelligence
From information to intelligence From information to intelligence
From information to intelligence
 
The Dangers of Machine Learning
The Dangers of Machine LearningThe Dangers of Machine Learning
The Dangers of Machine Learning
 
Scaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsScaling Training Data for AI Applications
Scaling Training Data for AI Applications
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Glantus Presentation Slides - Ethical Data Science - BoI Analytics Connect 2018
Glantus Presentation Slides - Ethical Data Science - BoI Analytics Connect 2018Glantus Presentation Slides - Ethical Data Science - BoI Analytics Connect 2018
Glantus Presentation Slides - Ethical Data Science - BoI Analytics Connect 2018
 
Data minig with Big data analysis
Data minig with Big data analysisData minig with Big data analysis
Data minig with Big data analysis
 
How would AI shape Future Integrations?
How would AI shape Future Integrations?How would AI shape Future Integrations?
How would AI shape Future Integrations?
 
El camino hacia el éxito con las bases de datos de grafos, la ciencia de dato...
El camino hacia el éxito con las bases de datos de grafos, la ciencia de dato...El camino hacia el éxito con las bases de datos de grafos, la ciencia de dato...
El camino hacia el éxito con las bases de datos de grafos, la ciencia de dato...
 
The value of big data analytics
The value of big data analyticsThe value of big data analytics
The value of big data analytics
 
Discovering Big Data in the Fog: Why Catalogs Matter
 Discovering Big Data in the Fog: Why Catalogs Matter Discovering Big Data in the Fog: Why Catalogs Matter
Discovering Big Data in the Fog: Why Catalogs Matter
 
Keith prabhu global high on cloud summit
Keith prabhu  global high on cloud summitKeith prabhu  global high on cloud summit
Keith prabhu global high on cloud summit
 
Glantus Presentation: Ethical Data Science - BoI Analytics Connect 2018
Glantus Presentation: Ethical Data Science - BoI Analytics Connect 2018Glantus Presentation: Ethical Data Science - BoI Analytics Connect 2018
Glantus Presentation: Ethical Data Science - BoI Analytics Connect 2018
 
Digital Transformation Business Evolution
Digital Transformation Business Evolution Digital Transformation Business Evolution
Digital Transformation Business Evolution
 
UNICOM Conference on Digital Transformation - The Trust Framework Initiative ...
UNICOM Conference on Digital Transformation - The Trust Framework Initiative ...UNICOM Conference on Digital Transformation - The Trust Framework Initiative ...
UNICOM Conference on Digital Transformation - The Trust Framework Initiative ...
 

More from Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...Edge AI and Vision Alliance
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...Edge AI and Vision Alliance
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...Edge AI and Vision Alliance
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...Edge AI and Vision Alliance
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...Edge AI and Vision Alliance
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsightsEdge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...Edge AI and Vision Alliance
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...Edge AI and Vision Alliance
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...Edge AI and Vision Alliance
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...Edge AI and Vision Alliance
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from SamsaraEdge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...Edge AI and Vision Alliance
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...Edge AI and Vision Alliance
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...Edge AI and Vision Alliance
 
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...Edge AI and Vision Alliance
 

More from Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
 

Recently uploaded

costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 

Recently uploaded (20)

costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 

“Combating Bias in Production Computer Vision Systems,” a Presentation from Red Cell Partners

  • 1. Combating Bias in Production Computer Vision Systems Alex Thaman Chief Architect Red Cell Partners
  • 2. About Red Cell Partners 2 © 2023 Red Cell Partners Red Cell is an incubation firm that builds & invests in rapidly scalable, technology-led companies bringing revolutionary advancements to market in National Security & Healthcare, sectors where we have a distinct competitive advantage. Our Mission United by a shared sense of duty & deep belief in the power of innovation, we develop technology to address our Nation’s most pressing problems. Domain knowledge Technology expertise (Big Data, AI/ML, AR/VR, Kinetics, etc.) Strategic networks Exceptional capabilities in regulated end markets Our Strategy We incubate & invest in technology-led companies that address key issues in National Security & Healthcare, backing the most promising emerging firms & dramatically accelerating their growth by leveraging our team’s:
  • 3. 3 © 2023 Red Cell Partners What Is The Problem?
  • 4. Understanding and Evaluating Racial Biases in Image Captioning • Males appear 2x more than females • Light skin appears 7.5x more than dark • Dark-skinned females appear 23.1x less frequently than dark-skinned males • Lighter-skinned appear more with indoor and furniture objects • Darker-skinned appear more with outdoor and vehicle objects Dataset Statistics in Coco Dataset 4 © 2023 Red Cell Partners Zhao, D., Wang, A., & Russakovsky, O. (2021). Understanding and Evaluating Racial Biases in Image Captioning. In International Conference on Computer Vision (ICCV). 1 2 3 4 5 6
  • 5. • CLIP • 400M images from 500K word queries based based on frequency in Wikipedia • Multimodal neurons in artificial neural networks (openai.com) • We have observed, for example, a “Middle East” neuron with an association with terrorism; and an “immigration” neuron that responds to Latin America. We have even found a neuron that fires for both dark-skinned people and gorillas... Encoded Bias 5 © 2023 Red Cell Partners https://microscope.openai.com/models/contrastive_v2/image_ block_4_2_Add_6_0/1895
  • 6. Generative AI Bias 6 © 2023 Red Cell Partners Prompt: “A person cleaning a living room” Source: Bing Image Creator
  • 7. Generative AI Bias 7 © 2023 Red Cell Partners Prompt: “A housecleaner cleaning a bathroom” Source: Bing Image Creator
  • 8. Generative AI Bias 8 © 2023 Red Cell Partners Search Query: “A housecleaner cleaning a bathroom” Source: Google Image Search
  • 9. Generative AI Bias 9 © 2023 Red Cell Partners Prompt: “A nurse taking care of a hospital patient” Source: Bing Image Creator
  • 10. 10 © 2023 Red Cell Partners Measuring Bias
  • 11. • Labeling data for analytics • Distribution mismatches • Find correlations • Consider labeling costs • Sampling your samples • Instructions • Pre-labeling Metadata Labeling 11 © 2023 Red Cell Partners Zhao, D., Wang, A., & Russakovsky, O. (2021). Understanding and Evaluating Racial Biases in Image Captioning. In International Conference on Computer Vision (ICCV).
  • 12. • Fitzpatrick scale • 6 classifications • Skewed towards white skin variations • Monk scale • 10 classifications • Tested socially across population groups for fair representation Skin Tone Labeling 12 © 2023 Red Cell Partners
  • 13. Metadata Statistics 13 © 2023 Red Cell Partners
  • 14. Metadata Statistics 14 © 2023 Red Cell Partners
  • 15. • Live data collection • Sample data in production • Compare inference results across dimensions • Without access to the data • Human supervision is difficult • Use metadata models for insights • Data drift in embedding space – KL divergence, WS distance, etc. Measuring Bias in Production 15 © 2023 Red Cell Partners UMAP of CLIP embedding of cropped boxes, colored by class
  • 16. 16 © 2023 Red Cell Partners Controlling Bias
  • 17. • Data collection • Targeted data collection for specific population groups • Collect more data than you think you need • Dataset balancing • Sample training data across dimensions • Overweight underrepresented data Controlling Bias 17 © 2023 Red Cell Partners
  • 18. • Synthetic data • Infinite labels/metadata • Study model behavior across dimensions • Learn domain-invariant features via pre-training Controlling Bias 18 © 2023 Red Cell Partners Microsoft Face Synthetics: https://microsoft.github.io/FaceSynthetics/
  • 19. Synthetic Data Study 19 © 2023 Red Cell Partners
  • 20. 20 © 2023 Red Cell Partners Other Considerations
  • 21. • Skin tone • Perceived differently by different cultures • Multidimensional • Image characteristics and environment • Sex/gender not globally normalized Descriptive Metadata Challenges 21 © 2023 Red Cell Partners What skin tone are these 2 people?
  • 22. • “To have any inductive process make predictions on unseen data, an agent requires a bias. What constitutes a good bias is an empirical question about which biases work best in practice” Poole, D. & Mackworth, A. (2019). Artificial Intelligence: Foundations of Computational Agents. • Think about tradeoffs • Can the application layer help? The Cost of Bias 22 © 2023 Red Cell Partners Clothes fitting Racial associations Obscure Misrecognitions Helpful Harmful
  • 23. • All AI systems have bias that results from the data that was used • AI systems need deep evaluation prior to widespread deployment • Analysis through additional metadata, statistics, and controlled experiments help predict how these systems may perform in the real world Conclusions 23 © 2023 Red Cell Partners
  • 24. Resources Contact Info LinkedIn: https://www.linkedin.com/in/alex- thaman-93436659/ Email: alex.thaman@redcellpartners.com 24 © 2023 Red Cell Partners GitHub repository for this talk https://github.com/alexthaman/evs2023 Understanding and Evaluating Racial Biases in Image Captioning https://arxiv.org/abs/2106.08503 Themis AI – evaluate bias in your model https://themisai.io/
  • 25. 25 © 2023 Red Cell Partners Appendix
  • 26. • Detecting data drift - https://www.evidentlyai.com/blog/data-drift- detection-large-datasets • Monk Scale blog https://blog.google/technology/research/ai-monk-scale-skin-tone- story/ • REVISE (open source tool for evaluating bias) - https://github.com/princetonvisualai/revise-tool • Other products to evaluate bias: Manot (https://www.manot.ai/) Additional Resources 26 © 2023 Red Cell Partners
  • 27. • Digital Humans Synthetic data • Synthesis AI - https://synthesis.ai/ • Datagen - https://datagen.tech/ • Infinity AI - https://infinity.ai/ • Unity Digital Humans - https://github.com/Unity- Technologies/com.unity.cv.synthetichumans • Microsoft Face Synthetics - https://microsoft.github.io/FaceSynthetics/ Additional Resources 27 © 2023 Red Cell Partners
  • 28. Bias AI systems produce biased humans: Experiment Setup AI-Human Feedback Loop 28 © 2023 Red Cell Partners Glickman, M., & Sharot, T. (2022, November 15). Biased AI systems produce biased humans. https://doi.org/10.31219/osf.io/c4e7r
  • 29. AI-Human Feedback Loop 29 © 2023 Red Cell Partners Glickman, M., & Sharot, T. (2022, November 15). Biased AI systems produce biased humans. https://doi.org/10.31219/osf.io/c4e7r Conclusion 1: Human-AI interactions create bias feedback loops
  • 30. AI-Human Feedback Loop 30 © 2023 Red Cell Partners Glickman, M., & Sharot, T. (2022, November 15). Biased AI systems produce biased humans. https://doi.org/10.31219/osf.io/c4e7r Conclusion 2: Humans underestimate the impact of the bias from AI