Generative AI for Social Good at Open Data Science East 2024

•Download as PPTX, PDF•

0 likes•17 views

A brief overview of generative AI technologies and their use for social good initiatives, including cultural training, medical image generation, drug design, and public health.

Data & Analytics

Generative
AI for Social
Good
Colleen Farrelly, Post Urban

What is generative AI?
• Deep learning frameworks that can produce new data based on
input prompts and large training datasets
• Can have any/all of these steps in the framework:
• Encoder-decoder structure
• Training sample matches
• Random noise and blending components
• Comparison steps to ensure realism

Examples
ChatGPT
DALL-E
Stable Diffusion
Large Language Models on
Hugging Face
Custom Generative Adversarial
Networks for other data types

Text Generators
• Massive training datasets
• Typically scraped and
possibly quality controlled
• Mostly in English
• Deep learning frameworks with
billions of parameters to train
• Can be modified by fine-tuning
• Specific examples relevant to
text generation task at hand
• LoRA as quicker way to train

Image Generators
• Many types
• Encoder-decoder steps in some
• Pull up related images
• Blend images
• Add random noise to back-fill
• Image generators plus comparison steps
• Two competing generators with one a
few training steps ahead of the other
• Comparison step to benchmark
against real dataset
• Some rely heavily on topology

Case 1: Medical
Image Generation
• Medical imaging data issues:
• Small sample sizes
• Sample imbalance (rare diseases…)
• Issues when augmenting small samples
or imbalanced samples:
• Biological structure fidelity in
generation (ex: ventricles in brain)
• Image variety in generation

TopoGAN
• Solution involves a generative
adversarial network with
topological awareness
• Topology
• Betti number
introduction
• Advantages:
• Preserves structures
like branching and
loops
• Generates large
number of images
close to target images

Case 2: Human Resource Diversity
Training
• Mindbloom
• Addresses training needs by providing synthetic people with whom to discuss
several types of conversations
• Employee reporting sexual harassment
• Addressing cultural mismatch of new employee
• Policy changes that impact employees
• Misgendering in the workplace
• Conversation and voice generation with proprietary generative algorithms
• Demo

Automated Reporting on Skill Improvement

Case 3: Protein
Generation
• Designing and testing new drugs takes a lot of
time and money.
• Not good for new pandemics in urgent need
of treatment
• Increased drug costs for consumers
• Many types of proteins/molecules in venom of
different animals
• Metalloproteinases, three finger toxins,
phospholipidase A2, disintigrins…
• Varies by geography and species
• Slight modifications of toxins as good
initial drug designs

Graph Generators
• Approach to protein/molecule-specific generative models:
• Translate protein/molecule to graph form
• Define properties of interest (solubility, for instance) or binding score
• Create generative model to work on generating similar graphs
• GAN trials generate new proteins/molecules with:
• Better target properties
• More variety
• Less time/cost to generation than other models/human generation

Case 4: Public Health Campaigns
• Many recent infectious diseases that can be spread from person to
person:
• Ebola
• COVID-19
• HIV
• Issues with traditional generation of video and poster messaging to
address behaviors contributing to spread
• Time to create script, image, and translations for local populations
• Lives lost in delays

Coupling Generators
• Generate culturally-relevant
images
• Generate text
• Translate text to local languages

Potential Bad
Behaviors
• Deep fakes
• Fake news
• Biased data
• Hallucinations and jailbreaks
• Manipulation of algorithm by
text engineering

Open-Source Resources
• https://huggingface.co/models
• https://www.craiyon.com/
• https://github.com/TopoXLab/TopoGAN-ECCV2020
• https://github.com/Biomatter-Designs/ProteinGAN

Similar to Generative AI for Social Good at Open Data Science East 2024

ASA conference Feb 2013mrkwr

Introduction•Super Computer developed by IBM Research•Named for .pdfanupambedcovers

Melissa Informatics - Data Quality and AImelissadata

Considerations and challenges in building an end to-end microbiome workflowEagle Genomics

1 d.1Society for Scholarly Publishing

N=10^9: Automated Experimentation at ScaleOptimizely

Social Listening for Scientists - BLA Case StudyMasood Akhtar

(Em)Powering Science: High-Performance Infrastructure in Biomedical ScienceAri Berman

2016 09 cxo forumChris Dwan

Health information professionals and Artificial Intelligencecoxamcoxam

Text MiningBiniam Asnake

The Simulacrum, a Synthetic Cancer DatasetCongChen35

Ethics and computing to healthcareBoysRelax

How to do science in a large IT company (ICPC World Finals 2021, Moscow)Alexander Borzunov

Can we induce change with what we measure?Michaela Greiler

Ontologies: What Librarians Need to KnowBarry Smith

MIS Unit-2.pptxZulfequarAliAhmad

Using Bioinformatics Data to inform Therapeutics discovery and developmentEleanor Howe

Intro_To_FHIR.pptxPierluigi10

Intro to machine learningTamir Taha

Similar to Generative AI for Social Good at Open Data Science East 2024 (20)

ASA conference Feb 2013

Introduction•Super Computer developed by IBM Research•Named for .pdf

Melissa Informatics - Data Quality and AI

Considerations and challenges in building an end to-end microbiome workflow

1 d.1

N=10^9: Automated Experimentation at Scale

Social Listening for Scientists - BLA Case Study

(Em)Powering Science: High-Performance Infrastructure in Biomedical Science

2016 09 cxo forum

Health information professionals and Artificial Intelligence

Text Mining

The Simulacrum, a Synthetic Cancer Dataset

Ethics and computing to healthcare

How to do science in a large IT company (ICPC World Finals 2021, Moscow)

Can we induce change with what we measure?

Ontologies: What Librarians Need to Know

MIS Unit-2.pptx

Using Bioinformatics Data to inform Therapeutics discovery and development

Intro_To_FHIR.pptx

Intro to machine learning

Recently uploaded

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor

RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh

Data Warehouse , Data Cube Computationsit20ad004

Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor

Data Science Jobs and Salaries Analysis.pptxFurkanTasci3

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal

B2 Creative Industry Response Evaluation.docxStephen266013

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna

RadioAdProWritingCinderellabyButleri.pdfgstagge

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

Ukraine War presentation: KNOW THE BASICSAishani27

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

Industrialised data - the key to AI success.pdfLars Albertsson

Recently uploaded (20)

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati

RA-11058_IRR-COMPRESS Do 198 series of 1998

Data Warehouse , Data Cube Computation

Call Girls In Mahipalpur O9654467111 Escorts Service

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...

Data Science Jobs and Salaries Analysis.pptx

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...

B2 Creative Industry Response Evaluation.docx

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...

RadioAdProWritingCinderellabyButleri.pdf

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call

Brighton SEO | April 2024 | Data Storytelling

Ukraine War presentation: KNOW THE BASICS

Schema on read is obsolete. Welcome metaprogramming..pdf

Industrialised data - the key to AI success.pdf

Generative AI for Social Good at Open Data Science East 2024

1. Generative AI for Social Good Colleen Farrelly, Post Urban

2. What is generative AI? • Deep learning frameworks that can produce new data based on input prompts and large training datasets • Can have any/all of these steps in the framework: • Encoder-decoder structure • Training sample matches • Random noise and blending components • Comparison steps to ensure realism

3. Examples ChatGPT DALL-E Stable Diffusion Large Language Models on Hugging Face Custom Generative Adversarial Networks for other data types

4. Text Generators • Massive training datasets • Typically scraped and possibly quality controlled • Mostly in English • Deep learning frameworks with billions of parameters to train • Can be modified by fine-tuning • Specific examples relevant to text generation task at hand • LoRA as quicker way to train

5. Image Generators • Many types • Encoder-decoder steps in some • Pull up related images • Blend images • Add random noise to back-fill • Image generators plus comparison steps • Two competing generators with one a few training steps ahead of the other • Comparison step to benchmark against real dataset • Some rely heavily on topology

6. Case Studies

7. Case 1: Medical Image Generation • Medical imaging data issues: • Small sample sizes • Sample imbalance (rare diseases…) • Issues when augmenting small samples or imbalanced samples: • Biological structure fidelity in generation (ex: ventricles in brain) • Image variety in generation

8. TopoGAN • Solution involves a generative adversarial network with topological awareness • Topology • Betti number introduction • Advantages: • Preserves structures like branching and loops • Generates large number of images close to target images

9. Case 2: Human Resource Diversity Training • Mindbloom • Addresses training needs by providing synthetic people with whom to discuss several types of conversations • Employee reporting sexual harassment • Addressing cultural mismatch of new employee • Policy changes that impact employees • Misgendering in the workplace • Conversation and voice generation with proprietary generative algorithms • Demo

10. Automated Reporting on Skill Improvement

11. Case 3: Protein Generation • Designing and testing new drugs takes a lot of time and money. • Not good for new pandemics in urgent need of treatment • Increased drug costs for consumers • Many types of proteins/molecules in venom of different animals • Metalloproteinases, three finger toxins, phospholipidase A2, disintigrins… • Varies by geography and species • Slight modifications of toxins as good initial drug designs

12. Graph Generators • Approach to protein/molecule-specific generative models: • Translate protein/molecule to graph form • Define properties of interest (solubility, for instance) or binding score • Create generative model to work on generating similar graphs • GAN trials generate new proteins/molecules with: • Better target properties • More variety • Less time/cost to generation than other models/human generation

13. Case 4: Public Health Campaigns • Many recent infectious diseases that can be spread from person to person: • Ebola • COVID-19 • HIV • Issues with traditional generation of video and poster messaging to address behaviors contributing to spread • Time to create script, image, and translations for local populations • Lives lost in delays

14. Coupling Generators • Generate culturally-relevant images • Generate text • Translate text to local languages

15. Ethical Considerations

16. Potential Bad Behaviors • Deep fakes • Fake news • Biased data • Hallucinations and jailbreaks • Manipulation of algorithm by text engineering

17. Open-Source Resources • https://huggingface.co/models • https://www.craiyon.com/ • https://github.com/TopoXLab/TopoGAN-ECCV2020 • https://github.com/Biomatter-Designs/ProteinGAN

Generative AI for Social Good at Open Data Science East 2024

Recommended

Recommended

More Related Content

Similar to Generative AI for Social Good at Open Data Science East 2024

Similar to Generative AI for Social Good at Open Data Science East 2024 (20)

More from Colleen Farrelly

More from Colleen Farrelly (20)

Recently uploaded

Recently uploaded (20)

Generative AI for Social Good at Open Data Science East 2024