Responsible & Safe AI Systems at ACM India ROCS at IIT Bombay

•Download as PPTX, PDF•

0 likes•10 views

IIIT Hyderabad

ChatGPT Large language models Responsible AI Safe AI Vision models

Engineering

Responsible & Safe AI
Feb 24, 2024
ROCS, IIT Bombay
Ponnurangam Kumaraguru (“PK”)
#ProfGiri CS IIIT Hyderabad
ACM Distinguished Member
TEDx Speaker
https://precog.iiit.ac.in/
/in/ponguru @ponguru

Know the Audience
Students / Industry / Faculty / Others
3

12
https://blog.google/products/gemini/gemini-image-generation-issue/

13
https://blog.google/products/gemini/gemini-image-generation-issue/

14
https://blog.google/products/gemini/gemini-image-generation-issue/

Changes in Predictions for Theme: Hatya (Murder)
22
Results

Changes in Predictions for Theme: Dahej (Dowry)
23
Results

Indian Legal Data Fair?
23
https://arxiv.org/pdf/2303.07247.pdf

29
https://flowingdata.com/2023/11/03/demonstration-of-bias-in-ai-generated-images/

Any use cases / experiences from your side?
30

Similar systems / applications
Bard by Google - is connected to internet, docs, drive, gmail
LLaMa by Meta - open source LLM
BingChat by Microsoft - integrates GPT with internet
Copilot X by Github - integrates with VSCode to help you write code
HuggingChat - open source chatGPT alternative
BLOOM by BigScience - multilingual LLM
OverflowAI by StackOverflow - LLM trained by stackoverflow
Poe by Quora - has chatbot personalities
YouChat - LLM powered by search engine You.com
31

Inconsistency in LLMs
LLMs are not consistent, they often give
contradictory answers to paraphrased
questions.
This makes them highly unreliable,
untrustworthy, especially when the users
are seeking advice.

Bias in LLMs
Current systems like ChatGPT employ
guardrails, and do not respond to biased
content
Users on the Web leave out key contexts,
which make LLMs think the content is
biased
This negatively affects user engagement
LLMs must be able to explore and ask
more questions
Our work aims to make LLMs bias-aware
– context resolves confusion!

Context-Oriented Bias Indicator and Assessment Score

Removing Harmful Knowledge
Large pretrained models trained on low quality
Internet corpora often have wrong / harmful
data. Can we remove it post-hoc? Existing
(GDPR, CCPA) and imminent legislation requires
this
Corrective Machine Unlearning to remove
influences of label corruptions like noise, biases
etc. in image classifiers
Now expanding to removing harmful
knowledge from LLMs
Applications: Debiasing, denoising, removing
harmful knowledge like weapon / toxic chemical
design
41
https://arxiv.org/pdf/2402.14015.pdf

Takeaways?
Growing fast
More power comes with more
responsibility
More of us to study, critique
these questions / topics
Teaching this as a course on
campus, content public
42

43
Search for: Ponnurangam Kumaraguru
https://www.linkedin.com/in/ponguru/
https://twitter.com/ponguru

44
https://precog.iiit.ac.in/pages/publications.html

Interested in working with us?
45
Full time Research Associates
PhD Students
Summer 2024 interns

Acknowledgements
Precog members
Collaborators
47

49
Thanks!
Questions? pk.guru@iiit.ac.in
http://precog.iiit.ac.in/
@ponguru
pk.profgiri
linkedin/in/ponguru

Similar to Responsible & Safe AI Systems at ACM India ROCS at IIT Bombay

ChatGPT for State The Art- Prof. Wisnu Jatmiko (UIN Raden Fatah 2023).pdfAchmadNizarHidayanto

Facebook businessseung hyun Seo

MoneySafe-FinalReportShaun Michael Stone

WEB APPLICATION FOR MATHEMATICS CLUB OF P.C.EIRJET Journal

How should our higher education institutions respond to innovations in new AI...Sue Beckingham

Understanding Emerging Technology and Its Impact on Online & Blended LearningStephen Murgatroyd, PhD FBPsS FRSA

Using GradeMark For Effective FeedbackKarl Luke

9172020 Originality Reporthttpsucumberlands.blackboar.docxJospehStull43

Phrases for resume and interview start Mar31Sander Stepanov

Firoz's Resume.pdfFirozkumar2

M2 l10 fairness, accountability, and transparencyBoPeng76

Android Based Internship Program And Feedback SystemDilkashShaikhMahajan

Online Learning Management System and Analytics using Deep LearningDr. Amarjeet Singh

Web 2.0Sriram Dash

AI & Ethics Presentation by Dr. Torrey Trust.pptxpriyasirohirhetoric

Modul Topik 1 - Kecerdasan BuatanSunu Wibirama

Algorithms and Application Programmingahaleemsl

Crowdsourcing Your Way to a Better ePortfolioKristina D.C. Hoeppner

Get future built in bachelor of computer scienceVictoria College

Similar to Responsible & Safe AI Systems at ACM India ROCS at IIT Bombay (20)

ChatGPT for State The Art- Prof. Wisnu Jatmiko (UIN Raden Fatah 2023).pdf

Facebook business

MoneySafe-FinalReport

WEB APPLICATION FOR MATHEMATICS CLUB OF P.C.E

How should our higher education institutions respond to innovations in new AI...

Understanding Emerging Technology and Its Impact on Online & Blended Learning

Using GradeMark For Effective Feedback

9172020 Originality Reporthttpsucumberlands.blackboar.docx

Phrases for resume and interview start Mar31

Firoz's Resume.pdf

M2 l10 fairness, accountability, and transparency

Android Based Internship Program And Feedback System

Online Learning Management System and Analytics using Deep Learning

Web 2.0

AI & Ethics Presentation by Dr. Torrey Trust.pptx

Modul Topik 1 - Kecerdasan Buatan

Algorithms and Application Programming

Crowdsourcing Your Way to a Better ePortfolio

Get future built in bachelor of computer science

Recently uploaded

Past, Present and Future of Generative AIabhishek36461

HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95

🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...9953056974 Low Rate Call Girls In Saket, Delhi NCR

Oxy acetylene welding presentation note.eptoze12

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3

Introduction to Microprocesso programming and interfacing.pptxvipinkmenon1

Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000

Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR9953056974 Low Rate Call Girls In Saket, Delhi NCR

Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff

complete construction, environmental and economics information of biomass com...asadnawaz62

power system scada applications and usesDevarapalliHaritha

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha

VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ

Internship report on mechanical engineeringmalavadedarshan25

★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR9953056974 Low Rate Call Girls In Saket, Delhi NCR

Electronically Controlled suspensions system .pdfme23b1001

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor

Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort

POWER SYSTEMS-1 Complete notes examplesDr. Gudipudi Nageswara Rao

Recently uploaded (20)

Past, Present and Future of Generative AI

HARMONY IN THE NATURE AND EXISTENCE - Unit-IV

🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...

Oxy acetylene welding presentation note.

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS

Introduction to Microprocesso programming and interfacing.pptx

Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...

Microscopic Analysis of Ceramic Materials.pptx

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR

Call Girls Narol 7397865700 Independent Call Girls

complete construction, environmental and economics information of biomass com...

power system scada applications and uses

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx

VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...

Internship report on mechanical engineering

★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR

Electronically Controlled suspensions system .pdf

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130

Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service

POWER SYSTEMS-1 Complete notes examples

Responsible & Safe AI Systems at ACM India ROCS at IIT Bombay

1. Responsible & Safe AI Feb 24, 2024 ROCS, IIT Bombay Ponnurangam Kumaraguru (“PK”) #ProfGiri CS IIIT Hyderabad ACM Distinguished Member TEDx Speaker https://precog.iiit.ac.in/ /in/ponguru @ponguru

2. 2

3. Know the Audience Students / Industry / Faculty / Others 3

4. 4 https://translate.google.co.in/

5. 5 https://translate.google.co.in/

6. 6 https://translate.google.co.in/

7. 7 https://translate.google.co.in/

8. 8

9. 9

10. 10

11. 11

12. 12 https://blog.google/products/gemini/gemini-image-generation-issue/

13. 13 https://blog.google/products/gemini/gemini-image-generation-issue/

14. 14 https://blog.google/products/gemini/gemini-image-generation-issue/

15. 15

16. 16

17. 17

18. 18 https://llm-attacks.org/

19. 19 https://llm-attacks.org/

20. 20

21. Changes in Predictions for Theme: Hatya (Murder) 22 Results

22. Changes in Predictions for Theme: Dahej (Dowry) 23 Results

23. Indian Legal Data Fair? 23 https://arxiv.org/pdf/2303.07247.pdf

24. 24 https://arxiv.org/pdf/2306.09983.pdf

25. 25 https://arxiv.org/pdf/2306.09983.pdf

26. 26 https://arxiv.org/pdf/2306.09983.pdf

27. 27

28. 28

29. 29 https://flowingdata.com/2023/11/03/demonstration-of-bias-in-ai-generated-images/

30. Any use cases / experiences from your side? 30

31. Similar systems / applications Bard by Google - is connected to internet, docs, drive, gmail LLaMa by Meta - open source LLM BingChat by Microsoft - integrates GPT with internet Copilot X by Github - integrates with VSCode to help you write code HuggingChat - open source chatGPT alternative BLOOM by BigScience - multilingual LLM OverflowAI by StackOverflow - LLM trained by stackoverflow Poe by Quora - has chatbot personalities YouChat - LLM powered by search engine You.com 31

32. Inconsistency in LLMs LLMs are not consistent, they often give contradictory answers to paraphrased questions. This makes them highly unreliable, untrustworthy, especially when the users are seeking advice.

33. Our Methodology

34. 34

35. 35 https://arxiv.org/pdf/2402.13709.pdf

36. Bias in LLMs Current systems like ChatGPT employ guardrails, and do not respond to biased content Users on the Web leave out key contexts, which make LLMs think the content is biased This negatively affects user engagement LLMs must be able to explore and ask more questions Our work aims to make LLMs bias-aware – context resolves confusion!

37. Context-Oriented Bias Indicator and Assessment Score

38. 38

39. 39

40.

41. Removing Harmful Knowledge Large pretrained models trained on low quality Internet corpora often have wrong / harmful data. Can we remove it post-hoc? Existing (GDPR, CCPA) and imminent legislation requires this Corrective Machine Unlearning to remove influences of label corruptions like noise, biases etc. in image classifiers Now expanding to removing harmful knowledge from LLMs Applications: Debiasing, denoising, removing harmful knowledge like weapon / toxic chemical design 41 https://arxiv.org/pdf/2402.14015.pdf

42. Takeaways? Growing fast More power comes with more responsibility More of us to study, critique these questions / topics Teaching this as a course on campus, content public 42

43. 43 Search for: Ponnurangam Kumaraguru https://www.linkedin.com/in/ponguru/ https://twitter.com/ponguru

44. 44 https://precog.iiit.ac.in/pages/publications.html

45. Interested in working with us? 45 Full time Research Associates PhD Students Summer 2024 interns

46. 46 https://precog.iiit.ac.in/

47. Acknowledgements Precog members Collaborators 47

48. Group pic & Selfie  48

49. 49 Thanks! Questions? pk.guru@iiit.ac.in http://precog.iiit.ac.in/ @ponguru pk.profgiri linkedin/in/ponguru

Editor's Notes

The counts for prediction changes in the Hindu names column are higher than the counts of changes in the Muslim names column. This shows that for the cases where model predictions are 0, the model switches its predictions to 1 more often when the proper noun is replaced by a Hindu name instead of being replaced by a Muslim name, indicating FOR THE THEME HATYA - MODEL HAS LEARNT that Hindus are more likely to get a bail granted.
FOR THE THEME DAHEJ - MODEL HAS LEARNT that Hindus are NOT more likely to get a bail granted.
I am done.. Happy to take any questions…

Responsible & Safe AI Systems at ACM India ROCS at IIT Bombay

Recommended

Recommended

More Related Content

Similar to Responsible & Safe AI Systems at ACM India ROCS at IIT Bombay

Similar to Responsible & Safe AI Systems at ACM India ROCS at IIT Bombay (20)

More from IIIT Hyderabad

More from IIIT Hyderabad (20)

Recently uploaded

Recently uploaded (20)

Responsible & Safe AI Systems at ACM India ROCS at IIT Bombay

Editor's Notes