SlideShare a Scribd company logo
1 of 23
Robust Testing Strategies for Machine Learning Models
Agile Testing Alliance Hyderabad Meet
TMI Networks, Hyderabad, 22 July 2023
Tilottama Goswami, Ph.D. (University Of Hyderabad)
Professor, Department Of Information Technology
Vasavi College Of Engineering
Hyderabad, INDIA
Agenda
1. Motivation
2. Real-World Examples of ML Model Failures
3. Key Factors for Robust ML Model Testing
4. Conclusion
Motivation
Fourth Industrial Revolution
 Digital Transformation
 AI & Automation – Intelligent Systems
 Data Usage - Privacy Security Ethics
 Social Transformation – Quality of Life
Robust Testing Strategies
o Performance
o Security
o Reliability
o Seamless Integration & Deployment
o Building Trust
DEMAND
Impact of Industrial
Revolution 4.0 in
Real World
Scenarios
ROBOTIC
PROCESS
AUTOMATION
MACHINE
LEARNING
Explore.entenic.com
Depositphotos.com
Repetitive
Rule Based
Structured Data
Pre-Programmed Rules
Not Adaptable to
handle variations
No Cognitive
Capabilities
Struggles with
Unstructured Data –
Audio/Image/Text
Learn From Data
Predictions
Unstructured Data
Predictions on new
unseen Data
Adaptable and Evolves
with changes – Flexible
Complex Cognitive
Tasks – Reasoning
Unstructured Data –
Image Recognition
Language Translation
Data Entry & Transactional Tasks
Sentiment Analysis & Pattern Recognition
Machine Learning
Robotic Process Automation
AI & CV
https://medium.com/swlh/a-beginners-guide-to-understanding-the-buzz-words-ai-ml-nlp-deep-learning-computer-vision-a877ee1c2cde
The goal of AI is to capture the collective intelligence of humans and do a given task better than any individual human
can ever do
Computer Vision Tasks
Object Detection: Vehicle
Object Recognition: Car
Object Tracking: Speed Limit
https://www.optisolbusiness.com/insight/an-overview-of-image-segmentation-part-1
Classification
Sematic Segmentation
Classification &
Localization
Instance Segmentation
Feature Extractor
Optical Character Recognition
content.iospress.com
Computer Vision Tasks with Natural Language Processing
Real-World Examples of ML Model Failures
IBM Watson's
Cancer Treatment
Recommendations
Amazon's AI
Recruitment Tool
Google Photos'
Racist Labelling
Tesla's Autopilot
Accidents
Microsoft's Tay
Chatbot
Real-World Examples of ML Model Failures
IBM Watson's
Cancer
Treatment
Recommendatio
ns
Amazon's AI
Recruitment
Tool
Google
Photos' Racist
Labelling
Tesla's
Autopilot
Accidents
Microsoft's
Tay Chatbot
Bias Against
Female Candidates
Limitation of Training Data
and Biased Training
Real time Decision Making in
complex environments
Learnt offensive and
inappropriate
conversations from
tweets
Lacked proper testing and Validation
Erroneous Recommendations
Challenges
A. IBM Watson's Cancer Treatment Recommendations
1. Challenges with training data and complexity of cancer treatment
2. Interpretation of unstructured data and limited contextual
understanding
3. Lessons learned and improvements made
B. Microsoft's Tay Chatbot
1. Vulnerability to manipulation and lack of contextual
understanding
2. Rapid learning and amplification of bias
3. Importance of human oversight and responsibility
Challenges
C. Google Photos' Racist Labeling / Amazon’s AI Recruitment Tool
1. Biased training data and insufficient testing
2. Limited diversity in development teams
3. Ethical considerations and response to the incident
D. Tesla's Autopilot Accidents
1. Overreliance on the Autopilot system and
inattentive driving
2. System limitations and edge cases
3. Regulatory and legal challenges
Key Factors for Robust ML Model Testing
1.BIAS-VARIANCE Trade Off Overfitting/Underfitting
2.Comprehensive Training Data
3.Hyperparameter Tuning
4.Validation and Evaluation Techniques
5.Adversarial Testing
6.Continuous Monitoring and Maintenance
1. BIAS-VARIANCE Trade Off
Overfitting/Underfitting
Courtesy: Medium.com
15
Bias-Variance
The goal of any predictive modelling machine learning algorithm is to achieve low
bias and low variance.
 Bias are the simplifying assumptions made by a model to make the target function
easier to learn.
 V
ariance is the amount that the estimate of the target function will change if
different training data was used.
Address BIAS in ML
High Bias = Underfitting
Building Ethical and
Trustworthy AI systems
Promote Fairness and
Inclusivity
Diverse and
Representative
Data
Regularization &
Post Processing
Fairness-aware
Algorithms
Bias Aware
Evaluation
Feature
Engineering
L
O
W
B
I
A
S
Address VARIANCE in ML
High Variance = Overfitting
Stable and Robust AI Systems
L
O
W
V
A
R
I
A
N
C
E
Generalization
Adequate Variance
to avoid
Underfitting
Feature
Engineering
Regularization &
Post Processing
Cross Validation Ensemble Methods
Early Stopping of
Training
Comprehensive Training Data
1. Importance of diverse, representative, and unbiased training data
2. Data quality, data augmentation, and addressing class imbalance
3. Rigorous Hyperparameter Tuning
Rigorous Hyperparameter Tuning
1.Optimizing model performance through systematic
exploration
2.Techniques such as grid search, random search, and
Bayesian optimization
Validation & Evaluation Techniques
1.Cross-validation and holdout validation for assessing
model performance
2.Metrics selection, including accuracy, precision, recall,
F1-score, and AUC-ROC
3.Uncovering vulnerabilities and weaknesses in ML models
2. Crafting deceptive inputs and evaluating model
robustness
Adversarial Testing
1. Uncovering vulnerabilities and weaknesses in ML
models
2. Crafting deceptive inputs and evaluating model
robustness
Continuous Monitoring &
Maintenance
1. Importance of ongoing model performance monitoring
2. Regular updates, retraining, and version control
3. Human Feedback in Loop
Conclusion
Responsible Feature Engineering
Responsible development & deployment of ML Model
Building RITE System- Reliable, Inclusive, Trustworthy and Ethical systems
Thank you
AgileTestingAlliance
TMI Networks
Vasavi College of Engineering

More Related Content

Similar to Robust Testing Strategies for Machine Learning Models

Machine Learning for automated diagnosis of distributed ...AE
Machine Learning for automated diagnosis of distributed ...AEMachine Learning for automated diagnosis of distributed ...AE
Machine Learning for automated diagnosis of distributed ...AE
butest
 
Unlocking the Potential of Artificial Intelligence_ Machine Learning in Pract...
Unlocking the Potential of Artificial Intelligence_ Machine Learning in Pract...Unlocking the Potential of Artificial Intelligence_ Machine Learning in Pract...
Unlocking the Potential of Artificial Intelligence_ Machine Learning in Pract...
eswaralaldevadoss
 
AI and its use in the training Sector.pdf
AI and its use in the training Sector.pdfAI and its use in the training Sector.pdf
AI and its use in the training Sector.pdf
GilbertoCardoso32
 

Similar to Robust Testing Strategies for Machine Learning Models (20)

How to use LLMs in synthesizing training data?
How to use LLMs in synthesizing training data?How to use LLMs in synthesizing training data?
How to use LLMs in synthesizing training data?
 
Practical Applications of Machine Learning in Cybersecurity
Practical Applications of Machine Learning in CybersecurityPractical Applications of Machine Learning in Cybersecurity
Practical Applications of Machine Learning in Cybersecurity
 
Post Graduate Admission Prediction System
Post Graduate Admission Prediction SystemPost Graduate Admission Prediction System
Post Graduate Admission Prediction System
 
Machine-Learning-Unlocking-the-Power-of-Data.pptx
Machine-Learning-Unlocking-the-Power-of-Data.pptxMachine-Learning-Unlocking-the-Power-of-Data.pptx
Machine-Learning-Unlocking-the-Power-of-Data.pptx
 
Machine Learning On Big Data: Opportunities And Challenges- Future Research D...
Machine Learning On Big Data: Opportunities And Challenges- Future Research D...Machine Learning On Big Data: Opportunities And Challenges- Future Research D...
Machine Learning On Big Data: Opportunities And Challenges- Future Research D...
 
Machine Learning for automated diagnosis of distributed ...AE
Machine Learning for automated diagnosis of distributed ...AEMachine Learning for automated diagnosis of distributed ...AE
Machine Learning for automated diagnosis of distributed ...AE
 
Trustworthy Recommender Systems
Trustworthy Recommender SystemsTrustworthy Recommender Systems
Trustworthy Recommender Systems
 
Guide to Successful AI.pdf
Guide to Successful AI.pdfGuide to Successful AI.pdf
Guide to Successful AI.pdf
 
AI model security.pdf
AI model security.pdfAI model security.pdf
AI model security.pdf
 
AI TESTING: ENSURING A GOOD DATA SPLIT BETWEEN DATA SETS (TRAINING AND TEST) ...
AI TESTING: ENSURING A GOOD DATA SPLIT BETWEEN DATA SETS (TRAINING AND TEST) ...AI TESTING: ENSURING A GOOD DATA SPLIT BETWEEN DATA SETS (TRAINING AND TEST) ...
AI TESTING: ENSURING A GOOD DATA SPLIT BETWEEN DATA SETS (TRAINING AND TEST) ...
 
Technovision
TechnovisionTechnovision
Technovision
 
Best Practices for Harnessing Generative AI and LLMs1.pdf
Best Practices for Harnessing Generative AI and LLMs1.pdfBest Practices for Harnessing Generative AI and LLMs1.pdf
Best Practices for Harnessing Generative AI and LLMs1.pdf
 
Algorithmic Impact Assessment: Fairness, Robustness and Explainability in Aut...
Algorithmic Impact Assessment: Fairness, Robustness and Explainability in Aut...Algorithmic Impact Assessment: Fairness, Robustness and Explainability in Aut...
Algorithmic Impact Assessment: Fairness, Robustness and Explainability in Aut...
 
Machine Learning The Powerhouse of AI Explained.pdf
Machine Learning The Powerhouse of AI Explained.pdfMachine Learning The Powerhouse of AI Explained.pdf
Machine Learning The Powerhouse of AI Explained.pdf
 
Unlocking the Potential of Artificial Intelligence_ Machine Learning in Pract...
Unlocking the Potential of Artificial Intelligence_ Machine Learning in Pract...Unlocking the Potential of Artificial Intelligence_ Machine Learning in Pract...
Unlocking the Potential of Artificial Intelligence_ Machine Learning in Pract...
 
Enhancing AI Models With Outsourced Data Annotation Services
Enhancing AI Models With Outsourced Data Annotation ServicesEnhancing AI Models With Outsourced Data Annotation Services
Enhancing AI Models With Outsourced Data Annotation Services
 
Machine Learning_Unit 2_Full.ppt.pdf
Machine Learning_Unit 2_Full.ppt.pdfMachine Learning_Unit 2_Full.ppt.pdf
Machine Learning_Unit 2_Full.ppt.pdf
 
Interpretable Machine Learning_ Techniques for Model Explainability.
Interpretable Machine Learning_ Techniques for Model Explainability.Interpretable Machine Learning_ Techniques for Model Explainability.
Interpretable Machine Learning_ Techniques for Model Explainability.
 
AI and its use in the training Sector.pdf
AI and its use in the training Sector.pdfAI and its use in the training Sector.pdf
AI and its use in the training Sector.pdf
 
IRJET - Employee Performance Prediction System using Data Mining
IRJET - Employee Performance Prediction System using Data MiningIRJET - Employee Performance Prediction System using Data Mining
IRJET - Employee Performance Prediction System using Data Mining
 

More from Agile Testing Alliance

More from Agile Testing Alliance (20)

#Interactive Session by Anindita Rath and Mahathee Dandibhotla, "From Good to...
#Interactive Session by Anindita Rath and Mahathee Dandibhotla, "From Good to...#Interactive Session by Anindita Rath and Mahathee Dandibhotla, "From Good to...
#Interactive Session by Anindita Rath and Mahathee Dandibhotla, "From Good to...
 
#Interactive Session by Ajay Balamurugadas, "Where Are The Real Testers In T...
#Interactive Session by  Ajay Balamurugadas, "Where Are The Real Testers In T...#Interactive Session by  Ajay Balamurugadas, "Where Are The Real Testers In T...
#Interactive Session by Ajay Balamurugadas, "Where Are The Real Testers In T...
 
#Interactive Session by Jishnu Nambiar and Mayur Ovhal, "Monitoring Web Per...
#Interactive Session by  Jishnu Nambiar and  Mayur Ovhal, "Monitoring Web Per...#Interactive Session by  Jishnu Nambiar and  Mayur Ovhal, "Monitoring Web Per...
#Interactive Session by Jishnu Nambiar and Mayur Ovhal, "Monitoring Web Per...
 
#Interactive Session by Pradipta Biswas and Sucheta Saurabh Chitale, "Navigat...
#Interactive Session by Pradipta Biswas and Sucheta Saurabh Chitale, "Navigat...#Interactive Session by Pradipta Biswas and Sucheta Saurabh Chitale, "Navigat...
#Interactive Session by Pradipta Biswas and Sucheta Saurabh Chitale, "Navigat...
 
#Interactive Session by Apoorva Ram, "The Art of Storytelling for Testers" at...
#Interactive Session by Apoorva Ram, "The Art of Storytelling for Testers" at...#Interactive Session by Apoorva Ram, "The Art of Storytelling for Testers" at...
#Interactive Session by Apoorva Ram, "The Art of Storytelling for Testers" at...
 
#Interactive Session by Nikhil Jain, "Catch All Mail With Graph" at #ATAGTR2023.
#Interactive Session by Nikhil Jain, "Catch All Mail With Graph" at #ATAGTR2023.#Interactive Session by Nikhil Jain, "Catch All Mail With Graph" at #ATAGTR2023.
#Interactive Session by Nikhil Jain, "Catch All Mail With Graph" at #ATAGTR2023.
 
#Interactive Session by Ashok Kumar S, "Test Data the key to robust test cove...
#Interactive Session by Ashok Kumar S, "Test Data the key to robust test cove...#Interactive Session by Ashok Kumar S, "Test Data the key to robust test cove...
#Interactive Session by Ashok Kumar S, "Test Data the key to robust test cove...
 
#Interactive Session by Seema Kohli, "Test Leadership in the Era of Artificia...
#Interactive Session by Seema Kohli, "Test Leadership in the Era of Artificia...#Interactive Session by Seema Kohli, "Test Leadership in the Era of Artificia...
#Interactive Session by Seema Kohli, "Test Leadership in the Era of Artificia...
 
#Interactive Session by Ashwini Lalit, RRR of Test Automation Maintenance" at...
#Interactive Session by Ashwini Lalit, RRR of Test Automation Maintenance" at...#Interactive Session by Ashwini Lalit, RRR of Test Automation Maintenance" at...
#Interactive Session by Ashwini Lalit, RRR of Test Automation Maintenance" at...
 
#Interactive Session by Srithanga Aishvarya T, "Machine Learning Model to aut...
#Interactive Session by Srithanga Aishvarya T, "Machine Learning Model to aut...#Interactive Session by Srithanga Aishvarya T, "Machine Learning Model to aut...
#Interactive Session by Srithanga Aishvarya T, "Machine Learning Model to aut...
 
#Interactive Session by Kirti Ranjan Satapathy and Nandini K, "Elements of Qu...
#Interactive Session by Kirti Ranjan Satapathy and Nandini K, "Elements of Qu...#Interactive Session by Kirti Ranjan Satapathy and Nandini K, "Elements of Qu...
#Interactive Session by Kirti Ranjan Satapathy and Nandini K, "Elements of Qu...
 
#Interactive Session by Sudhir Upadhyay and Ashish Kumar, "Strengthening Test...
#Interactive Session by Sudhir Upadhyay and Ashish Kumar, "Strengthening Test...#Interactive Session by Sudhir Upadhyay and Ashish Kumar, "Strengthening Test...
#Interactive Session by Sudhir Upadhyay and Ashish Kumar, "Strengthening Test...
 
#Interactive Session by Sayan Deb Kundu, "Testing Gen AI Applications" at #AT...
#Interactive Session by Sayan Deb Kundu, "Testing Gen AI Applications" at #AT...#Interactive Session by Sayan Deb Kundu, "Testing Gen AI Applications" at #AT...
#Interactive Session by Sayan Deb Kundu, "Testing Gen AI Applications" at #AT...
 
#Interactive Session by Dinesh Boravke, "Zero Defects – Myth or Reality" at #...
#Interactive Session by Dinesh Boravke, "Zero Defects – Myth or Reality" at #...#Interactive Session by Dinesh Boravke, "Zero Defects – Myth or Reality" at #...
#Interactive Session by Dinesh Boravke, "Zero Defects – Myth or Reality" at #...
 
#Interactive Session by Saby Saurabh Bhardwaj, "Redefine Quality Assurance –...
#Interactive Session by  Saby Saurabh Bhardwaj, "Redefine Quality Assurance –...#Interactive Session by  Saby Saurabh Bhardwaj, "Redefine Quality Assurance –...
#Interactive Session by Saby Saurabh Bhardwaj, "Redefine Quality Assurance –...
 
#Keynote Session by Sanjay Kumar, "Innovation Inspired Testing!!" at #ATAGTR2...
#Keynote Session by Sanjay Kumar, "Innovation Inspired Testing!!" at #ATAGTR2...#Keynote Session by Sanjay Kumar, "Innovation Inspired Testing!!" at #ATAGTR2...
#Keynote Session by Sanjay Kumar, "Innovation Inspired Testing!!" at #ATAGTR2...
 
#Keynote Session by Schalk Cronje, "Don’t Containerize me" at #ATAGTR2023.
#Keynote Session by Schalk Cronje, "Don’t Containerize me" at #ATAGTR2023.#Keynote Session by Schalk Cronje, "Don’t Containerize me" at #ATAGTR2023.
#Keynote Session by Schalk Cronje, "Don’t Containerize me" at #ATAGTR2023.
 
#Interactive Session by Chidambaram Vetrivel and Venkatesh Belde, "Revolution...
#Interactive Session by Chidambaram Vetrivel and Venkatesh Belde, "Revolution...#Interactive Session by Chidambaram Vetrivel and Venkatesh Belde, "Revolution...
#Interactive Session by Chidambaram Vetrivel and Venkatesh Belde, "Revolution...
 
#Interactive Session by Aniket Diwakar Kadukar and Padimiti Vaidik Eswar Dat...
#Interactive Session by Aniket Diwakar Kadukar and  Padimiti Vaidik Eswar Dat...#Interactive Session by Aniket Diwakar Kadukar and  Padimiti Vaidik Eswar Dat...
#Interactive Session by Aniket Diwakar Kadukar and Padimiti Vaidik Eswar Dat...
 
#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...
#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...
#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

Robust Testing Strategies for Machine Learning Models

  • 1. Robust Testing Strategies for Machine Learning Models Agile Testing Alliance Hyderabad Meet TMI Networks, Hyderabad, 22 July 2023 Tilottama Goswami, Ph.D. (University Of Hyderabad) Professor, Department Of Information Technology Vasavi College Of Engineering Hyderabad, INDIA
  • 2. Agenda 1. Motivation 2. Real-World Examples of ML Model Failures 3. Key Factors for Robust ML Model Testing 4. Conclusion
  • 3. Motivation Fourth Industrial Revolution  Digital Transformation  AI & Automation – Intelligent Systems  Data Usage - Privacy Security Ethics  Social Transformation – Quality of Life Robust Testing Strategies o Performance o Security o Reliability o Seamless Integration & Deployment o Building Trust DEMAND
  • 4. Impact of Industrial Revolution 4.0 in Real World Scenarios ROBOTIC PROCESS AUTOMATION MACHINE LEARNING Explore.entenic.com Depositphotos.com
  • 5. Repetitive Rule Based Structured Data Pre-Programmed Rules Not Adaptable to handle variations No Cognitive Capabilities Struggles with Unstructured Data – Audio/Image/Text Learn From Data Predictions Unstructured Data Predictions on new unseen Data Adaptable and Evolves with changes – Flexible Complex Cognitive Tasks – Reasoning Unstructured Data – Image Recognition Language Translation Data Entry & Transactional Tasks Sentiment Analysis & Pattern Recognition Machine Learning Robotic Process Automation
  • 6. AI & CV https://medium.com/swlh/a-beginners-guide-to-understanding-the-buzz-words-ai-ml-nlp-deep-learning-computer-vision-a877ee1c2cde The goal of AI is to capture the collective intelligence of humans and do a given task better than any individual human can ever do
  • 7. Computer Vision Tasks Object Detection: Vehicle Object Recognition: Car Object Tracking: Speed Limit https://www.optisolbusiness.com/insight/an-overview-of-image-segmentation-part-1 Classification Sematic Segmentation Classification & Localization Instance Segmentation Feature Extractor Optical Character Recognition
  • 8. content.iospress.com Computer Vision Tasks with Natural Language Processing
  • 9. Real-World Examples of ML Model Failures IBM Watson's Cancer Treatment Recommendations Amazon's AI Recruitment Tool Google Photos' Racist Labelling Tesla's Autopilot Accidents Microsoft's Tay Chatbot
  • 10. Real-World Examples of ML Model Failures IBM Watson's Cancer Treatment Recommendatio ns Amazon's AI Recruitment Tool Google Photos' Racist Labelling Tesla's Autopilot Accidents Microsoft's Tay Chatbot Bias Against Female Candidates Limitation of Training Data and Biased Training Real time Decision Making in complex environments Learnt offensive and inappropriate conversations from tweets Lacked proper testing and Validation Erroneous Recommendations
  • 11. Challenges A. IBM Watson's Cancer Treatment Recommendations 1. Challenges with training data and complexity of cancer treatment 2. Interpretation of unstructured data and limited contextual understanding 3. Lessons learned and improvements made B. Microsoft's Tay Chatbot 1. Vulnerability to manipulation and lack of contextual understanding 2. Rapid learning and amplification of bias 3. Importance of human oversight and responsibility
  • 12. Challenges C. Google Photos' Racist Labeling / Amazon’s AI Recruitment Tool 1. Biased training data and insufficient testing 2. Limited diversity in development teams 3. Ethical considerations and response to the incident D. Tesla's Autopilot Accidents 1. Overreliance on the Autopilot system and inattentive driving 2. System limitations and edge cases 3. Regulatory and legal challenges
  • 13. Key Factors for Robust ML Model Testing 1.BIAS-VARIANCE Trade Off Overfitting/Underfitting 2.Comprehensive Training Data 3.Hyperparameter Tuning 4.Validation and Evaluation Techniques 5.Adversarial Testing 6.Continuous Monitoring and Maintenance
  • 14. 1. BIAS-VARIANCE Trade Off Overfitting/Underfitting Courtesy: Medium.com
  • 15. 15 Bias-Variance The goal of any predictive modelling machine learning algorithm is to achieve low bias and low variance.  Bias are the simplifying assumptions made by a model to make the target function easier to learn.  V ariance is the amount that the estimate of the target function will change if different training data was used.
  • 16. Address BIAS in ML High Bias = Underfitting Building Ethical and Trustworthy AI systems Promote Fairness and Inclusivity Diverse and Representative Data Regularization & Post Processing Fairness-aware Algorithms Bias Aware Evaluation Feature Engineering L O W B I A S Address VARIANCE in ML High Variance = Overfitting Stable and Robust AI Systems L O W V A R I A N C E Generalization Adequate Variance to avoid Underfitting Feature Engineering Regularization & Post Processing Cross Validation Ensemble Methods Early Stopping of Training
  • 17. Comprehensive Training Data 1. Importance of diverse, representative, and unbiased training data 2. Data quality, data augmentation, and addressing class imbalance 3. Rigorous Hyperparameter Tuning
  • 18. Rigorous Hyperparameter Tuning 1.Optimizing model performance through systematic exploration 2.Techniques such as grid search, random search, and Bayesian optimization
  • 19. Validation & Evaluation Techniques 1.Cross-validation and holdout validation for assessing model performance 2.Metrics selection, including accuracy, precision, recall, F1-score, and AUC-ROC 3.Uncovering vulnerabilities and weaknesses in ML models 2. Crafting deceptive inputs and evaluating model robustness
  • 20. Adversarial Testing 1. Uncovering vulnerabilities and weaknesses in ML models 2. Crafting deceptive inputs and evaluating model robustness
  • 21. Continuous Monitoring & Maintenance 1. Importance of ongoing model performance monitoring 2. Regular updates, retraining, and version control 3. Human Feedback in Loop
  • 22. Conclusion Responsible Feature Engineering Responsible development & deployment of ML Model Building RITE System- Reliable, Inclusive, Trustworthy and Ethical systems