SlideShare a Scribd company logo
1 of 36
Download to read offline
Testing the Intelligence of your AI
Iosif Itkin, CEO and co-founder
Elena Treshcheva, Researcher
Iosif Itkin, CEO and co-founder
Elena Treshcheva, Business Development Manager and Researcher
Exactpro Overview
● A specialist firm focused on functional and non-functional testing of
exchanges, clearing houses, depositories, trade repositories and other
financial market infrastructures.
● Incorporated in 2009 with 10 people, our company has experienced
significant growth and is now employing over 550 specialists.
● We were part of the London Stock Exchange Group (LSEG) from May 2015
till January 2018. Exactpro management buyout from LSEG was successfully
completed in January 2018. We are headquartered in the UK and have
operations in the US, Georgia and Russia.
Exactpro Client Network
AI-based Systems in Finance
Machine Learning in financial organizations:
- already passed an initial development phase
- the usage of live ML applications is about to
dramatically increase over the next three years
https://www.bankofengland.co.uk/-
/media/boe/files/report/2019/machine-learning-in-uk-
financial-services.pdf
AI-based Systems in Finance
Machine Learning in financial organizations:
- already passed an initial development phase
- the usage of live ML applications is about to
dramatically increase over the next three years
● Market Surveillance Systems
● Conversational Assistants
● Algo Trading Systems
● Pricing Calculators
● Machine Readable News
● Insurance Claims
https://www.bankofengland.co.uk/-
/media/boe/files/report/2019/machine-learning-in-uk-
financial-services.pdf
AI-based Systems’ Quality Characteristics:
- Ability to learn: The capacity of the system to learn from use for the
system itself, or data and events it is exposed to.
- Trustworthiness: The degree to which the system is trusted by
stakeholders, for example a health diagnostic
- Ability to generalize: The ability of the system to apply to different
and previously unseen scenarios.
A4Q AI and Software Testing
Foundation
Syllabus https://www.gasq.org/en/exam-modules/a4q-ai-and-software-testing.html
Testing the
Intelligence
of your AI
Ability to Learn:
https://www.deeplearning.ai/
• Training set — Which you run your learning algorithm on.
• Development set — Which you use to tune parameters, select
features, and make other decisions regarding the learning algorithm.
Sometimes also called the hold-out cross validation set.
• Test set — which you use to evaluate the performance of the algorithm,
but not to make any decisions regarding what learning algorithm or
parameters to use.
Trustworthiness:
https://innovation.defense.gov/ai/
During the DIB’s quarterly public meeting on October 31, 2019, the DIB
members voted to approve the proposed AI Principles.
Trustworthiness:
https://www.mas.gov.sg/news/media-releases/2019/mas-partners-financial-
industry-to-create-framework-for-responsible-use-of-ai
Ability to Generalize: Scope of End-to-End and Negative Testing
Congruence bias
Confirmation
bias
Law of triviality
Zero-risk bias
Anthropocentric
thinking
Illusion of control
Cognitive Biases Affecting Software Testing of AI-based Systems
Automation bias
AI-based Systems: Machine-Readable News
Confirmation Bias
Salman, I. (2016). Cognitive biases in software quality and testing. Proceedings of
the 38th International Conference on Software Engineering Companion - ICSE ’16.
Pp. 823-826.
Mohanani, R., Salman, I., Turhan, B., Rodríguez, P., & Ralph, P. (2018).
Cognitive Biases in Software Engineering: A Systematic Mapping Study.
IEEE Transactions on Software Engineering
AI-based Systems: Conversational Assistants (Chatbots)
Chatbot
Anthropocentric Bias
We should not
humanize computers.
Anthropocentric bias
They dislike it a lot!
Anthropocentric Bias: Testing a Mine-Defusing Robot
Anthropocentric Bias: Why We Treat Robots Like Humans
Darling, Kate and Nandy, Palash and Breazeal,
Cynthia “Empathic Concern and the Effect of
Stories in Human-Robot Interaction” (2015).
Proceedings of the IEEE International Workshop on
Robot and Human Communication (ROMAN),
2015. 6 p.
https://www.ted.com/talks/kate_darling_why_we_ha
ve_an_emotional_connection_to_robots
Anthropocentric Bias: Testing Chatbots
Anaphora / Context
Human: I bought 500 Company X shares two years ago. The stocks’
cost was 60,000 USD. What’s their today’s cost?
Chatbot: What currency would you like to have for the rate? X
Spelling / overall correctness
Human: What is the setlement date of the tradeId XXX??
Chatbot: ???
AI-based Systems: Algo Trading
Congruence Bias
Direct
Testing
Indirect Testing Methods
Information
extraction and
Machine learning
End-to-End
Automated Test
Library
Whatever it
takes!
Test execution
data and log
analysis
Passive Testing
Whatever it
takes!
Applications of the Proposed Approach
https://unsplash.com/search/photos/san-francisco
The First IEEE International Conference on Artificial
Intelligence Testing (IEEE AITest 2019), April 4-9 2019,
San Francisco East Bay, CA, USA
User-Assisted Log Analysis for Quality
Control of Distributed Fintech Systems
Iosif Itkin, Anna Gromova, Anton Sitnikov, Rostislav Yavorskiy,
Evgenii Tsymbalov, Andrey Novikov and Kirill Rudakov.
AI-based Systems: Pricing Calculator
Law of Triviality (the Bike-Shed Effect)
Automation Bias
AI-based Systems: Fraud Detection and Market Surveillance
Build Software to Test Software
Click to know more about
Exactpro Test Tools
AI-based Systems: Insurance Claims
Zero-Risk Bias
Non-deterministic Systems: Financial Market Infrastructures
The Illusion of Control and Happiness
Sherman, G. D., Lee, J. J., Cuddy, A. J. C., Renshon, J., Oveis, C., Gross, J. J., &
Lerner, J. S. (2012). Leadership is associated with lower levels of stress.
Proceedings of the National Academy of Sciences, 109(44), 17903–17907.
Fenton-O’Creevy, M., Nicholson, N., Soane, E.,
& Willman, P. (2003). “Trading on illusions:
Unrealistic perceptions of control and trading
performance”. Journal of Occupational and
Organizational Psychology, 76(1), 53–68.
The Illusion of Control and Performance
Thank you

More Related Content

What's hot

Lionel Briand ICSM 2011 Keynote
Lionel Briand ICSM 2011 KeynoteLionel Briand ICSM 2011 Keynote
Lionel Briand ICSM 2011 Keynote
ICSM 2011
 

What's hot (20)

Lionel Briand ICSM 2011 Keynote
Lionel Briand ICSM 2011 KeynoteLionel Briand ICSM 2011 Keynote
Lionel Briand ICSM 2011 Keynote
 
Conversion Hotel 2018 Keynote: Aleksander Fabijan
Conversion Hotel 2018 Keynote: Aleksander FabijanConversion Hotel 2018 Keynote: Aleksander Fabijan
Conversion Hotel 2018 Keynote: Aleksander Fabijan
 
IEEE augmented reality learning experience model (ARLEM)
IEEE augmented reality learning experience model (ARLEM)IEEE augmented reality learning experience model (ARLEM)
IEEE augmented reality learning experience model (ARLEM)
 
Exploratory testing STEW 2016
Exploratory testing STEW 2016Exploratory testing STEW 2016
Exploratory testing STEW 2016
 
Software testing using genetic algorithms
Software testing using genetic algorithmsSoftware testing using genetic algorithms
Software testing using genetic algorithms
 
XAI or DIE at Data Science Summit 2019
XAI or DIE at Data Science Summit 2019XAI or DIE at Data Science Summit 2019
XAI or DIE at Data Science Summit 2019
 
Challenges and strategies in bringing AI models to production
Challenges and strategies in bringing AI models to productionChallenges and strategies in bringing AI models to production
Challenges and strategies in bringing AI models to production
 
Software Engineering for ML/AI, keynote at FAS*/ICAC/SASO 2019
Software Engineering for ML/AI, keynote at FAS*/ICAC/SASO 2019Software Engineering for ML/AI, keynote at FAS*/ICAC/SASO 2019
Software Engineering for ML/AI, keynote at FAS*/ICAC/SASO 2019
 
On Parameter Tuning in Search-Based Software Engineering: A Replicated Empiri...
On Parameter Tuning in Search-Based Software Engineering: A Replicated Empiri...On Parameter Tuning in Search-Based Software Engineering: A Replicated Empiri...
On Parameter Tuning in Search-Based Software Engineering: A Replicated Empiri...
 
Case Study Research in Software Engineering
Case Study Research in Software EngineeringCase Study Research in Software Engineering
Case Study Research in Software Engineering
 
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
 
SETTA'18 Keynote: Intelligent Software Engineering: Synergy between AI and So...
SETTA'18 Keynote: Intelligent Software Engineering: Synergy between AI and So...SETTA'18 Keynote: Intelligent Software Engineering: Synergy between AI and So...
SETTA'18 Keynote: Intelligent Software Engineering: Synergy between AI and So...
 
Se research update
Se research updateSe research update
Se research update
 
Controlled experiments, Hypothesis Testing, Test Selection, Threats to Validity
Controlled experiments, Hypothesis Testing, Test Selection, Threats to ValidityControlled experiments, Hypothesis Testing, Test Selection, Threats to Validity
Controlled experiments, Hypothesis Testing, Test Selection, Threats to Validity
 
Agile Data
Agile DataAgile Data
Agile Data
 
Synergy of Human and Artificial Intelligence in Software Engineering
Synergy of Human and Artificial Intelligence in Software EngineeringSynergy of Human and Artificial Intelligence in Software Engineering
Synergy of Human and Artificial Intelligence in Software Engineering
 
Software Engineering Ontology and Software Testing
Software Engineering Ontology and Software Testing�Software Engineering Ontology and Software Testing�
Software Engineering Ontology and Software Testing
 
Past and Future of Software Testing and Analysis
Past and Future of Software Testing and AnalysisPast and Future of Software Testing and Analysis
Past and Future of Software Testing and Analysis
 
Machine Learning Goes Production
Machine Learning Goes ProductionMachine Learning Goes Production
Machine Learning Goes Production
 
Machine learning testing survey, landscapes and horizons, the Cliff Notes
Machine learning testing  survey, landscapes and horizons, the Cliff NotesMachine learning testing  survey, landscapes and horizons, the Cliff Notes
Machine learning testing survey, landscapes and horizons, the Cliff Notes
 

Similar to Testing the Intelligence of your AI

200109-Open AI Chat GPT.pptx
200109-Open AI Chat GPT.pptx200109-Open AI Chat GPT.pptx
200109-Open AI Chat GPT.pptx
AnkurGuputa
 
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
Skyl.ai
 

Similar to Testing the Intelligence of your AI (20)

Maruti gollapudi cv
Maruti gollapudi cvMaruti gollapudi cv
Maruti gollapudi cv
 
Ai in insurance how to automate insurance claim processing with machine lear...
Ai in insurance  how to automate insurance claim processing with machine lear...Ai in insurance  how to automate insurance claim processing with machine lear...
Ai in insurance how to automate insurance claim processing with machine lear...
 
IRJET- NEEV: An Education Informational Chatbot
IRJET-  	  NEEV: An Education Informational ChatbotIRJET-  	  NEEV: An Education Informational Chatbot
IRJET- NEEV: An Education Informational Chatbot
 
How to Build an AI System A Complete Guide.pdf
How to Build an AI System A Complete Guide.pdfHow to Build an AI System A Complete Guide.pdf
How to Build an AI System A Complete Guide.pdf
 
How to Build an AI System A Complete Guide.pdf
How to Build an AI System A Complete Guide.pdfHow to Build an AI System A Complete Guide.pdf
How to Build an AI System A Complete Guide.pdf
 
Artificial intelligence, machine learning and internet of things
Artificial intelligence, machine learning and internet of thingsArtificial intelligence, machine learning and internet of things
Artificial intelligence, machine learning and internet of things
 
Top 5 Machine Learning Tools for Software Development in 2024.pdf
Top 5 Machine Learning Tools for Software Development in 2024.pdfTop 5 Machine Learning Tools for Software Development in 2024.pdf
Top 5 Machine Learning Tools for Software Development in 2024.pdf
 
How Machine Learning Will Transform Finance
How Machine Learning Will Transform FinanceHow Machine Learning Will Transform Finance
How Machine Learning Will Transform Finance
 
IRJET - A Review on Machine Learning Algorithms and their Applications
IRJET -  	  A Review on Machine Learning Algorithms and their ApplicationsIRJET -  	  A Review on Machine Learning Algorithms and their Applications
IRJET - A Review on Machine Learning Algorithms and their Applications
 
Introduction To Predictive Modelling
Introduction To Predictive ModellingIntroduction To Predictive Modelling
Introduction To Predictive Modelling
 
Machine Learning for Finance Master Class
Machine Learning for Finance Master Class Machine Learning for Finance Master Class
Machine Learning for Finance Master Class
 
Ai trend report
Ai trend reportAi trend report
Ai trend report
 
[DSC Europe 22] AI Ethics and AI Quality By Design - Muthu Ramachandran
[DSC Europe 22] AI Ethics and AI Quality By Design - Muthu Ramachandran[DSC Europe 22] AI Ethics and AI Quality By Design - Muthu Ramachandran
[DSC Europe 22] AI Ethics and AI Quality By Design - Muthu Ramachandran
 
AI in Insurance: How to Automate Insurance Claim Processing with Machine Lear...
AI in Insurance: How to Automate Insurance Claim Processing with Machine Lear...AI in Insurance: How to Automate Insurance Claim Processing with Machine Lear...
AI in Insurance: How to Automate Insurance Claim Processing with Machine Lear...
 
200109-Open AI Chat GPT.pptx
200109-Open AI Chat GPT.pptx200109-Open AI Chat GPT.pptx
200109-Open AI Chat GPT.pptx
 
Open AI Chat GPT.
Open AI Chat GPT.Open AI Chat GPT.
Open AI Chat GPT.
 
Smart Data Webinar: A Roadmap for Deploying Modern AI in Business
Smart Data Webinar: A Roadmap for Deploying Modern AI in BusinessSmart Data Webinar: A Roadmap for Deploying Modern AI in Business
Smart Data Webinar: A Roadmap for Deploying Modern AI in Business
 
Emotion Recognition By Textual Tweets Using Machine Learning
Emotion Recognition By Textual Tweets Using Machine LearningEmotion Recognition By Textual Tweets Using Machine Learning
Emotion Recognition By Textual Tweets Using Machine Learning
 
Resume kartikeya sharma
Resume kartikeya sharmaResume kartikeya sharma
Resume kartikeya sharma
 
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
 

More from Iosif Itkin

Using Cluster Analysis for Characteristics Detection in Software Defect Reports
Using Cluster Analysis for Characteristics Detection in Software Defect ReportsUsing Cluster Analysis for Characteristics Detection in Software Defect Reports
Using Cluster Analysis for Characteristics Detection in Software Defect Reports
Iosif Itkin
 

More from Iosif Itkin (20)

Foundations of Software Testing Lecture 4
Foundations of Software Testing Lecture 4Foundations of Software Testing Lecture 4
Foundations of Software Testing Lecture 4
 
Exactpro FinTech Webinar - Global Exchanges Test Oracles
Exactpro FinTech Webinar - Global Exchanges Test OraclesExactpro FinTech Webinar - Global Exchanges Test Oracles
Exactpro FinTech Webinar - Global Exchanges Test Oracles
 
Exactpro FinTech Webinar - Global Exchanges FIX Protocol
Exactpro FinTech Webinar - Global Exchanges FIX ProtocolExactpro FinTech Webinar - Global Exchanges FIX Protocol
Exactpro FinTech Webinar - Global Exchanges FIX Protocol
 
Operational Resilience in Financial Market Infrastructures
Operational Resilience in Financial Market InfrastructuresOperational Resilience in Financial Market Infrastructures
Operational Resilience in Financial Market Infrastructures
 
20 Simple Questions from Exactpro for Your Enjoyment This Holiday Season
20 Simple Questions from Exactpro for Your Enjoyment This Holiday Season20 Simple Questions from Exactpro for Your Enjoyment This Holiday Season
20 Simple Questions from Exactpro for Your Enjoyment This Holiday Season
 
EXTENT 2019: Exactpro Quality Assurance for Financial Market Infrastructures
EXTENT 2019: Exactpro Quality Assurance for Financial Market InfrastructuresEXTENT 2019: Exactpro Quality Assurance for Financial Market Infrastructures
EXTENT 2019: Exactpro Quality Assurance for Financial Market Infrastructures
 
ClearTH Test Automation Framework: Case Study in IRS & CDS Swaps Lifecycle Mo...
ClearTH Test Automation Framework: Case Study in IRS & CDS Swaps Lifecycle Mo...ClearTH Test Automation Framework: Case Study in IRS & CDS Swaps Lifecycle Mo...
ClearTH Test Automation Framework: Case Study in IRS & CDS Swaps Lifecycle Mo...
 
EXTENT Talks 2019 Tbilisi: Failover and Recovery Test Automation - Ivan Shamrai
EXTENT Talks 2019 Tbilisi: Failover and Recovery Test Automation - Ivan ShamraiEXTENT Talks 2019 Tbilisi: Failover and Recovery Test Automation - Ivan Shamrai
EXTENT Talks 2019 Tbilisi: Failover and Recovery Test Automation - Ivan Shamrai
 
EXTENT Talks QA Community Tbilisi 20 April 2019 - Conference Open
EXTENT Talks QA Community Tbilisi 20 April 2019 - Conference OpenEXTENT Talks QA Community Tbilisi 20 April 2019 - Conference Open
EXTENT Talks QA Community Tbilisi 20 April 2019 - Conference Open
 
User-Assisted Log Analysis for Quality Control of Distributed Fintech Applica...
User-Assisted Log Analysis for Quality Control of Distributed Fintech Applica...User-Assisted Log Analysis for Quality Control of Distributed Fintech Applica...
User-Assisted Log Analysis for Quality Control of Distributed Fintech Applica...
 
QAFF Chicago 2019 - Complex Post-Trade Systems, Requirements Traceability and...
QAFF Chicago 2019 - Complex Post-Trade Systems, Requirements Traceability and...QAFF Chicago 2019 - Complex Post-Trade Systems, Requirements Traceability and...
QAFF Chicago 2019 - Complex Post-Trade Systems, Requirements Traceability and...
 
QA Community Saratov: Past, Present, Future (2019-02-08)
QA Community Saratov: Past, Present, Future (2019-02-08)QA Community Saratov: Past, Present, Future (2019-02-08)
QA Community Saratov: Past, Present, Future (2019-02-08)
 
Machine Learning and RoboCop Testing
Machine Learning and RoboCop TestingMachine Learning and RoboCop Testing
Machine Learning and RoboCop Testing
 
Behaviour Driven Development: Oltre i limiti del possibile
Behaviour Driven Development: Oltre i limiti del possibileBehaviour Driven Development: Oltre i limiti del possibile
Behaviour Driven Development: Oltre i limiti del possibile
 
2018 - Exactpro Year in Review
2018 - Exactpro Year in Review2018 - Exactpro Year in Review
2018 - Exactpro Year in Review
 
Exactpro Discussion about Joy and Strategy
Exactpro Discussion about Joy and StrategyExactpro Discussion about Joy and Strategy
Exactpro Discussion about Joy and Strategy
 
FIX EMEA Conference 2018 - Post Trade Software Testing Challenges
FIX EMEA Conference 2018 - Post Trade Software Testing ChallengesFIX EMEA Conference 2018 - Post Trade Software Testing Challenges
FIX EMEA Conference 2018 - Post Trade Software Testing Challenges
 
BDD. The Outer Limits. Iosif Itkin at Youcon (in Russian)
BDD. The Outer Limits. Iosif Itkin at Youcon (in Russian)BDD. The Outer Limits. Iosif Itkin at Youcon (in Russian)
BDD. The Outer Limits. Iosif Itkin at Youcon (in Russian)
 
Sibos 2017: Disruptive functional testing - the next frontier in post-trade s...
Sibos 2017: Disruptive functional testing - the next frontier in post-trade s...Sibos 2017: Disruptive functional testing - the next frontier in post-trade s...
Sibos 2017: Disruptive functional testing - the next frontier in post-trade s...
 
Using Cluster Analysis for Characteristics Detection in Software Defect Reports
Using Cluster Analysis for Characteristics Detection in Software Defect ReportsUsing Cluster Analysis for Characteristics Detection in Software Defect Reports
Using Cluster Analysis for Characteristics Detection in Software Defect Reports
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern Enterprise
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 

Testing the Intelligence of your AI

  • 1. Testing the Intelligence of your AI Iosif Itkin, CEO and co-founder Elena Treshcheva, Researcher Iosif Itkin, CEO and co-founder Elena Treshcheva, Business Development Manager and Researcher
  • 2. Exactpro Overview ● A specialist firm focused on functional and non-functional testing of exchanges, clearing houses, depositories, trade repositories and other financial market infrastructures. ● Incorporated in 2009 with 10 people, our company has experienced significant growth and is now employing over 550 specialists. ● We were part of the London Stock Exchange Group (LSEG) from May 2015 till January 2018. Exactpro management buyout from LSEG was successfully completed in January 2018. We are headquartered in the UK and have operations in the US, Georgia and Russia.
  • 4. AI-based Systems in Finance Machine Learning in financial organizations: - already passed an initial development phase - the usage of live ML applications is about to dramatically increase over the next three years https://www.bankofengland.co.uk/- /media/boe/files/report/2019/machine-learning-in-uk- financial-services.pdf
  • 5. AI-based Systems in Finance Machine Learning in financial organizations: - already passed an initial development phase - the usage of live ML applications is about to dramatically increase over the next three years ● Market Surveillance Systems ● Conversational Assistants ● Algo Trading Systems ● Pricing Calculators ● Machine Readable News ● Insurance Claims https://www.bankofengland.co.uk/- /media/boe/files/report/2019/machine-learning-in-uk- financial-services.pdf
  • 6. AI-based Systems’ Quality Characteristics: - Ability to learn: The capacity of the system to learn from use for the system itself, or data and events it is exposed to. - Trustworthiness: The degree to which the system is trusted by stakeholders, for example a health diagnostic - Ability to generalize: The ability of the system to apply to different and previously unseen scenarios. A4Q AI and Software Testing Foundation Syllabus https://www.gasq.org/en/exam-modules/a4q-ai-and-software-testing.html Testing the Intelligence of your AI
  • 7. Ability to Learn: https://www.deeplearning.ai/ • Training set — Which you run your learning algorithm on. • Development set — Which you use to tune parameters, select features, and make other decisions regarding the learning algorithm. Sometimes also called the hold-out cross validation set. • Test set — which you use to evaluate the performance of the algorithm, but not to make any decisions regarding what learning algorithm or parameters to use.
  • 8. Trustworthiness: https://innovation.defense.gov/ai/ During the DIB’s quarterly public meeting on October 31, 2019, the DIB members voted to approve the proposed AI Principles.
  • 10. Ability to Generalize: Scope of End-to-End and Negative Testing
  • 11. Congruence bias Confirmation bias Law of triviality Zero-risk bias Anthropocentric thinking Illusion of control Cognitive Biases Affecting Software Testing of AI-based Systems Automation bias
  • 14. Salman, I. (2016). Cognitive biases in software quality and testing. Proceedings of the 38th International Conference on Software Engineering Companion - ICSE ’16. Pp. 823-826.
  • 15. Mohanani, R., Salman, I., Turhan, B., Rodríguez, P., & Ralph, P. (2018). Cognitive Biases in Software Engineering: A Systematic Mapping Study. IEEE Transactions on Software Engineering
  • 16. AI-based Systems: Conversational Assistants (Chatbots) Chatbot
  • 17. Anthropocentric Bias We should not humanize computers.
  • 19. Anthropocentric Bias: Testing a Mine-Defusing Robot
  • 20. Anthropocentric Bias: Why We Treat Robots Like Humans Darling, Kate and Nandy, Palash and Breazeal, Cynthia “Empathic Concern and the Effect of Stories in Human-Robot Interaction” (2015). Proceedings of the IEEE International Workshop on Robot and Human Communication (ROMAN), 2015. 6 p. https://www.ted.com/talks/kate_darling_why_we_ha ve_an_emotional_connection_to_robots
  • 21. Anthropocentric Bias: Testing Chatbots Anaphora / Context Human: I bought 500 Company X shares two years ago. The stocks’ cost was 60,000 USD. What’s their today’s cost? Chatbot: What currency would you like to have for the rate? X Spelling / overall correctness Human: What is the setlement date of the tradeId XXX?? Chatbot: ???
  • 24. Indirect Testing Methods Information extraction and Machine learning End-to-End Automated Test Library Whatever it takes! Test execution data and log analysis Passive Testing Whatever it takes!
  • 25. Applications of the Proposed Approach https://unsplash.com/search/photos/san-francisco The First IEEE International Conference on Artificial Intelligence Testing (IEEE AITest 2019), April 4-9 2019, San Francisco East Bay, CA, USA User-Assisted Log Analysis for Quality Control of Distributed Fintech Systems Iosif Itkin, Anna Gromova, Anton Sitnikov, Rostislav Yavorskiy, Evgenii Tsymbalov, Andrey Novikov and Kirill Rudakov.
  • 27. Law of Triviality (the Bike-Shed Effect)
  • 29. AI-based Systems: Fraud Detection and Market Surveillance
  • 30. Build Software to Test Software Click to know more about Exactpro Test Tools
  • 33. Non-deterministic Systems: Financial Market Infrastructures
  • 34. The Illusion of Control and Happiness Sherman, G. D., Lee, J. J., Cuddy, A. J. C., Renshon, J., Oveis, C., Gross, J. J., & Lerner, J. S. (2012). Leadership is associated with lower levels of stress. Proceedings of the National Academy of Sciences, 109(44), 17903–17907.
  • 35. Fenton-O’Creevy, M., Nicholson, N., Soane, E., & Willman, P. (2003). “Trading on illusions: Unrealistic perceptions of control and trading performance”. Journal of Occupational and Organizational Psychology, 76(1), 53–68. The Illusion of Control and Performance