This is slides used at Arithmer seminar given by Dr. Masaaki Uesaka at Arithmer inc.
It is a summary of recent methods for quality assurance of machine learning model.
Arithmer Seminar is weekly held, where professionals from within our company give lectures on their respective expertise.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
Slide for Arithmer Seminar given by Dr. Daisuke Sato (Arithmer) at Arithmer inc.
The topic is on "explainable AI".
"Arithmer Seminar" is weekly held, where professionals from within and outside our company give lectures on their respective expertise.
The slides are made by the lecturer from outside our company, and shared here with his/her permission.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
Recommendation algorithm using reinforcement learningArithmer Inc.
Slide for study session given by Lu Juanjuan at Arithmer inc.
It is a summary of recent methods for recommendation system using reinforcement learning.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
ICML 2018 included papers on generative models, music and audio applications, and AI security. On generative models, papers explored topics like learning many-to-many mappings between domains, joint distribution learning, and reducing amortization gaps in VAEs. In music, works examined hierarchical latent space models for music structure and style transfer for speech synthesis. Regarding security, studies analyzed adversarial attacks across domains, the threat of adversarial examples, and circumventing defenses through obfuscated gradients.
This tutorial provides an overview of recent advances in deep generative models. It will cover three types of generative models: Markov models, latent variable models, and implicit models. The tutorial aims to give attendees a full understanding of the latest developments in generative modeling and how these models can be applied to high-dimensional data. Several challenges and open questions in the field will also be discussed. The tutorial is intended for the 2017 conference of the International Society for Bayesian Analysis.
Deep generative models can generate synthetic images, speech, text and other data types. There are three popular types: autoregressive models which generate data step-by-step; variational autoencoders which learn the distribution of latent variables to generate data; and generative adversarial networks which train a generator and discriminator in an adversarial game to generate high quality samples. Generative models have applications in image generation, translation between domains, and simulation.
Recsys 2016: Modeling Contextual Information in Session-Aware Recommender Sys...Bartlomiej Twardowski
Modeling Contextual Information in Session-Aware Recommender Systems with Neural Networks, RecSys 2016 Boston, Bartłomiej Twardowski
Presentation for a paper:
http://dl.acm.org/citation.cfm?id=2959162
Abstract:
Preparing recommendations for unknown users or such that correctly respond to the short-term needs of a particular user is one of the fundamental problems for e-commerce. Most of the common Recommender Systems assume that user identification must be explicit. In this paper a Session-Aware Recommender System approach is presented where no straightforward user information is required. The recommendation process is based only on user activity within a single session, defined as a sequence of events. This information is incorporated in the recommendation process by explicit context modeling with factorization methods and a novel approach with Recurrent Neural Network (RNN). Compared to the session modeling approach, RNN directly models the dependency of user observed sequential behavior throughout its recurrent structure. The evaluation discusses the results based on sessions from real-life system with ephemeral items (identified only by the set of their attributes) for the task of top-n best recommendations.
Slide for Arithmer Seminar given by Dr. Daisuke Sato (Arithmer) at Arithmer inc.
The topic is on "explainable AI".
"Arithmer Seminar" is weekly held, where professionals from within and outside our company give lectures on their respective expertise.
The slides are made by the lecturer from outside our company, and shared here with his/her permission.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
Recommendation algorithm using reinforcement learningArithmer Inc.
Slide for study session given by Lu Juanjuan at Arithmer inc.
It is a summary of recent methods for recommendation system using reinforcement learning.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
ICML 2018 included papers on generative models, music and audio applications, and AI security. On generative models, papers explored topics like learning many-to-many mappings between domains, joint distribution learning, and reducing amortization gaps in VAEs. In music, works examined hierarchical latent space models for music structure and style transfer for speech synthesis. Regarding security, studies analyzed adversarial attacks across domains, the threat of adversarial examples, and circumventing defenses through obfuscated gradients.
This tutorial provides an overview of recent advances in deep generative models. It will cover three types of generative models: Markov models, latent variable models, and implicit models. The tutorial aims to give attendees a full understanding of the latest developments in generative modeling and how these models can be applied to high-dimensional data. Several challenges and open questions in the field will also be discussed. The tutorial is intended for the 2017 conference of the International Society for Bayesian Analysis.
Deep generative models can generate synthetic images, speech, text and other data types. There are three popular types: autoregressive models which generate data step-by-step; variational autoencoders which learn the distribution of latent variables to generate data; and generative adversarial networks which train a generator and discriminator in an adversarial game to generate high quality samples. Generative models have applications in image generation, translation between domains, and simulation.
Recsys 2016: Modeling Contextual Information in Session-Aware Recommender Sys...Bartlomiej Twardowski
Modeling Contextual Information in Session-Aware Recommender Systems with Neural Networks, RecSys 2016 Boston, Bartłomiej Twardowski
Presentation for a paper:
http://dl.acm.org/citation.cfm?id=2959162
Abstract:
Preparing recommendations for unknown users or such that correctly respond to the short-term needs of a particular user is one of the fundamental problems for e-commerce. Most of the common Recommender Systems assume that user identification must be explicit. In this paper a Session-Aware Recommender System approach is presented where no straightforward user information is required. The recommendation process is based only on user activity within a single session, defined as a sequence of events. This information is incorporated in the recommendation process by explicit context modeling with factorization methods and a novel approach with Recurrent Neural Network (RNN). Compared to the session modeling approach, RNN directly models the dependency of user observed sequential behavior throughout its recurrent structure. The evaluation discusses the results based on sessions from real-life system with ephemeral items (identified only by the set of their attributes) for the task of top-n best recommendations.
Naver learning to rank question answer pairs using hrde-ltcNAVER Engineering
The automatic question answering (QA) task has long been considered a primary objective of artificial intelligence.
Among the QA sub-systems, we focused on answer-ranking part. In particular, we investigated a novel neural network architecture with additional data clustering module to improve the performance in ranking answer candidates which are longer than a single sentence. This work can be used not only for the QA ranking task, but also to evaluate the relevance of next utterance with given dialogue generated from the dialogue model.
In this talk, I'll present our research results (NAACL 2018), and also its potential use cases (i.e. fake news detection). Finally, I'll conclude by introducing some issues on previous research, and by introducing recent approach in academic.
This document discusses using an evolutionary algorithm to automatically design particle swarm systems to solve tasks. It describes evolving the dynamics parameters and finite state machine structure to modify swarm behavior. The results show an evolved swarm was able to perform as well as a human-designed one on a resource collection task, even outperforming one human design. Future work could explore co-evolving multiple competing swarm species.
One-shot learning is an object categorization problem in computer vision. Whereas most machine learning based object categorization algorithms require training on hundreds or thousands of images and very large datasets, one-shot learning aims to learn information about object categories from one, or only a few, training images
This document discusses computational intelligence and supervised learning techniques for classification. It provides examples of applications in medical diagnosis and credit card approval. The goal of supervised learning is to learn from labeled training data to predict the class of new unlabeled examples. Decision trees and backpropagation neural networks are introduced as common supervised learning algorithms. Evaluation methods like holdout validation, cross-validation and performance metrics beyond accuracy are also summarized.
PPT - AutoML-Zero: Evolving Machine Learning Algorithms From ScratchJisang Yoon
1) AutoML-Zero is a framework that uses an evolutionary algorithm to evolve machine learning algorithms from basic mathematical operations, with minimal human constraints on the search space.
2) Experiments showed AutoML-Zero could find simple neural networks like linear and nonlinear regression models in difficult search spaces, outperforming random search.
3) When applied to image classification tasks on MNIST and CIFAR-10, the discovered algorithms achieved performance on par or better than standard models like logistic regression and multilayer perceptrons, trained with minimal human input.
The document examines using a nearest neighbor algorithm to rate men's suits based on color combinations. It trained the algorithm on 135 outfits rated as good, mediocre, or bad. It then tested the algorithm on 30 outfits rated by a human. When trained on 135 outfits, the algorithm incorrectly rated 36.7% of test outfits. When trained on only 68 outfits, it incorrectly rated 50% of test outfits, showing larger training data improves accuracy. It also tested using HSL color representation instead of RGB with similar results.
Machine Learning for Dummies (without mathematics)ActiveEon
It presents an introduction and the basic concepts of machine learning without mathematics. This is a short presentation for beginners in machine learning.
Policy Based reinforcement Learning for time series Anomaly detectionKishor Datta Gupta
This document discusses a policy-based reinforcement learning approach called PTAD for time series anomaly detection. PTAD formulates anomaly detection as a Markov Decision Process and uses an asynchronous actor-critic algorithm to learn a stochastic policy. The agent takes as input current and previous time series data and actions, and outputs a decision of normal or anomalous. It is rewarded based on a confusion matrix calculation. Experimental results show PTAD achieves best performance both within and across datasets by adjusting to different behaviors. The stochastic policy allows exploring precision-recall tradeoffs. While interesting, it is not compared to neural network based techniques like autoencoders.
Deep Learning Enabled Question Answering System to Automate Corporate HelpdeskSaurabh Saxena
Studied feasibility of applying state-of-the-art deep learning models like end-to-end memory networks and neural attention- based models to the problem of machine comprehension and subsequent question answering in corporate settings with huge
amount of unstructured textual data. Used pre-trained embeddings like word2vec and GLove to avoid huge training costs.
Presentation on Machine Learning and Data Miningbutest
The document discusses the differences between automatic learning/machine learning and data mining. It provides definitions for supervised vs unsupervised learning, what automated induction is, and the base components of data mining. Additionally, it outlines differences in the scientific approach between automatic learning and data mining, as well as differences from an industry perspective, including common data mining techniques used and tips for successful data mining projects.
This document describes research on using deep learning models to predict stock market movements based on news events. It presents a method to extract event representations from news articles, generalize the events, embed the events, and feed the embedded events into deep learning models. Experimental results show that using embedded events as inputs to convolutional neural networks achieved more accurate stock market predictions than baseline methods, and modeling long, mid, and short-term event effects further improved performance. The research demonstrates that deep learning can effectively capture hidden relationships between news events and stock prices.
Improving Spam Mail Filtering Using Classification Algorithms With Partition ...IRJET Journal
This document discusses improving spam mail filtering using various classification algorithms and a partition membership filter for preprocessing. It analyzes the performance of classification algorithms like JRip, Filtered Classifier, K-star, SGD, Multinomial, and Random Tree on a spam email dataset using metrics like accuracy, error rate, recall, and precision. The Random Tree algorithm achieved the best performance with 97.08% accuracy, 0.938 kappa statistics, and 0.56 seconds build time, outperforming the other algorithms. Preprocessing the data with a partition membership filter before classification further improved performance.
Few shot learning/ one shot learning/ machine learningﺁﺻﻒ ﻋﻠﯽ ﻣﯿﺮ
The document discusses few-shot learning approaches. It begins with an introduction explaining that current deep learning models require large datasets but humans can learn from just a few examples. It then discusses the problem of few-shot learning, where models must perform classification, detection, or regression on novel categories represented by only a few samples. Popular approaches discussed include meta-learning methods like MAML and prototypical networks, metric learning methods like relation networks, and data augmentation methods. The document provides an overview of the goals and techniques of few-shot learning.
Introduction to Machine Learning : Machine Learning (ML) is a type of Intelligence (AI) that allows Software applications to become more accurate at predicting outcomes without being explicitly programmed to do so. Machine Learning Algorithms use historical data as input to predict new output values.
Applying Deep Learning to Enhance Momentum Trading Strategies in StocksLawrence Takeuchi
Contact author: larrytakeuchi@gmail.com
Abstract
We use an autoencoder composed of stacked restricted Boltzmann machines to extract
features from the history of individual stock prices. Our model is able to discover an enhanced version of the momentum effect in stocks without extensive hand-engineering of input features and deliver an annualized return of 45.93% over the 1990-2009 test period
versus 10.53% for basic momentum.
Deep Learning in Recommender Systems - RecSys Summer School 2017Balázs Hidasi
This is the presentation accompanying my tutorial about deep learning methods in the recommender systems domain. The tutorial consists of a brief general overview of deep learning and the introduction of the four most prominent research direction of DL in recsys as of 2017. Presented during RecSys Summer School 2017 in Bolzano, Italy.
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence MatrixCSCJournals
This paper proposes a steganalysis technique for both grayscale and color images. It uses the feature vectors derived from gray level co-occurrence matrix (GLCM) in spatial domain, which is sensitive to data embedding process. This GLCM matrix is derived from an image. Several combinations of diagonal elements of GLCM are considered as features. There is difference between the features of stego and non-stego images and this characteristic is used for steganalysis. Distance measures like Absolute distance, Euclidean distance and Normalized Euclidean distance are used for classification. Experimental results demonstrate that the proposed scheme outperforms the existing steganalysis techniques in attacking LSB steganographic schemes applied to spatial domain.
Machine learning testing survey, landscapes and horizons, the Cliff NotesHeemeng Foo
This document provides an overview and summary of a 37-page paper on machine learning testing. It discusses the main sections and findings of the paper, including how ML testing is a growing field but also challenging due to its data-driven nature. Several approaches to ML testing are covered such as test input generation, test oracles, test adequacy, and debugging. Charts from the paper show trends in ML testing research over time.
The paper presents a new language called UDITA for describing tests. UDITA is a Java-based language that includes non-deterministic choice operators and an interface for generating linked data structures. This allows for more efficient and effective test generation compared to previous approaches. The language aims to make test specification easier while generating tests that are faster, of higher quality, and less complex than traditional manually written or randomly generated tests.
Naver learning to rank question answer pairs using hrde-ltcNAVER Engineering
The automatic question answering (QA) task has long been considered a primary objective of artificial intelligence.
Among the QA sub-systems, we focused on answer-ranking part. In particular, we investigated a novel neural network architecture with additional data clustering module to improve the performance in ranking answer candidates which are longer than a single sentence. This work can be used not only for the QA ranking task, but also to evaluate the relevance of next utterance with given dialogue generated from the dialogue model.
In this talk, I'll present our research results (NAACL 2018), and also its potential use cases (i.e. fake news detection). Finally, I'll conclude by introducing some issues on previous research, and by introducing recent approach in academic.
This document discusses using an evolutionary algorithm to automatically design particle swarm systems to solve tasks. It describes evolving the dynamics parameters and finite state machine structure to modify swarm behavior. The results show an evolved swarm was able to perform as well as a human-designed one on a resource collection task, even outperforming one human design. Future work could explore co-evolving multiple competing swarm species.
One-shot learning is an object categorization problem in computer vision. Whereas most machine learning based object categorization algorithms require training on hundreds or thousands of images and very large datasets, one-shot learning aims to learn information about object categories from one, or only a few, training images
This document discusses computational intelligence and supervised learning techniques for classification. It provides examples of applications in medical diagnosis and credit card approval. The goal of supervised learning is to learn from labeled training data to predict the class of new unlabeled examples. Decision trees and backpropagation neural networks are introduced as common supervised learning algorithms. Evaluation methods like holdout validation, cross-validation and performance metrics beyond accuracy are also summarized.
PPT - AutoML-Zero: Evolving Machine Learning Algorithms From ScratchJisang Yoon
1) AutoML-Zero is a framework that uses an evolutionary algorithm to evolve machine learning algorithms from basic mathematical operations, with minimal human constraints on the search space.
2) Experiments showed AutoML-Zero could find simple neural networks like linear and nonlinear regression models in difficult search spaces, outperforming random search.
3) When applied to image classification tasks on MNIST and CIFAR-10, the discovered algorithms achieved performance on par or better than standard models like logistic regression and multilayer perceptrons, trained with minimal human input.
The document examines using a nearest neighbor algorithm to rate men's suits based on color combinations. It trained the algorithm on 135 outfits rated as good, mediocre, or bad. It then tested the algorithm on 30 outfits rated by a human. When trained on 135 outfits, the algorithm incorrectly rated 36.7% of test outfits. When trained on only 68 outfits, it incorrectly rated 50% of test outfits, showing larger training data improves accuracy. It also tested using HSL color representation instead of RGB with similar results.
Machine Learning for Dummies (without mathematics)ActiveEon
It presents an introduction and the basic concepts of machine learning without mathematics. This is a short presentation for beginners in machine learning.
Policy Based reinforcement Learning for time series Anomaly detectionKishor Datta Gupta
This document discusses a policy-based reinforcement learning approach called PTAD for time series anomaly detection. PTAD formulates anomaly detection as a Markov Decision Process and uses an asynchronous actor-critic algorithm to learn a stochastic policy. The agent takes as input current and previous time series data and actions, and outputs a decision of normal or anomalous. It is rewarded based on a confusion matrix calculation. Experimental results show PTAD achieves best performance both within and across datasets by adjusting to different behaviors. The stochastic policy allows exploring precision-recall tradeoffs. While interesting, it is not compared to neural network based techniques like autoencoders.
Deep Learning Enabled Question Answering System to Automate Corporate HelpdeskSaurabh Saxena
Studied feasibility of applying state-of-the-art deep learning models like end-to-end memory networks and neural attention- based models to the problem of machine comprehension and subsequent question answering in corporate settings with huge
amount of unstructured textual data. Used pre-trained embeddings like word2vec and GLove to avoid huge training costs.
Presentation on Machine Learning and Data Miningbutest
The document discusses the differences between automatic learning/machine learning and data mining. It provides definitions for supervised vs unsupervised learning, what automated induction is, and the base components of data mining. Additionally, it outlines differences in the scientific approach between automatic learning and data mining, as well as differences from an industry perspective, including common data mining techniques used and tips for successful data mining projects.
This document describes research on using deep learning models to predict stock market movements based on news events. It presents a method to extract event representations from news articles, generalize the events, embed the events, and feed the embedded events into deep learning models. Experimental results show that using embedded events as inputs to convolutional neural networks achieved more accurate stock market predictions than baseline methods, and modeling long, mid, and short-term event effects further improved performance. The research demonstrates that deep learning can effectively capture hidden relationships between news events and stock prices.
Improving Spam Mail Filtering Using Classification Algorithms With Partition ...IRJET Journal
This document discusses improving spam mail filtering using various classification algorithms and a partition membership filter for preprocessing. It analyzes the performance of classification algorithms like JRip, Filtered Classifier, K-star, SGD, Multinomial, and Random Tree on a spam email dataset using metrics like accuracy, error rate, recall, and precision. The Random Tree algorithm achieved the best performance with 97.08% accuracy, 0.938 kappa statistics, and 0.56 seconds build time, outperforming the other algorithms. Preprocessing the data with a partition membership filter before classification further improved performance.
Few shot learning/ one shot learning/ machine learningﺁﺻﻒ ﻋﻠﯽ ﻣﯿﺮ
The document discusses few-shot learning approaches. It begins with an introduction explaining that current deep learning models require large datasets but humans can learn from just a few examples. It then discusses the problem of few-shot learning, where models must perform classification, detection, or regression on novel categories represented by only a few samples. Popular approaches discussed include meta-learning methods like MAML and prototypical networks, metric learning methods like relation networks, and data augmentation methods. The document provides an overview of the goals and techniques of few-shot learning.
Introduction to Machine Learning : Machine Learning (ML) is a type of Intelligence (AI) that allows Software applications to become more accurate at predicting outcomes without being explicitly programmed to do so. Machine Learning Algorithms use historical data as input to predict new output values.
Applying Deep Learning to Enhance Momentum Trading Strategies in StocksLawrence Takeuchi
Contact author: larrytakeuchi@gmail.com
Abstract
We use an autoencoder composed of stacked restricted Boltzmann machines to extract
features from the history of individual stock prices. Our model is able to discover an enhanced version of the momentum effect in stocks without extensive hand-engineering of input features and deliver an annualized return of 45.93% over the 1990-2009 test period
versus 10.53% for basic momentum.
Deep Learning in Recommender Systems - RecSys Summer School 2017Balázs Hidasi
This is the presentation accompanying my tutorial about deep learning methods in the recommender systems domain. The tutorial consists of a brief general overview of deep learning and the introduction of the four most prominent research direction of DL in recsys as of 2017. Presented during RecSys Summer School 2017 in Bolzano, Italy.
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence MatrixCSCJournals
This paper proposes a steganalysis technique for both grayscale and color images. It uses the feature vectors derived from gray level co-occurrence matrix (GLCM) in spatial domain, which is sensitive to data embedding process. This GLCM matrix is derived from an image. Several combinations of diagonal elements of GLCM are considered as features. There is difference between the features of stego and non-stego images and this characteristic is used for steganalysis. Distance measures like Absolute distance, Euclidean distance and Normalized Euclidean distance are used for classification. Experimental results demonstrate that the proposed scheme outperforms the existing steganalysis techniques in attacking LSB steganographic schemes applied to spatial domain.
Machine learning testing survey, landscapes and horizons, the Cliff NotesHeemeng Foo
This document provides an overview and summary of a 37-page paper on machine learning testing. It discusses the main sections and findings of the paper, including how ML testing is a growing field but also challenging due to its data-driven nature. Several approaches to ML testing are covered such as test input generation, test oracles, test adequacy, and debugging. Charts from the paper show trends in ML testing research over time.
The paper presents a new language called UDITA for describing tests. UDITA is a Java-based language that includes non-deterministic choice operators and an interface for generating linked data structures. This allows for more efficient and effective test generation compared to previous approaches. The language aims to make test specification easier while generating tests that are faster, of higher quality, and less complex than traditional manually written or randomly generated tests.
A Software Measurement Using Artificial Neural Network and Support Vector Mac...ijseajournal
Today, Software measurement are based on various techniques such that neural network, Genetic
algorithm, Fuzzy Logic etc. This study involves the efficiency of applying support vector machine using
Gaussian Radial Basis kernel function to software measurement problem to increase the performance and
accuracy. Support vector machines (SVM) are innovative approach to constructing learning machines that
Minimize generalization error. There is a close relationship between SVMs and the Radial Basis Function
(RBF) classifiers. Both have found numerous applications such as in optical character recognition, object
detection, face verification, text categorization, and so on. The result demonstrated that the accuracy and
generalization performance of SVM Gaussian Radial Basis kernel function is better than RBFN. We also
examine and summarize the several superior points of the SVM compared with RBFN.
(Structural) Feature Interactions for Variability-Intensive Systems Testing Gilles Perrouin
Presentation given in the "short talks" session in the Dagstuhl seminar 14281 on "Feature Interactions - the Next Generation" , Schloss Dagstuhl, Germany, July 2014.
Software Testing: Issues and Challenges of Artificial Intelligence & Machine ...gerogepatton
The history of Artificial Intelligence and Machine Learning dates back to 1950’s. In recent years, there has been an increase in popularity for applications that implement AI and ML technology. As with traditional development, software testing is a critical component of an efficient AI/ML application. However, the approach to development methodology used in AI/ML varies significantly from traditional development. Owing to these variations, numerous software testing challenges occur. This paper aims to recognize and to explain some of the biggest challenges that software testers face in dealing with AI/ML applications. For future research, this study has key implications. Each of the challenges outlined in this paper is ideal for further investigation and has great potential to shed light on the way to more productive software testing strategies and methodologies that can be applied to AI/ML applications.
Software Testing: Issues and Challenges of Artificial Intelligence & Machine ...gerogepatton
The history of Artificial Intelligence and Machine Learning dates back to 1950’s. In recent years, there has been an increase in popularity for applications that implement AI and ML technology. As with traditional development, software testing is a critical component of an efficient AI/ML application. However, the approach to development methodology used in AI/ML varies significantly from traditional development. Owing to these variations, numerous software testing challenges occur. This paper aims to recognize and to explain some of the biggest challenges that software testers face in dealing with AI/ML applications. For
future research, this study has key implications. Each of the challenges outlined in this paper is ideal for further investigation and has great potential to shed light on the way to more productive software testing strategies and methodologies that can be applied to AI/ML applications.
SOFTWARE TESTING: ISSUES AND CHALLENGES OF ARTIFICIAL INTELLIGENCE & MACHINE ...ijaia
The history of Artificial Intelligence and Machine Learning dates back to 1950’s. In recent years, there has
been an increase in popularity for applications that implement AI and ML technology. As with traditional
development, software testing is a critical component of an efficient AI/ML application. However, the
approach to development methodology used in AI/ML varies significantly from traditional development.
Owing to these variations, numerous software testing challenges occur. This paper aims to recognize and
to explain some of the biggest challenges that software testers face in dealing with AI/ML applications. For
future research, this study has key implications. Each of the challenges outlined in this paper is ideal for
further investigation and has great potential to shed light on the way to more productive software testing
strategies and methodologies that can be applied to AI/ML applications.
The Status of ML Algorithms for Structure-property Relationships Using Matb...Anubhav Jain
The document discusses the development of Matbench, a standardized benchmark for evaluating machine learning algorithms for materials property prediction. Matbench includes 13 standardized datasets covering a variety of materials prediction tasks. It employs a nested cross-validation procedure to evaluate algorithms and ranks submissions on an online leaderboard. This allows for reproducible evaluation and comparison of different algorithms. Matbench has provided insights into which algorithm types work best for certain prediction problems and has helped measure overall progress in the field. Future work aims to expand Matbench with more diverse datasets and evaluation procedures to better represent real-world materials design challenges.
This document summarizes a research paper that proposes a method for detecting and recognizing faces using the Viola Jones algorithm and Back Propagation Neural Network (BPNN).
The paper first discusses face detection and recognition challenges. It then provides background on Viola Jones algorithm and BPNN. The proposed methodology uses Viola Jones for face detection, converts the image to grayscale and binary, then trains segments or the whole image with BPNN. Results are analyzed using training, testing and validation curves in the MATLAB neural network tool to minimize error. In under 3 sentences, this document outlines the key techniques, proposed method, and analysis approach discussed in the source research paper.
IRJET- Deep Learning Model to Predict Hardware PerformanceIRJET Journal
This document discusses using deep learning models to predict hardware performance. Specifically, it aims to predict benchmark scores from hardware configurations, or predict configurations from scores. It explores various machine learning algorithms like linear regression, logistic regression, and multi-linear regression on hardware performance data. The best results were from backward elimination and linear regression, achieving over 80% accuracy. Data preprocessing like encoding was important. The model can help analyze hardware performance more quickly than manual methods.
IRJET- Analysis of PV Fed Vector Controlled Induction Motor DriveIRJET Journal
The document describes a project to develop a deep learning model to predict hardware performance. The model takes hardware configuration parameters like CPU, memory, etc. as input and predicts benchmark scores. The authors preprocessed data, tested various regression models like linear regression and lasso regression, and techniques like backward elimination and cross-validation. Their best model used backward elimination and linear regression, achieving 80.82% accuracy. The project aims to automate hardware performance analysis and prediction to save time compared to manual methods.
AN ENVIRONMENT FOR NON-DUPLICATE TEST GENERATION FOR WEB BASED APPLICATIONecij
Web applications quality, portability, consistency, interoperability and dependability are main parameters because software could block completely business. Internet based applications are very complex and heterogeneous also based on different kind of modules which can be implemented in a specific language, so testing is very much require. This paper represents the object oriented based environment. Web applications consist of Web related documents, class and objects and some different kinds of server like HTML, DHTML, and JAVA SCRIPT etc. This paper presents three kinds of layers of test: unit testing, integration testing and system testing. This paper also discusses the different techniques. This paper also
based on the reverse engineering techniques which can be used to analyze the software which is probably connected with a model. It also consists a method of generate the test cases. This paper also describes software used to perform some results.
This document provides an overview of a project report on simulating a single server queuing problem. The report includes an introduction to operations research, simulation, and the queuing problem. It discusses the research methodology, which involves defining the problem, developing a simulation model, validating the model, analyzing the data, and presenting findings and recommendations. The goal is to use simulation to provide optimal solutions to the queuing problem under study.
Automated Machine Learning Applied to Diverse Materials Design ProblemsAnubhav Jain
Automated Machine Learning Applied to Diverse Materials Design Problems
Anubhav Jain presented on developing standardized benchmark datasets and algorithms for automated machine learning in materials science. Matbench provides a diverse set of materials design problems for evaluating ML algorithms, including classification and regression tasks of varying sizes from experiments and DFT. Automatminer is a "black box" ML algorithm that uses genetic algorithms to automatically generate features, select models, and tune hyperparameters on a given dataset, performing comparably to specialized literature methods on small datasets but less well on large datasets. Standardized evaluations can help accelerate progress in automated ML for materials design.
The methods of exploratory testing has gained significant attention in industry and research in the last years. However, as many “buzzword" technologies, the introduction and application of exploratory testing is not straightforward. Exploratory testing it is not only black or white - scripted or exploratory - but also all shades of grey in between. Within the EASE industrial excellence center, we have executed an industrial workshop on exploratory testing, that helps providing understanding of how to choose feasible levels of exploration in exploratory testing. We will present the concepts of levels of exploration in exploratory testing, the outcomes of the workshop, along with relevant empirical research findings on exploratory testing.
This document discusses principles for software effort estimation. It begins with an introduction explaining the importance of accurate estimation. It then discusses publications in the area and lists eight key questions about effort estimation. The document provides answers to the questions in the form of twelve principles. It discusses issues like using multiple methods, improving analogy-based estimation, handling lack of local data, and determining the essential data needed. The principles promote methods that can compensate for missing size attributes, combine outlier and synonym pruning, and be aware of sampling method trade-offs.
This project aims to build a binary classifier model to label unlabeled DNA sequences as either positive (p) or negative (n) based on labeled training sequences. The team will take two approaches: 1) A k-mer approach that generates all DNA sequence fragments of length K and counts frequencies to use as attributes for classification models. 2) A PWM approach that uses motif finding tools to generate position weight matrices and score sequences to use as attributes. The approaches will be evaluated individually and combined to obtain the best performing model. Key challenges include deriving meaningful attributes from the sequence data alone. Parameters like k-mer length, number of motifs, and motif lengths will be tuned to optimize model performance.
Validation and Verification of SYSML Activity Diagrams Using HOARE Logic ijseajournal
SysML diagrams are significant medium using for supporting software lifecycle management. The existing TBFV method is designed for error detection with full automation efficiency, only for code. For verifying the correctness of SysML diagram, we applying TBFV method into SysML diagram. In this paper, we
propose a novel technique that makes use of Hoare Logic and testing to verify whether the SysML diagrams meet the requirement, called TBFV-M. This research can improve the correctness of SysML diagram, which is likely to significantly affect the reliability of the implementation. A case study is conducted to show its feasibility and used to illustrate how the proposed method is applied; and discussion on potential
challenges to TBFV-M is also presented.
Predicting Fault-Prone Files using Machine LearningGuido A. Ciollaro
This document summarizes a study that used machine learning algorithms to predict fault-prone files in nine open source Java projects containing over 18,000 files and 3 million lines of code. Six machine learning algorithms (Naive Bayes, Bayesian network, decision tree, radial basis function, simple logistic, and zeroR) were evaluated based on their ability to rank files by probability of being buggy, as determined by the FindBugs tool. The results showed that decision trees and Bayesian networks performed best at ranking the files, with decision trees outperforming other methods in previous studies. Lift curves were used to evaluate the performance of the models by plotting the number of files examined against the number of buggy files found.
本スライドは、弊社のArithmerDXソリューションの基礎的な技術についてご理解いただけるよう制作した資料となります。
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
本スライドは、弊社のArithmerDXチームによりArithmer Seminarで使用されたものです。
弊社のArithmerDXソリューションの基礎的な技術についてご理解いただけるよう制作した資料となります。
Arithmer Seminar is weekly held, where professionals from within our company give lectures on their respective expertise.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
本スライドは、弊社のArithmerR3チームによりArithmer Seminarで使用されたものです。
弊社のArithmerR3ソリューションの基礎的な技術についてご理解いただけるよう制作した資料となります。
Arithmer Seminar is weekly held, where professionals from within our company give lectures on their respective expertise.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
Weakly supervised semantic segmentation of 3D point cloudArithmer Inc.
Slide for study session given by Dr. Daisuke Sato at Arithmer inc.
It is a summary of methods for semantic segmentation for 3D pointcloud using 2D weakly-supervised learning.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
本スライドは、弊社のNLPチームによりArithmer Seminarで使用されたものです。
弊社のArithmer NLPソリューションの基礎的な技術についてご理解いただけるよう制作した資料となります。
Arithmer Seminar is weekly held, where professionals from within our company give lectures on their respective expertise.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
本スライドは、弊社のRoboチームによりArithmer Seminarで使用されたものです。
弊社のArithmer Roboソリューションの基礎的な技術についてご理解いただけるよう制作した資料となります。
Arithmer Seminar is weekly held, where professionals from within our company give lectures on their respective expertise.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
本スライドは、弊社のNLPソリューション・AIチャットボットについて、2021年7月20日にwebセミナーを行った際の資料になります。
"Arithmer Seminar" is weekly held, where professionals from within and outside our company give lectures on their respective expertise.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
本スライドは、弊社のR3チーム(3Dソリューション開発チーム)によりArithmer Seminarで使用されたものです。
弊社のArithmerR3ソリューションの基礎的な技術についてご理解いただけるよう制作した資料となります。
Arithmer Seminar is weekly held, where professionals from within our company give lectures on their respective expertise.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
VIBE: Video Inference for Human Body Pose and Shape EstimationArithmer Inc.
The document describes the VIBE approach for 3D human pose and shape estimation from video. VIBE uses an adversarial learning framework with a temporal encoder network that incorporates self-attention. It regresses pose and shape parameters from video frames. A motion discriminator is trained to distinguish real from generated poses, enforcing kinematically plausible poses without 3D ground truth labels. Results show VIBE generates accurate 3D poses and shapes from in-the-wild videos.
本スライドは、弊社のInspectionチーム(画像認識開発チーム)によりArithmer Seminarで使用されたものです。
弊社のArithmerInspectionソリューションの基礎的な技術についてご理解いただけるよう制作した資料となります。
Arithmer Seminar is weekly held, where professionals from within our company give lectures on their respective expertise.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
本スライドは、弊社の梅本により弊社内の技術勉強会で使用されたものです。
近年注目を集めるアーキテクチャーである「Transformer」の解説スライドとなっております。
"Arithmer Seminar" is weekly held, where professionals from within and outside our company give lectures on their respective expertise.
The slides are made by the lecturer from outside our company, and shared here with his/her permission.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
本スライドは、弊社の鈴木により2021年3月25日のArithmer Seminarで使用されたものです。
弊社のNLPソリューションの基礎的な部分について、営業メンバー・非DBエンジニアに理解してもらう目的で作った資料になります。
"Arithmer Seminar" is weekly held, where professionals from within and outside our company give lectures on their respective expertise.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
Introduction of Quantum Annealing and D-Wave MachinesArithmer Inc.
This slide was used for Arithmer seminar in April 2021, by Dr. Yuki Bando.
It is for introduction of quantum computer, D-wave series, and its application to optimization problems in industry.
"Arithmer Seminar" is weekly held, where professionals from within and outside our company give lectures on their respective expertise.
The slides are made by the lecturer from outside our company, and shared here with his/her permission.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
本スライドは、弊社の石橋により2021年1月14日のArithmer Seminarで使用されたものです。
弊社のOCRソリューションの基礎的な部分について、営業メンバー・非DBエンジニアに理解してもらう目的で作った資料になります。
"Arithmer Seminar" is weekly held, where professionals from within and outside our company give lectures on their respective expertise.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
本スライドは、弊社の上坂により2021年1月21日のArithmer Seminarで使用されたものです。
弊社のDynamicsソリューション(動画解析)の基礎的な部分について、営業メンバー・非Dynamicsエンジニアに理解してもらう目的で作った資料になります。
"Arithmer Seminar" is weekly held, where professionals from within and outside our company give lectures on their respective expertise.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
本スライドは、弊社の曽根により2021年1月28日のArithmer Seminarで使用されたものです。
弊社のDBソリューションの基礎的な部分について、営業メンバー・非DBエンジニアに理解してもらう目的で作った資料になります。
"Arithmer Seminar" is weekly held, where professionals from within and outside our company give lectures on their respective expertise.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
Slide for study session given by Christian Saravia at Arithmer inc.
It is a summary of methods for video summarization using attention mechanism.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
Slide for study session given by Dr. Enrico Rinaldi at Arithmer inc.
It is a summary of established methods for parametric modeling of 3D human body "SMPL", which has many possible applications in apparel/health care industry.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
Slide for study session given by Dr. Enrico Rinaldi at Arithmer inc.
It is a summary of recent methods for real-time instance segmentation "YOLACT", which is especially useful in robotics.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
Slide for study session given by Ryosuke Sasaki at Arithmer inc.
It is a summary of recent methods for object pose estimation in robotics using deep learning.
He entered Ph.D course at Univ. of Tokyo in April 2020.
Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。
Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.
HCL Notes and Domino License Cost Reduction in the World of DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-and-domino-license-cost-reduction-in-the-world-of-dlau/
The introduction of DLAU and the CCB & CCX licensing model caused quite a stir in the HCL community. As a Notes and Domino customer, you may have faced challenges with unexpected user counts and license costs. You probably have questions on how this new licensing approach works and how to benefit from it. Most importantly, you likely have budget constraints and want to save money where possible. Don’t worry, we can help with all of this!
We’ll show you how to fix common misconfigurations that cause higher-than-expected user counts, and how to identify accounts which you can deactivate to save money. There are also frequent patterns that can cause unnecessary cost, like using a person document instead of a mail-in for shared mailboxes. We’ll provide examples and solutions for those as well. And naturally we’ll explain the new licensing model.
Join HCL Ambassador Marc Thomas in this webinar with a special guest appearance from Franz Walder. It will give you the tools and know-how to stay on top of what is going on with Domino licensing. You will be able lower your cost through an optimized configuration and keep it low going forward.
These topics will be covered
- Reducing license cost by finding and fixing misconfigurations and superfluous accounts
- How do CCB and CCX licenses really work?
- Understanding the DLAU tool and how to best utilize it
- Tips for common problem areas, like team mailboxes, functional/test users, etc
- Practical examples and best practices to implement right away
Webinar: Designing a schema for a Data WarehouseFederico Razzoli
Are you new to data warehouses (DWH)? Do you need to check whether your data warehouse follows the best practices for a good design? In both cases, this webinar is for you.
A data warehouse is a central relational database that contains all measurements about a business or an organisation. This data comes from a variety of heterogeneous data sources, which includes databases of any type that back the applications used by the company, data files exported by some applications, or APIs provided by internal or external services.
But designing a data warehouse correctly is a hard task, which requires gathering information about the business processes that need to be analysed in the first place. These processes must be translated into so-called star schemas, which means, denormalised databases where each table represents a dimension or facts.
We will discuss these topics:
- How to gather information about a business;
- Understanding dictionaries and how to identify business entities;
- Dimensions and facts;
- Setting a table granularity;
- Types of facts;
- Types of dimensions;
- Snowflakes and how to avoid them;
- Expanding existing dimensions and facts.
Generating privacy-protected synthetic data using Secludy and MilvusZilliz
During this demo, the founders of Secludy will demonstrate how their system utilizes Milvus to store and manipulate embeddings for generating privacy-protected synthetic data. Their approach not only maintains the confidentiality of the original data but also enhances the utility and scalability of LLMs under privacy constraints. Attendees, including machine learning engineers, data scientists, and data managers, will witness first-hand how Secludy's integration with Milvus empowers organizations to harness the power of LLMs securely and efficiently.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Fueling AI with Great Data with Airbyte WebinarZilliz
This talk will focus on how to collect data from a variety of sources, leveraging this data for RAG and other GenAI use cases, and finally charting your course to productionalization.
Digital Marketing Trends in 2024 | Guide for Staying AheadWask
https://www.wask.co/ebooks/digital-marketing-trends-in-2024
Feeling lost in the digital marketing whirlwind of 2024? Technology is changing, consumer habits are evolving, and staying ahead of the curve feels like a never-ending pursuit. This e-book is your compass. Dive into actionable insights to handle the complexities of modern marketing. From hyper-personalization to the power of user-generated content, learn how to build long-term relationships with your audience and unlock the secrets to success in the ever-shifting digital landscape.
Programming Foundation Models with DSPy - Meetup SlidesZilliz
Prompting language models is hard, while programming language models is easy. In this talk, I will discuss the state-of-the-art framework DSPy for programming foundation models with its powerful optimizers and runtime constraint system.
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfChart Kalyan
A Mix Chart displays historical data of numbers in a graphical or tabular form. The Kalyan Rajdhani Mix Chart specifically shows the results of a sequence of numbers over different periods.
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfMalak Abu Hammad
Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers:
* What is Vector Search?
* Importance and benefits of vector search
* Practical use cases across various industries
* Step-by-step implementation guide
* Live demos with code snippets
* Enhancing LLM capabilities with vector search
* Best practices and optimization strategies
Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications.
#MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceIndexBug
Imagine a world where machines not only perform tasks but also learn, adapt, and make decisions. This is the promise of Artificial Intelligence (AI), a technology that's not just enhancing our lives but revolutionizing entire industries.
UiPath Test Automation using UiPath Test Suite series, part 6DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 6. In this session, we will cover Test Automation with generative AI and Open AI.
UiPath Test Automation with generative AI and Open AI webinar offers an in-depth exploration of leveraging cutting-edge technologies for test automation within the UiPath platform. Attendees will delve into the integration of generative AI, a test automation solution, with Open AI advanced natural language processing capabilities.
Throughout the session, participants will discover how this synergy empowers testers to automate repetitive tasks, enhance testing accuracy, and expedite the software testing life cycle. Topics covered include the seamless integration process, practical use cases, and the benefits of harnessing AI-driven automation for UiPath testing initiatives. By attending this webinar, testers, and automation professionals can gain valuable insights into harnessing the power of AI to optimize their test automation workflows within the UiPath ecosystem, ultimately driving efficiency and quality in software development processes.
What will you get from this session?
1. Insights into integrating generative AI.
2. Understanding how this integration enhances test automation within the UiPath platform
3. Practical demonstrations
4. Exploration of real-world use cases illustrating the benefits of AI-driven test automation for UiPath
Topics covered:
What is generative AI
Test Automation with generative AI and Open AI.
UiPath integration with generative AI
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Introduction of Cybersecurity with OSS at Code Europe 2024Hiroshi SHIBATA
I develop the Ruby programming language, RubyGems, and Bundler, which are package managers for Ruby. Today, I will introduce how to enhance the security of your application using open-source software (OSS) examples from Ruby and RubyGems.
The first topic is CVE (Common Vulnerabilities and Exposures). I have published CVEs many times. But what exactly is a CVE? I'll provide a basic understanding of CVEs and explain how to detect and handle vulnerabilities in OSS.
Next, let's discuss package managers. Package managers play a critical role in the OSS ecosystem. I'll explain how to manage library dependencies in your application.
I'll share insights into how the Ruby and RubyGems core team works to keep our ecosystem safe. By the end of this talk, you'll have a better understanding of how to safeguard your code.
Taking AI to the Next Level in Manufacturing.pdfssuserfac0301
Read Taking AI to the Next Level in Manufacturing to gain insights on AI adoption in the manufacturing industry, such as:
1. How quickly AI is being implemented in manufacturing.
2. Which barriers stand in the way of AI adoption.
3. How data quality and governance form the backbone of AI.
4. Organizational processes and structures that may inhibit effective AI adoption.
6. Ideas and approaches to help build your organization's AI strategy.
OpenID AuthZEN Interop Read Out - AuthorizationDavid Brossard
During Identiverse 2024 and EIC 2024, members of the OpenID AuthZEN WG got together and demoed their authorization endpoints conforming to the AuthZEN API
Project Management Semester Long Project - Acuityjpupo2018
Acuity is an innovative learning app designed to transform the way you engage with knowledge. Powered by AI technology, Acuity takes complex topics and distills them into concise, interactive summaries that are easy to read & understand. Whether you're exploring the depths of quantum mechanics or seeking insight into historical events, Acuity provides the key information you need without the burden of lengthy texts.
2. Today’s topic
I would like to introduce the testing of machine learning models
based on the contents of the book [Sato et al, 2021].
But as a result of surveying papers related to the contents of the book, the contents have expanded
beyond the book.
I will skip the detail in my talk and please read this slide and search the references if you are interested in this
topic.
We focus on the following four topics:
• Metamorphic test
• Neuron coverage
• Adversarial minimum distortion (最大安全半径)
• Formal verification (網羅検証)
[Sato et al, 2021] 佐藤直人・小川秀人・來間啓伸・明神智之 (2021).『AIソフトウェアのテストー答えのない答え合わせ [4つの手法]』リックテレコム
We can access the code in [Sato et al, 2021] by purchasing it.
(based on keras & tensorflow. not pytorch)
3. Problems in testing of machine learning
Output
Behaviour
In ordinary software, (unit) test is done by checking whether the output of function in the
specified input is equal to expected one.
4. Some techniques for software tests
Equivalence class test
• Divide the input spaces to equivalence classes in which the
expected outputs are same.
• Take a representative from each equivalence class
• Create the test case for the representatives.
Boundary value test
• Divide the input spaces to equivalence classes
• Take the nearest values to neighboring classes for each class
• Create the test cases for these boundary values
• This is applicable if the input space is ordered.
Input value Expected output
0〜9 1
10〜99 2
100〜999 3
1000〜9999 4
e.g. The function which returns the
number of digits in decimal
notation
5. Problems in test for machine learning model
Can we apply the software test techniques for machine learning model?
dog
cat
Machine
Learning
Model
We do not know the entire boundary of the dog images
and cat images.
dog cat
other
?
In the vase space of images, we cannot determine the
boundaries of each category in advance.
We cannot apply the boundary test.
6. Problems in test for machine learning model
Can we apply the software test techniques for machine learning model?
dog
cat
Machine
Learning
Model
• The expected behavior of the classification depends
on the human judgements.
• Sometimes output of the model differs from the
human judgements.
Deterministic assertion test is not applicable
At most, we can make the statistical one:
Assert that the probability that the labeling of human and ML model coincides is high enough to
make it practical.
→ Precision, Recall, F1 score,…
7. Problems in test for machine learning model
If we admit the probabilistic assertion, we encounter the problem of adversarial attack.
[Goodfellow et al., 2015]
ML models are easily fooled by small perturbations unrecognizable to humans.
8. Methods of Adversarial Attacks
Fast Gradient Sign Method [Goodfellow et al, 2015] Corresponding label of x
Cost function of training
Carlini-Wagner Method [Carlini & Wagner, 2017]
Trained model
Projected Gradient Descent Method [Madry et al, 2018]
Measure of correspondence
[Goodfellow et al., 2015] Goodfellow, I. J., Shlens, J., & Szegedy, C. (2015). Explaining and Harnessing Adversarial Examples. ICLR.
[Carlini & Wagner, 2017] Carlini, N., & Wagner, D. (2017). Towards Evaluating the Robustness of Neural Networks. 2017 IEEE Symposium on Security and Privacy (SP), 39–57.
[Madry et al, 2018] Madry, A., Makelov, A., Schmidt, L., Tsipras, D., & Vladu, A. (2018). Towards Deep Learning Models Resistant to Adversarial Attacks. ICLR
9. Intermediate Summary
In ML model, the ordinary software test is difficult to apply because
(1) Tests cannot be deterministic; they can only be evaluated statistically.
(2) In the first place, it is not possible to identify the equivalence classes and their
boundaries in advance in a very high dimensional data space.
(3) Even if we accept statistical evaluation, the existence of adversarial attacks does not
guarantee the robustness of the inference.
These are due to the inductive nature of the ML model construction.
• There are no clear functional requirements. (The expected output for the input cannot be
perfectly assumed.)
• Because it learns only from a limited amount of data (though large), it is not perfectly clear
whether the trained model will make valid inferences in other cases (even if generalization
performance is improved).
10. What is the metamorphic test?
Metamorphic test itself is proposed in [Chen et al, 1998].
This technique is to uncover software error in the production phase and in the absence of test oracle.
Example: Shortest path problem on the weighted graph
undirected weighted graph with vertices V and edges E with distance function d
[Chen et al, 1998] Chen, T. Y., Cheung, S. C., & Yiu, S. M. (2020). Metamorphic Testing: A New Approach for Generating Next Test Cases. ArXiv:2002.12543
11. What is the metamorphic test?
• Of course, if the graph 𝐺 is small, we can find manually the
shortest path. So we can make the test case manually.
• But in production phase, 𝐺 is generally large. We cannot
construct the test case for such large graph!!
• How can we test for real-world input in the production phase?
→Metamorphic testing!!!
[Chen et al, 1998] Chen, T. Y., Cheung, S. C., & Yiu, S. M. (2020). Metamorphic Testing: A New Approach for Generating Next Test Cases. ArXiv:2002.12543
12. Examples of metamorphic test
Suppose that
Then the following statements should also be valid:
[Chen et al, 1998] Chen, T. Y., Cheung, S. C., & Yiu, S. M. (2020). Metamorphic Testing: A New Approach for Generating Next Test Cases. ArXiv:2002.12543
13. Examples of metamorphic test
Assertion 1
Assertion 2
[Chen et al, 1998] Chen, T. Y., Cheung, S. C., & Yiu, S. M. (2020). Metamorphic Testing: A New Approach for Generating Next Test Cases. ArXiv:2002.12543
Assertion 3
14. Formulation
the function to be tested
Ordinary test
Assert
Metamorphic test
Assert
We don’t need the
expected output.
Input
Modified
input
Output
Output
f
Metamorphic
relation
Input Output
Expected
Output
f
15. Application of metamorphic test
Application in search engine [Zhou et al, 2016]
Relation name Input modification Expected relation
MPSite If the domain of pi
is di
,
”A” → “A & site:di
”
The result still contains pi
.
MPTitle If the title of pi
is ti
,
”A” → “A & ti
”
The result still contains pi
.
MPReverseJD If A have the form “A1
& A2
& … & An
”,
“A1
& A2
& … & An
” → “An
& … & A 2
& A1
”
The results of A and reversed one have
large similarity.
(measured by Jaccard coefficient)
Search query
A
result
p1
,…,pn
Examples of metamorphic relations
Metamorphic test is designed in order to estimate the quality of search engine.
[Zhou et al, 2016] Zhou, Z. Q., Xiang, S., & Chen, T. Y. (2016). Metamorphic Testing for Software Quality Assessment: A Study of Search Engines.
IEEE Transactions on Software Engineering, 42(3), 264–284.
16. Metamorphic test in ML model
Also in machine learning system, the concept of metamorphic test is applicable.
Example in image recognition for autonomous cars (DeepTest) [Tian et al, 2018]
Applying the several transformation which is expected not to be affected to the
steering angle, the metamorphic test oracles are created.
They find the erroneous behaviors by these synthetic images.
Remark: They also use the neuron coverage for finding erroneous behavior
(explain later)
Erroneous behaviors found
by metamorphic test
[Tian et al, 2018] Tian, Y., Pei, K., Jana, S., & Ray, B. (2018). DeepTest. Proceedings of the 40th International Conference on Software Engineering, 2018-May, 303–314.
17. Other examples of metamorphic test
[Zhang et al, 2018] constructs the metamorphic test oracles by synthetic images created by GAN.
[Zhang et al, 2018] Zhang, M., Zhang, Y., Zhang, L., Liu, C., & Khurshid, S. (2018). DeepRoad: GAN-Based Metamorphic Testing and Input Validation Framework for Autonomous Driving
Systems. 2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE), 132–142.
18. Other examples of metamorphic test
[Zhou et al, 2019] uses metamorphic test on
LIDAR data in the Baidu Apollo autonomous
car system.
By adding random data outside the ROI, they
find the fatal behavior that the pedestrian or
other car in the ROI cannot be detected by
small random noise.
[Zhou et al, 2019] Zhou, Z. Q., & Sun, L. (2019). Metamorphic testing of driverless
cars. Communications of the ACM, 62(3), 61–67.
19. Other examples of metamorphic test
[He et al, 2020] uses metamorphic test on machine translation.
I live on campus and am tall.
(erroneous behavior)
[He et al, 2020] He, P., Meister, C., & Su, Z. (2020). Structure-invariant testing for machine translation.
Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering, 961–973.
20. The number of “activated” neurons
What is neuron coverage?
Deep neural network
The number of neurons
Activation function like ReLU
21. What is neuron coverage?
NC is proposed in [Pei et al, 2017].
Idea: analogy of coverage rate
If the neuron value is large, this neuron strongly influences
the output through the downstream layers.
Low NC implies that a significant number of neurons have
no effect on the output for the test data set.
In order to test the effect on the output for all neurons, we
need to create test data that will have a high NC.
[Pei et al, 2017] Pei, K., Cao, Y., Yang, J., & Jana, S. (2017). DeepXplore. Proceedings of the 26th Symposium on Operating Systems Principles, 1–18.
22. How to create the high NC test data?
[Pei et al, 2017] proposes to use the gradient ascending and it is widely used.
Update the data in the
direction of increasing the
value of inactive neurons
23. Application of neuron coverage
DeepXplore in [Pei et al, 2017] Erroneous behaviors by changing light conditions
Make the outputs of NNs differently
Raise NC
[Pei et al, 2017] Pei, K., Cao, Y., Yang, J., & Jana, S. (2017). DeepXplore. Proceedings of the 26th Symposium on Operating Systems Principles, 1–18.
24. Application of neuron coverage
[Tian et al, 2018] DeepTest (bis.)
They use the NC for finding the erroneous behavior like DeepXplore.
Erroneous behaviors found
by metamorphic test
Metamorphic test and neuron
coverage can be combined to
find errors efficiently.
25. Is neuron coverage meaningful?
Some studies indicate that increasing NC is not necessarily meaningful for testing adversarial attack.
[Harel-Canada et al, 2020]
Generate adversarial images that also improve NC.
Carlini-Wagner type adversarial attack
(We can also choose other attacks)
Kullback-Leibler divergence
Uniform distribution in [0,1]
This term measures how different the output of the
j-th layer is from uniform distribution
This also measures whether the neurons in the
j-th layer is unbiasedly activated
= high NC
[Harel-Canada et al, 2020] Harel-Canada, F., Wang, L., Gulzar, M. A., Gu, Q., & Kim, M. (2020). Is neuron coverage a meaningful measure for testing deep neural networks?
Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 851–862.
26. Is neuron coverage meaningful?
For these high-NC adversarial images, they measures the following indices:
Naturalness measures how natural the generated image is.
• Frèchet Inception Distance: a measure of similarity between two dataset of images.
• Inception Score: a measure of naturalness based on salience and diversity
Output impartiality measures how the predictions of the model for adversarial images are biased.
Attack Success Rate is the classification error rate for adversarial test data set.
[Harel-Canada et al, 2020] Harel-Canada, F., Wang, L., Gulzar, M. A., Gu, Q., & Kim, M. (2020). Is neuron coverage a meaningful measure for testing deep neural networks?
Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 851–862.
27. Is neuron coverage meaningful?
Attack success rate Naturalness Output Impartiality
They cannot support the hypothesis that these indices is positively correlated with NC.
[Harel-Canada et al, 2020] Harel-Canada, F., Wang, L., Gulzar, M. A., Gu, Q., & Kim, M. (2020). Is neuron coverage a meaningful measure for testing deep neural networks?
Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 851–862.
28. Is neuron coverage meaningful?
Speaker’s Remark
• Contexts seems to be different between DeepTest & DeepXplore and [Harel-Canada et al, 2020].
• Deeptest & DeepXplore
• The deformation method of image is restricted and recognizable by human
• Finding erroneous behavior of the model is the main concern.
• [Harel-Canada et al, 2020]
• The deformation is unrecognizable by human.
• Finding adversarial attack is the main concern, which is more severe.
• NC+metamorphic test effectively detects erroneous behavior in response to changes in data that are
not in the test dataset but are likely to be real and recognizable by humans.
• For adversarial attack, naturalness and output impartiality are more useful indices.
29. Minimal adversarial distortion
How can we assure the robustness of ML model including adversarial attack?
In order to do it, it is sufficient to estimate how much noise is allowed to be added without change the
inference results.
Decision boundary
Decision boundary
Decision boundary
x
r
Can we calculate minimal adversarial
distortion in advance?
We can estimate the lower bound.
30. Lower bound of adversarial minimum distortion
CROWN [Zhang et al, NIPS 2018]
Moreover, these two functions are calculated explicitly as
The proof can be done by backward
induction on the layer number j and
some elementary calculations (if you
have the energy).
The proof for the simple case is given in
[Sato et al, 2021].
[Zhang et al, NIPS 2018] Zhang, H., Weng, T. W., Chen, P. Y., Hsieh, C. J., & Daniel, L. (2018). Efficient neural network robustness certification with general activation functions.
NeurIPS, 4939–4948.
31. Lower bound of adversarial minimum distortion
Prescribed range of
the neuron values
[Zhang et al, NIPS 2018] Zhang, H., Weng, T. W., Chen, P. Y., Hsieh, C. J., & Daniel, L. (2018). Efficient neural network robustness certification with general activation functions.
NeurIPS, 4939–4948.
32. Lower bound of adversarial minimum distortion
Moreover, these two functions are calculated explicitly as
Then if c is the inferenced label of the input x,
we have the lower bound of adversarial minimum distortion by
[Zhang et al, 2018]
[Zhang et al, NIPS 2018] Zhang, H., Weng, T. W., Chen, P. Y., Hsieh, C. J., & Daniel, L. (2018). Efficient neural network robustness certification with general activation functions.
NeurIPS, 4939–4948.
33. Other techniques to find the lower bound
[Boopathy et al, 2019] extends [Zhang et al, 2018] to CNN case.
The code is available as CNN-Cert on https://github.com/IBM/CNN-Cert
[Boopathy et al, 2019] Boopathy, A., Weng, T.-W., Chen, P.-Y., Liu, S., & Daniel, L. (2019). CNN-Cert: An Efficient Framework for Certifying
Robustness of Convolutional Neural Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 33, 3240–3247.
[Weng et al, 2019] Weng, L., Chen, P.-Y., Nguyen, L., Squillante, M., Boopathy, A., Oseledets, I., & Daniel, L. (2019). PROVEN: Verifying
Robustness of Neural Networks with a Probabilistic Approach. PMLR.
Then the following inequality holds for any a:
34. Can we check the assertion directly?
Example: We construct the machine learning model predicting housing prices based on housing
attributes.
Can we check the following assertion?
If the living area is more than 6,000 square feet, the inferred result is more than $400,000.
At first glance, it looks like it can't be done...
• It is impossible to test on all possible data where the living area is more than 6000 square feet;
• No matter how many data are tested, there is no guarantee that the above claim will hold 100% of
the time.
However, given that we are trying to find counterexamples to the above claim, it would be nice if we
could find at least one example as follows:
Find the data where the living area is more than 6000 square feet but the inferred result is less
than or equal to $400,000.
ML model
Find the data x such that
“and”
35. Formal verification
From this logical formula, we consider eliminating f.
It may seem impossible, but for example, if f is a deep neural
network, then only affine transformations and activation functions
(ReLU) appear in the definition of f.
So, for example, when the activation function is ReLU, if you try hard, you should be able to eliminate f from
the above equation and write it down using only linear inequalities and "and/or".
In this way, we can reduce the problem to find the counterexample to the satisfiability problem:
Satisfiability problem:
For given propositional logical formula, determine if there exists the true/false assignment to atomic formula
so that the whole proposition is made true.
This problem is NP-complete but efficient algorithms are being actively researched.
36. Application of formal verification
[Huang et al, 2017] Huang, X., Kwiatkowska, M., Wang, S., & Wu, M. (2017). Safety Verification of Deep Neural Networks. In R. Majumdar & V. Kunčak (Eds.), Computer Aided
Verification (pp. 3–29). Springer International Publishing.
[Huang et al, 2017] applies formal verification to find the adversarial images.
37. Application of formal verification
[Sato et al, 2020] SATO, N., KURUMA, H., NAKAGAWA, Y., & OGAWA, H. (2020). Formal Verification of a Decision-Tree Ensemble Model and Detection of Its Violation Ranges. IEICE
Transactions on Information and Systems, E103.D(2), 363–378.
[Sato et al, 2020] applies formal verification to decision tree ensemble model and find the violation range
(the range of the values which does not satisfy the assertion)
Application result to housesales prediction problem
38. Summary of four testing method
Metamorphic test uses the relationship between the output of the base input and that of the modified input.
The erroneous behavior can be captured.
Increasing the neuron coverage is helpful to find the erroneous examples efficiently by combining
metamorphic test.
If you are worried about adversarial attack, you may check the adversarial minimum distortion. The lower
bound of this quantity can be estimated like backpropagation.
For more comprehensive search of counterexamples for some assertion, the formal verification is helpful.
Behaviorist's perspective
Focus on how the ML model
behaves.
Metamorphic test
Neuron Coverage
Anatomist’s perspective
Follow the detailed structure of the
ML model.
Adversarial minimum distortion
Formal verification
This kind of detailed
analysis is possible due to
the fact that the model
structure of DNNs and
the like is a composition
of simple functions.
39. Practical viewpoint
Metamorphic test is easy to introduce in application.
• We do not need the special libraries or special codes.
• The key is to define the metamorphic relation appropriately according to the target domain.
• Driving support system: make the images foggy, rainy, rotated, ...etc
• Chatbot: change a question into a different expression without changing the meaning.
• ...
• It is possible to use parameter search to find defects in the model without relying on neuron coverage.
CNN-Cert is available to estimate the adversarial minimum distortion.
• This is based on tensorflow and the applicable models are limited.
More important thing is how to raise the robustness and quality of ML model based on these testing
method.
• Erroneous behavior
• data augmentation (we can utilize the metamorphic test to augment the data)
• preprocessing
• Adversarial attack
• adversarial training
40. References (I)
[Sato et al, 2021] 佐藤直人・小川秀人・來間啓伸・明神智之 (2021).『AIソフトウェアのテストー答えのない答え
合わせ [4つの手法]』リックテレコム
[Goodfellow et al., 2015] Goodfellow, I. J., Shlens, J., & Szegedy, C. (2015). Explaining and Harnessing
Adversarial Examples. In Y. Bengio & Y. LeCun (Eds.), 3rd International Conference on Learning
Representations.
[Carlini & Wagner, 2017] Carlini, N., & Wagner, D. (2017). Towards Evaluating the Robustness of Neural
Networks. 2017 IEEE Symposium on Security and Privacy (SP), 39–57.
[Madry et al, 2018] Madry, A., Makelov, A., Schmidt, L., Tsipras, D., & Vladu, A. (2018). Towards Deep
Learning Models Resistant to Adversarial Attacks. 6th International Conference on Learning
Representations.
[Chen et al, 1998] Chen, T. Y., Cheung, S. C., & Yiu, S. M. (2020). Metamorphic Testing: A New Approach
for Generating Next Test Cases. ArXiv:2002.12543
[Zhou et al, 2016] Zhou, Z. Q., Xiang, S., & Chen, T. Y. (2016). Metamorphic Testing for Software Quality
Assessment: A Study of Search Engines. IEEE Transactions on Software Engineering, 42(3), 264–284.
41. References (II)
[Tian et al, 2018] Tian, Y., Pei, K., Jana, S., & Ray, B. (2018). DeepTest. Proceedings of the 40th
International Conference on Software Engineering, 2018-May, 303–314.
[Zhang et al, 2018] Zhang, M., Zhang, Y., Zhang, L., Liu, C., & Khurshid, S. (2018). DeepRoad:
GAN-Based Metamorphic Testing and Input Validation Framework for Autonomous Driving Systems.
2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE), 132–142.
[Zhou et al, 2019] Zhou, Z. Q., & Sun, L. (2019). Metamorphic testing of driverless cars. Communications
of the ACM, 62(3), 61–67.
[He et al, 2020] He, P., Meister, C., & Su, Z. (2020). Structure-invariant testing for machine translation.
Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering, 961–973.
[Pei et al, 2017] Pei, K., Cao, Y., Yang, J., & Jana, S. (2017). DeepXplore. Proceedings of the 26th
Symposium on Operating Systems Principles, 1–18.
[Harel-Canada et al, 2020] Harel-Canada, F., Wang, L., Gulzar, M. A., Gu, Q., & Kim, M. (2020). Is
neuron coverage a meaningful measure for testing deep neural networks? Proceedings of the 28th ACM
Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of
Software Engineering, 851–862.
42. References (III)
[Zhang et al, NIPS2018] Zhang, H., Weng, T. W., Chen, P. Y., Hsieh, C. J., & Daniel, L. (2018). Efficient
neural network robustness certification with general activation functions. Advances in Neural Information
Processing Systems, 2018-Decem(NeurIPS), 4939–4948.
[Boopathy et al, 2019] Boopathy, A., Weng, T.-W., Chen, P.-Y., Liu, S., & Daniel, L. (2019). CNN-Cert: An
Efficient Framework for Certifying Robustness of Convolutional Neural Networks. Proceedings of the
AAAI Conference on Artificial Intelligence, 33, 3240–3247.
[Weng et al, 2019] Weng, L., Chen, P.-Y., Nguyen, L., Squillante, M., Boopathy, A., Oseledets, I., &
Daniel, L. (2019). PROVEN: Verifying Robustness of Neural Networks with a Probabilistic Approach. In
K. Chaudhuri & R. Salakhutdinov (Eds.), Proceedings of the 36th International Conference on Machine
Learning (Vol. 97, pp. 6727–6736). PMLR.
[Huang et al, 2017] Huang, X., Kwiatkowska, M., Wang, S., & Wu, M. (2017). Safety Verification of Deep
Neural Networks. In R. Majumdar & V. Kunčak (Eds.), Computer Aided Verification (pp. 3–29). Springer
International Publishing.
[Sato et al, 2020] SATO, N., KURUMA, H., NAKAGAWA, Y., & OGAWA, H. (2020). Formal Verification of
a Decision-Tree Ensemble Model and Detection of Its Violation Ranges. IEICE Transactions on
Information and Systems, E103.D(2), 363–378.