Test AI/ML Applications

Testing AI,ML Applications
BY:
Tarun Maini
Anshul Gautam
VodQA 2019 Chennai
1

AGENDA
1. Intro + Quick agenda walkthrough(brief talk)
a. What is AI/ML
b. How technology is shifting towards AI, ML
c. Where does a QA step in
d. Challenges while testing AI,ML application
Hands-ON Activity:
1. Create and Test a basic Beer-Wine Classifier
1. Create an Image Classifier ( via CLI )
a. Retrain a Mobile Net
b. Generate test data
c. Create Optimized graphs
d. Test you classifier
1. Real Time Image Classifier via Android App - (OPTIONAL)
a. Retrain a Mobile Net
b. Generate test data
c. Create Optimized graphs
d. Test you classifier
2

PREREQUISITES
Please complete all the following steps:
● Clone all the following repositories at local:
a. https://github.com/tarunmaini16/beer-wine-classifier
b. https://github.com/tarunmaini16/image-classifier
c. https://github.com/tarunmaini16/android-image-classifier
● Pull following docker images (optional):
a. https://cloud.docker.com/u/tarunmaini/repository/docker/tarunmaini/wine-beer-classification
b. https://cloud.docker.com/u/tarunmaini/repository/docker/tarunmaini/image-classifier
● Install Python at system and python plugin in IntelliJ
● Install Tensorflow via terminal $ pip install --upgrade “tensorflow==1.9*”
● Android Studio Setup [v3.1+]
● Android Device OR Virtual Emulator ( API Level = 27/28, Target = Android 8.1/9 )
● Bring your data Cables to connect mobile device
● ADB setup
3

What is AI/ML ? Why the
buzzword Data Science ?
4

“Machine learning is an application
of artificial intelligence (AI) that
provides systems the ability to
automatically learn and improve
from experience without being
explicitly programmed“
6

How technology is shifting
towards AI/ML & affected
the world around us ?
8

©ThoughtWorks 2017 Commercial in Confidence
11

COMPONENTS
1
3
Training Data --> Algorithm --> Model --> Test Data --> Prediction/Output

● Label: Is what you're attempting to predict or forecast
● Features: are an individual measurable property OR the descriptive attributes
● Feature Vectors: A feature vector is a vector in which each dimension represent a certain feature
of an example
● Learning Rate: number of time data is reread in a model to perform accurate predictions.
● Hyperparameters : is a parameter whose value is set before the learning process begins to fine
tune performance such as coefficient of features for logistic regression model.
Frequent terms used in ML
1
4

Supervised Learning Recipe
15Source: http://slideplayer.com/slide/9493622/

Problem we are dealing with:
Beer-Wine
Classification
16

18

Training data Vs Test data
● Training set— Data subset to train a model
● Test set— Data subset to test the trained model
You could imagine slicing the single data set as follows:
1
9

Guidelines to generate test
data for ML features
21

Avoid UnderFitting or OverFitting
2
4

Testing the feature
● Test whether the value of features lies between the threshold values
● Test whether the feature importance changed with respect to previous QA run
● Test the feature unsuitability by testing RAM, usage, inference latency etc.
● Test/Review whether the generated feature violates the data compliance related issues
2
5

Image Classification problem statement
2
6

It depends on application type.
Examples :
● Decision tree,Random forest → classification
● Linear Regression → regression
● Naive bayes algorithm → classification
APIs of few libraries used to develop/test ML models
● Tensorflow
● Cloud Vision API
● Natural Language
● Google Speech
Some algorithmic models
2
7

Train Classifier - by hyperparameters
Random_brightness = 0
Architecture = inception_v3
Random_crop = 0
Flip_left_right = false
Bottleneck_dir = /tmp/bottleneck'
Testing_percentage = 10
Validation_percentage = 10
Learning_rate = 0.01
How_many_training_steps = 4000
3
2

Accuracy of the classification models ?
3
4

Accuracy
True positive + True Negative
Total Predictions
3
6

Precision
Out of all the predictions predicted as beer , how many are correctly classified as beer ?
True Positive +False Positive
True Positive
3
7

Recall
Out of all the drinks labeled as beer , How many were correctly predicted ?
True Positive
True Positive +False Negative
3
8

Metrics used for Regression Model
● Root Mean Square Error : is a measure of accuracy, to compare forecasting errors of different
models for a particular dataset and not between datasets
● Mean Absolute Error : how much % error the model makes in its predictions.
● Entropy : is used as an impurity measure of the model.
3
9

Challenges in testing
● Fast machines and processors
● Generate training data
● Generate test Data
● Know the Threshold and test with new data
● Data Filtering/quality of data - Enhancing data, Prevent overfitting & underfitting
4
1

PREREQUISITES
Please complete all the following steps:
● Clone all the following repositories at local:
a. https://github.com/tarunmaini16/beer-wine-classifier
b. https://github.com/tarunmaini16/image-classifier
c. https://github.com/tarunmaini16/android-image-classifier
● Pull following docker images (optional):
a. https://cloud.docker.com/u/tarunmaini/repository/docker/tarunmaini/wine-beer-classification
b. https://cloud.docker.com/u/tarunmaini/repository/docker/tarunmaini/image-classifier
● Install Python at system and python plugin in IntelliJ
● Install Tensorflow via terminal $ pip install --upgrade “tensorflow==1.9*”
● Android Studio Setup [v3.1+]
● Android Device OR Virtual Emulator ( API Level = 27/28, Target = Android 8.1/9 )
● Bring your data Cables to connect mobile device
● ADB setup
42

Test AI/ML Applications

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Test AI/ML Applications

Similar to Test AI/ML Applications (20)

Recently uploaded

Recently uploaded (20)

Test AI/ML Applications

Editor's Notes