2. AGENDA
Intro + Quick agenda walkthrough(brief talk)
a. Introduction to Machine Learning and Artificial Intelligence
b. Machine Learning vs Traditional Programming
c. Model Lifecycle
d. Model Evaluation
e. Regularization and challenges
Hands-On Activity:
a. Create classification model
b. CNN model
2
6. “Machine learning is an application
of artificial intelligence (AI) that
provides systems the ability to
automatically learn and improve
from experience without being
explicitly programmed“
6
13. ● Label: Is what you're attempting to predict or forecast
● Features: are an individual measurable property OR the descriptive attributes
● Hyperparameters : is an external parameter to the model whose value is set before the learning
process and are adjustable that must be tuned in order to obtain a model with optimal
performance.
Frequent terms used in ML
1
3
20. Training data Vs Test data
● Training set— Data subset to train a model
● Test set— Data subset to test the trained model
You could imagine slicing the single data set as follows:
2
0
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.33, random_state=42)
25. Testing the feature
● Test whether the value of features lies between the threshold values
● Test whether the feature importance changed with respect to previous QA run
● Test/Review whether the generated feature violates the data compliance related issues
2
5
32. Precision
Out of all the predictions predicted as beer , how many are correctly classified as beer ?
True Positive +False Positive
True Positive
3
2
33. Recall
Out of all the drinks labeled as beer , How many were correctly predicted ?
True Positive
True Positive +False Negative
3
3
35. Metrics used for Regression Model
● Mean Absolute Error : MAE measures the average magnitude of the errors in a set of
predictions, without considering their direction.
● Root Mean Squared Error : RMSE is a quadratic scoring rule that also measures the average
magnitude of the error. It’s the square root of the average of squared differences between
prediction and actual observation.
3
5
37. ● Feature Vectors: A feature vector is a vector in which each dimension represent a certain feature
of an example
● Learning Rate: number of time data is reread in a model to perform accurate predictions.
Other terms used in ML
3
7
38. It depends on application type.
Examples :
● Decision tree,Random forest → classification
● Linear Regression → regression
● Naive bayes algorithm → classification
APIs of few libraries used to develop/test ML models
● Tensorflow
● Cloud Vision API
● Natural Language
● Google Speech
Some algorithmic models
3
8
42. Challenges in testing
● Fast machines and processors
● Generate training data
● Generate test Data
● Know the Threshold and test with new data
● Data Filtering/quality of data - Enhancing data, Prevent overfitting & underfitting
4
2