Remove unnecessary data
Remove ping commands
Remove IP addresses
Remove drive letters
40
Data Cleansing
Copyright ©2017 JPCERT/CC All rights reserved.
Collection of training data
Data cleansing
Data analysis with Machine Learning
Evaluate and select the best
algorithm
41
Flow of Algorithm Selection for Machine Learning
Data analysis with Machine Learning algorithms:
- Decision Tree
- Naive Bayes
- k-Nearest Neighbors
- Logistic Regression
- Support Vector Machine
Evaluate based on:
- Accuracy
- Recall
- Precision
- F1 Score
Select algorithm with best performance