Botnet detection in SDN by DL techniques

Botnet Detection in Software
Defined Networks by Deep
Learning Techniques
Authors:
Ivan Letteri
Giuseppe Della Penna
Giovanni De Gasperis
University of L’Aquila

● Components
○ Botmaster
○ Bots
○ Command & Control
● Architecture
○ Client-Server
○ Peer 2 Peer
○ Hybrid
- Botnet: consist of a internet-connected devices (bots), controlled by an attacker called botmaster that
manage by Command & Control, difficult to detect and eradicate due to the
- Architecture: determine the size & distribution of botnet and could be Client-Server, P2P, or a hybrid of
both in order to be more resilient and avoid detection
Botnet and Cyber Crime

Software Defined Networking
● Architecture
○ Application plane
○ Control plane
○ Data plane
● Open Flow protocol
○ msgs
○ OF tables
● Application plane
○ Routing for traffic
monitoring
○ botnet behavioral
analysis
- SDN: is an emerging network approach where the intelligence is in a single component. The forwarding
process separate Data Plane from the routing (Control plane), through messages via OpenFlow protocol to
the flow tables
- Idea: traffic monitoring for malware behavior analysis in order to detect botnet attacks

State of the Art
● Tang et al.
○ NSL-KDD
○ Self Taught Learning
○ 6 SDN features
- Tang et al.: propose a deep learning IDS in SDN using STL and NSL-KDD dataset,
- 6 SDN features: duration, protocol type, source& destin. bytes, count and service count
- Kalavini et al.: compare ML models like SVM, Naive Bayes, Dec.Trees and NN using CTU13 dataset
- Wang W. et al.: encode traffic in images for train a CNN with customized dataset USTC-TFC2016
● Kalavaini et al.
○ CTU 13
○ SVN, NB, NN and
Decision Trees
● Wang W. et al.
○ Convolutional NN
○ USTC-TFC2016

The Dataset
● HogZilla Dataset
○ CTU 13
○ ISCX-IDS
○ 990k samples
○ 192 features
- HogZilla Dataset: public dataset by merging CTU13 and ISCX IDS preprocessed & classified
- Fair dataset: composed by 50% of bots traffic (180K samples) and 50% of normal traffic (180K samples)
- Feats. Selection: All statistical features from the Controller via OF, 8 direct extracted, the rest calculated
● Fair dataset
○ 50% bot traffic
○ 50% normal traffic
● Features Selection
○ 22 SDN features
■ 8 direct
■ 14 calculated

The Neural Network (MLP)
● Network implementation
○ Ternsorflow
○ Keras
○ SciKitLearn
● Multi Layer Perceptron
○ 22 input neurons
○ 7 hidden layer
(diamond shape)
○ Softmax activation
with 2 output neurons
● Dropout (dropp the output)
○ random cutting of 3%
of links
- Keras + TF + SciKitlearn: Keras communicate with TensorFlow API, SKlearn library for data manipulation
- Architecture: IN layer with 22 neurons, 7 hid. layer (44, 88, 176, 88, 44, 22, 11) and 2 neurons in OUT layer
- Avoid overfitting via Dropout: All layers are fully connected only 70% randomly linked every epoch

Experimentation
● Split in 5 Train&Test
Sets
○ Fair dataset
○ Shuffling
○ Partitioning
■ 50%-30%-20%
■ 50&20% Train
&Test set
■ 30% prediction
● Best result achieved
○ 5th dataset
○ 96,52 % of accuracy
- Splitting: 50% & 20% for train and test set, 30% for empirical prediction
- Split in 5 sets: “similar” to a 5-cross folder validation used during the testing
- Shuffling and Partitioning: reducing variance and making sure the model remain general
- 5th subset for Training: the best result with the 96,52% of accuracy

Fine-tuning hyper parameters
● Learning Rate
○ Step size descend
○ Increase/Decrease
■ Accuracy
● Batch Size
○ 100 samples is the
best size
● Epochs
○ 25 times is the
optimal compromise
● Optimizer
○ Adam is the best in
this experiment
- 0.001 of Learning Rate: is the optimal size of the updates applied to the network weight, especially prediction
- Batch Size: short size requires less memory and train fast but it is less accurate to estimate the gradient
- 25 Epochs: train significantly fast to get a good accuracy/performance compromise with prediction 96,96%
- Adam Optimizer: is clearly not the best optimizer for some tasks but in this case is the best

Conclusions
● Summary
○ New, big & realistic
dataset
○ Derived SDN-specific
features
○ MLP with 7 hidden
layers
○ Intense fine-tuned
parameters
○ result in 97%
prediction accuracy
on unknown traffic
- Future Work: an our “unbiased” new dataset SDN driven from accurate Data Analysis (scatter matrix,
feature Importance, Clustering, etc...)
- An in-depth traffic analysis with our SDN framework (SDNsecKit)

Botnet detection in SDN by DL techniques

More Related Content

Similar to Botnet detection in SDN by DL techniques

Recently uploaded

Botnet detection in SDN by DL techniques

Editor's Notes