Network Analytics: Machine Learning Intrusion Detection

•Download as PPTX, PDF•

0 likes•85 views

Saksham Agrawal

Network Intrusion Detection System using Machine Learning. Final Project by Team 1 ADS Fall 2016

Data & Analytics

Network Analytics :
Intrusion Detection
using Machine
Learning

Intrusion Detection System(IDS)
• Combination of software and hardware that attempts to
perform intrusion detection
• Raise the alarm when possible intrusion or suspicious patterns are
observed
The
Internet
Attacker
Internal Network
Firewall
IDS
IDS

Why we need IDS?
• Unknown weakness or bugs
• Complex, unforeseen attacks
• Firewalls, security policies
• Using information detected
• Recover compromised system
• Understand the attack mechanism
• Detect novel attacks
• Defend our systems

Types of IDS
These are the main types of Intrusion Detection Systems:
• Host Based
• Network Based
• Stack Based
• Signature Based
• Anomaly Based

KDD Cup 99 Data Set
• Modification of DARPA 1998 data set
• DARPA 1998 data set
• Managed by Lincoln Lab.(under DARPA sponsorship)
• Simulated nine weeks of raw TCP dump data
• Attacks
• 38 different attacks against Unix/Linux machines
• DoS, Scan, Buffer overflow and so on.
• Normal traffic
• 1000’s of virtual hosts and 100’s of user automata

KDD Cup 99 Data Set
• Each connection ⇒ 41-dimensions vector
• Samples
5,tcp,smtp,SF,959,337,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,1,1,0.00,0.00,0.00,0.00,1.00,
0.00,0.00,144,192,0.70,0.02,0.01,0.01,0.00,0.00,0.00,0.00,normal
0,tcp,http,SF,54540,8314,0,0,0,2,0,1,1,0,0,0,0,0,0,0,0,0,2,2,0.00,0.00,0.00,0.00,1.0
0,0.00,0.00,118,118,1.00,0.00,0.01,0.00,0.00,0.00,0.02,0.02,back.
• Numerical: 34, Categorical: 7
• Basic feature： “duration”, “protocol”…
• Statistical feature： “number of connections to the same host as the current connection in the past two
seconds”…
• Label ⇒ “normal” or “name of attacks”

FLOW:
Pre-processing of
data in R
Pre-processing of
data in Azure ML
Filter-based
Feature Selection
Model Selection
Tune Model
Parameters
Build system for
selected model
Deploy the
selected model
Build website for
ML as a Service

Data pre-processing in R
• Assign column values to the dataset
• Transformation of labels into binomial classes

• Store the Training and testing data
in the Azure cloud storage
• Specify the categorical variables
by editing the metadata
• Convert the categorical variables
into dummy numerical variables
Data pre-processing in Azure ML

Filter-based feature selection
• Total number of features = 41
• Selected number of features = 15
• Method used = Pearson Correlation

Model Selection
• We need both accuracy and good response time!
• Evaluated different models on 10% data and then evaluated each of
them.
Model Accuracy (AUC)
Logistic Regression 0.995634
Boosted Decision Tree 0.999093
Neural Network 0.996295
Support Vector Machines 0.994526

Tune Model hyper parameters
• The model's hyper parameters are the settings and values you use
when configuring and testing the model, with the aim of finding the
best combination.
• You get an accuracy report describing the different models that
were created and their parameters, plus a trained model that you
can save for re-use.

Build System for
selected model
• Boosted Decision Tree – For
its high accuracy and good
response time
• Train the data 100% of the
training data
• Build and Deploy the model
as a web service

Place your screenshot here
Machine Learning as
a Service
• Frontend : HTML5, CSS3,
Bootstrap, jQuery
• Backend : Python Flask
• DEMO!

Viewers also liked

(Seniors) Early Assessment Program AssesibleCynthia Maravilla-Macias

TALLERPRACTICOGRUPO3DANIELA ARISTIZABAL CASTAÑEDA

TICSmarcos259

Palmetto Pitmasters PresentationDaniel Carroll

2706 05ratnawulanpooh

General music video conventionswbsmediagroup4

Lea unit 5 projectgcleaxf

CHAPTER 5 BEGowthu Cgowthu

Sample software-development-agreement (1)govmenon

Lea unit 5 projectgcleaxf

CHAPTER 1 BEGowthu Cgowthu

CHAPTER 3 BEGowthu Cgowthu

CHAPTER 2 BEGowthu Cgowthu

Titanic powerpointmbillington53

CHAPTER 4 BEGowthu Cgowthu

Kardiyak pacemakerAsena Ozay

Network_Intrusion_Detection_System_Team1Saksham Agrawal

Viewers also liked (17)

(Seniors) Early Assessment Program Assesible

TALLERPRACTICOGRUPO3

TICS

Palmetto Pitmasters Presentation

2706 05

General music video conventions

Lea unit 5 project

CHAPTER 5 BE

Sample software-development-agreement (1)

Lea unit 5 project

CHAPTER 1 BE

CHAPTER 3 BE

CHAPTER 2 BE

Titanic powerpoint

CHAPTER 4 BE

Kardiyak pacemaker

Network_Intrusion_Detection_System_Team1

Similar to Network Analytics: Machine Learning Intrusion Detection

Application of machine learning and cognitive computing in intrusion detectio...Mahdi Hosseini Moghaddam

Guiding through a typical Machine Learning PipelineMichael Gerke

How Azure Databricks helped make IoT Analytics a Reality with Janath Manohara...Databricks

VTU Open Elective 6th Sem CSE - Module 2 - Cloud ComputingSachin Gowda

Sukumar Nayak-Detailed-Cloud Risk Management and AuditSukumar Nayak

semantic searchSaikiranK15

Seminar Presentation | Network Intrusion Detection using Supervised Machine L...Jowin John Chemban

Architecture for Scale [AppFirst]AppFirst

Normalizing Empire's Traffic to Evade Anomaly-Based IDSUtku Sen

Brad stack - Digital Health and Well-Being Festival Digital Health Enterprise Zone

Managing Trustworthy Big-data Applications in the Cloud with the ATMOSPHERE P...ATMOSPHERE .

Relational cloud, A Database-as-a-Service for the CloudHossein Riasati

Smart Manufacturing Requirements forEquipment Capability and ControlKimberly Daich

Artificial neural networks Tharushi Ruwandika

Finding the needle in the haystack: how Nestle is leveraging big data to defe...Big Data Spain

Parallel Distributed Deep Learning on HPCC SystemsHPCC Systems

EM12c: Capacity Planning with OEM MetricsMaaz Anjum

malware detection ppt for vtu project and other final year projectNaveenAd4

IEEE 2014 DOTNET DATA MINING PROJECTS Trusted db a-trusted-hardware-based-dat...IEEEMEMTECHSTUDENTPROJECTS

2014 IEEE DOTNET DATA MINING PROJECT Trusteddb a-trusted-hardware-based-datab...IEEEMEMTECHSTUDENTSPROJECTS

Similar to Network Analytics: Machine Learning Intrusion Detection (20)

Application of machine learning and cognitive computing in intrusion detectio...

Guiding through a typical Machine Learning Pipeline

How Azure Databricks helped make IoT Analytics a Reality with Janath Manohara...

VTU Open Elective 6th Sem CSE - Module 2 - Cloud Computing

Sukumar Nayak-Detailed-Cloud Risk Management and Audit

semantic search

Seminar Presentation | Network Intrusion Detection using Supervised Machine L...

Architecture for Scale [AppFirst]

Normalizing Empire's Traffic to Evade Anomaly-Based IDS

Brad stack - Digital Health and Well-Being Festival

Managing Trustworthy Big-data Applications in the Cloud with the ATMOSPHERE P...

Relational cloud, A Database-as-a-Service for the Cloud

Smart Manufacturing Requirements forEquipment Capability and Control

Artificial neural networks

Finding the needle in the haystack: how Nestle is leveraging big data to defe...

Parallel Distributed Deep Learning on HPCC Systems

EM12c: Capacity Planning with OEM Metrics

malware detection ppt for vtu project and other final year project

IEEE 2014 DOTNET DATA MINING PROJECTS Trusted db a-trusted-hardware-based-dat...

2014 IEEE DOTNET DATA MINING PROJECT Trusteddb a-trusted-hardware-based-datab...

Recently uploaded

Industrialised data - the key to AI success.pdfLars Albertsson

Carero dropshipping via API with DroFx.pptxolyaivanovalion

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408

April 2024 - Crypto Market Report's Analysismanisha194592

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad EscortsCall girls in Ahmedabad High profile

Smarteg dropshipping via API with DroFx.pptxolyaivanovalion

Ukraine War presentation: KNOW THE BASICSAishani27

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach

Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth

Mature dropshipping via API with DroFx.pptxolyaivanovalion

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor

定制英国白金汉大学毕业证（UCB毕业证书）成绩单原版一比一ffjhghh

Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh9953056974 Low Rate Call Girls In Saket, Delhi NCR

Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann

Recently uploaded (20)

Industrialised data - the key to AI success.pdf

Carero dropshipping via API with DroFx.pptx

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf

100-Concepts-of-AI by Anupama Kate .pptx

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps

April 2024 - Crypto Market Report's Analysis

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts

Smarteg dropshipping via API with DroFx.pptx

Ukraine War presentation: KNOW THE BASICS

FESE Capital Markets Fact Sheet 2024 Q1.pdf

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt

Unveiling Insights: The Role of a Data Analyst

Mature dropshipping via API with DroFx.pptx

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...

定制英国白金汉大学毕业证（UCB毕业证书）成绩单原版一比一

Customer Service Analytics - Make Sense of All Your Data.pptx

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh

Generative AI on Enterprise Cloud with NiFi and Milvus

Network Analytics: Machine Learning Intrusion Detection

1. Network Analytics : Intrusion Detection using Machine Learning

2. Intrusion Detection System(IDS) • Combination of software and hardware that attempts to perform intrusion detection • Raise the alarm when possible intrusion or suspicious patterns are observed The Internet Attacker Internal Network Firewall IDS IDS

3. Why we need IDS? • Unknown weakness or bugs • Complex, unforeseen attacks • Firewalls, security policies • Using information detected • Recover compromised system • Understand the attack mechanism • Detect novel attacks • Defend our systems

4. Types of IDS These are the main types of Intrusion Detection Systems: • Host Based • Network Based • Stack Based • Signature Based • Anomaly Based

5. KDD Cup 99 Data Set • Modification of DARPA 1998 data set • DARPA 1998 data set • Managed by Lincoln Lab.(under DARPA sponsorship) • Simulated nine weeks of raw TCP dump data • Attacks • 38 different attacks against Unix/Linux machines • DoS, Scan, Buffer overflow and so on. • Normal traffic • 1000’s of virtual hosts and 100’s of user automata

6. KDD Cup 99 Data Set • Each connection ⇒ 41-dimensions vector • Samples 5,tcp,smtp,SF,959,337,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,1,1,0.00,0.00,0.00,0.00,1.00, 0.00,0.00,144,192,0.70,0.02,0.01,0.01,0.00,0.00,0.00,0.00,normal 0,tcp,http,SF,54540,8314,0,0,0,2,0,1,1,0,0,0,0,0,0,0,0,0,2,2,0.00,0.00,0.00,0.00,1.0 0,0.00,0.00,118,118,1.00,0.00,0.01,0.00,0.00,0.00,0.02,0.02,back. • Numerical: 34, Categorical: 7 • Basic feature： “duration”, “protocol”… • Statistical feature： “number of connections to the same host as the current connection in the past two seconds”… • Label ⇒ “normal” or “name of attacks”

7. FLOW: Pre-processing of data in R Pre-processing of data in Azure ML Filter-based Feature Selection Model Selection Tune Model Parameters Build system for selected model Deploy the selected model Build website for ML as a Service

8. Data pre-processing in R • Assign column values to the dataset • Transformation of labels into binomial classes

9. • Store the Training and testing data in the Azure cloud storage • Specify the categorical variables by editing the metadata • Convert the categorical variables into dummy numerical variables Data pre-processing in Azure ML

10. Filter-based feature selection • Total number of features = 41 • Selected number of features = 15 • Method used = Pearson Correlation

11. Model Selection • We need both accuracy and good response time! • Evaluated different models on 10% data and then evaluated each of them. Model Accuracy (AUC) Logistic Regression 0.995634 Boosted Decision Tree 0.999093 Neural Network 0.996295 Support Vector Machines 0.994526

12. Tune Model hyper parameters • The model's hyper parameters are the settings and values you use when configuring and testing the model, with the aim of finding the best combination. • You get an accuracy report describing the different models that were created and their parameters, plus a trained model that you can save for re-use.

13. Build System for selected model • Boosted Decision Tree – For its high accuracy and good response time • Train the data 100% of the training data • Build and Deploy the model as a web service

14. Place your screenshot here Machine Learning as a Service • Frontend : HTML5, CSS3, Bootstrap, jQuery • Backend : Python Flask • DEMO!

15. Thank you!!

Network Analytics: Machine Learning Intrusion Detection

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (17)

Similar to Network Analytics: Machine Learning Intrusion Detection

Similar to Network Analytics: Machine Learning Intrusion Detection (20)

Recently uploaded

Recently uploaded (20)

Network Analytics: Machine Learning Intrusion Detection