SlideShare a Scribd company logo
1 of 22
Download to read offline
On the feasibility of 40 Gbps network data capture and
retention with general purpose hardware
SAC 2018 | Pau, France
Guillermo Julián-Moreno
Rafael Leira
Jorge E. López de Vergara
Francisco Gómez-Arribas
Iván González
April 10, 2018
Naudit HPCN &
Escuela Politécnica Superior, Universidad
Autónoma de Madrid
Outline
1. Introduction
2. Design and implementation
3. Results
4. Conclusions
1
Introduction
Motivation
Why would we want to capture and store traffic?
• Online analysis and monitoring (e.g., flow records, traffic volume dashboards, IDS).
• Data retention for specialized analysis or policy requirements (e.g. GPDR).
10 GbE standard is widespread, now we are seeing higher speeds (40 GbE, but also 100
GbE)
• 10 GbE is the “last” standard that we can process with a single core.
• 40 GbE and higher speeds require parallelism: how?
2
Purpose of the system
• Receive the traffic at 40 Gbps as efficiently as possible.
• Timestamp the incoming traffic.
• Store the network frames in the disk at 40 Gbps.
• Use commercial off-the-shelf hardware to reduce costs.
3
Design and implementation
Previous architecture
DMA
tail
head
Descriptor ring Data bufferNIC
• Single thread copying frames to the intermediate buffer.
• Write files by blocks and use padding at the end of each file.
• Return the descriptor’s ownership to the card after the copy (no allocations
needed).
4
Reading from NIC and copying to buffer
1
23
4
Rx
Intermediate buffer
...
• Usual approach for parallelism is RSS queues. Problem: ensuring uniform
distribution between queues.
• Given our limited scope, switch to single queue and fixed descriptor assignments:
uniform distribution and no synchronization required for reading.
• Use a single atomic counter for the buffer write offset: as fast as possible and no
deadlocks possible.
• Add padding to the beginning and end of the files to avoid having frames
overrunning file boundaries.
5
Client reading
Kernel Userspace
Rx 0
Rx 1
Rx 2
Rx 3
1. Allocation
2. Copy Client
…
3. New data!
4. Reading
• Userspace clients get last written byte via syscalls and set their last read byte.
• RX thread 0 updates s, the space available in the buffer. No thread writes more
than ⌊s/n⌋ bytes in a batch.
6
Writing to disk
Two options for the write process:
• Regular files written in 4MB blocks. Needs a fast filesystem.
• Distributed writes between several NVMe disks with SPDK.
Features to reduce hardware requirements:
• Simple filtering system that searches “strings” of bytes at fixed positions.
• Selective storage: only store the first N bytes of each frame.
7
Results
Hardware used
Traffic generator RX Server 1 RX Server 2
CPU Intel Xeon E5-1620 v2 Intel Xeon E5-1620 v2 2 × Intel Xeon E5-2630 v4
Clock 3.70GHz 3.70GHz 2.20 GHz
Cores 4 4 2 x 10
Memory 32 GB 32 GB 2 x 64 GB
NIC Intel XL710 Intel XL710 Intel XL710
Storage SATA RAID SATA RAID 6 x NVMe
Est. cost 7,000e 7,000e 10,000e
Table 1: Specifications of the servers used for testing. HyperThreading was disabled.
8
Storage speed
0
10
20
30
40
50
2 3 4 5 6
Rate(Gbps)
Number of discs
Disk write rate
Software RAID
SPDK
Theoretical max speed
Figure 1: Performance of the NVMe disk array.
9
Traffic generation
0
5
10
15
20
25
30
35
40
100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500
Rate(Gbps)
Frame size (bytes)
Generation rate
Send rate
Theoretical max rate
Figure 2: Synthetic traffic rates achieved with our custom, DPDK-based traffic generator. We also
made a version capable of sending large PCAP files at line rate.
10
Timestamping accuracy
Frames Mean Std
All 1738 ns 3296 ns
One out of every eight 55 ns 287 ns
Table 2: Timestamping accuracy. The Intel NIC posts descriptors in batches of eight, so we have to
take that into account for the accuracy.
11
Traffic capture
0
25
50
75
100
100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500
0
5
10
15
20
25
30
35
40
Loss%
Rate(Gbps)
Frame size (bytes)
Capture rate
Lost %
Port Drop %
Send rate
Capture rate
Theoretical max rate
Figure 3: Results of the first test: retrieval of the
frames from the NIC. The bottleneck is the card
for small frame sizes.
0
25
50
75
100
100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500
0
5
10
15
20
25
30
35
40
Loss%
Rate(Gbps)
Frame size (bytes)
Capture rate
Lost %
Port Drop %
Send rate
Capture rate
Theoretical max rate
Figure 4: Results of the second test: writing of
the frames to a null device.
12
Traffic storage
0
25
50
75
100
100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500
0
5
10
15
20
25
30
35
40
Loss%
Rate(Gbps)
Frame size (bytes)
Capture rate
Lost %
Send rate
Capture rate
Theoretical max rate
Figure 5: Results of the third test: traffic storage using SPDK.
13
Traffic storage
Name Size Avg. frame size Send rate Loss %
CAIDA 222 GB 787.91 B 39.78 Gbps <0.01
University 4.3 GB 910.08 B 39.82 Gbps 0
Table 3: Performance on reception of traffic capture files.
14
Conclusions
Results
• We have created and open-sourced a system capable of capturing, timestamping
and storing network traffic at 40 Gbps.
• Not using RSS parallelism is feasible and useful in our limited-scope system.
• The one-copy mechanism and synchronization algorithms allow our system to store
line-rate traffic at frame sizes of 300 bytes and above (enough for the majority of
environments).
• We have created a testbed capable of saturating 40 GbE links for frames of size 96
bytes or greater.
15
Future work
• Improve the selective-storage approach: more effective filters (ASCII/BPF) or limits
based on RX rate.
• Detailed comparison of the disorder and timestamp inaccuracies between our
approach and RSS queues.
• Port this system to virtual machines with SR-IOV NVFs.
16
Questions?

More Related Content

What's hot

Network emulator
Network emulatorNetwork emulator
Network emulatorjeromy fu
 
Kernel Recipes 2017 - EBPF and XDP - Eric Leblond
Kernel Recipes 2017 - EBPF and XDP - Eric LeblondKernel Recipes 2017 - EBPF and XDP - Eric Leblond
Kernel Recipes 2017 - EBPF and XDP - Eric LeblondAnne Nicolas
 
Clemson: Solving the HPC Data Deluge
Clemson: Solving the HPC Data DelugeClemson: Solving the HPC Data Deluge
Clemson: Solving the HPC Data Delugeinside-BigData.com
 
Whitebox Switches Deployment Experience
Whitebox Switches Deployment ExperienceWhitebox Switches Deployment Experience
Whitebox Switches Deployment ExperienceAPNIC
 
Using Network Acceleration for an Optimized Edge Cloud Server Architecture
Using Network Acceleration for an Optimized Edge Cloud Server ArchitectureUsing Network Acceleration for an Optimized Edge Cloud Server Architecture
Using Network Acceleration for an Optimized Edge Cloud Server ArchitectureNetronome
 
Compiling P4 to XDP, IOVISOR Summit 2017
Compiling P4 to XDP, IOVISOR Summit 2017Compiling P4 to XDP, IOVISOR Summit 2017
Compiling P4 to XDP, IOVISOR Summit 2017Cheng-Chun William Tu
 
Ceph for Big Science - Dan van der Ster
Ceph for Big Science - Dan van der SterCeph for Big Science - Dan van der Ster
Ceph for Big Science - Dan van der SterCeph Community
 
Redis on NVMe SSD - Zvika Guz, Samsung
 Redis on NVMe SSD - Zvika Guz, Samsung Redis on NVMe SSD - Zvika Guz, Samsung
Redis on NVMe SSD - Zvika Guz, SamsungRedis Labs
 
Header compression and multiplexing in LISP
Header compression and multiplexing in LISPHeader compression and multiplexing in LISP
Header compression and multiplexing in LISPJose Saldana
 
Building the Internet of Things with Thingsquare and Contiki - day 2 part 2
Building the Internet of Things with Thingsquare and Contiki - day 2 part 2Building the Internet of Things with Thingsquare and Contiki - day 2 part 2
Building the Internet of Things with Thingsquare and Contiki - day 2 part 2Adam Dunkels
 
Revisão: Forwarding Metamorphosis: Fast Programmable Match-Action Processing ...
Revisão: Forwarding Metamorphosis: Fast Programmable Match-Action Processing ...Revisão: Forwarding Metamorphosis: Fast Programmable Match-Action Processing ...
Revisão: Forwarding Metamorphosis: Fast Programmable Match-Action Processing ...Bruno Castelucci
 
[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project
[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project
[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB ProjectRakuten Group, Inc.
 
Simplemux traffic optimization
Simplemux traffic optimizationSimplemux traffic optimization
Simplemux traffic optimizationJose Saldana
 
Accelerating Ceph with iWARP RDMA over Ethernet - Brien Porter, Haodong Tang
Accelerating Ceph with iWARP RDMA over Ethernet - Brien Porter, Haodong TangAccelerating Ceph with iWARP RDMA over Ethernet - Brien Porter, Haodong Tang
Accelerating Ceph with iWARP RDMA over Ethernet - Brien Porter, Haodong TangCeph Community
 
Caching methodology and strategies
Caching methodology and strategiesCaching methodology and strategies
Caching methodology and strategiesTiep Vu
 
2009 11 06 3gpp Ietf Ipv6 Shanghai Nat64
2009 11 06 3gpp Ietf Ipv6 Shanghai Nat642009 11 06 3gpp Ietf Ipv6 Shanghai Nat64
2009 11 06 3gpp Ietf Ipv6 Shanghai Nat64yacc2000
 
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...Netronome
 

What's hot (20)

Network emulator
Network emulatorNetwork emulator
Network emulator
 
Kernel Recipes 2017 - EBPF and XDP - Eric Leblond
Kernel Recipes 2017 - EBPF and XDP - Eric LeblondKernel Recipes 2017 - EBPF and XDP - Eric Leblond
Kernel Recipes 2017 - EBPF and XDP - Eric Leblond
 
Clemson: Solving the HPC Data Deluge
Clemson: Solving the HPC Data DelugeClemson: Solving the HPC Data Deluge
Clemson: Solving the HPC Data Deluge
 
Whitebox Switches Deployment Experience
Whitebox Switches Deployment ExperienceWhitebox Switches Deployment Experience
Whitebox Switches Deployment Experience
 
Tc basics
Tc basicsTc basics
Tc basics
 
Using Network Acceleration for an Optimized Edge Cloud Server Architecture
Using Network Acceleration for an Optimized Edge Cloud Server ArchitectureUsing Network Acceleration for an Optimized Edge Cloud Server Architecture
Using Network Acceleration for an Optimized Edge Cloud Server Architecture
 
Compiling P4 to XDP, IOVISOR Summit 2017
Compiling P4 to XDP, IOVISOR Summit 2017Compiling P4 to XDP, IOVISOR Summit 2017
Compiling P4 to XDP, IOVISOR Summit 2017
 
Ceph for Big Science - Dan van der Ster
Ceph for Big Science - Dan van der SterCeph for Big Science - Dan van der Ster
Ceph for Big Science - Dan van der Ster
 
A Baker's dozen of TCP
A Baker's dozen of TCPA Baker's dozen of TCP
A Baker's dozen of TCP
 
Redis on NVMe SSD - Zvika Guz, Samsung
 Redis on NVMe SSD - Zvika Guz, Samsung Redis on NVMe SSD - Zvika Guz, Samsung
Redis on NVMe SSD - Zvika Guz, Samsung
 
Header compression and multiplexing in LISP
Header compression and multiplexing in LISPHeader compression and multiplexing in LISP
Header compression and multiplexing in LISP
 
Building the Internet of Things with Thingsquare and Contiki - day 2 part 2
Building the Internet of Things with Thingsquare and Contiki - day 2 part 2Building the Internet of Things with Thingsquare and Contiki - day 2 part 2
Building the Internet of Things with Thingsquare and Contiki - day 2 part 2
 
Jumbo Mumbo in OpenStack
Jumbo Mumbo in OpenStackJumbo Mumbo in OpenStack
Jumbo Mumbo in OpenStack
 
Revisão: Forwarding Metamorphosis: Fast Programmable Match-Action Processing ...
Revisão: Forwarding Metamorphosis: Fast Programmable Match-Action Processing ...Revisão: Forwarding Metamorphosis: Fast Programmable Match-Action Processing ...
Revisão: Forwarding Metamorphosis: Fast Programmable Match-Action Processing ...
 
[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project
[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project
[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project
 
Simplemux traffic optimization
Simplemux traffic optimizationSimplemux traffic optimization
Simplemux traffic optimization
 
Accelerating Ceph with iWARP RDMA over Ethernet - Brien Porter, Haodong Tang
Accelerating Ceph with iWARP RDMA over Ethernet - Brien Porter, Haodong TangAccelerating Ceph with iWARP RDMA over Ethernet - Brien Porter, Haodong Tang
Accelerating Ceph with iWARP RDMA over Ethernet - Brien Porter, Haodong Tang
 
Caching methodology and strategies
Caching methodology and strategiesCaching methodology and strategies
Caching methodology and strategies
 
2009 11 06 3gpp Ietf Ipv6 Shanghai Nat64
2009 11 06 3gpp Ietf Ipv6 Shanghai Nat642009 11 06 3gpp Ietf Ipv6 Shanghai Nat64
2009 11 06 3gpp Ietf Ipv6 Shanghai Nat64
 
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
 

Similar to On the feasibility of 40 Gbps network data capture and retention with general purpose hardware

DPDK Summit 2015 - Aspera - Charles Shiflett
DPDK Summit 2015 - Aspera - Charles ShiflettDPDK Summit 2015 - Aspera - Charles Shiflett
DPDK Summit 2015 - Aspera - Charles ShiflettJim St. Leger
 
Sunx4450 Intel7460 GigaSpaces XAP Platform Benchmark
Sunx4450 Intel7460 GigaSpaces XAP Platform BenchmarkSunx4450 Intel7460 GigaSpaces XAP Platform Benchmark
Sunx4450 Intel7460 GigaSpaces XAP Platform BenchmarkShay Hassidim
 
Fast datastacks - fast and flexible nfv solution stacks leveraging fd.io
Fast datastacks - fast and flexible nfv solution stacks leveraging fd.ioFast datastacks - fast and flexible nfv solution stacks leveraging fd.io
Fast datastacks - fast and flexible nfv solution stacks leveraging fd.ioOPNFV
 
MAT 510 – Homework AssignmentHomework Assignment 6 Due.docx
MAT 510 – Homework AssignmentHomework Assignment 6 Due.docxMAT 510 – Homework AssignmentHomework Assignment 6 Due.docx
MAT 510 – Homework AssignmentHomework Assignment 6 Due.docxjessiehampson
 
osdi20-slides_zhao.pptx
osdi20-slides_zhao.pptxosdi20-slides_zhao.pptx
osdi20-slides_zhao.pptxCive1971
 
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)Ontico
 
Apache Spark AI Use Case in Telco: Network Quality Analysis and Prediction wi...
Apache Spark AI Use Case in Telco: Network Quality Analysis and Prediction wi...Apache Spark AI Use Case in Telco: Network Quality Analysis and Prediction wi...
Apache Spark AI Use Case in Telco: Network Quality Analysis and Prediction wi...Databricks
 
LinuxCon2009: 10Gbit/s Bi-Directional Routing on standard hardware running Linux
LinuxCon2009: 10Gbit/s Bi-Directional Routing on standard hardware running LinuxLinuxCon2009: 10Gbit/s Bi-Directional Routing on standard hardware running Linux
LinuxCon2009: 10Gbit/s Bi-Directional Routing on standard hardware running Linuxbrouer
 
High performace network of Cloud Native Taiwan User Group
High performace network of Cloud Native Taiwan User GroupHigh performace network of Cloud Native Taiwan User Group
High performace network of Cloud Native Taiwan User GroupHungWei Chiu
 
Accelerating HBase with NVMe and Bucket Cache
Accelerating HBase with NVMe and Bucket CacheAccelerating HBase with NVMe and Bucket Cache
Accelerating HBase with NVMe and Bucket CacheNicolas Poggi
 
CPN302 your-linux-ami-optimization-and-performance
CPN302 your-linux-ami-optimization-and-performanceCPN302 your-linux-ami-optimization-and-performance
CPN302 your-linux-ami-optimization-and-performanceCoburn Watson
 
Deploying flash storage for Ceph without compromising performance
Deploying flash storage for Ceph without compromising performance Deploying flash storage for Ceph without compromising performance
Deploying flash storage for Ceph without compromising performance Ceph Community
 
100G Networking Berlin.pdf
100G Networking Berlin.pdf100G Networking Berlin.pdf
100G Networking Berlin.pdfJunZhao68
 
A Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural NetworksA Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural Networksinside-BigData.com
 
Introduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3RIntroduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3RSimon Huang
 
Large-Scale Optimization Strategies for Typical HPC Workloads
Large-Scale Optimization Strategies for Typical HPC WorkloadsLarge-Scale Optimization Strategies for Typical HPC Workloads
Large-Scale Optimization Strategies for Typical HPC Workloadsinside-BigData.com
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureCeph Community
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitecturePatrick McGarry
 

Similar to On the feasibility of 40 Gbps network data capture and retention with general purpose hardware (20)

DPDK Summit 2015 - Aspera - Charles Shiflett
DPDK Summit 2015 - Aspera - Charles ShiflettDPDK Summit 2015 - Aspera - Charles Shiflett
DPDK Summit 2015 - Aspera - Charles Shiflett
 
100 M pps on PC.
100 M pps on PC.100 M pps on PC.
100 M pps on PC.
 
Sunx4450 Intel7460 GigaSpaces XAP Platform Benchmark
Sunx4450 Intel7460 GigaSpaces XAP Platform BenchmarkSunx4450 Intel7460 GigaSpaces XAP Platform Benchmark
Sunx4450 Intel7460 GigaSpaces XAP Platform Benchmark
 
Fast datastacks - fast and flexible nfv solution stacks leveraging fd.io
Fast datastacks - fast and flexible nfv solution stacks leveraging fd.ioFast datastacks - fast and flexible nfv solution stacks leveraging fd.io
Fast datastacks - fast and flexible nfv solution stacks leveraging fd.io
 
MAT 510 – Homework AssignmentHomework Assignment 6 Due.docx
MAT 510 – Homework AssignmentHomework Assignment 6 Due.docxMAT 510 – Homework AssignmentHomework Assignment 6 Due.docx
MAT 510 – Homework AssignmentHomework Assignment 6 Due.docx
 
osdi20-slides_zhao.pptx
osdi20-slides_zhao.pptxosdi20-slides_zhao.pptx
osdi20-slides_zhao.pptx
 
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
 
Apache Spark AI Use Case in Telco: Network Quality Analysis and Prediction wi...
Apache Spark AI Use Case in Telco: Network Quality Analysis and Prediction wi...Apache Spark AI Use Case in Telco: Network Quality Analysis and Prediction wi...
Apache Spark AI Use Case in Telco: Network Quality Analysis and Prediction wi...
 
LinuxCon2009: 10Gbit/s Bi-Directional Routing on standard hardware running Linux
LinuxCon2009: 10Gbit/s Bi-Directional Routing on standard hardware running LinuxLinuxCon2009: 10Gbit/s Bi-Directional Routing on standard hardware running Linux
LinuxCon2009: 10Gbit/s Bi-Directional Routing on standard hardware running Linux
 
High performace network of Cloud Native Taiwan User Group
High performace network of Cloud Native Taiwan User GroupHigh performace network of Cloud Native Taiwan User Group
High performace network of Cloud Native Taiwan User Group
 
Accelerating HBase with NVMe and Bucket Cache
Accelerating HBase with NVMe and Bucket CacheAccelerating HBase with NVMe and Bucket Cache
Accelerating HBase with NVMe and Bucket Cache
 
CPN302 your-linux-ami-optimization-and-performance
CPN302 your-linux-ami-optimization-and-performanceCPN302 your-linux-ami-optimization-and-performance
CPN302 your-linux-ami-optimization-and-performance
 
Deploying flash storage for Ceph without compromising performance
Deploying flash storage for Ceph without compromising performance Deploying flash storage for Ceph without compromising performance
Deploying flash storage for Ceph without compromising performance
 
100G Networking Berlin.pdf
100G Networking Berlin.pdf100G Networking Berlin.pdf
100G Networking Berlin.pdf
 
A Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural NetworksA Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural Networks
 
Introduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3RIntroduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3R
 
Large-Scale Optimization Strategies for Typical HPC Workloads
Large-Scale Optimization Strategies for Typical HPC WorkloadsLarge-Scale Optimization Strategies for Typical HPC Workloads
Large-Scale Optimization Strategies for Typical HPC Workloads
 
QNAP TS-832PX-4G.pdf
QNAP TS-832PX-4G.pdfQNAP TS-832PX-4G.pdf
QNAP TS-832PX-4G.pdf
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 

More from Jorge E. López de Vergara Méndez

Evaluación de equipamiento de bajo coste para realizar medidas de red en ento...
Evaluación de equipamiento de bajo coste para realizar medidas de red en ento...Evaluación de equipamiento de bajo coste para realizar medidas de red en ento...
Evaluación de equipamiento de bajo coste para realizar medidas de red en ento...Jorge E. López de Vergara Méndez
 
Dictyogram: a Statistical Approach for the Definition and Visualization of Ne...
Dictyogram: a Statistical Approach for the Definition and Visualization of Ne...Dictyogram: a Statistical Approach for the Definition and Visualization of Ne...
Dictyogram: a Statistical Approach for the Definition and Visualization of Ne...Jorge E. López de Vergara Méndez
 
Análisis de Datos Funcionales para Gestión de Red: Téecnicas, Retos y Oportun...
Análisis de Datos Funcionales para Gestión de Red: Téecnicas, Retos y Oportun...Análisis de Datos Funcionales para Gestión de Red: Téecnicas, Retos y Oportun...
Análisis de Datos Funcionales para Gestión de Red: Téecnicas, Retos y Oportun...Jorge E. López de Vergara Méndez
 
MONITORIZACIÓN Y ANÁLISIS DE TRÁFICO DE RED CON APACHE HADOOP
MONITORIZACIÓN Y ANÁLISIS DE TRÁFICO DE RED CON APACHE HADOOPMONITORIZACIÓN Y ANÁLISIS DE TRÁFICO DE RED CON APACHE HADOOP
MONITORIZACIÓN Y ANÁLISIS DE TRÁFICO DE RED CON APACHE HADOOPJorge E. López de Vergara Méndez
 
Multimedia flow classification at 10 Gbps using acceleration techniques on co...
Multimedia flow classification at 10 Gbps using acceleration techniques on co...Multimedia flow classification at 10 Gbps using acceleration techniques on co...
Multimedia flow classification at 10 Gbps using acceleration techniques on co...Jorge E. López de Vergara Méndez
 
Evaluating Quality of Experience in IPTV Services Using MPEG Frame Loss Rate
Evaluating Quality of Experience in IPTV Services Using MPEG Frame Loss RateEvaluating Quality of Experience in IPTV Services Using MPEG Frame Loss Rate
Evaluating Quality of Experience in IPTV Services Using MPEG Frame Loss RateJorge E. López de Vergara Méndez
 
Integración semántica de información de distintos repositorios de medidas de red
Integración semántica de información de distintos repositorios de medidas de redIntegración semántica de información de distintos repositorios de medidas de red
Integración semántica de información de distintos repositorios de medidas de redJorge E. López de Vergara Méndez
 

More from Jorge E. López de Vergara Méndez (9)

Evaluación de equipamiento de bajo coste para realizar medidas de red en ento...
Evaluación de equipamiento de bajo coste para realizar medidas de red en ento...Evaluación de equipamiento de bajo coste para realizar medidas de red en ento...
Evaluación de equipamiento de bajo coste para realizar medidas de red en ento...
 
Dictyogram: a Statistical Approach for the Definition and Visualization of Ne...
Dictyogram: a Statistical Approach for the Definition and Visualization of Ne...Dictyogram: a Statistical Approach for the Definition and Visualization of Ne...
Dictyogram: a Statistical Approach for the Definition and Visualization of Ne...
 
Análisis de Datos Funcionales para Gestión de Red: Téecnicas, Retos y Oportun...
Análisis de Datos Funcionales para Gestión de Red: Téecnicas, Retos y Oportun...Análisis de Datos Funcionales para Gestión de Red: Téecnicas, Retos y Oportun...
Análisis de Datos Funcionales para Gestión de Red: Téecnicas, Retos y Oportun...
 
MONITORIZACIÓN Y ANÁLISIS DE TRÁFICO DE RED CON APACHE HADOOP
MONITORIZACIÓN Y ANÁLISIS DE TRÁFICO DE RED CON APACHE HADOOPMONITORIZACIÓN Y ANÁLISIS DE TRÁFICO DE RED CON APACHE HADOOP
MONITORIZACIÓN Y ANÁLISIS DE TRÁFICO DE RED CON APACHE HADOOP
 
Merging heterogeneous network measurement data
Merging heterogeneous network measurement dataMerging heterogeneous network measurement data
Merging heterogeneous network measurement data
 
Multimedia flow classification at 10 Gbps using acceleration techniques on co...
Multimedia flow classification at 10 Gbps using acceleration techniques on co...Multimedia flow classification at 10 Gbps using acceleration techniques on co...
Multimedia flow classification at 10 Gbps using acceleration techniques on co...
 
Evaluating Quality of Experience in IPTV Services Using MPEG Frame Loss Rate
Evaluating Quality of Experience in IPTV Services Using MPEG Frame Loss RateEvaluating Quality of Experience in IPTV Services Using MPEG Frame Loss Rate
Evaluating Quality of Experience in IPTV Services Using MPEG Frame Loss Rate
 
Defining ontologies for IP traffic measurements at MOI ISG
Defining ontologies for IP traffic measurements at MOI ISGDefining ontologies for IP traffic measurements at MOI ISG
Defining ontologies for IP traffic measurements at MOI ISG
 
Integración semántica de información de distintos repositorios de medidas de red
Integración semántica de información de distintos repositorios de medidas de redIntegración semántica de información de distintos repositorios de medidas de red
Integración semántica de información de distintos repositorios de medidas de red
 

Recently uploaded

Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...121011101441
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)dollysharma2066
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleAlluxio, Inc.
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...asadnawaz62
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
Comparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization TechniquesComparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization Techniquesugginaramesh
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEroselinkalist12
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 

Recently uploaded (20)

Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at Scale
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
Comparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization TechniquesComparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization Techniques
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.ppt
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 

On the feasibility of 40 Gbps network data capture and retention with general purpose hardware

  • 1. On the feasibility of 40 Gbps network data capture and retention with general purpose hardware SAC 2018 | Pau, France Guillermo Julián-Moreno Rafael Leira Jorge E. López de Vergara Francisco Gómez-Arribas Iván González April 10, 2018 Naudit HPCN & Escuela Politécnica Superior, Universidad Autónoma de Madrid
  • 2. Outline 1. Introduction 2. Design and implementation 3. Results 4. Conclusions 1
  • 4. Motivation Why would we want to capture and store traffic? • Online analysis and monitoring (e.g., flow records, traffic volume dashboards, IDS). • Data retention for specialized analysis or policy requirements (e.g. GPDR). 10 GbE standard is widespread, now we are seeing higher speeds (40 GbE, but also 100 GbE) • 10 GbE is the “last” standard that we can process with a single core. • 40 GbE and higher speeds require parallelism: how? 2
  • 5. Purpose of the system • Receive the traffic at 40 Gbps as efficiently as possible. • Timestamp the incoming traffic. • Store the network frames in the disk at 40 Gbps. • Use commercial off-the-shelf hardware to reduce costs. 3
  • 7. Previous architecture DMA tail head Descriptor ring Data bufferNIC • Single thread copying frames to the intermediate buffer. • Write files by blocks and use padding at the end of each file. • Return the descriptor’s ownership to the card after the copy (no allocations needed). 4
  • 8. Reading from NIC and copying to buffer 1 23 4 Rx Intermediate buffer ... • Usual approach for parallelism is RSS queues. Problem: ensuring uniform distribution between queues. • Given our limited scope, switch to single queue and fixed descriptor assignments: uniform distribution and no synchronization required for reading. • Use a single atomic counter for the buffer write offset: as fast as possible and no deadlocks possible. • Add padding to the beginning and end of the files to avoid having frames overrunning file boundaries. 5
  • 9. Client reading Kernel Userspace Rx 0 Rx 1 Rx 2 Rx 3 1. Allocation 2. Copy Client … 3. New data! 4. Reading • Userspace clients get last written byte via syscalls and set their last read byte. • RX thread 0 updates s, the space available in the buffer. No thread writes more than ⌊s/n⌋ bytes in a batch. 6
  • 10. Writing to disk Two options for the write process: • Regular files written in 4MB blocks. Needs a fast filesystem. • Distributed writes between several NVMe disks with SPDK. Features to reduce hardware requirements: • Simple filtering system that searches “strings” of bytes at fixed positions. • Selective storage: only store the first N bytes of each frame. 7
  • 12. Hardware used Traffic generator RX Server 1 RX Server 2 CPU Intel Xeon E5-1620 v2 Intel Xeon E5-1620 v2 2 × Intel Xeon E5-2630 v4 Clock 3.70GHz 3.70GHz 2.20 GHz Cores 4 4 2 x 10 Memory 32 GB 32 GB 2 x 64 GB NIC Intel XL710 Intel XL710 Intel XL710 Storage SATA RAID SATA RAID 6 x NVMe Est. cost 7,000e 7,000e 10,000e Table 1: Specifications of the servers used for testing. HyperThreading was disabled. 8
  • 13. Storage speed 0 10 20 30 40 50 2 3 4 5 6 Rate(Gbps) Number of discs Disk write rate Software RAID SPDK Theoretical max speed Figure 1: Performance of the NVMe disk array. 9
  • 14. Traffic generation 0 5 10 15 20 25 30 35 40 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 Rate(Gbps) Frame size (bytes) Generation rate Send rate Theoretical max rate Figure 2: Synthetic traffic rates achieved with our custom, DPDK-based traffic generator. We also made a version capable of sending large PCAP files at line rate. 10
  • 15. Timestamping accuracy Frames Mean Std All 1738 ns 3296 ns One out of every eight 55 ns 287 ns Table 2: Timestamping accuracy. The Intel NIC posts descriptors in batches of eight, so we have to take that into account for the accuracy. 11
  • 16. Traffic capture 0 25 50 75 100 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 0 5 10 15 20 25 30 35 40 Loss% Rate(Gbps) Frame size (bytes) Capture rate Lost % Port Drop % Send rate Capture rate Theoretical max rate Figure 3: Results of the first test: retrieval of the frames from the NIC. The bottleneck is the card for small frame sizes. 0 25 50 75 100 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 0 5 10 15 20 25 30 35 40 Loss% Rate(Gbps) Frame size (bytes) Capture rate Lost % Port Drop % Send rate Capture rate Theoretical max rate Figure 4: Results of the second test: writing of the frames to a null device. 12
  • 17. Traffic storage 0 25 50 75 100 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 0 5 10 15 20 25 30 35 40 Loss% Rate(Gbps) Frame size (bytes) Capture rate Lost % Send rate Capture rate Theoretical max rate Figure 5: Results of the third test: traffic storage using SPDK. 13
  • 18. Traffic storage Name Size Avg. frame size Send rate Loss % CAIDA 222 GB 787.91 B 39.78 Gbps <0.01 University 4.3 GB 910.08 B 39.82 Gbps 0 Table 3: Performance on reception of traffic capture files. 14
  • 20. Results • We have created and open-sourced a system capable of capturing, timestamping and storing network traffic at 40 Gbps. • Not using RSS parallelism is feasible and useful in our limited-scope system. • The one-copy mechanism and synchronization algorithms allow our system to store line-rate traffic at frame sizes of 300 bytes and above (enough for the majority of environments). • We have created a testbed capable of saturating 40 GbE links for frames of size 96 bytes or greater. 15
  • 21. Future work • Improve the selective-storage approach: more effective filters (ASCII/BPF) or limits based on RX rate. • Detailed comparison of the disorder and timestamp inaccuracies between our approach and RSS queues. • Port this system to virtual machines with SR-IOV NVFs. 16