SlideShare a Scribd company logo
1 of 22
Download to read offline
ASPERA HIGH-SPEED TRANSFER
Moving the world’s data at maximum speed
ASPERA HIGH-SPEED FILE TRANSFER
2
80 GBIT/S OVER IP USING DPDK
Performance, Code, and Architecture
Charles Shiflett
Developer of next-generation FASP
bear @ us . IBM . com
bear@asperasoft.com
3
RAPID PROTYPING OF BARE METAL SOLUTIONS
3
Benefits of using DPDK
• Lightweight API that exposes Network and Acceleration Cards
• Rapid Iteration on high performance transport designs as
compared to equivalent code developed as Kernel Module
• Set of performance best practices
- Time, Locking, Memory Alignment, and Synchronization
- Avoids libc and system call interaction when possible
• NUMA aware solution
• Framework for High Performance Userland Direct I/O
FASP NX DESIGN GOAL
4
Goal is to bring industry closer to the data they
are using both within the data center and across
the globe.
• Effectively utilize PCIe based IO on Intel Server platform.
• Solve storage bottleneck using Direct I/O w/ multiple controllers
• Solve network bottleneck using DPDK and link aggregation
• Provide a secure transport solution using Intel AES-NI GCM
• Significantly reduce time spent getting your data set to the CPU
HARDWARE CONFIGURATION
5
5x Intel® DC P3700 NVMe SSD
2x Intel® Xeon®E5-2697 v3
2x Intel® XL710 40 GbE Ethernet QSFP+
ASPERA’S MISSION
Creating next-generation transport technologies
that move the world’s digital assets at maximum speed,
regardless of file size, transfer distance and network conditions.
6
CHALLENGES WITH TCP AND ALTERNATIVE TECHNOLOGIES
Distance degrades conditions on all networks
• Latency (or Round Trip Times) increase
• Packet losses increase
• Fast networks just as prone to degradation
TCP performance degrades with distance
• Throughput bottleneck becomes more severe with
increased latency and packet loss
TCP does not scale with bandwidth
• TCP designed for low bandwidth
• Adding more bandwidth does not improve throughput
Alternative Technologies
• TCP-based - Network latency and packet loss must be low
• UDP traffic blasters - Inefficient and waste bandwidth
• Data caching - Inappropriate for many large file transfer workflows
• Modified TCP - Improves on TCP performance but insufficient for fast networks
• Data compression - Time consuming and impractical for certain file types
• CDNs & co-lo build outs - High overhead and expensive to scale
7
FASP™: HIGH-PERFORMANCE TRANSPORT
Maximum transfer speed
• Optimal end-to-end throughput efficiency
• Transfer performance scales with bandwidth independent of transfer distance
and resilient to packet loss
Congestion Avoidance and Policy Control
• Automatic, full utilization of available bandwidth
• On-the-fly prioritization and bandwidth allocation
Uncompromising security and reliability
• Secure, user/endpoint authentication
• AES-128 cryptography in transit and at-rest
Scalable management, monitoring and control
• Real-time progress, performance and bandwidth utilization
• Detailed transfer history, logging, and manifest
Low Overhead
• Less than 0.1% overhead on 30% packet loss
• High performance with large files or large sets of small files
Resulting in
• Transfers up to thousands of times faster than FTP
• Precise and predictable transfer times
• Extreme scalability (concurrency and throughput)
8
LAN TRANSFER RESULTS IN GBIT PER SECOND
9
0
10
20
30
40
50
60
70
80
SSH
AES-128 CTR
FTP
iperf
FASP 4x10Gbit
(Intel® Xeon®
E5-2650 v2)
FASP
(Intel® Xeon®
E5-2697 v3)
Disk to Disk
Wire Utilization w/o disk
Results from SC 14 showing the relative performance of Network Transfer Technologies
TYPICAL TRANSFER APPLICATION
10
ASPERA NEXT GENERATION FASP
11
AVOIDING STALLS
• Synchronization primitives aligned to cache-line, lockless. Example for circular buffer queues;
Dequeue, claim block: block_id = __sync_fetch_and_add ( &sess->block_tail, 1 ) % block_count;
Dequeue, consume block: while ( !__sync_val_compare_and_swap( &sess->block[block_id].state, 0, 1) ) PAUSE;
• Zero copy paradigm critical to good memory performance. Memory copies are VERY expensive!
• Memory copies should be avoided at all costs, writes to Memory must be page aligned to avoid read of
destination cacheline
• Solution is to operate on datasets which fit into L3 cache. New tools provide better insight into L3 cache
utilization such that code can be optimized.
• Low latency direct I/O.
• Intel DDIO critical in bringing data into L3 from Storage, Network Adapters, and PCIe Accelerators.
• DPDK provides framework for operating on data streamed into L3 cache
• Avoid TLB misses by using Hugepages
• Avoid NUMA issues and minimize thread preemption.
• Userland IO drivers where possible.
12
EXAMPLE OF ALIGNED/UNALIGNED MEMORY COPY
13
5.7 GB/S copy performance constrained by RFO
9.0 GB/s copy avoids RFO, close to per core limit
* Per channel results via Intel Performance Counter Monitor
OPTIMIZING NETWORK THROUGHPUT
14
• Create network queues which fit into L3. Each queue tied to a specific core.
• Experiment with values to minimize memory, while retaining useful properties:
• Should support bulk allocation
• MAX_RX_BURST < rx_free_thresh < nb_rx_desc (set when assigning queue to port)
• Nb_rx_desc % rx_free_thresh == 0
• Ideally pkts_in_cache % nb_pkts == 0 and nb_pkts == 2^n-1
• PTHRESH/HTHRESH/WTHRESH values feel like black magic, but can have a huge
effect on performance. Start by copying values from example applications.
IMPROVING STORAGE THROUGHPUT
• Traditional storage is built around the idea of moving slow data from disk to memory and then from
memory to application. Memory is used to cache data to improve access speeds.
• Cache structure quickly becomes a bottleneck as transfer speeds exceed 10 gbit/s.
• While individual spinning disks are slow, JBOD’s of 100’s of disk have very high aggregate bandwidth
• Modern SSDs (especially NVMe) is very fast.
• Two solutions for fast data
• Use XFS with direct IO (and/or MMAP)
• Have shown performance at about 40gbit/s with hardware raid and direct attach disks.
• Have shown performance at about 70gbit/s with NVMe SSD.
• Limited by how many devices you can connect to PCIe data link.
• Clustered parallel storage
• Aspera is targeting IBM GPFS (Spectrum) or Lustre based solutions
• Individual nodes can be slow, but by aggregating nodes high performance is achieved
• Both offer direct I/O solutions
15
DC P3700 PERFORMANCE (SINGLE DRIVE)
16
Performance relative to block size (SI Units in MB)
2678 MB/s
1869 MB/s
0
500
1000
1500
2000
2500
3000
128 16 4 1
Write
Synchronous Read
Async Read (AIO)
2800 MB/s
ENCRYPTION IS NOT A BOTTLENECK WITH HW AES-GCM
17
0
0.5
1
1.5
2
2.5
Core i7 X 980
3.33GHz
Xeon E5-2650 v2
2.60GHz
Xeon E5-2697 v3
2.60GHz
AES 128 GCM Encryption Rate in GB/s per Core
NEW ON THE WIRE PROTOCOL
• Built around AES-128 GCM, which is similar to DTLS ( Datagram TLS ).
18
AES-128 GCM {
uint64 IV;
uint8[] payload;
uint8[16] MAC;
}
DTLSRecord {
ContentType type;
ProtocolVersion version;
uint16 epoch;
uint48 sequence_number;
uint16 length;
uint8_t[length] payload;
}
MAC {
uint8[16] hash;
}
FASP4 {
uint16 stream_id;
uint8 ver_and_pkt_type;
uint40 sequence_number;
uint8[] payload;
uint8[16] hash; // optional
}
• In the same way it is possible to encapsulate FASP NX header in UDP, will also support
Encapsulating FASP4 in DTLS.
EXAMPLE TRANSFER SESSION USING FASP NX API
19
FASP NX API’S
• Native FASP NX API
• Share hugepage sized regions of memory with client application
• Usable from VM or Native machine
• Enables Zero Copy transfer solutions of non file data
• Dynamically set rate policy, destination IP, and so on.
• Traditional Aspera (ascp) command line tools
• Similar command line to SCP, with FASP NX performance.
• Existing Aspera API
• Faspmgmt
• Web Services
• Rest based Queries
20
FUTURE DIRECTION
• Aspera’s Goal: Transfer Solution to Petascale Datasets
- 1 SI Terabyte in 2 Minutes
- 1 SI Petabyte in 1⅜ Days
- Performance improvements expected to scale relative to PCIe interconnect.
• Better integration with storage systems
- Take advantage of native APIs to avoid kernel cache.
• Better integration with network hardware
- Expected to show 100gbit/s transfers using Mellanox ConnectX®-4
- Query network switch and routers?
• Support wider use cases
- Compression, Forward Error Correcting Codecs, Berkeley Sockets API
21
THANK YOU
CHARLES SHIFLETT
bear @ us.ibm.com

More Related Content

What's hot

Accelerate Service Function Chaining Vertical Solution with DPDK
Accelerate Service Function Chaining Vertical Solution with DPDKAccelerate Service Function Chaining Vertical Solution with DPDK
Accelerate Service Function Chaining Vertical Solution with DPDKOPNFV
 
DPDK Summit 2015 - Intel - Keith Wiles
DPDK Summit 2015 - Intel - Keith WilesDPDK Summit 2015 - Intel - Keith Wiles
DPDK Summit 2015 - Intel - Keith WilesJim St. Leger
 
DPDK summit 2015: It's kind of fun to do the impossible with DPDK
DPDK summit 2015: It's kind of fun  to do the impossible with DPDKDPDK summit 2015: It's kind of fun  to do the impossible with DPDK
DPDK summit 2015: It's kind of fun to do the impossible with DPDKLagopus SDN/OpenFlow switch
 
DPDK Summit - 08 Sept 2014 - Futurewei - Jun Xu - Revisit the IP Stack in Lin...
DPDK Summit - 08 Sept 2014 - Futurewei - Jun Xu - Revisit the IP Stack in Lin...DPDK Summit - 08 Sept 2014 - Futurewei - Jun Xu - Revisit the IP Stack in Lin...
DPDK Summit - 08 Sept 2014 - Futurewei - Jun Xu - Revisit the IP Stack in Lin...Jim St. Leger
 
DPDK Summit - 08 Sept 2014 - NTT - High Performance vSwitch
DPDK Summit - 08 Sept 2014 - NTT - High Performance vSwitchDPDK Summit - 08 Sept 2014 - NTT - High Performance vSwitch
DPDK Summit - 08 Sept 2014 - NTT - High Performance vSwitchJim St. Leger
 
DPDK Summit - 08 Sept 2014 - 6WIND - High Perf Networking Leveraging the DPDK...
DPDK Summit - 08 Sept 2014 - 6WIND - High Perf Networking Leveraging the DPDK...DPDK Summit - 08 Sept 2014 - 6WIND - High Perf Networking Leveraging the DPDK...
DPDK Summit - 08 Sept 2014 - 6WIND - High Perf Networking Leveraging the DPDK...Jim St. Leger
 
DPDK Summit 2015 - HP - Al Sanders
DPDK Summit 2015 - HP - Al SandersDPDK Summit 2015 - HP - Al Sanders
DPDK Summit 2015 - HP - Al SandersJim St. Leger
 
Netsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfvNetsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfvIntel
 
Lagopus presentation on 14th Annual ON*VECTOR International Photonics Workshop
Lagopus presentation on 14th Annual ON*VECTOR International Photonics WorkshopLagopus presentation on 14th Annual ON*VECTOR International Photonics Workshop
Lagopus presentation on 14th Annual ON*VECTOR International Photonics WorkshopLagopus SDN/OpenFlow switch
 
Performance challenges in software networking
Performance challenges in software networkingPerformance challenges in software networking
Performance challenges in software networkingStephen Hemminger
 
Network Programming: Data Plane Development Kit (DPDK)
Network Programming: Data Plane Development Kit (DPDK)Network Programming: Data Plane Development Kit (DPDK)
Network Programming: Data Plane Development Kit (DPDK)Andriy Berestovskyy
 
DPDK in Containers Hands-on Lab
DPDK in Containers Hands-on LabDPDK in Containers Hands-on Lab
DPDK in Containers Hands-on LabMichelle Holley
 
Intel DPDK Step by Step instructions
Intel DPDK Step by Step instructionsIntel DPDK Step by Step instructions
Intel DPDK Step by Step instructionsHisaki Ohara
 
redGuardian DP100 large scale DDoS mitigation solution
redGuardian DP100 large scale DDoS mitigation solutionredGuardian DP100 large scale DDoS mitigation solution
redGuardian DP100 large scale DDoS mitigation solutionRedge Technologies
 
DPDK: Multi Architecture High Performance Packet Processing
DPDK: Multi Architecture High Performance Packet ProcessingDPDK: Multi Architecture High Performance Packet Processing
DPDK: Multi Architecture High Performance Packet ProcessingMichelle Holley
 
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel ArchitectureDPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel ArchitectureJim St. Leger
 

What's hot (20)

Accelerate Service Function Chaining Vertical Solution with DPDK
Accelerate Service Function Chaining Vertical Solution with DPDKAccelerate Service Function Chaining Vertical Solution with DPDK
Accelerate Service Function Chaining Vertical Solution with DPDK
 
DPDK Summit 2015 - Intel - Keith Wiles
DPDK Summit 2015 - Intel - Keith WilesDPDK Summit 2015 - Intel - Keith Wiles
DPDK Summit 2015 - Intel - Keith Wiles
 
DPDK summit 2015: It's kind of fun to do the impossible with DPDK
DPDK summit 2015: It's kind of fun  to do the impossible with DPDKDPDK summit 2015: It's kind of fun  to do the impossible with DPDK
DPDK summit 2015: It's kind of fun to do the impossible with DPDK
 
DPDK Summit - 08 Sept 2014 - Futurewei - Jun Xu - Revisit the IP Stack in Lin...
DPDK Summit - 08 Sept 2014 - Futurewei - Jun Xu - Revisit the IP Stack in Lin...DPDK Summit - 08 Sept 2014 - Futurewei - Jun Xu - Revisit the IP Stack in Lin...
DPDK Summit - 08 Sept 2014 - Futurewei - Jun Xu - Revisit the IP Stack in Lin...
 
DPDK Summit - 08 Sept 2014 - NTT - High Performance vSwitch
DPDK Summit - 08 Sept 2014 - NTT - High Performance vSwitchDPDK Summit - 08 Sept 2014 - NTT - High Performance vSwitch
DPDK Summit - 08 Sept 2014 - NTT - High Performance vSwitch
 
DPDK Summit - 08 Sept 2014 - 6WIND - High Perf Networking Leveraging the DPDK...
DPDK Summit - 08 Sept 2014 - 6WIND - High Perf Networking Leveraging the DPDK...DPDK Summit - 08 Sept 2014 - 6WIND - High Perf Networking Leveraging the DPDK...
DPDK Summit - 08 Sept 2014 - 6WIND - High Perf Networking Leveraging the DPDK...
 
DPDK Summit 2015 - HP - Al Sanders
DPDK Summit 2015 - HP - Al SandersDPDK Summit 2015 - HP - Al Sanders
DPDK Summit 2015 - HP - Al Sanders
 
Intel dpdk Tutorial
Intel dpdk TutorialIntel dpdk Tutorial
Intel dpdk Tutorial
 
Netsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfvNetsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfv
 
Lagopus presentation on 14th Annual ON*VECTOR International Photonics Workshop
Lagopus presentation on 14th Annual ON*VECTOR International Photonics WorkshopLagopus presentation on 14th Annual ON*VECTOR International Photonics Workshop
Lagopus presentation on 14th Annual ON*VECTOR International Photonics Workshop
 
Performance challenges in software networking
Performance challenges in software networkingPerformance challenges in software networking
Performance challenges in software networking
 
Network Programming: Data Plane Development Kit (DPDK)
Network Programming: Data Plane Development Kit (DPDK)Network Programming: Data Plane Development Kit (DPDK)
Network Programming: Data Plane Development Kit (DPDK)
 
DPDK in Containers Hands-on Lab
DPDK in Containers Hands-on LabDPDK in Containers Hands-on Lab
DPDK in Containers Hands-on Lab
 
DPDK In Depth
DPDK In DepthDPDK In Depth
DPDK In Depth
 
Userspace networking
Userspace networkingUserspace networking
Userspace networking
 
Dpdk applications
Dpdk applicationsDpdk applications
Dpdk applications
 
Intel DPDK Step by Step instructions
Intel DPDK Step by Step instructionsIntel DPDK Step by Step instructions
Intel DPDK Step by Step instructions
 
redGuardian DP100 large scale DDoS mitigation solution
redGuardian DP100 large scale DDoS mitigation solutionredGuardian DP100 large scale DDoS mitigation solution
redGuardian DP100 large scale DDoS mitigation solution
 
DPDK: Multi Architecture High Performance Packet Processing
DPDK: Multi Architecture High Performance Packet ProcessingDPDK: Multi Architecture High Performance Packet Processing
DPDK: Multi Architecture High Performance Packet Processing
 
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel ArchitectureDPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
 

Viewers also liked

DPDK Summit - 08 Sept 2014 - Intel - Closing Remarks
DPDK Summit - 08 Sept 2014 - Intel - Closing RemarksDPDK Summit - 08 Sept 2014 - Intel - Closing Remarks
DPDK Summit - 08 Sept 2014 - Intel - Closing RemarksJim St. Leger
 
DPDK Summit 2015 - Intro - Tim O'Driscoll
DPDK Summit 2015 - Intro - Tim O'DriscollDPDK Summit 2015 - Intro - Tim O'Driscoll
DPDK Summit 2015 - Intro - Tim O'DriscollJim St. Leger
 
Fast and secure protocol (fasp)
Fast and secure protocol (fasp)Fast and secure protocol (fasp)
Fast and secure protocol (fasp)Vikrant Arya
 
Pharma process ficha_tecnica_february_2011
Pharma process ficha_tecnica_february_2011Pharma process ficha_tecnica_february_2011
Pharma process ficha_tecnica_february_2011adri451
 
Asegure el cumplimiento de Normativas Regulatorias, Evite Errores Costosos y ...
Asegure el cumplimiento de Normativas Regulatorias, Evite Errores Costosos y ...Asegure el cumplimiento de Normativas Regulatorias, Evite Errores Costosos y ...
Asegure el cumplimiento de Normativas Regulatorias, Evite Errores Costosos y ...Rocket Software ALM
 
Éco actu n°19
Éco actu n°19Éco actu n°19
Éco actu n°19AudéLor
 
TechShanghai2016 - 高可靠性PCB – 从设计到制造
TechShanghai2016 - 高可靠性PCB – 从设计到制造TechShanghai2016 - 高可靠性PCB – 从设计到制造
TechShanghai2016 - 高可靠性PCB – 从设计到制造Hardway Hou
 
Finlandia final
Finlandia finalFinlandia final
Finlandia finalpauroro
 
2012 kids count data workshop
2012  kids count data workshop2012  kids count data workshop
2012 kids count data workshopLaila Bell
 
Le rapport annuel 2011 de l'ICCA
Le rapport annuel 2011 de l'ICCALe rapport annuel 2011 de l'ICCA
Le rapport annuel 2011 de l'ICCAThierry Labro
 
Revista Línea Sur 5
Revista Línea Sur 5Revista Línea Sur 5
Revista Línea Sur 5cancilleriaec
 
Diagrama Visio Agosto/2013 - Proceso Brainstorming
Diagrama Visio Agosto/2013 - Proceso BrainstormingDiagrama Visio Agosto/2013 - Proceso Brainstorming
Diagrama Visio Agosto/2013 - Proceso BrainstormingNicolas Lopez Cisneros
 
компьютерные сети
компьютерные сетикомпьютерные сети
компьютерные сетиstudent_SSGA
 
Redes Sociais - Sou mais Web - julho/2009
Redes Sociais - Sou mais Web - julho/2009Redes Sociais - Sou mais Web - julho/2009
Redes Sociais - Sou mais Web - julho/2009Camila Leite
 
Mc minifittings
Mc minifittingsMc minifittings
Mc minifittingsWanechan
 
Guida lavoro estero.... cosa fare prima di partire
Guida lavoro estero.... cosa fare prima di partireGuida lavoro estero.... cosa fare prima di partire
Guida lavoro estero.... cosa fare prima di partireLinda Belloni
 

Viewers also liked (20)

DPDK Summit - 08 Sept 2014 - Intel - Closing Remarks
DPDK Summit - 08 Sept 2014 - Intel - Closing RemarksDPDK Summit - 08 Sept 2014 - Intel - Closing Remarks
DPDK Summit - 08 Sept 2014 - Intel - Closing Remarks
 
DPDK Summit 2015 - Intro - Tim O'Driscoll
DPDK Summit 2015 - Intro - Tim O'DriscollDPDK Summit 2015 - Intro - Tim O'Driscoll
DPDK Summit 2015 - Intro - Tim O'Driscoll
 
Fast and secure protocol (fasp)
Fast and secure protocol (fasp)Fast and secure protocol (fasp)
Fast and secure protocol (fasp)
 
Pharma process ficha_tecnica_february_2011
Pharma process ficha_tecnica_february_2011Pharma process ficha_tecnica_february_2011
Pharma process ficha_tecnica_february_2011
 
Asegure el cumplimiento de Normativas Regulatorias, Evite Errores Costosos y ...
Asegure el cumplimiento de Normativas Regulatorias, Evite Errores Costosos y ...Asegure el cumplimiento de Normativas Regulatorias, Evite Errores Costosos y ...
Asegure el cumplimiento de Normativas Regulatorias, Evite Errores Costosos y ...
 
Éco actu n°19
Éco actu n°19Éco actu n°19
Éco actu n°19
 
TechShanghai2016 - 高可靠性PCB – 从设计到制造
TechShanghai2016 - 高可靠性PCB – 从设计到制造TechShanghai2016 - 高可靠性PCB – 从设计到制造
TechShanghai2016 - 高可靠性PCB – 从设计到制造
 
Finlandia final
Finlandia finalFinlandia final
Finlandia final
 
2012 kids count data workshop
2012  kids count data workshop2012  kids count data workshop
2012 kids count data workshop
 
Le rapport annuel 2011 de l'ICCA
Le rapport annuel 2011 de l'ICCALe rapport annuel 2011 de l'ICCA
Le rapport annuel 2011 de l'ICCA
 
Revista Línea Sur 5
Revista Línea Sur 5Revista Línea Sur 5
Revista Línea Sur 5
 
In Store Research - SR Labs Marketing&Usability
In Store Research - SR Labs Marketing&UsabilityIn Store Research - SR Labs Marketing&Usability
In Store Research - SR Labs Marketing&Usability
 
Diagrama Visio Agosto/2013 - Proceso Brainstorming
Diagrama Visio Agosto/2013 - Proceso BrainstormingDiagrama Visio Agosto/2013 - Proceso Brainstorming
Diagrama Visio Agosto/2013 - Proceso Brainstorming
 
Content Marketing
Content MarketingContent Marketing
Content Marketing
 
компьютерные сети
компьютерные сетикомпьютерные сети
компьютерные сети
 
Bc hc
Bc hcBc hc
Bc hc
 
Redes Sociais - Sou mais Web - julho/2009
Redes Sociais - Sou mais Web - julho/2009Redes Sociais - Sou mais Web - julho/2009
Redes Sociais - Sou mais Web - julho/2009
 
Mc minifittings
Mc minifittingsMc minifittings
Mc minifittings
 
Guida lavoro estero.... cosa fare prima di partire
Guida lavoro estero.... cosa fare prima di partireGuida lavoro estero.... cosa fare prima di partire
Guida lavoro estero.... cosa fare prima di partire
 
Correo
CorreoCorreo
Correo
 

Similar to DPDK Summit 2015 - Aspera - Charles Shiflett

High performace network of Cloud Native Taiwan User Group
High performace network of Cloud Native Taiwan User GroupHigh performace network of Cloud Native Taiwan User Group
High performace network of Cloud Native Taiwan User GroupHungWei Chiu
 
Introduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3RIntroduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3RSimon Huang
 
Deploying flash storage for Ceph without compromising performance
Deploying flash storage for Ceph without compromising performance Deploying flash storage for Ceph without compromising performance
Deploying flash storage for Ceph without compromising performance Ceph Community
 
6 open capi_meetup_in_japan_final
6 open capi_meetup_in_japan_final6 open capi_meetup_in_japan_final
6 open capi_meetup_in_japan_finalYutaka Kawai
 
Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...
Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...
Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...Ceph Community
 
Ceph Day SF 2015 - Deploying flash storage for Ceph without compromising perf...
Ceph Day SF 2015 - Deploying flash storage for Ceph without compromising perf...Ceph Day SF 2015 - Deploying flash storage for Ceph without compromising perf...
Ceph Day SF 2015 - Deploying flash storage for Ceph without compromising perf...Ceph Community
 
On the feasibility of 40 Gbps network data capture and retention with general...
On the feasibility of 40 Gbps network data capture and retention with general...On the feasibility of 40 Gbps network data capture and retention with general...
On the feasibility of 40 Gbps network data capture and retention with general...Jorge E. López de Vergara Méndez
 
OpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC SystemsOpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC SystemsHPCC Systems
 
HPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyHPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyPeter Clapham
 
Introduction to HPC & Supercomputing in AI
Introduction to HPC & Supercomputing in AIIntroduction to HPC & Supercomputing in AI
Introduction to HPC & Supercomputing in AITyrone Systems
 
Hyperscan - Mohammad Abdul Awal
Hyperscan - Mohammad Abdul AwalHyperscan - Mohammad Abdul Awal
Hyperscan - Mohammad Abdul Awalharryvanhaaren
 
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)Ontico
 
Webinar: OpenEBS - Still Free and now FASTEST Kubernetes storage
Webinar: OpenEBS - Still Free and now FASTEST Kubernetes storageWebinar: OpenEBS - Still Free and now FASTEST Kubernetes storage
Webinar: OpenEBS - Still Free and now FASTEST Kubernetes storageMayaData Inc
 
IBM Power9 Features and Specifications
IBM Power9 Features and SpecificationsIBM Power9 Features and Specifications
IBM Power9 Features and Specificationsinside-BigData.com
 
A Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural NetworksA Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural Networksinside-BigData.com
 
Introduction to DPDK
Introduction to DPDKIntroduction to DPDK
Introduction to DPDKKernel TLV
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureCeph Community
 

Similar to DPDK Summit 2015 - Aspera - Charles Shiflett (20)

High performace network of Cloud Native Taiwan User Group
High performace network of Cloud Native Taiwan User GroupHigh performace network of Cloud Native Taiwan User Group
High performace network of Cloud Native Taiwan User Group
 
Introduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3RIntroduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3R
 
Deploying flash storage for Ceph without compromising performance
Deploying flash storage for Ceph without compromising performance Deploying flash storage for Ceph without compromising performance
Deploying flash storage for Ceph without compromising performance
 
6 open capi_meetup_in_japan_final
6 open capi_meetup_in_japan_final6 open capi_meetup_in_japan_final
6 open capi_meetup_in_japan_final
 
QNAP TS-832PX-4G.pdf
QNAP TS-832PX-4G.pdfQNAP TS-832PX-4G.pdf
QNAP TS-832PX-4G.pdf
 
Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...
Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...
Ceph Day Amsterdam 2015 - Deploying flash storage for Ceph without compromisi...
 
Ceph Day SF 2015 - Deploying flash storage for Ceph without compromising perf...
Ceph Day SF 2015 - Deploying flash storage for Ceph without compromising perf...Ceph Day SF 2015 - Deploying flash storage for Ceph without compromising perf...
Ceph Day SF 2015 - Deploying flash storage for Ceph without compromising perf...
 
On the feasibility of 40 Gbps network data capture and retention with general...
On the feasibility of 40 Gbps network data capture and retention with general...On the feasibility of 40 Gbps network data capture and retention with general...
On the feasibility of 40 Gbps network data capture and retention with general...
 
OpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC SystemsOpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC Systems
 
pps Matters
pps Matterspps Matters
pps Matters
 
HPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyHPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journey
 
@IBM Power roadmap 8
@IBM Power roadmap 8 @IBM Power roadmap 8
@IBM Power roadmap 8
 
Introduction to HPC & Supercomputing in AI
Introduction to HPC & Supercomputing in AIIntroduction to HPC & Supercomputing in AI
Introduction to HPC & Supercomputing in AI
 
Hyperscan - Mohammad Abdul Awal
Hyperscan - Mohammad Abdul AwalHyperscan - Mohammad Abdul Awal
Hyperscan - Mohammad Abdul Awal
 
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
 
Webinar: OpenEBS - Still Free and now FASTEST Kubernetes storage
Webinar: OpenEBS - Still Free and now FASTEST Kubernetes storageWebinar: OpenEBS - Still Free and now FASTEST Kubernetes storage
Webinar: OpenEBS - Still Free and now FASTEST Kubernetes storage
 
IBM Power9 Features and Specifications
IBM Power9 Features and SpecificationsIBM Power9 Features and Specifications
IBM Power9 Features and Specifications
 
A Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural NetworksA Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural Networks
 
Introduction to DPDK
Introduction to DPDKIntroduction to DPDK
Introduction to DPDK
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 

Recently uploaded

%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in sowetomasabamasaba
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Bert Jan Schrijver
 
WSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrandmasabamasaba
 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationJuha-Pekka Tolvanen
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...masabamasaba
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...masabamasaba
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...masabamasaba
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2
 
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benonimasabamasaba
 
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...masabamasaba
 
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfonteinmasabamasaba
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...masabamasaba
 
WSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...Shane Coughlan
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyviewmasabamasaba
 

Recently uploaded (20)

%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
 
WSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - Keynote
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the Situation
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
 
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
WSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaS
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 

DPDK Summit 2015 - Aspera - Charles Shiflett

  • 1. ASPERA HIGH-SPEED TRANSFER Moving the world’s data at maximum speed
  • 2. ASPERA HIGH-SPEED FILE TRANSFER 2 80 GBIT/S OVER IP USING DPDK Performance, Code, and Architecture Charles Shiflett Developer of next-generation FASP bear @ us . IBM . com bear@asperasoft.com
  • 3. 3 RAPID PROTYPING OF BARE METAL SOLUTIONS 3 Benefits of using DPDK • Lightweight API that exposes Network and Acceleration Cards • Rapid Iteration on high performance transport designs as compared to equivalent code developed as Kernel Module • Set of performance best practices - Time, Locking, Memory Alignment, and Synchronization - Avoids libc and system call interaction when possible • NUMA aware solution • Framework for High Performance Userland Direct I/O
  • 4. FASP NX DESIGN GOAL 4 Goal is to bring industry closer to the data they are using both within the data center and across the globe. • Effectively utilize PCIe based IO on Intel Server platform. • Solve storage bottleneck using Direct I/O w/ multiple controllers • Solve network bottleneck using DPDK and link aggregation • Provide a secure transport solution using Intel AES-NI GCM • Significantly reduce time spent getting your data set to the CPU
  • 5. HARDWARE CONFIGURATION 5 5x Intel® DC P3700 NVMe SSD 2x Intel® Xeon®E5-2697 v3 2x Intel® XL710 40 GbE Ethernet QSFP+
  • 6. ASPERA’S MISSION Creating next-generation transport technologies that move the world’s digital assets at maximum speed, regardless of file size, transfer distance and network conditions. 6
  • 7. CHALLENGES WITH TCP AND ALTERNATIVE TECHNOLOGIES Distance degrades conditions on all networks • Latency (or Round Trip Times) increase • Packet losses increase • Fast networks just as prone to degradation TCP performance degrades with distance • Throughput bottleneck becomes more severe with increased latency and packet loss TCP does not scale with bandwidth • TCP designed for low bandwidth • Adding more bandwidth does not improve throughput Alternative Technologies • TCP-based - Network latency and packet loss must be low • UDP traffic blasters - Inefficient and waste bandwidth • Data caching - Inappropriate for many large file transfer workflows • Modified TCP - Improves on TCP performance but insufficient for fast networks • Data compression - Time consuming and impractical for certain file types • CDNs & co-lo build outs - High overhead and expensive to scale 7
  • 8. FASP™: HIGH-PERFORMANCE TRANSPORT Maximum transfer speed • Optimal end-to-end throughput efficiency • Transfer performance scales with bandwidth independent of transfer distance and resilient to packet loss Congestion Avoidance and Policy Control • Automatic, full utilization of available bandwidth • On-the-fly prioritization and bandwidth allocation Uncompromising security and reliability • Secure, user/endpoint authentication • AES-128 cryptography in transit and at-rest Scalable management, monitoring and control • Real-time progress, performance and bandwidth utilization • Detailed transfer history, logging, and manifest Low Overhead • Less than 0.1% overhead on 30% packet loss • High performance with large files or large sets of small files Resulting in • Transfers up to thousands of times faster than FTP • Precise and predictable transfer times • Extreme scalability (concurrency and throughput) 8
  • 9. LAN TRANSFER RESULTS IN GBIT PER SECOND 9 0 10 20 30 40 50 60 70 80 SSH AES-128 CTR FTP iperf FASP 4x10Gbit (Intel® Xeon® E5-2650 v2) FASP (Intel® Xeon® E5-2697 v3) Disk to Disk Wire Utilization w/o disk Results from SC 14 showing the relative performance of Network Transfer Technologies
  • 12. AVOIDING STALLS • Synchronization primitives aligned to cache-line, lockless. Example for circular buffer queues; Dequeue, claim block: block_id = __sync_fetch_and_add ( &sess->block_tail, 1 ) % block_count; Dequeue, consume block: while ( !__sync_val_compare_and_swap( &sess->block[block_id].state, 0, 1) ) PAUSE; • Zero copy paradigm critical to good memory performance. Memory copies are VERY expensive! • Memory copies should be avoided at all costs, writes to Memory must be page aligned to avoid read of destination cacheline • Solution is to operate on datasets which fit into L3 cache. New tools provide better insight into L3 cache utilization such that code can be optimized. • Low latency direct I/O. • Intel DDIO critical in bringing data into L3 from Storage, Network Adapters, and PCIe Accelerators. • DPDK provides framework for operating on data streamed into L3 cache • Avoid TLB misses by using Hugepages • Avoid NUMA issues and minimize thread preemption. • Userland IO drivers where possible. 12
  • 13. EXAMPLE OF ALIGNED/UNALIGNED MEMORY COPY 13 5.7 GB/S copy performance constrained by RFO 9.0 GB/s copy avoids RFO, close to per core limit * Per channel results via Intel Performance Counter Monitor
  • 14. OPTIMIZING NETWORK THROUGHPUT 14 • Create network queues which fit into L3. Each queue tied to a specific core. • Experiment with values to minimize memory, while retaining useful properties: • Should support bulk allocation • MAX_RX_BURST < rx_free_thresh < nb_rx_desc (set when assigning queue to port) • Nb_rx_desc % rx_free_thresh == 0 • Ideally pkts_in_cache % nb_pkts == 0 and nb_pkts == 2^n-1 • PTHRESH/HTHRESH/WTHRESH values feel like black magic, but can have a huge effect on performance. Start by copying values from example applications.
  • 15. IMPROVING STORAGE THROUGHPUT • Traditional storage is built around the idea of moving slow data from disk to memory and then from memory to application. Memory is used to cache data to improve access speeds. • Cache structure quickly becomes a bottleneck as transfer speeds exceed 10 gbit/s. • While individual spinning disks are slow, JBOD’s of 100’s of disk have very high aggregate bandwidth • Modern SSDs (especially NVMe) is very fast. • Two solutions for fast data • Use XFS with direct IO (and/or MMAP) • Have shown performance at about 40gbit/s with hardware raid and direct attach disks. • Have shown performance at about 70gbit/s with NVMe SSD. • Limited by how many devices you can connect to PCIe data link. • Clustered parallel storage • Aspera is targeting IBM GPFS (Spectrum) or Lustre based solutions • Individual nodes can be slow, but by aggregating nodes high performance is achieved • Both offer direct I/O solutions 15
  • 16. DC P3700 PERFORMANCE (SINGLE DRIVE) 16 Performance relative to block size (SI Units in MB) 2678 MB/s 1869 MB/s 0 500 1000 1500 2000 2500 3000 128 16 4 1 Write Synchronous Read Async Read (AIO) 2800 MB/s
  • 17. ENCRYPTION IS NOT A BOTTLENECK WITH HW AES-GCM 17 0 0.5 1 1.5 2 2.5 Core i7 X 980 3.33GHz Xeon E5-2650 v2 2.60GHz Xeon E5-2697 v3 2.60GHz AES 128 GCM Encryption Rate in GB/s per Core
  • 18. NEW ON THE WIRE PROTOCOL • Built around AES-128 GCM, which is similar to DTLS ( Datagram TLS ). 18 AES-128 GCM { uint64 IV; uint8[] payload; uint8[16] MAC; } DTLSRecord { ContentType type; ProtocolVersion version; uint16 epoch; uint48 sequence_number; uint16 length; uint8_t[length] payload; } MAC { uint8[16] hash; } FASP4 { uint16 stream_id; uint8 ver_and_pkt_type; uint40 sequence_number; uint8[] payload; uint8[16] hash; // optional } • In the same way it is possible to encapsulate FASP NX header in UDP, will also support Encapsulating FASP4 in DTLS.
  • 19. EXAMPLE TRANSFER SESSION USING FASP NX API 19
  • 20. FASP NX API’S • Native FASP NX API • Share hugepage sized regions of memory with client application • Usable from VM or Native machine • Enables Zero Copy transfer solutions of non file data • Dynamically set rate policy, destination IP, and so on. • Traditional Aspera (ascp) command line tools • Similar command line to SCP, with FASP NX performance. • Existing Aspera API • Faspmgmt • Web Services • Rest based Queries 20
  • 21. FUTURE DIRECTION • Aspera’s Goal: Transfer Solution to Petascale Datasets - 1 SI Terabyte in 2 Minutes - 1 SI Petabyte in 1⅜ Days - Performance improvements expected to scale relative to PCIe interconnect. • Better integration with storage systems - Take advantage of native APIs to avoid kernel cache. • Better integration with network hardware - Expected to show 100gbit/s transfers using Mellanox ConnectX®-4 - Query network switch and routers? • Support wider use cases - Compression, Forward Error Correcting Codecs, Berkeley Sockets API 21