SlideShare a Scribd company logo
© 2019 Mellanox Technologies | Confidential 1
Paving the Road to Exascale
March 2019
Interconnect Your Future
Guilherme Fuhrken
gfuhrken@Mellanox.com
+55-11-99934-2297
© 2019 Mellanox Technologies | Confidential 2
Cloud &
Web 2.0
Big
Data
Enterprise
Business
Intelligence
HPC
Storage
Security
AI & Machine
Learning
Internet
of Things
Exponential Data
Growth Everywhere
Source: IDC
HPC: High-Performance Compute
AI: Artificial Intelligence
© 2019 Mellanox Technologies | Confidential 3
Higher Data Speeds
Faster Data Processing
Better Data Security
Exponential Data
Growth Everywhere
source: IDC
© 2019 Mellanox Technologies | Confidential 4
Higher Data Speeds
Faster Data Processing
Better Data Security
Adapters Switches
Cables &
Transceivers
SmartNIC System on a Chip
HPC and AI Needs the Most
Intelligent Interconnect
© 2019 Mellanox Technologies | Confidential 5
Mellanox Accelerates Leading HPC and AI Systems
World’s Top 3 Supercomputers
Summit CORAL System
World’s Fastest HPC / AI System
9.2K InfiniBand Nodes
Sierra CORAL System
#2 USA Supercomputer
8.6K InfiniBand Nodes
1 2
Wuxi Supercomputing Center
Fastest Supercomputer in China
41K InfiniBand Nodes
3
© 2019 Mellanox Technologies | Confidential 6
Mellanox Accelerates Leading HPC and AI Systems
(Examples)
JUWELS Supercomputer
2.6K InfiniBand Nodes
Fastest HPC / AI System in Japan
1.1K InfiniBand Nodes
The world's Fastest Industry
Supercomputer
1.6K InfiniBand Nodes
7 15 26
Fastest Supercomputer in Canada
Dragonfly+ Topology
1.5K InfiniBand Nodes
NASA Ames Research Center
20K InfiniBand Nodes
World’s leading Industry
Supercomputer
4.6K InfiniBand Nodes
34 5927
© 2019 Mellanox Technologies | Confidential 7
The Need for Intelligent and Faster Interconnect
CPU-Centric (Onload) Data-Centric (Offload)
Must Wait for the Data
Creates Performance Bottlenecks
Faster Data Speeds and In-Network Computing
Enable Higher Performance and Scale
GPU
CPU
GPU
CPU
Onload Network In-Network Computing
GPU
CPU
CPU
GPU
GPU
CPU
GPU
CPU
GPU
CPU
CPU
GPU
Analyze Data as it Moves!
Higher Performance and Scale
© 2019 Mellanox Technologies | Confidential 8
In-Network
Computing
Self-Healing
Technology
In-Network
Computing
Unbreakable Data Centers
Delivers Highest Application Performance
GPUDirect™ RDMA
Critical for HPC and Machine Learning ApplicationsGPU Acceleration Technology
10X Performance Acceleration
Critical for HPC and Machine Learning Applications
35X
Faster Network Recovery
5000X
10X Performance Acceleration
Performance Acceleration
In-Network Computing Delivers Highest Performance
Scalable Hierarchical
Aggregation and
Reduction Protocol
© 2019 Mellanox Technologies | Confidential 9
30%-250% Higher Return on Investment
Up to 50% Saving on Capital and Operation Expenses
Highest Applications Performance, Scalability and Productivity
InfiniBand Delivers Best Return on Investment
1.9X
Better
2X
Better
1.4X
Better
2.5X
Better
1.3X
Better
Molecular DynamicsGenomicsWeather Automotive Chemistry
© 2019 Mellanox Technologies | Confidential 10
Mellanox Supercharges Leading AI Companies
Higher ROI
Lower CapEx
& OpEx
60%
50%
Unlocking the Power of Artificial Intelligence
Cognitive Toolkit
RDMA Supercharges Leading AI Frameworks
© 2019 Mellanox Technologies | Confidential 11
Scalable Hierarchical
Aggregation and
Reduction Protocol
(SHARP)
© 2019 Mellanox Technologies | Confidential 12
Scalable Hierarchical Aggregation and Reduction Protocol (SHARP)
 Reliable Scalable General Purpose Primitive
 In-network Tree based aggregation mechanism
 Large number of groups
 Multiple simultaneous outstanding operations
 Applicable to Multiple Use-cases
 HPC Applications using MPI / SHMEM
 Distributed Machine Learning applications
 Scalable High Performance Collective Offload
 Barrier, Reduce, All-Reduce, Broadcast and more
 Sum, Min, Max, Min-loc, max-loc, OR, XOR, AND
 Integer and Floating-Point, 16/32/64 bits
SHArP Tree
SHARP Tree Aggregation Node
(Process running on HCA)
SHARP Tree Endnode
(Process running on HCA)
SHARP Tree Root
© 2019 Mellanox Technologies | Confidential 13
SHARP AllReduce Performance Advantages (128 Nodes)
SHARP enables 75% Reduction in Latency
Providing Scalable Flat LatencyScalable Hierarchical
Aggregation and
Reduction Protocol
© 2019 Mellanox Technologies | Confidential 14
Oak Ridge National Laboratory – Coral Summit Supercomputer
SHARP AllReduce Performance Advantages
SHARP Enables Highest PerformanceScalable Hierarchical
Aggregation and
Reduction Protocol
© 2019 Mellanox Technologies | Confidential 15
SHARP Performance – Application (OSU)
Network-Based Computing Laboratory
http://nowlab.cse.ohio-state.edu/
The MVAPICH2 Project
http://mvapich.cse.ohio-state.edu/
Source: Prof. DK Panda, Ohio State University
© 2019 Mellanox Technologies | Confidential 16
SHARP Performance Advantage for AI
 SHARP provides 16% Performance Increase for deep learning, initial results
 TensorFlow with Horovod running ResNet50 benchmark, HDR InfiniBand (ConnectX-6, Quantum)
16%
© 2019 Mellanox Technologies | Confidential 17
GPUDirect
© 2019 Mellanox Technologies | Confidential 18
10X Higher Performance with GPUDirect™ RDMA
Accelerates HPC and Deep Learning performance
Lowest communication latency for GPUs
GPUDirect™ RDMA
© 2019 Mellanox Technologies | Confidential 19
HDR InfiniBand
© 2019 Mellanox Technologies | Confidential 20
Highest-Performance 200Gb/s InfiniBand Solutions
Transceivers
Active Optical and Copper Cables
(10 / 25 / 40 / 50 / 56 / 100 / 200Gb/s)
40 HDR (200Gb/s) InfiniBand Ports
80 HDR100 InfiniBand Ports
Throughput of 16Tb/s, <90ns Latency
200Gb/s Adapter, 0.6us latency
215 million messages per second
(10 / 25 / 40 / 50 / 56 / 100 / 200Gb/s)
MPI, SHMEM/PGAS, UPC
For Commercial and Open Source Applications
Leverages Hardware Accelerations
System on Chip and SmartNIC
Programmable adapter
Smart Offloads
© 2019 Mellanox Technologies | Confidential 21
Leading Connectivity
ConnectX-6 HDR InfiniBand Adapter
Leading Performance
Leading Features
 200Gb/s InfiniBand and Ethernet
 HDR, HDR100, EDR (100Gb/s) and lower speeds
 200GbE, 100GbE and lower speeds
 Single and dual ports
 50Gb/s PAM4 SerDes
 200Gb/s throughput, 0.6usec latency, 215 million message per second
 PCIe Gen3 / Gen4, 32 lanes
 Integrated PCIe switch
 Multi-Host - up to 8 hosts, supporting 4 dual-socket servers
 In-network computing and memory for HPC collective offloads
 Security – Block-level encryption to storage, key management, FIPS
 Storage – NVMe Emulation, NVMe-oF target, Erasure coding, T10/DIF
© 2019 Mellanox Technologies | Confidential 22
HDR InfiniBand Switch: QM8700, 1U Series
 40 ports of HDR, 200G
 80 ports of HDR100, 100G
Superior performance
40 QSFP56 ports (50G PAM4 per lane)
 90ns latency
 390M packets per sec (64B)
 16Tb/s aggregate bandwidth
Superior resiliency
 22’’ depth
 6 fans (5+1), hot swappable
 2 power supplies (1+1), hot swappable
© 2019 Mellanox Technologies | Confidential 23
HDR InfiniBand Switch: CS8500, Modular Series
 800 ports of HDR, 200G
 1600 ports of HDR100, 100G
Superior performance
800 QSFP56 ports
 300ns latency
 320Tb/s aggregate bandwidth
 LCD Tablet IO panel
Water-cooled solution
 Liquid – Liquid 4U CDU
 Liquid – Air 42U (350mm wide) stand alone HEX
 0C – 35C (air) or 40C (water) operating air range
© 2019 Mellanox Technologies | Confidential 24
BlueField SoC
Advantages and Platforms
© 2019 Mellanox Technologies | Confidential 25
BlueField Block Diagram
 Tile Architecture - 16 ARM® A72 CPUs subsystem
 SkyMesh™ fully coherent low-latency interconnect
 8MB L2 Cache, 8 Tiles
 48KB I-Cache, 32KB D-Cache per core
 12MB L3 Last Level Cache
 ARM Frequency: 0.8GHz - 1.3GHz
 Dual Port 100g IO Controller, based on ConnectX-5
 Dual 100Gb/s Ethernet/InfiniBand, compatible with ConnectX-5
 NVMe-oF hardware accelerator
 High-end Networking Offloads: RDMA, Erasure Coding, T10-DIF
 Fully Integrated PCIe switch
 32 Bifurcated PCI Gen3/4 lanes (up to 200Gb/s)
 Root Complex or Endpoint modes
 2x16, 4x8, 8x4 or 16x2 configurations
 Hardware Accelerators, Crypto Engines
 Bulk crypto by A72 Neon ISA (AES, SHA)
 Public Key acceleration, True RNG
 Memory Controllers
 2x Channels DDR4 Memory Controllers w/ ECC
 NVDIMM-N Support
Dual VPI Ports
Ethernet/InfiniBand:
1, 10, 25,40,50,100G
32-lanes
PCIe Gen3/4
© 2019 Mellanox Technologies | Confidential 26
BlueField for Smart Solutions
 SoC: Compute, networking and PCIe connectivity
 Dual port VPI EDR/100GbE
 16 Arm cores
 32 lanes of PCIe switch gen3/4
Storage Solutions
BlueField SoC (System on Chip)
 NVMe-based storage platforms
 RDMA, NVMe over Fabrics, RAID, Signature offload
 Partner’s solutions based on BlueField storage controller
Smart Adapters
 In-network computing and collective offloads
 Co-processor running proprietary smart algorithms
 Security and privacy algorithms
© 2019 Mellanox Technologies | Confidential 27
Mellanox Ethernet
Switch Systems
© 2019 Mellanox Technologies | Confidential 28
Open Ethernet – The Freedom to Optimize
Open APIs
Automation
End-to-End
Interconnect
Network-OS
Choice
SONiC
SDK
Customer-OS
© 2019 Mellanox Technologies | Confidential 29
The only predictable 25/50/100Gb/s Ethernet switch
Spectrum: The Ultimate 25/100GbE Switch
Full wire speed, non-blocking switch
ZPL: ZeroPacketLoss for all packets sizes
 Doesn’t drop packets per RFC2544
© 2019 Mellanox Technologies | Confidential 30
SN2100 – 16x100GbE ports
(up to 32 x50GbE , 64x25/10GbE)
Ideal storage / Database 25/100GbE Switch
Open Ethernet SN2000 Series
300nsSN2700 – 169W
SN2410 – 165W
SN2100 – 94W
 Predictable Performance
 Fair Traffic Distribution for Cloud
 Best-in-Class Throughput, Latency, Power Consumption
 Zero Packet Loss
SN2700 – 32x100GbE (up to 64 x 50/25/10GbE)
The Ideal 100GbE ToR / Aggregation
SN2410 – 8x100GbE + 48x25GbE
25GbE  100GbE ToR
Energy efficiency
SN2010 – 4x100GbE + 18x10/25GbE
Ideal Hyperconverged Switch
10/25GbE  100GbE half width ToR
© 2019 Mellanox Technologies | Confidential 31
Ideal Spine/Super-spine
High Density Leaf/200GbE Spine/Super-spine
Ideal Leaf
SN3800 – 64x100GbE (QSFP-28) 2U
Open Ethernet SN3000 Series
Compact ½ U Switch
Introducing speeds from 1GbE through 400GbE
SN3700 – 32x200GbE (QSFP-56) 1U
SN3510 – 48x25/50GbE (SFP-56) +
6x400GbE (QSFP-DD) 1U
SN3200 – 16x400GbE (QSFP-DD) ½U
© 2019 Mellanox Technologies | Confidential 32
End-to-End Solutions
for All Platforms
Highest Performance and Scalability for Intel, AMD, IBM Power,
NVIDIA, Arm and FPGA-based Compute and Storage Platforms at
10, 20, 25, 40, 50, 100, 200 and 400Gb/s Speeds
Unleashing the Power of All Compute Architectures
X86
Open
POWER
GPU ARM FPGA
© 2019 Mellanox Technologies | Confidential 33
Thank You
Guilherme Fuhrken
gfuhrken@Mellanox.com
+55-11-99934-2297

More Related Content

What's hot

Announcing the Mellanox ConnectX-5 100G InfiniBand Adapter
Announcing the Mellanox ConnectX-5 100G InfiniBand AdapterAnnouncing the Mellanox ConnectX-5 100G InfiniBand Adapter
Announcing the Mellanox ConnectX-5 100G InfiniBand Adapter
inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
inside-BigData.com
 
Accelerated Any-Scale Solutions from DDN
Accelerated Any-Scale Solutions from DDNAccelerated Any-Scale Solutions from DDN
Accelerated Any-Scale Solutions from DDN
inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
inside-BigData.com
 
Interconnect Your Future with Connect-IB
Interconnect Your Future with Connect-IBInterconnect Your Future with Connect-IB
Interconnect Your Future with Connect-IB
Mellanox Technologies
 
Interconnect Your Future
Interconnect Your FutureInterconnect Your Future
Interconnect Your Future
Mellanox Technologies
 
Mellanox IBM
Mellanox IBMMellanox IBM
Mellanox IBM
IBM Danmark
 
Xilinx Data Center Strategy and CCIX
Xilinx Data Center Strategy and CCIXXilinx Data Center Strategy and CCIX
Xilinx Data Center Strategy and CCIX
Yoshihiro Horie
 
Open power topics20191023
Open power topics20191023Open power topics20191023
Open power topics20191023
Yutaka Kawai
 
Open Source 5G/Edge Automation via ONAP
Open Source 5G/Edge Automation via ONAPOpen Source 5G/Edge Automation via ONAP
Open Source 5G/Edge Automation via ONAP
Liz Warner
 
Accelerating 5G enterprise networks with edge computing and latency assurance
Accelerating 5G enterprise networks with edge computing and latency assuranceAccelerating 5G enterprise networks with edge computing and latency assurance
Accelerating 5G enterprise networks with edge computing and latency assurance
ADVA
 
Mellanox's Technological Advantage
Mellanox's Technological AdvantageMellanox's Technological Advantage
Mellanox's Technological Advantage
Mellanox Technologies
 
Iccbn Presentation - optical
Iccbn Presentation - opticalIccbn Presentation - optical
Iccbn Presentation - optical
roysoumya
 
Development, test, and characterization of MEC platforms with Teranium and Dr...
Development, test, and characterization of MEC platforms with Teranium and Dr...Development, test, and characterization of MEC platforms with Teranium and Dr...
Development, test, and characterization of MEC platforms with Teranium and Dr...
Michelle Holley
 
Interconnect Your Future With Mellanox
Interconnect Your Future With MellanoxInterconnect Your Future With Mellanox
Interconnect Your Future With Mellanox
Mellanox Technologies
 
Xilinx Edge Compute using Power 9 /OpenPOWER systems
Xilinx Edge Compute using Power 9 /OpenPOWER systemsXilinx Edge Compute using Power 9 /OpenPOWER systems
Xilinx Edge Compute using Power 9 /OpenPOWER systems
Ganesan Narayanasamy
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
inside-BigData.com
 
White Box Hardware Challenges in the 5G & IoT Hyperconnected Era
White Box Hardware Challenges in the 5G & IoT Hyperconnected EraWhite Box Hardware Challenges in the 5G & IoT Hyperconnected Era
White Box Hardware Challenges in the 5G & IoT Hyperconnected Era
Charo Sanchez
 
Scaling Arm from One to One Trillion
Scaling Arm from One to One TrillionScaling Arm from One to One Trillion
Scaling Arm from One to One Trillion
Eric Van Hensbergen
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 

What's hot (20)

Announcing the Mellanox ConnectX-5 100G InfiniBand Adapter
Announcing the Mellanox ConnectX-5 100G InfiniBand AdapterAnnouncing the Mellanox ConnectX-5 100G InfiniBand Adapter
Announcing the Mellanox ConnectX-5 100G InfiniBand Adapter
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
Accelerated Any-Scale Solutions from DDN
Accelerated Any-Scale Solutions from DDNAccelerated Any-Scale Solutions from DDN
Accelerated Any-Scale Solutions from DDN
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
Interconnect Your Future with Connect-IB
Interconnect Your Future with Connect-IBInterconnect Your Future with Connect-IB
Interconnect Your Future with Connect-IB
 
Interconnect Your Future
Interconnect Your FutureInterconnect Your Future
Interconnect Your Future
 
Mellanox IBM
Mellanox IBMMellanox IBM
Mellanox IBM
 
Xilinx Data Center Strategy and CCIX
Xilinx Data Center Strategy and CCIXXilinx Data Center Strategy and CCIX
Xilinx Data Center Strategy and CCIX
 
Open power topics20191023
Open power topics20191023Open power topics20191023
Open power topics20191023
 
Open Source 5G/Edge Automation via ONAP
Open Source 5G/Edge Automation via ONAPOpen Source 5G/Edge Automation via ONAP
Open Source 5G/Edge Automation via ONAP
 
Accelerating 5G enterprise networks with edge computing and latency assurance
Accelerating 5G enterprise networks with edge computing and latency assuranceAccelerating 5G enterprise networks with edge computing and latency assurance
Accelerating 5G enterprise networks with edge computing and latency assurance
 
Mellanox's Technological Advantage
Mellanox's Technological AdvantageMellanox's Technological Advantage
Mellanox's Technological Advantage
 
Iccbn Presentation - optical
Iccbn Presentation - opticalIccbn Presentation - optical
Iccbn Presentation - optical
 
Development, test, and characterization of MEC platforms with Teranium and Dr...
Development, test, and characterization of MEC platforms with Teranium and Dr...Development, test, and characterization of MEC platforms with Teranium and Dr...
Development, test, and characterization of MEC platforms with Teranium and Dr...
 
Interconnect Your Future With Mellanox
Interconnect Your Future With MellanoxInterconnect Your Future With Mellanox
Interconnect Your Future With Mellanox
 
Xilinx Edge Compute using Power 9 /OpenPOWER systems
Xilinx Edge Compute using Power 9 /OpenPOWER systemsXilinx Edge Compute using Power 9 /OpenPOWER systems
Xilinx Edge Compute using Power 9 /OpenPOWER systems
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
White Box Hardware Challenges in the 5G & IoT Hyperconnected Era
White Box Hardware Challenges in the 5G & IoT Hyperconnected EraWhite Box Hardware Challenges in the 5G & IoT Hyperconnected Era
White Box Hardware Challenges in the 5G & IoT Hyperconnected Era
 
Scaling Arm from One to One Trillion
Scaling Arm from One to One TrillionScaling Arm from One to One Trillion
Scaling Arm from One to One Trillion
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 

Similar to Mellnox Interconnect presentation in OpenPOWER Brazil workshop

Mellanox Announces HDR 200 Gb/s InfiniBand Solutions
Mellanox Announces HDR 200 Gb/s InfiniBand SolutionsMellanox Announces HDR 200 Gb/s InfiniBand Solutions
Mellanox Announces HDR 200 Gb/s InfiniBand Solutions
inside-BigData.com
 
Co-Design Architecture for Exascale
Co-Design Architecture for ExascaleCo-Design Architecture for Exascale
Co-Design Architecture for Exascale
inside-BigData.com
 
Mellanox Announcements at SC15
Mellanox Announcements at SC15Mellanox Announcements at SC15
Mellanox Announcements at SC15
inside-BigData.com
 
Interconnect your future
Interconnect your futureInterconnect your future
Interconnect your future
inside-BigData.com
 
Interconnect Your Future: Paving the Road to Exascale
Interconnect Your Future: Paving the Road to ExascaleInterconnect Your Future: Paving the Road to Exascale
Interconnect Your Future: Paving the Road to Exascale
inside-BigData.com
 
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Linaro
 
Advanced Networking: The Critical Path for HPC, Cloud, Machine Learning and more
Advanced Networking: The Critical Path for HPC, Cloud, Machine Learning and moreAdvanced Networking: The Critical Path for HPC, Cloud, Machine Learning and more
Advanced Networking: The Critical Path for HPC, Cloud, Machine Learning and more
inside-BigData.com
 
Energy-efficient data processing in smart networks
Energy-efficient data processing in smart networksEnergy-efficient data processing in smart networks
Energy-efficient data processing in smart networks
ADVA
 
Advancing Applications Performance With InfiniBand
Advancing Applications Performance With InfiniBandAdvancing Applications Performance With InfiniBand
Advancing Applications Performance With InfiniBand
Mellanox Technologies
 
Industry Brief: Tectonic Shift - HPC Networks Converge
Industry Brief: Tectonic Shift - HPC Networks ConvergeIndustry Brief: Tectonic Shift - HPC Networks Converge
Industry Brief: Tectonic Shift - HPC Networks Converge
IT Brand Pulse
 
Open coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi AlkobiOpen coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi Alkobi
OpenInfra Days Poland 2019
 
Huawei WiFi6 AIFabric.pptx
Huawei WiFi6 AIFabric.pptxHuawei WiFi6 AIFabric.pptx
Huawei WiFi6 AIFabric.pptx
helberth73
 
Take Your Automated Campus to the Next Level
Take Your Automated Campus to the Next LevelTake Your Automated Campus to the Next Level
Take Your Automated Campus to the Next Level
Extreme Networks
 
Ppt Dc 09
Ppt Dc 09Ppt Dc 09
Ppt Dc 09
mattknowles
 
HPC DAY 2017 | The network part in accelerating Machine-Learning and Big-Data
HPC DAY 2017 | The network part in accelerating Machine-Learning and Big-DataHPC DAY 2017 | The network part in accelerating Machine-Learning and Big-Data
HPC DAY 2017 | The network part in accelerating Machine-Learning and Big-Data
HPC DAY
 
InfiniBand in the Enterprise Data Center.pdf
InfiniBand in the Enterprise Data Center.pdfInfiniBand in the Enterprise Data Center.pdf
InfiniBand in the Enterprise Data Center.pdf
bui thequan
 
Storage, Cloud, Web 2.0, Big Data Driving Growth
Storage, Cloud, Web 2.0, Big Data Driving GrowthStorage, Cloud, Web 2.0, Big Data Driving Growth
Storage, Cloud, Web 2.0, Big Data Driving Growth
Mellanox Technologies
 
High Performance Communication for Oracle using InfiniBand
High Performance Communication for Oracle using InfiniBandHigh Performance Communication for Oracle using InfiniBand
High Performance Communication for Oracle using InfiniBandwebhostingguy
 
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableSupermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Rebekah Rodriguez
 
Netronome Corporate Brochure
Netronome Corporate BrochureNetronome Corporate Brochure
Netronome Corporate Brochure
Netronome
 

Similar to Mellnox Interconnect presentation in OpenPOWER Brazil workshop (20)

Mellanox Announces HDR 200 Gb/s InfiniBand Solutions
Mellanox Announces HDR 200 Gb/s InfiniBand SolutionsMellanox Announces HDR 200 Gb/s InfiniBand Solutions
Mellanox Announces HDR 200 Gb/s InfiniBand Solutions
 
Co-Design Architecture for Exascale
Co-Design Architecture for ExascaleCo-Design Architecture for Exascale
Co-Design Architecture for Exascale
 
Mellanox Announcements at SC15
Mellanox Announcements at SC15Mellanox Announcements at SC15
Mellanox Announcements at SC15
 
Interconnect your future
Interconnect your futureInterconnect your future
Interconnect your future
 
Interconnect Your Future: Paving the Road to Exascale
Interconnect Your Future: Paving the Road to ExascaleInterconnect Your Future: Paving the Road to Exascale
Interconnect Your Future: Paving the Road to Exascale
 
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
 
Advanced Networking: The Critical Path for HPC, Cloud, Machine Learning and more
Advanced Networking: The Critical Path for HPC, Cloud, Machine Learning and moreAdvanced Networking: The Critical Path for HPC, Cloud, Machine Learning and more
Advanced Networking: The Critical Path for HPC, Cloud, Machine Learning and more
 
Energy-efficient data processing in smart networks
Energy-efficient data processing in smart networksEnergy-efficient data processing in smart networks
Energy-efficient data processing in smart networks
 
Advancing Applications Performance With InfiniBand
Advancing Applications Performance With InfiniBandAdvancing Applications Performance With InfiniBand
Advancing Applications Performance With InfiniBand
 
Industry Brief: Tectonic Shift - HPC Networks Converge
Industry Brief: Tectonic Shift - HPC Networks ConvergeIndustry Brief: Tectonic Shift - HPC Networks Converge
Industry Brief: Tectonic Shift - HPC Networks Converge
 
Open coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi AlkobiOpen coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi Alkobi
 
Huawei WiFi6 AIFabric.pptx
Huawei WiFi6 AIFabric.pptxHuawei WiFi6 AIFabric.pptx
Huawei WiFi6 AIFabric.pptx
 
Take Your Automated Campus to the Next Level
Take Your Automated Campus to the Next LevelTake Your Automated Campus to the Next Level
Take Your Automated Campus to the Next Level
 
Ppt Dc 09
Ppt Dc 09Ppt Dc 09
Ppt Dc 09
 
HPC DAY 2017 | The network part in accelerating Machine-Learning and Big-Data
HPC DAY 2017 | The network part in accelerating Machine-Learning and Big-DataHPC DAY 2017 | The network part in accelerating Machine-Learning and Big-Data
HPC DAY 2017 | The network part in accelerating Machine-Learning and Big-Data
 
InfiniBand in the Enterprise Data Center.pdf
InfiniBand in the Enterprise Data Center.pdfInfiniBand in the Enterprise Data Center.pdf
InfiniBand in the Enterprise Data Center.pdf
 
Storage, Cloud, Web 2.0, Big Data Driving Growth
Storage, Cloud, Web 2.0, Big Data Driving GrowthStorage, Cloud, Web 2.0, Big Data Driving Growth
Storage, Cloud, Web 2.0, Big Data Driving Growth
 
High Performance Communication for Oracle using InfiniBand
High Performance Communication for Oracle using InfiniBandHigh Performance Communication for Oracle using InfiniBand
High Performance Communication for Oracle using InfiniBand
 
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableSupermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
 
Netronome Corporate Brochure
Netronome Corporate BrochureNetronome Corporate Brochure
Netronome Corporate Brochure
 

More from Ganesan Narayanasamy

Chip Design Curriculum development Residency program
Chip Design Curriculum development Residency programChip Design Curriculum development Residency program
Chip Design Curriculum development Residency program
Ganesan Narayanasamy
 
Basics of Digital Design and Verilog
Basics of Digital Design and VerilogBasics of Digital Design and Verilog
Basics of Digital Design and Verilog
Ganesan Narayanasamy
 
180 nm Tape out experience using Open POWER ISA
180 nm Tape out experience using Open POWER ISA180 nm Tape out experience using Open POWER ISA
180 nm Tape out experience using Open POWER ISA
Ganesan Narayanasamy
 
Workload Transformation and Innovations in POWER Architecture
Workload Transformation and Innovations in POWER Architecture Workload Transformation and Innovations in POWER Architecture
Workload Transformation and Innovations in POWER Architecture
Ganesan Narayanasamy
 
OpenPOWER Workshop at IIT Roorkee
OpenPOWER Workshop at IIT RoorkeeOpenPOWER Workshop at IIT Roorkee
OpenPOWER Workshop at IIT Roorkee
Ganesan Narayanasamy
 
Deep Learning Use Cases using OpenPOWER systems
Deep Learning Use Cases using OpenPOWER systemsDeep Learning Use Cases using OpenPOWER systems
Deep Learning Use Cases using OpenPOWER systems
Ganesan Narayanasamy
 
IBM BOA for POWER
IBM BOA for POWER IBM BOA for POWER
IBM BOA for POWER
Ganesan Narayanasamy
 
OpenPOWER System Marconi100
OpenPOWER System Marconi100OpenPOWER System Marconi100
OpenPOWER System Marconi100
Ganesan Narayanasamy
 
OpenPOWER Latest Updates
OpenPOWER Latest UpdatesOpenPOWER Latest Updates
OpenPOWER Latest Updates
Ganesan Narayanasamy
 
POWER10 innovations for HPC
POWER10 innovations for HPCPOWER10 innovations for HPC
POWER10 innovations for HPC
Ganesan Narayanasamy
 
Deeplearningusingcloudpakfordata
DeeplearningusingcloudpakfordataDeeplearningusingcloudpakfordata
Deeplearningusingcloudpakfordata
Ganesan Narayanasamy
 
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
Ganesan Narayanasamy
 
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systemsAI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
Ganesan Narayanasamy
 
AI in healthcare - Use Cases
AI in healthcare - Use Cases AI in healthcare - Use Cases
AI in healthcare - Use Cases
Ganesan Narayanasamy
 
AI in Health Care using IBM Systems/OpenPOWER systems
AI in Health Care using IBM Systems/OpenPOWER systemsAI in Health Care using IBM Systems/OpenPOWER systems
AI in Health Care using IBM Systems/OpenPOWER systems
Ganesan Narayanasamy
 
AI in Healh Care using IBM POWER systems
AI in Healh Care using IBM POWER systems AI in Healh Care using IBM POWER systems
AI in Healh Care using IBM POWER systems
Ganesan Narayanasamy
 
Poster from NUS
Poster from NUSPoster from NUS
Poster from NUS
Ganesan Narayanasamy
 
SAP HANA on POWER9 systems
SAP HANA on POWER9 systemsSAP HANA on POWER9 systems
SAP HANA on POWER9 systems
Ganesan Narayanasamy
 
Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9
Ganesan Narayanasamy
 
AI in the enterprise
AI in the enterprise AI in the enterprise
AI in the enterprise
Ganesan Narayanasamy
 

More from Ganesan Narayanasamy (20)

Chip Design Curriculum development Residency program
Chip Design Curriculum development Residency programChip Design Curriculum development Residency program
Chip Design Curriculum development Residency program
 
Basics of Digital Design and Verilog
Basics of Digital Design and VerilogBasics of Digital Design and Verilog
Basics of Digital Design and Verilog
 
180 nm Tape out experience using Open POWER ISA
180 nm Tape out experience using Open POWER ISA180 nm Tape out experience using Open POWER ISA
180 nm Tape out experience using Open POWER ISA
 
Workload Transformation and Innovations in POWER Architecture
Workload Transformation and Innovations in POWER Architecture Workload Transformation and Innovations in POWER Architecture
Workload Transformation and Innovations in POWER Architecture
 
OpenPOWER Workshop at IIT Roorkee
OpenPOWER Workshop at IIT RoorkeeOpenPOWER Workshop at IIT Roorkee
OpenPOWER Workshop at IIT Roorkee
 
Deep Learning Use Cases using OpenPOWER systems
Deep Learning Use Cases using OpenPOWER systemsDeep Learning Use Cases using OpenPOWER systems
Deep Learning Use Cases using OpenPOWER systems
 
IBM BOA for POWER
IBM BOA for POWER IBM BOA for POWER
IBM BOA for POWER
 
OpenPOWER System Marconi100
OpenPOWER System Marconi100OpenPOWER System Marconi100
OpenPOWER System Marconi100
 
OpenPOWER Latest Updates
OpenPOWER Latest UpdatesOpenPOWER Latest Updates
OpenPOWER Latest Updates
 
POWER10 innovations for HPC
POWER10 innovations for HPCPOWER10 innovations for HPC
POWER10 innovations for HPC
 
Deeplearningusingcloudpakfordata
DeeplearningusingcloudpakfordataDeeplearningusingcloudpakfordata
Deeplearningusingcloudpakfordata
 
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
 
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systemsAI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
 
AI in healthcare - Use Cases
AI in healthcare - Use Cases AI in healthcare - Use Cases
AI in healthcare - Use Cases
 
AI in Health Care using IBM Systems/OpenPOWER systems
AI in Health Care using IBM Systems/OpenPOWER systemsAI in Health Care using IBM Systems/OpenPOWER systems
AI in Health Care using IBM Systems/OpenPOWER systems
 
AI in Healh Care using IBM POWER systems
AI in Healh Care using IBM POWER systems AI in Healh Care using IBM POWER systems
AI in Healh Care using IBM POWER systems
 
Poster from NUS
Poster from NUSPoster from NUS
Poster from NUS
 
SAP HANA on POWER9 systems
SAP HANA on POWER9 systemsSAP HANA on POWER9 systems
SAP HANA on POWER9 systems
 
Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9
 
AI in the enterprise
AI in the enterprise AI in the enterprise
AI in the enterprise
 

Recently uploaded

Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 

Recently uploaded (20)

Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 

Mellnox Interconnect presentation in OpenPOWER Brazil workshop

  • 1. © 2019 Mellanox Technologies | Confidential 1 Paving the Road to Exascale March 2019 Interconnect Your Future Guilherme Fuhrken gfuhrken@Mellanox.com +55-11-99934-2297
  • 2. © 2019 Mellanox Technologies | Confidential 2 Cloud & Web 2.0 Big Data Enterprise Business Intelligence HPC Storage Security AI & Machine Learning Internet of Things Exponential Data Growth Everywhere Source: IDC HPC: High-Performance Compute AI: Artificial Intelligence
  • 3. © 2019 Mellanox Technologies | Confidential 3 Higher Data Speeds Faster Data Processing Better Data Security Exponential Data Growth Everywhere source: IDC
  • 4. © 2019 Mellanox Technologies | Confidential 4 Higher Data Speeds Faster Data Processing Better Data Security Adapters Switches Cables & Transceivers SmartNIC System on a Chip HPC and AI Needs the Most Intelligent Interconnect
  • 5. © 2019 Mellanox Technologies | Confidential 5 Mellanox Accelerates Leading HPC and AI Systems World’s Top 3 Supercomputers Summit CORAL System World’s Fastest HPC / AI System 9.2K InfiniBand Nodes Sierra CORAL System #2 USA Supercomputer 8.6K InfiniBand Nodes 1 2 Wuxi Supercomputing Center Fastest Supercomputer in China 41K InfiniBand Nodes 3
  • 6. © 2019 Mellanox Technologies | Confidential 6 Mellanox Accelerates Leading HPC and AI Systems (Examples) JUWELS Supercomputer 2.6K InfiniBand Nodes Fastest HPC / AI System in Japan 1.1K InfiniBand Nodes The world's Fastest Industry Supercomputer 1.6K InfiniBand Nodes 7 15 26 Fastest Supercomputer in Canada Dragonfly+ Topology 1.5K InfiniBand Nodes NASA Ames Research Center 20K InfiniBand Nodes World’s leading Industry Supercomputer 4.6K InfiniBand Nodes 34 5927
  • 7. © 2019 Mellanox Technologies | Confidential 7 The Need for Intelligent and Faster Interconnect CPU-Centric (Onload) Data-Centric (Offload) Must Wait for the Data Creates Performance Bottlenecks Faster Data Speeds and In-Network Computing Enable Higher Performance and Scale GPU CPU GPU CPU Onload Network In-Network Computing GPU CPU CPU GPU GPU CPU GPU CPU GPU CPU CPU GPU Analyze Data as it Moves! Higher Performance and Scale
  • 8. © 2019 Mellanox Technologies | Confidential 8 In-Network Computing Self-Healing Technology In-Network Computing Unbreakable Data Centers Delivers Highest Application Performance GPUDirect™ RDMA Critical for HPC and Machine Learning ApplicationsGPU Acceleration Technology 10X Performance Acceleration Critical for HPC and Machine Learning Applications 35X Faster Network Recovery 5000X 10X Performance Acceleration Performance Acceleration In-Network Computing Delivers Highest Performance Scalable Hierarchical Aggregation and Reduction Protocol
  • 9. © 2019 Mellanox Technologies | Confidential 9 30%-250% Higher Return on Investment Up to 50% Saving on Capital and Operation Expenses Highest Applications Performance, Scalability and Productivity InfiniBand Delivers Best Return on Investment 1.9X Better 2X Better 1.4X Better 2.5X Better 1.3X Better Molecular DynamicsGenomicsWeather Automotive Chemistry
  • 10. © 2019 Mellanox Technologies | Confidential 10 Mellanox Supercharges Leading AI Companies Higher ROI Lower CapEx & OpEx 60% 50% Unlocking the Power of Artificial Intelligence Cognitive Toolkit RDMA Supercharges Leading AI Frameworks
  • 11. © 2019 Mellanox Technologies | Confidential 11 Scalable Hierarchical Aggregation and Reduction Protocol (SHARP)
  • 12. © 2019 Mellanox Technologies | Confidential 12 Scalable Hierarchical Aggregation and Reduction Protocol (SHARP)  Reliable Scalable General Purpose Primitive  In-network Tree based aggregation mechanism  Large number of groups  Multiple simultaneous outstanding operations  Applicable to Multiple Use-cases  HPC Applications using MPI / SHMEM  Distributed Machine Learning applications  Scalable High Performance Collective Offload  Barrier, Reduce, All-Reduce, Broadcast and more  Sum, Min, Max, Min-loc, max-loc, OR, XOR, AND  Integer and Floating-Point, 16/32/64 bits SHArP Tree SHARP Tree Aggregation Node (Process running on HCA) SHARP Tree Endnode (Process running on HCA) SHARP Tree Root
  • 13. © 2019 Mellanox Technologies | Confidential 13 SHARP AllReduce Performance Advantages (128 Nodes) SHARP enables 75% Reduction in Latency Providing Scalable Flat LatencyScalable Hierarchical Aggregation and Reduction Protocol
  • 14. © 2019 Mellanox Technologies | Confidential 14 Oak Ridge National Laboratory – Coral Summit Supercomputer SHARP AllReduce Performance Advantages SHARP Enables Highest PerformanceScalable Hierarchical Aggregation and Reduction Protocol
  • 15. © 2019 Mellanox Technologies | Confidential 15 SHARP Performance – Application (OSU) Network-Based Computing Laboratory http://nowlab.cse.ohio-state.edu/ The MVAPICH2 Project http://mvapich.cse.ohio-state.edu/ Source: Prof. DK Panda, Ohio State University
  • 16. © 2019 Mellanox Technologies | Confidential 16 SHARP Performance Advantage for AI  SHARP provides 16% Performance Increase for deep learning, initial results  TensorFlow with Horovod running ResNet50 benchmark, HDR InfiniBand (ConnectX-6, Quantum) 16%
  • 17. © 2019 Mellanox Technologies | Confidential 17 GPUDirect
  • 18. © 2019 Mellanox Technologies | Confidential 18 10X Higher Performance with GPUDirect™ RDMA Accelerates HPC and Deep Learning performance Lowest communication latency for GPUs GPUDirect™ RDMA
  • 19. © 2019 Mellanox Technologies | Confidential 19 HDR InfiniBand
  • 20. © 2019 Mellanox Technologies | Confidential 20 Highest-Performance 200Gb/s InfiniBand Solutions Transceivers Active Optical and Copper Cables (10 / 25 / 40 / 50 / 56 / 100 / 200Gb/s) 40 HDR (200Gb/s) InfiniBand Ports 80 HDR100 InfiniBand Ports Throughput of 16Tb/s, <90ns Latency 200Gb/s Adapter, 0.6us latency 215 million messages per second (10 / 25 / 40 / 50 / 56 / 100 / 200Gb/s) MPI, SHMEM/PGAS, UPC For Commercial and Open Source Applications Leverages Hardware Accelerations System on Chip and SmartNIC Programmable adapter Smart Offloads
  • 21. © 2019 Mellanox Technologies | Confidential 21 Leading Connectivity ConnectX-6 HDR InfiniBand Adapter Leading Performance Leading Features  200Gb/s InfiniBand and Ethernet  HDR, HDR100, EDR (100Gb/s) and lower speeds  200GbE, 100GbE and lower speeds  Single and dual ports  50Gb/s PAM4 SerDes  200Gb/s throughput, 0.6usec latency, 215 million message per second  PCIe Gen3 / Gen4, 32 lanes  Integrated PCIe switch  Multi-Host - up to 8 hosts, supporting 4 dual-socket servers  In-network computing and memory for HPC collective offloads  Security – Block-level encryption to storage, key management, FIPS  Storage – NVMe Emulation, NVMe-oF target, Erasure coding, T10/DIF
  • 22. © 2019 Mellanox Technologies | Confidential 22 HDR InfiniBand Switch: QM8700, 1U Series  40 ports of HDR, 200G  80 ports of HDR100, 100G Superior performance 40 QSFP56 ports (50G PAM4 per lane)  90ns latency  390M packets per sec (64B)  16Tb/s aggregate bandwidth Superior resiliency  22’’ depth  6 fans (5+1), hot swappable  2 power supplies (1+1), hot swappable
  • 23. © 2019 Mellanox Technologies | Confidential 23 HDR InfiniBand Switch: CS8500, Modular Series  800 ports of HDR, 200G  1600 ports of HDR100, 100G Superior performance 800 QSFP56 ports  300ns latency  320Tb/s aggregate bandwidth  LCD Tablet IO panel Water-cooled solution  Liquid – Liquid 4U CDU  Liquid – Air 42U (350mm wide) stand alone HEX  0C – 35C (air) or 40C (water) operating air range
  • 24. © 2019 Mellanox Technologies | Confidential 24 BlueField SoC Advantages and Platforms
  • 25. © 2019 Mellanox Technologies | Confidential 25 BlueField Block Diagram  Tile Architecture - 16 ARM® A72 CPUs subsystem  SkyMesh™ fully coherent low-latency interconnect  8MB L2 Cache, 8 Tiles  48KB I-Cache, 32KB D-Cache per core  12MB L3 Last Level Cache  ARM Frequency: 0.8GHz - 1.3GHz  Dual Port 100g IO Controller, based on ConnectX-5  Dual 100Gb/s Ethernet/InfiniBand, compatible with ConnectX-5  NVMe-oF hardware accelerator  High-end Networking Offloads: RDMA, Erasure Coding, T10-DIF  Fully Integrated PCIe switch  32 Bifurcated PCI Gen3/4 lanes (up to 200Gb/s)  Root Complex or Endpoint modes  2x16, 4x8, 8x4 or 16x2 configurations  Hardware Accelerators, Crypto Engines  Bulk crypto by A72 Neon ISA (AES, SHA)  Public Key acceleration, True RNG  Memory Controllers  2x Channels DDR4 Memory Controllers w/ ECC  NVDIMM-N Support Dual VPI Ports Ethernet/InfiniBand: 1, 10, 25,40,50,100G 32-lanes PCIe Gen3/4
  • 26. © 2019 Mellanox Technologies | Confidential 26 BlueField for Smart Solutions  SoC: Compute, networking and PCIe connectivity  Dual port VPI EDR/100GbE  16 Arm cores  32 lanes of PCIe switch gen3/4 Storage Solutions BlueField SoC (System on Chip)  NVMe-based storage platforms  RDMA, NVMe over Fabrics, RAID, Signature offload  Partner’s solutions based on BlueField storage controller Smart Adapters  In-network computing and collective offloads  Co-processor running proprietary smart algorithms  Security and privacy algorithms
  • 27. © 2019 Mellanox Technologies | Confidential 27 Mellanox Ethernet Switch Systems
  • 28. © 2019 Mellanox Technologies | Confidential 28 Open Ethernet – The Freedom to Optimize Open APIs Automation End-to-End Interconnect Network-OS Choice SONiC SDK Customer-OS
  • 29. © 2019 Mellanox Technologies | Confidential 29 The only predictable 25/50/100Gb/s Ethernet switch Spectrum: The Ultimate 25/100GbE Switch Full wire speed, non-blocking switch ZPL: ZeroPacketLoss for all packets sizes  Doesn’t drop packets per RFC2544
  • 30. © 2019 Mellanox Technologies | Confidential 30 SN2100 – 16x100GbE ports (up to 32 x50GbE , 64x25/10GbE) Ideal storage / Database 25/100GbE Switch Open Ethernet SN2000 Series 300nsSN2700 – 169W SN2410 – 165W SN2100 – 94W  Predictable Performance  Fair Traffic Distribution for Cloud  Best-in-Class Throughput, Latency, Power Consumption  Zero Packet Loss SN2700 – 32x100GbE (up to 64 x 50/25/10GbE) The Ideal 100GbE ToR / Aggregation SN2410 – 8x100GbE + 48x25GbE 25GbE  100GbE ToR Energy efficiency SN2010 – 4x100GbE + 18x10/25GbE Ideal Hyperconverged Switch 10/25GbE  100GbE half width ToR
  • 31. © 2019 Mellanox Technologies | Confidential 31 Ideal Spine/Super-spine High Density Leaf/200GbE Spine/Super-spine Ideal Leaf SN3800 – 64x100GbE (QSFP-28) 2U Open Ethernet SN3000 Series Compact ½ U Switch Introducing speeds from 1GbE through 400GbE SN3700 – 32x200GbE (QSFP-56) 1U SN3510 – 48x25/50GbE (SFP-56) + 6x400GbE (QSFP-DD) 1U SN3200 – 16x400GbE (QSFP-DD) ½U
  • 32. © 2019 Mellanox Technologies | Confidential 32 End-to-End Solutions for All Platforms Highest Performance and Scalability for Intel, AMD, IBM Power, NVIDIA, Arm and FPGA-based Compute and Storage Platforms at 10, 20, 25, 40, 50, 100, 200 and 400Gb/s Speeds Unleashing the Power of All Compute Architectures X86 Open POWER GPU ARM FPGA
  • 33. © 2019 Mellanox Technologies | Confidential 33 Thank You Guilherme Fuhrken gfuhrken@Mellanox.com +55-11-99934-2297

Editor's Notes

  1. According to a study published by IDC in 2017, there would be a staggering 163Zb of data generated globally by 2025 (Zettabyte has 21 zeros)
  2. Now it might be easier to understand the value of Mellanox. We have been leading the HPC industry through many generations of technology, introducing capabilities into the network that are specifically designed to achieve maximum application performance and scalability. Here are just four of the latest acceleration enhancements we have added to the already robust, industry standards-based InfiniBand specification. SHARP is one of the most significant improvements introduced – for HPC and Machine Learning. SHARP transforms the network into a compute resource. While other networks move data from one compute element to another, we can actually process and compute data as it traverses the network. Effectively, we turn network into a powerful co-processor to deliver 10X application performance. -- Our network adapters include additional enhancements to HPC infrastructures by providing hardware-based Tag Matching offloads, out-of-order Write and Read operations, as well as additional support for atomic operations. -- SHIELD, is an innovative interconnect technology that improves data center resiliency by speeding up fault recovery by 5000 times, enabling interconnect autonomous self-healing capabilities. SHIELD provides the ability for interconnect components to exchange real-time information and to make immediate smart decisions that overcome issues and optimize data flows. -- And of course, GPUDirect RDMA – supported by nearly all HPC workloads that use GPUs, including nearly every machine learning framework and acceleration library. It provides a significant decrease in GPU-to-GPU communication latency and completely offloads the CPU, removing it from all GPU-to-GPU communications across the network. All of these technologies are essential for high-performance computing, artificial intelligence, and any type of big data workload.