SlideShare a Scribd company logo
1 of 23
Download to read offline
What is 3D Torus

The switchless interconnection topology
A demanding future for HPC
• Supercomputers are asked to process demanding computational
  loads (process and data)
•
• Processor power is paramount but a key aspect of parallel computers
  is the communication network that interconnects the computing nodes

• Together with speed, HPC systems are increasingly asked to be more
  available

• One additional challenge with large systems is scalability, so the ability
  to add nodes to a cluster without affecting performance and reliability
  or affecting them as little as possible

• It is also paramount for future machines to consume less energy
Conceptual difference

 Switched Infiniband network   Switchless Torus network
3D Torus topology
• Connecting nodes using a 3D
  Torus configuration means than
  each node in a cluster is
  connected to the adjacent ones via
  short cabling
• The signal is routed directly from
  one node to the other with no need
  of switches. 3D means that the
  communication takes places in 6
  different “directions”: X+, X-, Y+,
  Y-, Z+, Z-
• In practical terms, each node can
  be connected to 6 other nodes: in
  this way, the graph of the
  connections resembles a tri-
  dimensional matrix
3D Torus topology
• Such configuration allows the addition of nodes to a system without
  degrading performance.
• Each new node is joint as an addition of a grid, linked to it with no
  extensive cabling or switching
• Scaling linearly, with little or no performance loss, is strictly true for
  those problems that heavily rely on next neighbor communication
• the addition of a node in a large system happens with much less
  working and potential troubles
• Being the connections between nodes short and direct, the latency of
  the links is very low
3D Torus advantages
•   High speed and low latency
•   Linear scalability. Switchless
    configuration that avoids bottlenecks
    and allows hardware cost reduction
•   Improvement of MTBF
•   Regular and hidden wiring, leading
    to less cabling
•   Lower energy usage for
    communication
•   Good match between physical
    communication channels and local
    pattern algorithms
•   Less energy consumed
3D Torus applications




Place, Date
3D Torus applications
•   The maximization of the performance of
    the 3D Torus takes place with a subset of
    problems which is specific but rather
    large.

•   These are local pattern problems, which
    typically deal with modeling systems
    whose functioning/reaction depends on
    adjacent systems.
Example: Lattice QCD
• Computer simulations of Lattice QCD
  (the theory of strong interactions e.g.
  inside protons) is one of the great
  challenges for massively parallel
  supercomputers and requires a
  communication network with high
  bandwidth and low latency.

    •   The equations governing Lattice QDC
        describe local interactions (each degree
        of freedom interacts with its nearest
        neighbors) and this results in a well
        balanced computational task in which
        each degree of freedom (the value of a
        field on a space-time point) obeys the
        same equations, which are coupled to a
        small number of other degrees of
        freedom residing nearby.
Example: Fluid Dynamics
• Fluid Dynamics in turbulent regime
  shares the same opportunity of being
  “easily” put on a supercomputer in
  the formulation defined by what are
  known as Lattice Boltzmann Methods
  (LB).

• This is a scientific field which is both
  intriguing from the point of view of
  fundamental science and relevant to
  many technological applications.
Additional applications
• Many Monte Carlo simulations and
  embarrassingly parallel problems can
  exploit the full performance advantage
  of the 3D Torus architecture

• Problems that require all to all
  dialogue between nodes may exploit
  less the full performance of the 3D
  torus interconnection

• However, independently from the type
  of application and problem, the 3D
  torus still bears the massive
  advantage of scalability and
  serviceability
Eurotech Aurora 3D Torus
Aurora Torus peculiarities
•   Unified network architecture:
     – the 3D Torus coexists with an Infiniband
        network.
     – Both local and global MPI calls can be
        processed efficiently
     – Dedicated synchronization network
     – Gigabit Ethernet
•   FPGA driven Torus. Based the result of
    the work of Aurora Science researchers
    who acquired experienced with Janus
    and QPACE
•   Full duplex communication links
     – Allowing sub-tori to create subdomain
•   The length of cables kept very short due
    to smart backplane design
Aurora 3D Torus Network

• Aurora Science implementation
   – Based on FTNW (Pisanti, Schifano, Simma)
       • http://sourceforge.net/projects/ftnw
   – GPL licensed
   – Optimized for nearest-neighbor communication
   – Proven technology in LQDC communities

• Extoll implementation on Aurora
   – Licensed
   – Optimized for all-to-all communication for wide range of
     applications
   – Future interconnect paving the way to exascale computing
Aurora FPGA: 3D Torus network processor




    PCIe 2.0 x8
Aurora S– 3D Torus network

             CPU                CPU

       PCIe2 x8                        PCIe2 x8
       40 Gbps                         40 Gbps




                    FPGA

             phy phy phy phy phy phy

             4x     4x     4x   4x     4x       4x



       X+     X-         Y+      Y-      Z+          Z-
       10     10          10     10       10          10
      Gbps   Gbps        Gbps   Gbps     Gbps        Gbps
Aurora systems
AURORA, a high density – highly efficient family
of supercomputers


     One Aurora Rack

       256 nodes,
       512 CPUs,
       3072 cores,                           48U


  100 TFLOPS @ 100kW


  Entirely liquid cooled
Aurora identity card

 High computational power

  Liquid cooling

 Energy efficiency

 Reliability and availability

 Scalability

 Unified network architecture

 Compatibility
Aurora identity card

 High computational power

  Liquid cooling

 Energy efficiency

 Reliability and availability

 Scalability

 Unified network architecture

 Compatibility
Unified Network Architecture

                        Ultra low latency       Regular, massive, local
                        High bandwidth          patterns
           3D Torus     Nearest neighbor
                        Unlimited scalability




               select   Very low latency        Irregular, long distance
                        High bandwidth          patterns (Molecular
           Infiniband   Switched network        Dynamics)
                        Multiple services       Storage (SAN)
                                                Monitoring (IPMI)



                        Very fast channel       Net processor synch
                        Global commands         Thread synch
            Synch       Subdomain manag.        Global clock
                        Low/high level synch    System coordination
                                                Debugging
Eurotech HPC Principles
High performance. We want our customer run their simulations and
applications as quick as the world latest technologies allow.

Energy efficiency and Green. We built products to allow our customer to save
on energy bills and leverage sustainability.


Scalability. We want our solutions to scale linearly and our customer to grow
according to their needs and budget availability

Availability. Intelligent design, quality, support readiness and preventive
maintenance to increase the availability of our HPC systems during their
lifetime.

Cost effectiveness. We concentrate a lot of our efforts to deliver advanced
technology at competitive prices and to allow our customers reducing the total
cost of ownership.

Versatility and compatibility. We designed our products to tackle different
problems in the most effective way possible
www.eurotech.com/aurora

More Related Content

What's hot

Client server s/w Engineering
Client server s/w EngineeringClient server s/w Engineering
Client server s/w EngineeringRajan Shah
 
Quality of service(qos) by M.BILAL.SATTI
Quality of service(qos) by M.BILAL.SATTIQuality of service(qos) by M.BILAL.SATTI
Quality of service(qos) by M.BILAL.SATTIMuhammad Bilal Satti
 
819 Static Channel Allocation
819 Static Channel Allocation819 Static Channel Allocation
819 Static Channel Allocationtechbed
 
Cloud computing notes unit II
Cloud computing notes unit II Cloud computing notes unit II
Cloud computing notes unit II NANDINI SHARMA
 
ARM CoAP Tutorial
ARM CoAP TutorialARM CoAP Tutorial
ARM CoAP Tutorialzdshelby
 
TCP- Transmission Control Protocol
TCP-  Transmission Control Protocol TCP-  Transmission Control Protocol
TCP- Transmission Control Protocol Akhil .B
 
distribution layer
distribution layerdistribution layer
distribution layererick chuwa
 
Introduction to Internet Governance and Cyber-security
Introduction to Internet Governance and Cyber-securityIntroduction to Internet Governance and Cyber-security
Introduction to Internet Governance and Cyber-securityGlenn McKnight
 
Leaky Bucket & Tocken Bucket - Traffic shaping
Leaky Bucket & Tocken Bucket - Traffic shapingLeaky Bucket & Tocken Bucket - Traffic shaping
Leaky Bucket & Tocken Bucket - Traffic shapingVimal Dewangan
 
An overview of grid monitoring
An overview of grid monitoringAn overview of grid monitoring
An overview of grid monitoringManoj Prabhakar
 
Point-to-Point Protocol(PPP) CCN ppt
Point-to-Point Protocol(PPP) CCN pptPoint-to-Point Protocol(PPP) CCN ppt
Point-to-Point Protocol(PPP) CCN pptNiaz Shaikh
 
IoT Meets the Cloud: The Origins of Edge Computing
IoT Meets the Cloud:  The Origins of Edge ComputingIoT Meets the Cloud:  The Origins of Edge Computing
IoT Meets the Cloud: The Origins of Edge ComputingMaria Gorlatova
 
Unit i introduction to grid computing
Unit i   introduction to grid computingUnit i   introduction to grid computing
Unit i introduction to grid computingsudha kar
 
Data communication and networks by B. Forouzan
Data communication and networks by B. ForouzanData communication and networks by B. Forouzan
Data communication and networks by B. ForouzanPreethi T G
 

What's hot (20)

Client server s/w Engineering
Client server s/w EngineeringClient server s/w Engineering
Client server s/w Engineering
 
Quality of service(qos) by M.BILAL.SATTI
Quality of service(qos) by M.BILAL.SATTIQuality of service(qos) by M.BILAL.SATTI
Quality of service(qos) by M.BILAL.SATTI
 
819 Static Channel Allocation
819 Static Channel Allocation819 Static Channel Allocation
819 Static Channel Allocation
 
Module 3-cloud computing
Module 3-cloud computingModule 3-cloud computing
Module 3-cloud computing
 
SNMP
SNMPSNMP
SNMP
 
Ethernet
EthernetEthernet
Ethernet
 
Congestion control
Congestion controlCongestion control
Congestion control
 
Network layer tanenbaum
Network layer tanenbaumNetwork layer tanenbaum
Network layer tanenbaum
 
Cloud computing notes unit II
Cloud computing notes unit II Cloud computing notes unit II
Cloud computing notes unit II
 
ARM CoAP Tutorial
ARM CoAP TutorialARM CoAP Tutorial
ARM CoAP Tutorial
 
TCP- Transmission Control Protocol
TCP-  Transmission Control Protocol TCP-  Transmission Control Protocol
TCP- Transmission Control Protocol
 
Medium access control unit 3-33
Medium access control  unit 3-33Medium access control  unit 3-33
Medium access control unit 3-33
 
distribution layer
distribution layerdistribution layer
distribution layer
 
Introduction to Internet Governance and Cyber-security
Introduction to Internet Governance and Cyber-securityIntroduction to Internet Governance and Cyber-security
Introduction to Internet Governance and Cyber-security
 
Leaky Bucket & Tocken Bucket - Traffic shaping
Leaky Bucket & Tocken Bucket - Traffic shapingLeaky Bucket & Tocken Bucket - Traffic shaping
Leaky Bucket & Tocken Bucket - Traffic shaping
 
An overview of grid monitoring
An overview of grid monitoringAn overview of grid monitoring
An overview of grid monitoring
 
Point-to-Point Protocol(PPP) CCN ppt
Point-to-Point Protocol(PPP) CCN pptPoint-to-Point Protocol(PPP) CCN ppt
Point-to-Point Protocol(PPP) CCN ppt
 
IoT Meets the Cloud: The Origins of Edge Computing
IoT Meets the Cloud:  The Origins of Edge ComputingIoT Meets the Cloud:  The Origins of Edge Computing
IoT Meets the Cloud: The Origins of Edge Computing
 
Unit i introduction to grid computing
Unit i   introduction to grid computingUnit i   introduction to grid computing
Unit i introduction to grid computing
 
Data communication and networks by B. Forouzan
Data communication and networks by B. ForouzanData communication and networks by B. Forouzan
Data communication and networks by B. Forouzan
 

Viewers also liked

Hardware accelerated switching with Linux @ SWLUG Talks May 2014
Hardware accelerated switching with Linux @ SWLUG Talks May 2014Hardware accelerated switching with Linux @ SWLUG Talks May 2014
Hardware accelerated switching with Linux @ SWLUG Talks May 2014Nat Morris
 
Demystifying Networking Webinar Series- Routing on the Host
Demystifying Networking Webinar Series- Routing on the HostDemystifying Networking Webinar Series- Routing on the Host
Demystifying Networking Webinar Series- Routing on the HostCumulus Networks
 
Topology presentation
Topology  presentationTopology  presentation
Topology presentationJobaida Nahar
 
Simplifying OpenStack Networks with Routing on the Host: Gerard Chami + Scott...
Simplifying OpenStack Networks with Routing on the Host: Gerard Chami + Scott...Simplifying OpenStack Networks with Routing on the Host: Gerard Chami + Scott...
Simplifying OpenStack Networks with Routing on the Host: Gerard Chami + Scott...OpenStack
 
Building Scalable Data Center Networks
Building Scalable Data Center NetworksBuilding Scalable Data Center Networks
Building Scalable Data Center NetworksCumulus Networks
 
Morphology of Modern Data Center Networks - YaC 2013
Morphology of Modern Data Center Networks - YaC 2013Morphology of Modern Data Center Networks - YaC 2013
Morphology of Modern Data Center Networks - YaC 2013Cumulus Networks
 
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)inside-BigData.com
 
Erikson's Psychosocial Stages of Developmetn
Erikson's Psychosocial Stages of DevelopmetnErikson's Psychosocial Stages of Developmetn
Erikson's Psychosocial Stages of Developmetnsanko1sm
 

Viewers also liked (8)

Hardware accelerated switching with Linux @ SWLUG Talks May 2014
Hardware accelerated switching with Linux @ SWLUG Talks May 2014Hardware accelerated switching with Linux @ SWLUG Talks May 2014
Hardware accelerated switching with Linux @ SWLUG Talks May 2014
 
Demystifying Networking Webinar Series- Routing on the Host
Demystifying Networking Webinar Series- Routing on the HostDemystifying Networking Webinar Series- Routing on the Host
Demystifying Networking Webinar Series- Routing on the Host
 
Topology presentation
Topology  presentationTopology  presentation
Topology presentation
 
Simplifying OpenStack Networks with Routing on the Host: Gerard Chami + Scott...
Simplifying OpenStack Networks with Routing on the Host: Gerard Chami + Scott...Simplifying OpenStack Networks with Routing on the Host: Gerard Chami + Scott...
Simplifying OpenStack Networks with Routing on the Host: Gerard Chami + Scott...
 
Building Scalable Data Center Networks
Building Scalable Data Center NetworksBuilding Scalable Data Center Networks
Building Scalable Data Center Networks
 
Morphology of Modern Data Center Networks - YaC 2013
Morphology of Modern Data Center Networks - YaC 2013Morphology of Modern Data Center Networks - YaC 2013
Morphology of Modern Data Center Networks - YaC 2013
 
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
 
Erikson's Psychosocial Stages of Developmetn
Erikson's Psychosocial Stages of DevelopmetnErikson's Psychosocial Stages of Developmetn
Erikson's Psychosocial Stages of Developmetn
 

Similar to What is 3d torus

Dcn invited ecoc2018_short
Dcn invited ecoc2018_shortDcn invited ecoc2018_short
Dcn invited ecoc2018_shortShuangyi Yan
 
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...PROIDEA
 
Data Networks: Next-Generation Optical Access toward 10 Gb/s Everywhere
Data Networks: Next-Generation Optical Access toward 10 Gb/s EverywhereData Networks: Next-Generation Optical Access toward 10 Gb/s Everywhere
Data Networks: Next-Generation Optical Access toward 10 Gb/s EverywhereXi'an Jiaotong-Liverpool University
 
Hardware virtualized flexible network for wireless data center optical interc...
Hardware virtualized flexible network for wireless data center optical interc...Hardware virtualized flexible network for wireless data center optical interc...
Hardware virtualized flexible network for wireless data center optical interc...ieeepondy
 
Maxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialMaxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialmadhuinturi
 
Building the foundations of Ultra-RELIABLE and Low-LATENCY Wireless Communica...
Building the foundations of Ultra-RELIABLE and Low-LATENCY Wireless Communica...Building the foundations of Ultra-RELIABLE and Low-LATENCY Wireless Communica...
Building the foundations of Ultra-RELIABLE and Low-LATENCY Wireless Communica...3G4G
 
Interconnect Your Future With Mellanox
Interconnect Your Future With MellanoxInterconnect Your Future With Mellanox
Interconnect Your Future With MellanoxMellanox Technologies
 
A Scalable, Commodity Data Center Network Architecture
A Scalable, Commodity Data Center Network ArchitectureA Scalable, Commodity Data Center Network Architecture
A Scalable, Commodity Data Center Network ArchitectureHiroshi Ono
 
Trends and challenges in IP based SOC design
Trends and challenges in IP based SOC designTrends and challenges in IP based SOC design
Trends and challenges in IP based SOC designAishwaryaRavishankar8
 
Cloud interconnection networks basic .pptx
Cloud interconnection networks basic .pptxCloud interconnection networks basic .pptx
Cloud interconnection networks basic .pptxRahulBhole12
 
40 Powers of 10 - Simulating the Universe with the DiRAC HPC Facility
40 Powers of 10 - Simulating the Universe with the DiRAC HPC Facility40 Powers of 10 - Simulating the Universe with the DiRAC HPC Facility
40 Powers of 10 - Simulating the Universe with the DiRAC HPC Facilityinside-BigData.com
 
Hyper Transport Technology
Hyper Transport TechnologyHyper Transport Technology
Hyper Transport Technologynayakslideshare
 
Navigating dc architectures tech&sales
Navigating dc architectures tech&salesNavigating dc architectures tech&sales
Navigating dc architectures tech&salesEric Zhaohui Ji
 
Network on Chip Architecture and Routing Techniques: A survey
Network on Chip Architecture and Routing Techniques: A surveyNetwork on Chip Architecture and Routing Techniques: A survey
Network on Chip Architecture and Routing Techniques: A surveyIJRES Journal
 
Lecture 1_Introduction to Networking_1.ppt
Lecture 1_Introduction to Networking_1.pptLecture 1_Introduction to Networking_1.ppt
Lecture 1_Introduction to Networking_1.pptflyinimohamed
 
1-introduction-to-computer-networking-converted 2.pptx
1-introduction-to-computer-networking-converted 2.pptx1-introduction-to-computer-networking-converted 2.pptx
1-introduction-to-computer-networking-converted 2.pptxYashwant Srikrishnan
 
Evaluating UCIe based multi-die SoC to meet timing and power
Evaluating UCIe based multi-die SoC to meet timing and power Evaluating UCIe based multi-die SoC to meet timing and power
Evaluating UCIe based multi-die SoC to meet timing and power Deepak Shankar
 
Hyper Transport Technology
Hyper Transport TechnologyHyper Transport Technology
Hyper Transport TechnologyRohan Khude
 

Similar to What is 3d torus (20)

Dcn invited ecoc2018_short
Dcn invited ecoc2018_shortDcn invited ecoc2018_short
Dcn invited ecoc2018_short
 
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...
 
Data Networks: Next-Generation Optical Access toward 10 Gb/s Everywhere
Data Networks: Next-Generation Optical Access toward 10 Gb/s EverywhereData Networks: Next-Generation Optical Access toward 10 Gb/s Everywhere
Data Networks: Next-Generation Optical Access toward 10 Gb/s Everywhere
 
Hardware virtualized flexible network for wireless data center optical interc...
Hardware virtualized flexible network for wireless data center optical interc...Hardware virtualized flexible network for wireless data center optical interc...
Hardware virtualized flexible network for wireless data center optical interc...
 
Maxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialMaxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorial
 
Building the foundations of Ultra-RELIABLE and Low-LATENCY Wireless Communica...
Building the foundations of Ultra-RELIABLE and Low-LATENCY Wireless Communica...Building the foundations of Ultra-RELIABLE and Low-LATENCY Wireless Communica...
Building the foundations of Ultra-RELIABLE and Low-LATENCY Wireless Communica...
 
Interconnect Your Future With Mellanox
Interconnect Your Future With MellanoxInterconnect Your Future With Mellanox
Interconnect Your Future With Mellanox
 
A Scalable, Commodity Data Center Network Architecture
A Scalable, Commodity Data Center Network ArchitectureA Scalable, Commodity Data Center Network Architecture
A Scalable, Commodity Data Center Network Architecture
 
Trends and challenges in IP based SOC design
Trends and challenges in IP based SOC designTrends and challenges in IP based SOC design
Trends and challenges in IP based SOC design
 
Cloud interconnection networks basic .pptx
Cloud interconnection networks basic .pptxCloud interconnection networks basic .pptx
Cloud interconnection networks basic .pptx
 
40 Powers of 10 - Simulating the Universe with the DiRAC HPC Facility
40 Powers of 10 - Simulating the Universe with the DiRAC HPC Facility40 Powers of 10 - Simulating the Universe with the DiRAC HPC Facility
40 Powers of 10 - Simulating the Universe with the DiRAC HPC Facility
 
UDT.pptx
UDT.pptxUDT.pptx
UDT.pptx
 
Hyper Transport Technology
Hyper Transport TechnologyHyper Transport Technology
Hyper Transport Technology
 
Navigating dc architectures tech&sales
Navigating dc architectures tech&salesNavigating dc architectures tech&sales
Navigating dc architectures tech&sales
 
Network on Chip Architecture and Routing Techniques: A survey
Network on Chip Architecture and Routing Techniques: A surveyNetwork on Chip Architecture and Routing Techniques: A survey
Network on Chip Architecture and Routing Techniques: A survey
 
Lecture 1_Introduction to Networking_1.ppt
Lecture 1_Introduction to Networking_1.pptLecture 1_Introduction to Networking_1.ppt
Lecture 1_Introduction to Networking_1.ppt
 
1-introduction-to-computer-networking-converted 2.pptx
1-introduction-to-computer-networking-converted 2.pptx1-introduction-to-computer-networking-converted 2.pptx
1-introduction-to-computer-networking-converted 2.pptx
 
Evaluating UCIe based multi-die SoC to meet timing and power
Evaluating UCIe based multi-die SoC to meet timing and power Evaluating UCIe based multi-die SoC to meet timing and power
Evaluating UCIe based multi-die SoC to meet timing and power
 
Hyper Transport Technology
Hyper Transport TechnologyHyper Transport Technology
Hyper Transport Technology
 
Новые коммутаторы QFX10000. Технология JunOS Fusion
Новые коммутаторы QFX10000. Технология JunOS FusionНовые коммутаторы QFX10000. Технология JunOS Fusion
Новые коммутаторы QFX10000. Технология JunOS Fusion
 

More from Eurotech Aurora

Aurora Departmental HPC Systems
Aurora Departmental HPC SystemsAurora Departmental HPC Systems
Aurora Departmental HPC SystemsEurotech Aurora
 
Fpga computing 14 03 2013
Fpga computing 14 03 2013Fpga computing 14 03 2013
Fpga computing 14 03 2013Eurotech Aurora
 
Aurora hpc energy efficiency
Aurora hpc energy efficiencyAurora hpc energy efficiency
Aurora hpc energy efficiencyEurotech Aurora
 
Liquid cooling hot water cooling
Liquid cooling   hot water coolingLiquid cooling   hot water cooling
Liquid cooling hot water coolingEurotech Aurora
 
Eurotech aurora (eurora) - most efficient hpc
Eurotech   aurora (eurora) - most efficient hpcEurotech   aurora (eurora) - most efficient hpc
Eurotech aurora (eurora) - most efficient hpcEurotech Aurora
 
Green it economics_aurora_tco_paper
Green it economics_aurora_tco_paperGreen it economics_aurora_tco_paper
Green it economics_aurora_tco_paperEurotech Aurora
 
Aurora hpc solutions value
Aurora hpc solutions valueAurora hpc solutions value
Aurora hpc solutions valueEurotech Aurora
 

More from Eurotech Aurora (7)

Aurora Departmental HPC Systems
Aurora Departmental HPC SystemsAurora Departmental HPC Systems
Aurora Departmental HPC Systems
 
Fpga computing 14 03 2013
Fpga computing 14 03 2013Fpga computing 14 03 2013
Fpga computing 14 03 2013
 
Aurora hpc energy efficiency
Aurora hpc energy efficiencyAurora hpc energy efficiency
Aurora hpc energy efficiency
 
Liquid cooling hot water cooling
Liquid cooling   hot water coolingLiquid cooling   hot water cooling
Liquid cooling hot water cooling
 
Eurotech aurora (eurora) - most efficient hpc
Eurotech   aurora (eurora) - most efficient hpcEurotech   aurora (eurora) - most efficient hpc
Eurotech aurora (eurora) - most efficient hpc
 
Green it economics_aurora_tco_paper
Green it economics_aurora_tco_paperGreen it economics_aurora_tco_paper
Green it economics_aurora_tco_paper
 
Aurora hpc solutions value
Aurora hpc solutions valueAurora hpc solutions value
Aurora hpc solutions value
 

Recently uploaded

Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 

Recently uploaded (20)

Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 

What is 3d torus

  • 1. What is 3D Torus The switchless interconnection topology
  • 2. A demanding future for HPC • Supercomputers are asked to process demanding computational loads (process and data) • • Processor power is paramount but a key aspect of parallel computers is the communication network that interconnects the computing nodes • Together with speed, HPC systems are increasingly asked to be more available • One additional challenge with large systems is scalability, so the ability to add nodes to a cluster without affecting performance and reliability or affecting them as little as possible • It is also paramount for future machines to consume less energy
  • 3. Conceptual difference Switched Infiniband network Switchless Torus network
  • 4. 3D Torus topology • Connecting nodes using a 3D Torus configuration means than each node in a cluster is connected to the adjacent ones via short cabling • The signal is routed directly from one node to the other with no need of switches. 3D means that the communication takes places in 6 different “directions”: X+, X-, Y+, Y-, Z+, Z- • In practical terms, each node can be connected to 6 other nodes: in this way, the graph of the connections resembles a tri- dimensional matrix
  • 5. 3D Torus topology • Such configuration allows the addition of nodes to a system without degrading performance. • Each new node is joint as an addition of a grid, linked to it with no extensive cabling or switching • Scaling linearly, with little or no performance loss, is strictly true for those problems that heavily rely on next neighbor communication • the addition of a node in a large system happens with much less working and potential troubles • Being the connections between nodes short and direct, the latency of the links is very low
  • 6. 3D Torus advantages • High speed and low latency • Linear scalability. Switchless configuration that avoids bottlenecks and allows hardware cost reduction • Improvement of MTBF • Regular and hidden wiring, leading to less cabling • Lower energy usage for communication • Good match between physical communication channels and local pattern algorithms • Less energy consumed
  • 8. 3D Torus applications • The maximization of the performance of the 3D Torus takes place with a subset of problems which is specific but rather large. • These are local pattern problems, which typically deal with modeling systems whose functioning/reaction depends on adjacent systems.
  • 9. Example: Lattice QCD • Computer simulations of Lattice QCD (the theory of strong interactions e.g. inside protons) is one of the great challenges for massively parallel supercomputers and requires a communication network with high bandwidth and low latency. • The equations governing Lattice QDC describe local interactions (each degree of freedom interacts with its nearest neighbors) and this results in a well balanced computational task in which each degree of freedom (the value of a field on a space-time point) obeys the same equations, which are coupled to a small number of other degrees of freedom residing nearby.
  • 10. Example: Fluid Dynamics • Fluid Dynamics in turbulent regime shares the same opportunity of being “easily” put on a supercomputer in the formulation defined by what are known as Lattice Boltzmann Methods (LB). • This is a scientific field which is both intriguing from the point of view of fundamental science and relevant to many technological applications.
  • 11. Additional applications • Many Monte Carlo simulations and embarrassingly parallel problems can exploit the full performance advantage of the 3D Torus architecture • Problems that require all to all dialogue between nodes may exploit less the full performance of the 3D torus interconnection • However, independently from the type of application and problem, the 3D torus still bears the massive advantage of scalability and serviceability
  • 13. Aurora Torus peculiarities • Unified network architecture: – the 3D Torus coexists with an Infiniband network. – Both local and global MPI calls can be processed efficiently – Dedicated synchronization network – Gigabit Ethernet • FPGA driven Torus. Based the result of the work of Aurora Science researchers who acquired experienced with Janus and QPACE • Full duplex communication links – Allowing sub-tori to create subdomain • The length of cables kept very short due to smart backplane design
  • 14. Aurora 3D Torus Network • Aurora Science implementation – Based on FTNW (Pisanti, Schifano, Simma) • http://sourceforge.net/projects/ftnw – GPL licensed – Optimized for nearest-neighbor communication – Proven technology in LQDC communities • Extoll implementation on Aurora – Licensed – Optimized for all-to-all communication for wide range of applications – Future interconnect paving the way to exascale computing
  • 15. Aurora FPGA: 3D Torus network processor PCIe 2.0 x8
  • 16. Aurora S– 3D Torus network CPU CPU PCIe2 x8 PCIe2 x8 40 Gbps 40 Gbps FPGA phy phy phy phy phy phy 4x 4x 4x 4x 4x 4x X+ X- Y+ Y- Z+ Z- 10 10 10 10 10 10 Gbps Gbps Gbps Gbps Gbps Gbps
  • 18. AURORA, a high density – highly efficient family of supercomputers One Aurora Rack 256 nodes, 512 CPUs, 3072 cores, 48U 100 TFLOPS @ 100kW Entirely liquid cooled
  • 19. Aurora identity card High computational power Liquid cooling Energy efficiency Reliability and availability Scalability Unified network architecture Compatibility
  • 20. Aurora identity card High computational power Liquid cooling Energy efficiency Reliability and availability Scalability Unified network architecture Compatibility
  • 21. Unified Network Architecture Ultra low latency Regular, massive, local High bandwidth patterns 3D Torus Nearest neighbor Unlimited scalability select Very low latency Irregular, long distance High bandwidth patterns (Molecular Infiniband Switched network Dynamics) Multiple services Storage (SAN) Monitoring (IPMI) Very fast channel Net processor synch Global commands Thread synch Synch Subdomain manag. Global clock Low/high level synch System coordination Debugging
  • 22. Eurotech HPC Principles High performance. We want our customer run their simulations and applications as quick as the world latest technologies allow. Energy efficiency and Green. We built products to allow our customer to save on energy bills and leverage sustainability. Scalability. We want our solutions to scale linearly and our customer to grow according to their needs and budget availability Availability. Intelligent design, quality, support readiness and preventive maintenance to increase the availability of our HPC systems during their lifetime. Cost effectiveness. We concentrate a lot of our efforts to deliver advanced technology at competitive prices and to allow our customers reducing the total cost of ownership. Versatility and compatibility. We designed our products to tackle different problems in the most effective way possible