SlideShare a Scribd company logo
Rev PA1Rev PA1 1
Keren Bergman
Lightwave Research Lab, Columbia University
New York, NY, USA
Flexibly Scalable High Performance Architectures 
with Embedded Photonics 
Rev PA1Rev PA1 3
Performance/Communications Trends for Top 10 (2010-2018)
Sunway TaihuLight (Nov 2017) B/F = 0.004; Summit HPC (June 2018) B/F = 0.0005  8X decrease
Rev PA1Rev PA1 4
Performance and the Data Movement Energy Budget
• GFLOPs/Watt = GFlop/second / Joule/second = GFlop/Joule
• 14 GFLOPs/W (Summit)  72 pJ/FLOP
• Target: 50 GFLOPs/W  20 pJ/FLOP
• Energy per bit total budget (200 bits/FLOP):
14 GFLOPs/W: 72 pJ/FLOP 0.36 pJ/bit
50 GFLOPs/W: 20 pJ/FLOP 0.1 pJ/bit
• Scaling performance under ultra-tight energy budget:
– Raise cache hit rates (expanded caches, more reuse)
– Improve memory access (read, write) energy efficiency
– Improve data movement energy efficiency:
• Novel interconnect technologies and architectures
Rev PA1Rev PA1 8
What can photonics bring to the table?
The Photonic Opportunity for Data Movement
Reduce Energy Consumption             Eliminate Bandwidth Taper
R. Lucas et al., “Top ten exascale research challenges,” DOE ASCAC subcommittee Report, 2014
Rev PA1Rev PA1 9
Maximizing the system‐wide energy efficient data movement benefits of photonics:
Approach 3 Key Concepts:
Rev PA1Rev PA1 12
clk
data
clk
data
clk
data
clk
dataTIA
Heater
clk gen
Silicon waveguide
Sensitivity of receiver @
10 Gb/s or 25 Gb/s
Coupler
Loss
Demux Array Penalty
Coupler
Loss
Coupler
Loss
Modulator Array Penalty
Available
Laser
power
Maximum Budget
Extra
Budget
Optical Power
Key Concept #1: Global Link Optimization
Max Bandwidth Density / Min Energy Design
Rev PA1Rev PA1 14
Passive Alignment – High Density Optical Fiber IO
• Demonstrated robustness to temperature
and fabrication variations.
• Temperature fluctuations of >80 degrees
have penalty of less than 0.6 dB on the
coupling efficiency.
• Designed and fabricated the funnel for
inverse taper coupling.
• Plug-and-play fiber to waveguide coupling
Rev PA1Rev PA1 16
Energy Optimized Performance – 800G
10Gb/s 25Gb/s
Power Penalty 15 dB 13.5 dB
-2.5 dBm -1.5 dBmLaser Power (dB)
0.56 mW 0.71 mWLaser Power (mW)
-17.5 dBm -15 dBmReceiver Sensitivity
10 Gb/s
2.2 pJ/b
25 Gb/s
3.2 pJ/b
• Aggregate bandwidth: 800G per link
• 8 groups of 10G x 10 vs 4 groups of 25G x 8
Rev PA1Rev PA1 17
Scaling chip ‘escape’ bandwidth density
~22mm
• 18 NVLink 2.0 ports  9 per long edge top/bottom
• 50GB/s per port (25GB/s each Tx/Rx)
• 1 NVLink ~ 2mm of linear edge
• 50GB/s per 2mm  200Gb/s/mm
NVIDIA NVSwitch
Each fiber ‘pin’ carries
80 s (scalable)Dense WDM Silicon Photonic:
• 250um fiber pitch
• 8 fiber links ~ over 2mm linear edge
• 80 s per fiber link; each  at 10Gb/s = 800 Gb/s per link
• 6.4 Tb/s per 2mm  3.2 Tb/s/mm
Rev PA1Rev PA1 18
• OC‐MCM: Optically Connected Multi‐chip Module
– Optical communication among interposers/MCM 
– Enables fully flexible and scalable architectures
Approach: Deep Disaggregation
…network of Resources
CMP(s), GPUs
on interposer
Key Concept #2: 
Ubiquitous Optically Connected‐MCM
Rev PA1Rev PA1 19
Electronic ICs
Photonic IC
(to be packaged)
Silicon Interposer
(100m thick)Silicon Interposer
(100m thick)
Photonic IC
Electronic ICs
Interconnected 2.5D/3D node (MCM)
Active interposer 3D
TI TIAs  8x8 Switch  Lasers
Rev PA1Rev PA1 20
Current server
Current rack
Disaggregated
rack
Pool and
compose
Key Concept #3: Adaptive, Flexible Connectivity with Bandwidth Steering
Only “Power Up” Needed Optical Links
Inserting photonic switching – directly integrated in MCM – fully flexible connectivity
Rev PA1Rev PA1 21
Standard (‘Vanilla’) Fat Tree Architecture - Challenges
1 2 3 4 5 6 7 8
9 10 11 12
13 14 Electronic Packet
Switch (EPS)
Server
Pod 1 Pod 2Spine
Aggregation
ToR
• In a standard Fat Tree, ToR EPSs are only connected to their parent aggregation EPSs
• Inter-pod traffic (defragmentation) needs to traverse (+ hops) through spine to connect between pods
• Spine often oversubscribed, energy consuming large routers
Pre-Publication accepted to SC19: G. Michelogiannakis et al. “Bandwidth Steering for HPC using Silicon Nanophotonics”
Rev PA1Rev PA1 23
1 2 3 4 5 6 7 8 A B C D E F G HSiP Switch SiP Switch SiP Switch SiP Switch
Pod 1 Pod 2Spine
Aggregation
ToR 1 2 3 4 5 6 7 8
9 10 11 12
13 14
Electronic Packet
Switch (EPS)
Server
• Embedded SiP switches – flexible network within Fat Tree - direct ToR/Aggregation switch connections across pods
• Bypassing need to travel through congested higher layer links
• Lower latency - by providing a direct path that reduces hop count, and lowers utilization of upper layer
Flexible Fat Tree: Bandwidth steering across Pods
Rev PA1Rev PA1 24
Throughput of Upper Layer Links in Standard Fat-Tree Topology
Throughput of Upper Layer Links in Bandwidth-Steered Topology
56 72
25% execution
time
difference
System Experimental Results – Standard Fat Tree vs Flexible Fat Tree
• Operating skeletonized
GTC application with MPI
• Standard Fat Tree:
• all upper layer links used
• runtime = 72 sec
• Flexible PINE Fat Tree:
• direct lower layer connectivity
• only 4 upper layer links utilized
• runtime = 56 sec
Pre-Publication accepted to SC19: G. Michelogiannakis et al. “Bandwidth Steering for HPC using Silicon Nanophotonics”
Rev PA1Rev PA1 25
1 2 3 4 5 6 7 8 A B C D E F G H
4x4
SiP Switch 1
4x4
SiP Switch 2
4x4
SiP Switch 3
4x4
SiP Switch 4
Pod 1 Pod 2Spine
Aggregation
ToR 1 2 3 4 5 6 7 8
9 10 11 12
13 14
Electronic Packet
Switch (EPS)
Server
Flexible Fat Tree: Removal of Upper Layer Links
• PINE Flexible Fat Tree – provides direct connectivity for traffic at the lower layer
• Remove spine layer links to reduce energy consumption
Rev PA1Rev PA1 26
Throughput of Upper Layer Links in Fat-Tree Topology with Some Upper Links Removed
Throughput of Upper Layer Links in Bandwidth-Steered Topology with
Some Upper Links Removed
115
69% execution time
difference
56
System Results: Standard Fat Tree vs Flexible Fat Tree with Links Removed
• Flexible Fat Tree:
• direct lower layer connectivity
• Can remove upper layer EPSs =
reduced energy consumption
• Standard Fat Tree: all remaining
upper layer links congested
• runtime = 115 sec (vs 72 sec)
• Flexible Fat Tree:
• unaffected by removal of links,
remains at 56 sec runtime
Rev PA1Rev PA1 28
Summary: • Data Movement is Critical to any Future Performance Scaling
– Power Consumption
– Bandwidth Density (and Cost)
• Photonics: System‐Wide Tb/s per ‘wire’ and 1 pJ/bit
– Ultra bandwidth dense WDM photonic links
– Energy efficiency and cost via co‐integration 2.5D and 3D platforms
– High bandwidth Optically Connected Memory/GPUs/CPUs
• Deeply disaggregated Architectures
– Optical connectivity for flexibly assembled interconnectivity topologies
• Computer architecture landscape is changing rapidly ‐ Data Analytics, AI
– Optical bandwidth steering, adaptable architectures for scalability
– Ultimate energy efficiency – use only required resources for needed time period

More Related Content

What's hot

Satellite Downlink Budget Analysis
Satellite Downlink Budget AnalysisSatellite Downlink Budget Analysis
Satellite Downlink Budget AnalysisPartho Choudhury
 
Graphene based nano anteenas for terahertz communication
Graphene based nano anteenas for terahertz communicationGraphene based nano anteenas for terahertz communication
Graphene based nano anteenas for terahertz communication
Abdul Qudoos
 
Earth Station and Satellite Antennas
Earth Station and Satellite AntennasEarth Station and Satellite Antennas
Earth Station and Satellite Antennas
furkansleyman
 
Satellite link design
Satellite link designSatellite link design
Satellite link design
Satellite link designSatellite link design
Satellite link design
RAVIKIRAN ANANDE
 
Nonlinear Impact of Diverse Optical Routing in Uncompensated 120 Gb/s PM-QPSK...
Nonlinear Impact of Diverse Optical Routing in Uncompensated 120 Gb/s PM-QPSK...Nonlinear Impact of Diverse Optical Routing in Uncompensated 120 Gb/s PM-QPSK...
Nonlinear Impact of Diverse Optical Routing in Uncompensated 120 Gb/s PM-QPSK...
ADVA
 
Zipper Modulation
Zipper Modulation  Zipper Modulation
Zipper Modulation
Jörgen Larsson
 
Relay selection for geographical forwarding in sleep wake cycling wireless se...
Relay selection for geographical forwarding in sleep wake cycling wireless se...Relay selection for geographical forwarding in sleep wake cycling wireless se...
Relay selection for geographical forwarding in sleep wake cycling wireless se...
JPINFOTECH JAYAPRAKASH
 
about message coalescing
about message coalescingabout message coalescing
about message coalescing
yyooooon
 
Design of the satellite link
Design of the satellite linkDesign of the satellite link
Design of the satellite link
AJAL A J
 
UPLINK, DOWNLINK AND OVERALL LINK PERFORMANCE INTER-SATELLITE LINKS
UPLINK, DOWNLINK ANDOVERALL LINK PERFORMANCE INTER-SATELLITE LINKSUPLINK, DOWNLINK ANDOVERALL LINK PERFORMANCE INTER-SATELLITE LINKS
UPLINK, DOWNLINK AND OVERALL LINK PERFORMANCE INTER-SATELLITE LINKS
Ahmed Ayman
 
Link Adapatation in Mobile Satellite Links: Field Trial Results using SDR and...
Link Adapatation in Mobile Satellite Links: Field Trial Results using SDR and...Link Adapatation in Mobile Satellite Links: Field Trial Results using SDR and...
Link Adapatation in Mobile Satellite Links: Field Trial Results using SDR and...
Anxo Tato Arias
 
Time from the Sky
Time from the SkyTime from the Sky
Time from the Sky
3G4G
 
IEEE BTS Symposium 2015 Repack Considerations by K.Pelletier
IEEE BTS Symposium 2015 Repack Considerations by K.PelletierIEEE BTS Symposium 2015 Repack Considerations by K.Pelletier
IEEE BTS Symposium 2015 Repack Considerations by K.Pelletier
kmsavage
 
WE3.L09 - RAIN EFFECT ON POLARIMETRIC SAR OBSERVATION
WE3.L09 - RAIN EFFECT ON POLARIMETRIC SAR OBSERVATIONWE3.L09 - RAIN EFFECT ON POLARIMETRIC SAR OBSERVATION
WE3.L09 - RAIN EFFECT ON POLARIMETRIC SAR OBSERVATIONgrssieee
 
Earth station and satellite antenna
Earth station and satellite antennaEarth station and satellite antenna
Earth station and satellite antenna
Andrew Huang
 
Optical Antenna
Optical AntennaOptical Antenna
Optical Antenna
Srinivas Vasamsetti
 
20GHz DQPSK Optical Modulator Electrical Bias Optimiser
20GHz DQPSK Optical Modulator Electrical Bias Optimiser20GHz DQPSK Optical Modulator Electrical Bias Optimiser
20GHz DQPSK Optical Modulator Electrical Bias Optimiser
Ben Larcombe
 

What's hot (20)

Satellite Downlink Budget Analysis
Satellite Downlink Budget AnalysisSatellite Downlink Budget Analysis
Satellite Downlink Budget Analysis
 
Graphene based nano anteenas for terahertz communication
Graphene based nano anteenas for terahertz communicationGraphene based nano anteenas for terahertz communication
Graphene based nano anteenas for terahertz communication
 
Earth Station and Satellite Antennas
Earth Station and Satellite AntennasEarth Station and Satellite Antennas
Earth Station and Satellite Antennas
 
Satellite link design
Satellite link designSatellite link design
Satellite link design
 
Satellite link design
Satellite link designSatellite link design
Satellite link design
 
Radio Link budget
Radio Link budgetRadio Link budget
Radio Link budget
 
Nonlinear Impact of Diverse Optical Routing in Uncompensated 120 Gb/s PM-QPSK...
Nonlinear Impact of Diverse Optical Routing in Uncompensated 120 Gb/s PM-QPSK...Nonlinear Impact of Diverse Optical Routing in Uncompensated 120 Gb/s PM-QPSK...
Nonlinear Impact of Diverse Optical Routing in Uncompensated 120 Gb/s PM-QPSK...
 
Zipper Modulation
Zipper Modulation  Zipper Modulation
Zipper Modulation
 
Relay selection for geographical forwarding in sleep wake cycling wireless se...
Relay selection for geographical forwarding in sleep wake cycling wireless se...Relay selection for geographical forwarding in sleep wake cycling wireless se...
Relay selection for geographical forwarding in sleep wake cycling wireless se...
 
about message coalescing
about message coalescingabout message coalescing
about message coalescing
 
Design of the satellite link
Design of the satellite linkDesign of the satellite link
Design of the satellite link
 
Satellite antennas
Satellite antennasSatellite antennas
Satellite antennas
 
UPLINK, DOWNLINK AND OVERALL LINK PERFORMANCE INTER-SATELLITE LINKS
UPLINK, DOWNLINK ANDOVERALL LINK PERFORMANCE INTER-SATELLITE LINKSUPLINK, DOWNLINK ANDOVERALL LINK PERFORMANCE INTER-SATELLITE LINKS
UPLINK, DOWNLINK AND OVERALL LINK PERFORMANCE INTER-SATELLITE LINKS
 
Link Adapatation in Mobile Satellite Links: Field Trial Results using SDR and...
Link Adapatation in Mobile Satellite Links: Field Trial Results using SDR and...Link Adapatation in Mobile Satellite Links: Field Trial Results using SDR and...
Link Adapatation in Mobile Satellite Links: Field Trial Results using SDR and...
 
Time from the Sky
Time from the SkyTime from the Sky
Time from the Sky
 
IEEE BTS Symposium 2015 Repack Considerations by K.Pelletier
IEEE BTS Symposium 2015 Repack Considerations by K.PelletierIEEE BTS Symposium 2015 Repack Considerations by K.Pelletier
IEEE BTS Symposium 2015 Repack Considerations by K.Pelletier
 
WE3.L09 - RAIN EFFECT ON POLARIMETRIC SAR OBSERVATION
WE3.L09 - RAIN EFFECT ON POLARIMETRIC SAR OBSERVATIONWE3.L09 - RAIN EFFECT ON POLARIMETRIC SAR OBSERVATION
WE3.L09 - RAIN EFFECT ON POLARIMETRIC SAR OBSERVATION
 
Earth station and satellite antenna
Earth station and satellite antennaEarth station and satellite antenna
Earth station and satellite antenna
 
Optical Antenna
Optical AntennaOptical Antenna
Optical Antenna
 
20GHz DQPSK Optical Modulator Electrical Bias Optimiser
20GHz DQPSK Optical Modulator Electrical Bias Optimiser20GHz DQPSK Optical Modulator Electrical Bias Optimiser
20GHz DQPSK Optical Modulator Electrical Bias Optimiser
 

Similar to Flexibly Scalable High Performance Architectures with Embedded Photonics

Silicon Photonics for Extreme Computing - Challenges and Opportunities
Silicon Photonics for Extreme Computing - Challenges and OpportunitiesSilicon Photonics for Extreme Computing - Challenges and Opportunities
Silicon Photonics for Extreme Computing - Challenges and Opportunities
inside-BigData.com
 
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...
PROIDEA
 
A 64Gb/s PAM-4 Transmitter with 4-Tap FFE and 2.26pJ/b Energy Efficiency in 2...
A 64Gb/s PAM-4 Transmitter with 4-Tap FFE and 2.26pJ/b Energy Efficiency in 2...A 64Gb/s PAM-4 Transmitter with 4-Tap FFE and 2.26pJ/b Energy Efficiency in 2...
A 64Gb/s PAM-4 Transmitter with 4-Tap FFE and 2.26pJ/b Energy Efficiency in 2...
aiclab
 
To HDR and Beyond
To HDR and BeyondTo HDR and Beyond
To HDR and Beyond
inside-BigData.com
 
A 45Gb/s PAM-4 Transmitter Delivering 1.3Vppd Output Swing with 1V supply in ...
A 45Gb/s PAM-4 Transmitter Delivering 1.3Vppd Output Swing with 1V supply in ...A 45Gb/s PAM-4 Transmitter Delivering 1.3Vppd Output Swing with 1V supply in ...
A 45Gb/s PAM-4 Transmitter Delivering 1.3Vppd Output Swing with 1V supply in ...
aiclab
 
ODSA BoW: Basic, Fast, Turbo Die to Die Open Interface Solutions
ODSA BoW: Basic, Fast, Turbo Die to Die Open Interface SolutionsODSA BoW: Basic, Fast, Turbo Die to Die Open Interface Solutions
ODSA BoW: Basic, Fast, Turbo Die to Die Open Interface Solutions
jennimenni
 
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface Solutions
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface SolutionsOODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface Solutions
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface Solutions
ODSA Workgroup
 
Linkmeup v23-compass-eos
Linkmeup v23-compass-eosLinkmeup v23-compass-eos
Linkmeup v23-compass-eos
eucariot
 
ENRZ Advanced Modulation for Low Latency Applications
ENRZ Advanced Modulation for Low Latency ApplicationsENRZ Advanced Modulation for Low Latency Applications
ENRZ Advanced Modulation for Low Latency Applications
Deborah Porchivina
 
Secure Optical LAN: TechNet Augusta 2015
Secure Optical LAN: TechNet Augusta 2015Secure Optical LAN: TechNet Augusta 2015
Secure Optical LAN: TechNet Augusta 2015
AFCEA International
 
Experiences with High-bandwidth Networks
Experiences with High-bandwidth NetworksExperiences with High-bandwidth Networks
Experiences with High-bandwidth Networks
balmanme
 
Streaming exa-scale data over 100Gbps networks
Streaming exa-scale data over 100Gbps networksStreaming exa-scale data over 100Gbps networks
Streaming exa-scale data over 100Gbps networks
balmanme
 
Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...
Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...
Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...
NXP Admin
 
Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...
Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...
Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...
Lisa Bradley
 
Tomorrow’s Data Centre, Today
Tomorrow’s Data Centre, TodayTomorrow’s Data Centre, Today
Tomorrow’s Data Centre, Today
Panduit
 
ECOC Panel on OIF CEI 56G
ECOC Panel on OIF CEI 56GECOC Panel on OIF CEI 56G
ECOC Panel on OIF CEI 56G
Deborah Porchivina
 
Prof. Emanuel Cohen, Technion
Prof. Emanuel Cohen, TechnionProf. Emanuel Cohen, Technion
Prof. Emanuel Cohen, Technion
chiportal
 
Next Generation Fiber Structured Cabling and Migration to 40/100g
Next Generation Fiber Structured Cabling and Migration to 40/100gNext Generation Fiber Structured Cabling and Migration to 40/100g
Next Generation Fiber Structured Cabling and Migration to 40/100g
Panduit
 
OIF 2015 FOE Architecture Presentation
OIF 2015 FOE Architecture PresentationOIF 2015 FOE Architecture Presentation
OIF 2015 FOE Architecture Presentation
Deborah Porchivina
 

Similar to Flexibly Scalable High Performance Architectures with Embedded Photonics (20)

Silicon Photonics for Extreme Computing - Challenges and Opportunities
Silicon Photonics for Extreme Computing - Challenges and OpportunitiesSilicon Photonics for Extreme Computing - Challenges and Opportunities
Silicon Photonics for Extreme Computing - Challenges and Opportunities
 
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...
 
A 64Gb/s PAM-4 Transmitter with 4-Tap FFE and 2.26pJ/b Energy Efficiency in 2...
A 64Gb/s PAM-4 Transmitter with 4-Tap FFE and 2.26pJ/b Energy Efficiency in 2...A 64Gb/s PAM-4 Transmitter with 4-Tap FFE and 2.26pJ/b Energy Efficiency in 2...
A 64Gb/s PAM-4 Transmitter with 4-Tap FFE and 2.26pJ/b Energy Efficiency in 2...
 
To HDR and Beyond
To HDR and BeyondTo HDR and Beyond
To HDR and Beyond
 
A 45Gb/s PAM-4 Transmitter Delivering 1.3Vppd Output Swing with 1V supply in ...
A 45Gb/s PAM-4 Transmitter Delivering 1.3Vppd Output Swing with 1V supply in ...A 45Gb/s PAM-4 Transmitter Delivering 1.3Vppd Output Swing with 1V supply in ...
A 45Gb/s PAM-4 Transmitter Delivering 1.3Vppd Output Swing with 1V supply in ...
 
ODSA BoW: Basic, Fast, Turbo Die to Die Open Interface Solutions
ODSA BoW: Basic, Fast, Turbo Die to Die Open Interface SolutionsODSA BoW: Basic, Fast, Turbo Die to Die Open Interface Solutions
ODSA BoW: Basic, Fast, Turbo Die to Die Open Interface Solutions
 
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface Solutions
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface SolutionsOODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface Solutions
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface Solutions
 
Linkmeup v23-compass-eos
Linkmeup v23-compass-eosLinkmeup v23-compass-eos
Linkmeup v23-compass-eos
 
ENRZ Advanced Modulation for Low Latency Applications
ENRZ Advanced Modulation for Low Latency ApplicationsENRZ Advanced Modulation for Low Latency Applications
ENRZ Advanced Modulation for Low Latency Applications
 
Secure Optical LAN: TechNet Augusta 2015
Secure Optical LAN: TechNet Augusta 2015Secure Optical LAN: TechNet Augusta 2015
Secure Optical LAN: TechNet Augusta 2015
 
Experiences with High-bandwidth Networks
Experiences with High-bandwidth NetworksExperiences with High-bandwidth Networks
Experiences with High-bandwidth Networks
 
Streaming exa-scale data over 100Gbps networks
Streaming exa-scale data over 100Gbps networksStreaming exa-scale data over 100Gbps networks
Streaming exa-scale data over 100Gbps networks
 
Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...
Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...
Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...
 
Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...
Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...
Techniques and Challenges in Designing Wideband Power Amplifiers Using GaN an...
 
PMC_6G
PMC_6GPMC_6G
PMC_6G
 
Tomorrow’s Data Centre, Today
Tomorrow’s Data Centre, TodayTomorrow’s Data Centre, Today
Tomorrow’s Data Centre, Today
 
ECOC Panel on OIF CEI 56G
ECOC Panel on OIF CEI 56GECOC Panel on OIF CEI 56G
ECOC Panel on OIF CEI 56G
 
Prof. Emanuel Cohen, Technion
Prof. Emanuel Cohen, TechnionProf. Emanuel Cohen, Technion
Prof. Emanuel Cohen, Technion
 
Next Generation Fiber Structured Cabling and Migration to 40/100g
Next Generation Fiber Structured Cabling and Migration to 40/100gNext Generation Fiber Structured Cabling and Migration to 40/100g
Next Generation Fiber Structured Cabling and Migration to 40/100g
 
OIF 2015 FOE Architecture Presentation
OIF 2015 FOE Architecture PresentationOIF 2015 FOE Architecture Presentation
OIF 2015 FOE Architecture Presentation
 

More from inside-BigData.com

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
inside-BigData.com
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
inside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
inside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
inside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
inside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
inside-BigData.com
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
inside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
inside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
inside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
inside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
inside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Recently uploaded

LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 

Recently uploaded (20)

LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 

Flexibly Scalable High Performance Architectures with Embedded Photonics

  • 1. Rev PA1Rev PA1 1 Keren Bergman Lightwave Research Lab, Columbia University New York, NY, USA Flexibly Scalable High Performance Architectures  with Embedded Photonics 
  • 2. Rev PA1Rev PA1 3 Performance/Communications Trends for Top 10 (2010-2018) Sunway TaihuLight (Nov 2017) B/F = 0.004; Summit HPC (June 2018) B/F = 0.0005  8X decrease
  • 3. Rev PA1Rev PA1 4 Performance and the Data Movement Energy Budget • GFLOPs/Watt = GFlop/second / Joule/second = GFlop/Joule • 14 GFLOPs/W (Summit)  72 pJ/FLOP • Target: 50 GFLOPs/W  20 pJ/FLOP • Energy per bit total budget (200 bits/FLOP): 14 GFLOPs/W: 72 pJ/FLOP 0.36 pJ/bit 50 GFLOPs/W: 20 pJ/FLOP 0.1 pJ/bit • Scaling performance under ultra-tight energy budget: – Raise cache hit rates (expanded caches, more reuse) – Improve memory access (read, write) energy efficiency – Improve data movement energy efficiency: • Novel interconnect technologies and architectures
  • 4. Rev PA1Rev PA1 8 What can photonics bring to the table? The Photonic Opportunity for Data Movement Reduce Energy Consumption             Eliminate Bandwidth Taper R. Lucas et al., “Top ten exascale research challenges,” DOE ASCAC subcommittee Report, 2014
  • 5. Rev PA1Rev PA1 9 Maximizing the system‐wide energy efficient data movement benefits of photonics: Approach 3 Key Concepts:
  • 6. Rev PA1Rev PA1 12 clk data clk data clk data clk dataTIA Heater clk gen Silicon waveguide Sensitivity of receiver @ 10 Gb/s or 25 Gb/s Coupler Loss Demux Array Penalty Coupler Loss Coupler Loss Modulator Array Penalty Available Laser power Maximum Budget Extra Budget Optical Power Key Concept #1: Global Link Optimization Max Bandwidth Density / Min Energy Design
  • 7. Rev PA1Rev PA1 14 Passive Alignment – High Density Optical Fiber IO • Demonstrated robustness to temperature and fabrication variations. • Temperature fluctuations of >80 degrees have penalty of less than 0.6 dB on the coupling efficiency. • Designed and fabricated the funnel for inverse taper coupling. • Plug-and-play fiber to waveguide coupling
  • 8. Rev PA1Rev PA1 16 Energy Optimized Performance – 800G 10Gb/s 25Gb/s Power Penalty 15 dB 13.5 dB -2.5 dBm -1.5 dBmLaser Power (dB) 0.56 mW 0.71 mWLaser Power (mW) -17.5 dBm -15 dBmReceiver Sensitivity 10 Gb/s 2.2 pJ/b 25 Gb/s 3.2 pJ/b • Aggregate bandwidth: 800G per link • 8 groups of 10G x 10 vs 4 groups of 25G x 8
  • 9. Rev PA1Rev PA1 17 Scaling chip ‘escape’ bandwidth density ~22mm • 18 NVLink 2.0 ports  9 per long edge top/bottom • 50GB/s per port (25GB/s each Tx/Rx) • 1 NVLink ~ 2mm of linear edge • 50GB/s per 2mm  200Gb/s/mm NVIDIA NVSwitch Each fiber ‘pin’ carries 80 s (scalable)Dense WDM Silicon Photonic: • 250um fiber pitch • 8 fiber links ~ over 2mm linear edge • 80 s per fiber link; each  at 10Gb/s = 800 Gb/s per link • 6.4 Tb/s per 2mm  3.2 Tb/s/mm
  • 10. Rev PA1Rev PA1 18 • OC‐MCM: Optically Connected Multi‐chip Module – Optical communication among interposers/MCM  – Enables fully flexible and scalable architectures Approach: Deep Disaggregation …network of Resources CMP(s), GPUs on interposer Key Concept #2:  Ubiquitous Optically Connected‐MCM
  • 11. Rev PA1Rev PA1 19 Electronic ICs Photonic IC (to be packaged) Silicon Interposer (100m thick)Silicon Interposer (100m thick) Photonic IC Electronic ICs Interconnected 2.5D/3D node (MCM) Active interposer 3D TI TIAs  8x8 Switch  Lasers
  • 12. Rev PA1Rev PA1 20 Current server Current rack Disaggregated rack Pool and compose Key Concept #3: Adaptive, Flexible Connectivity with Bandwidth Steering Only “Power Up” Needed Optical Links Inserting photonic switching – directly integrated in MCM – fully flexible connectivity
  • 13. Rev PA1Rev PA1 21 Standard (‘Vanilla’) Fat Tree Architecture - Challenges 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Electronic Packet Switch (EPS) Server Pod 1 Pod 2Spine Aggregation ToR • In a standard Fat Tree, ToR EPSs are only connected to their parent aggregation EPSs • Inter-pod traffic (defragmentation) needs to traverse (+ hops) through spine to connect between pods • Spine often oversubscribed, energy consuming large routers Pre-Publication accepted to SC19: G. Michelogiannakis et al. “Bandwidth Steering for HPC using Silicon Nanophotonics”
  • 14. Rev PA1Rev PA1 23 1 2 3 4 5 6 7 8 A B C D E F G HSiP Switch SiP Switch SiP Switch SiP Switch Pod 1 Pod 2Spine Aggregation ToR 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Electronic Packet Switch (EPS) Server • Embedded SiP switches – flexible network within Fat Tree - direct ToR/Aggregation switch connections across pods • Bypassing need to travel through congested higher layer links • Lower latency - by providing a direct path that reduces hop count, and lowers utilization of upper layer Flexible Fat Tree: Bandwidth steering across Pods
  • 15. Rev PA1Rev PA1 24 Throughput of Upper Layer Links in Standard Fat-Tree Topology Throughput of Upper Layer Links in Bandwidth-Steered Topology 56 72 25% execution time difference System Experimental Results – Standard Fat Tree vs Flexible Fat Tree • Operating skeletonized GTC application with MPI • Standard Fat Tree: • all upper layer links used • runtime = 72 sec • Flexible PINE Fat Tree: • direct lower layer connectivity • only 4 upper layer links utilized • runtime = 56 sec Pre-Publication accepted to SC19: G. Michelogiannakis et al. “Bandwidth Steering for HPC using Silicon Nanophotonics”
  • 16. Rev PA1Rev PA1 25 1 2 3 4 5 6 7 8 A B C D E F G H 4x4 SiP Switch 1 4x4 SiP Switch 2 4x4 SiP Switch 3 4x4 SiP Switch 4 Pod 1 Pod 2Spine Aggregation ToR 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Electronic Packet Switch (EPS) Server Flexible Fat Tree: Removal of Upper Layer Links • PINE Flexible Fat Tree – provides direct connectivity for traffic at the lower layer • Remove spine layer links to reduce energy consumption
  • 17. Rev PA1Rev PA1 26 Throughput of Upper Layer Links in Fat-Tree Topology with Some Upper Links Removed Throughput of Upper Layer Links in Bandwidth-Steered Topology with Some Upper Links Removed 115 69% execution time difference 56 System Results: Standard Fat Tree vs Flexible Fat Tree with Links Removed • Flexible Fat Tree: • direct lower layer connectivity • Can remove upper layer EPSs = reduced energy consumption • Standard Fat Tree: all remaining upper layer links congested • runtime = 115 sec (vs 72 sec) • Flexible Fat Tree: • unaffected by removal of links, remains at 56 sec runtime
  • 18. Rev PA1Rev PA1 28 Summary: • Data Movement is Critical to any Future Performance Scaling – Power Consumption – Bandwidth Density (and Cost) • Photonics: System‐Wide Tb/s per ‘wire’ and 1 pJ/bit – Ultra bandwidth dense WDM photonic links – Energy efficiency and cost via co‐integration 2.5D and 3D platforms – High bandwidth Optically Connected Memory/GPUs/CPUs • Deeply disaggregated Architectures – Optical connectivity for flexibly assembled interconnectivity topologies • Computer architecture landscape is changing rapidly ‐ Data Analytics, AI – Optical bandwidth steering, adaptable architectures for scalability – Ultimate energy efficiency – use only required resources for needed time period