SlideShare a Scribd company logo
1 of 55
Datacenters
CS 168, Fall 2014
Sylvia Ratnasamy
http://inst.eecs.berkeley.edu/~cs168/
What you need to know
• Characteristics of a datacenter environment
– goals, constraints, workloads, etc.
• How and why DC networks are different (vs. WAN)
– e.g., latency, geo, autonomy, …
• How traditional solutions fare in this environment
– e.g., IP, Ethernet, TCP, ARP, DHCP
• Specific design approaches we cover in class
– next lecture
Disclaimer
• Material is emerging (not established) wisdom
• Material is incomplete
– many details on how and why datacenter networks
operate aren’t public
Plan
Today
• Characteristics and goals of datacenter networks
• Focus on differences relative to the Internet
Next lecture
• Emerging solutions
• Servers organized in racks
What goes into a datacenter (network)?
• Servers organized in racks
• Each rack has a `Top of Rack’ (ToR) switch
What goes into a datacenter (network)?
What goes into a datacenter (network)?
• Servers organized in racks
• Each rack has a `Top of Rack’ (ToR) switch
• `Aggregation switches interconnect ToR switches
What goes into a datacenter (network)?
• Servers organized in racks
• Each rack has a `Top of Rack’ (ToR) switch
• `Aggregation switches interconnect ToR switches
• Connected to the outside via `core’ switches
– note: blurry line between aggregation and core
• With 2x redundancy for fault-tolerance
E.g., Brocade Reference Design
CR CR
AR AR AR AR
. . .
S
S
Internet
S
S
…
S
S
…
. . .
~ 40-80 servers/rack
E.g., Cisco Reference Design
Datacenters have been around for a while
1961, Information Processing Center at the National Bank of Arizona
What’s new?
What’s new?
• Scale
• Applications
– Large-scale computations (“big data”)
– Customer-facing, revenue generating services
• Service model
– Clouds (jargon: SaaS, PaaS, DaaS, IaaS, …)
– Multi-tenancy
SCALE!
How big exactly?
• 1M servers/site [Microsoft/Amazon/Google]
• > $1B to build one site [Facebook]
• >$20M/month/site operational costs [Microsoft ’09]
But only O(10-100) sites
Implications (1)
• Scale
Need scalable designs (duh): e.g., avoid flooding
Low cost designs: e.g., use commodity technology
 High utilization (efficiency): e.g., >80% avg. utilization
• Contrast: avg. utilization on Internet links often ~30%
 Tolerate frequent failure
• Large number of (low cost) components
 Automate
Implications (2)
• Service model: clouds / multi-tenancy
performance guarantees
isolation guarantees
portability
• How?
–“network virtualization” (lecture on SDN)
Applications
• Common theme: parallelism
–Applications decomposed into tasks
–Running in parallel on different machines
• Two common paradigms
–“Partition Aggregate”
–Map Reduce
18
Partition-Aggregate
19
Router /
Load Balancer
Web Server Web Server Web Server
Data
Cache
Data
Cache
Database Database
Front-End
Proxy
Front-End
Proxy
user requests from the Internet
“North – South” Traffic
• Interactive / query-response exchange between
external clients and datacenter
• Handled by front-end (web) servers, mid-tier
application servers, and back-end databases
Map-Reduce
21
Distributed
Storage
Distributed
Storage
Map
Tasks
Reduce
Tasks
Map-Reduce
22
Distributed
Storage
Distributed
Storage
Map
Tasks
Reduce
Tasks
Often doesn’t
cross the
network
Always goes over
the network
Some fraction
(typically 2/3)
crosses the network
“East-West” Traffic
CR CR
AR AR AR AR
S
S
S
S
…
S
S
…
. .
.
S
S
S
S
…
S
S
…
“East-West” Traffic
• Traffic between servers in the datacenter
• Communication within “big data” computations
• Traffic may shift on small timescales (< minutes)
Common traffic pattern:
“Elephants” and “Mice”
55% of flows
3% of bytes
5% of flows
35% of bytes
<100KB >10MB
• Web search, data mining (Microsoft) [Alizadeh 2010]
Implications (3)
• Applications
High bandwidth any-to-any communication
(“bisection bandwidth”)
Low latency is critical
Worst-case (“tail”) latency is critical
High Bandwidth
• Ideal: Each server can talk to any other server
at its full access link rate
• Conceptually: DC network as one giant switch
DC Network: Just a Giant Switch!
Slides from: Alizadeh, HotNets 2012
H1 H2 H3 H4 H5 H6 H7 H8 H9
H1 H2 H3 H4 H5 H6 H7 H8 H9
DC Network: Just a Giant Switch!
29
Slides from: Alizadeh, HotNets 2012
H1
H2
H3
H4
H5
H6
H7
H8
H9
H1
H2
H3
H4
H5
H6
H7
H8
H9
H1
H2
H3
H4
H5
H6
H7
H8
H9
TX RX
DC Network: Just a Giant Switch!
Slides from: Alizadeh, HotNets 2012
DC Network: Just a Giant Switch!
H1
H2
H3
H4
H5
H6
H7
H8
H9
H1
H2
H3
H4
H5
H6
H7
H8
H9
TX RX
Slides from: Alizadeh, HotNets 2012
High Bandwidth
• Ideal: Each server can talk to any other server at
its full access link rate
• Conceptually: DC network as one giant switch
– Would require a 10 Pbits/sec switch!
• 1M ports (one port/server)
• 10Gbps per port
• Practical approach: build a network of switches
(“fabric”) with high “bisection bandwidth”
– Each switch has practical #ports and link speeds
Bisection Bandwidth
• Partition a network into two equal parts
• Minimum bandwidth between the partitions
is the bisection bandwidth
• Full bisection bandwidth: bisection bandwidth
in an N node network is N/2 times the
bandwidth of a single link
nodes of any two halves can communicate at full
speed with each other.
Achieving Full Bisection Bandwidth
CR CR
AR AR AR AR
. . .
S
S
Internet
S
S
…
S
S
…
. . .
~ 40 servers/rack
10Gbps
links
O(40x10) Gbps
O(40x10x100)
Gbps
“scale up” approach
Achieving Full Bisection Bandwidth
• Problem: “Scaling up” a traditional tree
topology is expensive!
– requires non-commodity / impractical / link and
switch components
• Solutions?
– Over-subscribe (i.e., provision less than full BBW)
– Better topologies
Oversubscription
Need techniques to avoid congesting
oversubscribed links!
10G
20G
40G
10G
10G
20G
Better topologies?
10G
20G
40G
“scale up” approach “scale out” approach
“scale out” approach
• E.g., `Clos’ topology
– Multi-stage network
– All switches have k ports
– k/2 ports up, k/2 down
• E.g., with 3 stages, k=48
– k3/4 hosts = 27,648 servers
• All links have same speed
Better topologies?
Challenges in scale-out designs?
• Topology offers high bisection bandwidth
• All other system components must be able to
exploit this available capacity
– Routing must use all paths
– Transport protocol must
fill all pipes (fast)
Low Latency
Two (related) issues:
1) Very low RTTs within the DC (approaching 1μsec)
Implications?
– BW x delay: 10Gbps x 1μsec = 10000 bits = 2.5 packets
– Consider TX 500B @ 10Gbps = 0.4μs per hop = 2μs if a
packet traverses 5 hops and waits behind one packet at
every hop
– What does this mean for buffering and switch design?
– What does this mean for congestion control?
Low Latency
Two (related) issues:
1) Very low RTTs within the DC (approaching 1μsec)
2) Applications want low latency
– predictable / guaranteed bounds on flow completion time,
including the worst-case!
(recall: `best effort’ vs. `guaranteed service’ debates)
– How is still an open question
What’s different about DC networks?
Characteristics
• Huge scale
– ~20,000 switches/routers
– contrast: AT&T ~500 routers
What’s different about DC networks?
Characteristics
• Huge scale
• Limited geographic scope
– High bandwidth: 10/40/100G (Contrast: DSL/WiFi)
– Very low RTT: 1-10s μsecs. (Contrast: 100s msecs)
What’s different about DC networks?
Characteristics
• Huge scale
• Limited geographic scope
• Limited heterogeneity
– link speeds, technologies, latencies, …
What’s different about DC networks?
Characteristics
• Huge scale
• Limited geographic scope
• Limited heterogeneity
• Regular/planned topologies (e.g., trees)
– Contrast: ad-hoc evolution of wide-area topologies
What’s different about DC networks?
Goals
• Extreme bisection bandwidth requirements
– recall: all that east-west traffic
– target: any server can communicate at its full link speed
– How: next lecture
What’s different about DC networks?
Goals
• Extreme bisection bandwidth requirements
• Extreme latency requirements
– real money on the line
– current target: 1μs RTTs
– how? Next lecture
What’s different about DC networks?
Goals
• Extreme bisection bandwidth requirements
• Extreme latency requirements
• Predictable, deterministic performance
– “your packet will reach in Xms, or not at all”
– “your VM will always see at least YGbps throughput”
– How is still an open question
What’s different about DC networks?
Goals
• Extreme bisection bandwidth requirements
• Extreme latency requirements
• Predictable, deterministic performance
• Differentiating between tenants is key
– e.g., “No traffic between VMs of tenant A and tenant B”
– “Tenant X cannot consume more than XGbps”
– “Tenant Y’s traffic is low priority”
– How: lecture on SDN (Nov 24)
What’s different about DC networks?
Goals
• Extreme bisection bandwidth requirements
• Extreme latency requirements
• Predictable, deterministic performance
• Differentiating between tenants is key
• Scalability (of course)
What’s different about DC networks?
Goals
• Extreme bisection bandwidth requirements
• Extreme latency requirements
• Predictable, deterministic performance
• Differentiating between tenants is key
• Scalability (of course)
• Cost/efficiency
– focus on commodity solutions, ease of management
What’s different about DC networks?
New degrees of (design) freedom
• Single administrative domain
– Can deviate from standards, invent your own, etc.
– “Green field” deployment is still feasible
What’s different about DC networks?
New degrees of (design) freedom
• Single administrative domain
• Control over network and endpoint(s)
– can change (say) addressing, congestion control, etc.
– can add mechanisms for security/policy/etc. at the
endpoints (typically in the hypervisor)
What’s different about DC networks?
New degrees of (design) freedom
• Single administrative domain
• Control over network and endpoint(s)
• Control over the placement of traffic source/sink
– e.g., map-reduce scheduler chooses where tasks run
– Can control what traffic crosses which links
Summary
• Recap: datacenters
– new characteristics and goals
– some liberating, some constraining
– scalability is the baseline requirement
– more emphasis on performance
– less emphasis on heterogeneity
– less emphasis on interoperability

More Related Content

Similar to Lecture notes - Data Centers________.pptx

FATTREE: A scalable Commodity Data Center Network Architecture
FATTREE: A scalable Commodity Data Center Network ArchitectureFATTREE: A scalable Commodity Data Center Network Architecture
FATTREE: A scalable Commodity Data Center Network ArchitectureAnkita Mahajan
 
An Architecture for Data Intensive Service Enabled by Next Generation Optical...
An Architecture for Data Intensive Service Enabled by Next Generation Optical...An Architecture for Data Intensive Service Enabled by Next Generation Optical...
An Architecture for Data Intensive Service Enabled by Next Generation Optical...Tal Lavian Ph.D.
 
VL2: A scalable and flexible Data Center Network
VL2: A scalable and flexible Data Center NetworkVL2: A scalable and flexible Data Center Network
VL2: A scalable and flexible Data Center NetworkAnkita Mahajan
 
Warehouse scale computer
Warehouse scale computerWarehouse scale computer
Warehouse scale computerHassan A-j
 
EEDC Intelligent Placement of Datacenters
EEDC Intelligent Placement of DatacentersEEDC Intelligent Placement of Datacenters
EEDC Intelligent Placement of DatacentersRoger Rafanell Mas
 
ITN3052_04_Switched_Networks.pdf
ITN3052_04_Switched_Networks.pdfITN3052_04_Switched_Networks.pdf
ITN3052_04_Switched_Networks.pdfssuser2d7235
 
ECS19 - Ingo Gegenwarth - Running Exchange in large environment
ECS19 - Ingo Gegenwarth -  Running Exchangein large environmentECS19 - Ingo Gegenwarth -  Running Exchangein large environment
ECS19 - Ingo Gegenwarth - Running Exchange in large environmentEuropean Collaboration Summit
 
Sept 2017 internetworking
Sept 2017   internetworkingSept 2017   internetworking
Sept 2017 internetworkingshahin raj
 
High performance browser networking ch1,2,3
High performance browser networking ch1,2,3High performance browser networking ch1,2,3
High performance browser networking ch1,2,3Seung-Bum Lee
 
End-to-End Reactive Data Access Using R2DBC with RSocket and Proteus
End-to-End Reactive Data Access Using R2DBC with RSocket and ProteusEnd-to-End Reactive Data Access Using R2DBC with RSocket and Proteus
End-to-End Reactive Data Access Using R2DBC with RSocket and ProteusVMware Tanzu
 
CellSDN: Software-Defined Cellular Core networks
CellSDN: Software-Defined Cellular Core networksCellSDN: Software-Defined Cellular Core networks
CellSDN: Software-Defined Cellular Core networksOpen Networking Summits
 
Building a cost-effective and high-performing public cloud
Building a cost-effective and high-performing public cloudBuilding a cost-effective and high-performing public cloud
Building a cost-effective and high-performing public cloudcloudprovider
 
6 intelligent-placement-of-datacenters
6 intelligent-placement-of-datacenters6 intelligent-placement-of-datacenters
6 intelligent-placement-of-datacenterszafargilani
 
Congection control and Internet working
Congection control and Internet workingCongection control and Internet working
Congection control and Internet workingTharuniDiddekunta
 

Similar to Lecture notes - Data Centers________.pptx (20)

FATTREE: A scalable Commodity Data Center Network Architecture
FATTREE: A scalable Commodity Data Center Network ArchitectureFATTREE: A scalable Commodity Data Center Network Architecture
FATTREE: A scalable Commodity Data Center Network Architecture
 
An Architecture for Data Intensive Service Enabled by Next Generation Optical...
An Architecture for Data Intensive Service Enabled by Next Generation Optical...An Architecture for Data Intensive Service Enabled by Next Generation Optical...
An Architecture for Data Intensive Service Enabled by Next Generation Optical...
 
Deco1
Deco1Deco1
Deco1
 
VL2: A scalable and flexible Data Center Network
VL2: A scalable and flexible Data Center NetworkVL2: A scalable and flexible Data Center Network
VL2: A scalable and flexible Data Center Network
 
Warehouse scale computer
Warehouse scale computerWarehouse scale computer
Warehouse scale computer
 
Topology.ppt
Topology.pptTopology.ppt
Topology.ppt
 
EEDC Intelligent Placement of Datacenters
EEDC Intelligent Placement of DatacentersEEDC Intelligent Placement of Datacenters
EEDC Intelligent Placement of Datacenters
 
Software Defined Networking: Primer
Software Defined Networking: Primer Software Defined Networking: Primer
Software Defined Networking: Primer
 
ITN3052_04_Switched_Networks.pdf
ITN3052_04_Switched_Networks.pdfITN3052_04_Switched_Networks.pdf
ITN3052_04_Switched_Networks.pdf
 
Disruptive Technologies
Disruptive TechnologiesDisruptive Technologies
Disruptive Technologies
 
ECS19 - Ingo Gegenwarth - Running Exchange in large environment
ECS19 - Ingo Gegenwarth -  Running Exchangein large environmentECS19 - Ingo Gegenwarth -  Running Exchangein large environment
ECS19 - Ingo Gegenwarth - Running Exchange in large environment
 
Sept 2017 internetworking
Sept 2017   internetworkingSept 2017   internetworking
Sept 2017 internetworking
 
High performance browser networking ch1,2,3
High performance browser networking ch1,2,3High performance browser networking ch1,2,3
High performance browser networking ch1,2,3
 
End-to-End Reactive Data Access Using R2DBC with RSocket and Proteus
End-to-End Reactive Data Access Using R2DBC with RSocket and ProteusEnd-to-End Reactive Data Access Using R2DBC with RSocket and Proteus
End-to-End Reactive Data Access Using R2DBC with RSocket and Proteus
 
CellSDN: Software-Defined Cellular Core networks
CellSDN: Software-Defined Cellular Core networksCellSDN: Software-Defined Cellular Core networks
CellSDN: Software-Defined Cellular Core networks
 
Alternative metrics
Alternative metricsAlternative metrics
Alternative metrics
 
Building a cost-effective and high-performing public cloud
Building a cost-effective and high-performing public cloudBuilding a cost-effective and high-performing public cloud
Building a cost-effective and high-performing public cloud
 
6 intelligent-placement-of-datacenters
6 intelligent-placement-of-datacenters6 intelligent-placement-of-datacenters
6 intelligent-placement-of-datacenters
 
Congection control and Internet working
Congection control and Internet workingCongection control and Internet working
Congection control and Internet working
 
Chapter 10.pptx
Chapter 10.pptxChapter 10.pptx
Chapter 10.pptx
 

Recently uploaded

IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxk795866
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxvipinkmenon1
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .Satyam Kumar
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 

Recently uploaded (20)

IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptx
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptx
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 

Lecture notes - Data Centers________.pptx

  • 1. Datacenters CS 168, Fall 2014 Sylvia Ratnasamy http://inst.eecs.berkeley.edu/~cs168/
  • 2. What you need to know • Characteristics of a datacenter environment – goals, constraints, workloads, etc. • How and why DC networks are different (vs. WAN) – e.g., latency, geo, autonomy, … • How traditional solutions fare in this environment – e.g., IP, Ethernet, TCP, ARP, DHCP • Specific design approaches we cover in class – next lecture
  • 3. Disclaimer • Material is emerging (not established) wisdom • Material is incomplete – many details on how and why datacenter networks operate aren’t public
  • 4. Plan Today • Characteristics and goals of datacenter networks • Focus on differences relative to the Internet Next lecture • Emerging solutions
  • 5. • Servers organized in racks What goes into a datacenter (network)?
  • 6. • Servers organized in racks • Each rack has a `Top of Rack’ (ToR) switch What goes into a datacenter (network)?
  • 7. What goes into a datacenter (network)? • Servers organized in racks • Each rack has a `Top of Rack’ (ToR) switch • `Aggregation switches interconnect ToR switches
  • 8. What goes into a datacenter (network)? • Servers organized in racks • Each rack has a `Top of Rack’ (ToR) switch • `Aggregation switches interconnect ToR switches • Connected to the outside via `core’ switches – note: blurry line between aggregation and core • With 2x redundancy for fault-tolerance
  • 10. CR CR AR AR AR AR . . . S S Internet S S … S S … . . . ~ 40-80 servers/rack E.g., Cisco Reference Design
  • 11. Datacenters have been around for a while 1961, Information Processing Center at the National Bank of Arizona
  • 13. What’s new? • Scale • Applications – Large-scale computations (“big data”) – Customer-facing, revenue generating services • Service model – Clouds (jargon: SaaS, PaaS, DaaS, IaaS, …) – Multi-tenancy
  • 15. How big exactly? • 1M servers/site [Microsoft/Amazon/Google] • > $1B to build one site [Facebook] • >$20M/month/site operational costs [Microsoft ’09] But only O(10-100) sites
  • 16. Implications (1) • Scale Need scalable designs (duh): e.g., avoid flooding Low cost designs: e.g., use commodity technology  High utilization (efficiency): e.g., >80% avg. utilization • Contrast: avg. utilization on Internet links often ~30%  Tolerate frequent failure • Large number of (low cost) components  Automate
  • 17. Implications (2) • Service model: clouds / multi-tenancy performance guarantees isolation guarantees portability • How? –“network virtualization” (lecture on SDN)
  • 18. Applications • Common theme: parallelism –Applications decomposed into tasks –Running in parallel on different machines • Two common paradigms –“Partition Aggregate” –Map Reduce 18
  • 19. Partition-Aggregate 19 Router / Load Balancer Web Server Web Server Web Server Data Cache Data Cache Database Database Front-End Proxy Front-End Proxy user requests from the Internet
  • 20. “North – South” Traffic • Interactive / query-response exchange between external clients and datacenter • Handled by front-end (web) servers, mid-tier application servers, and back-end databases
  • 23. “East-West” Traffic CR CR AR AR AR AR S S S S … S S … . . . S S S S … S S …
  • 24. “East-West” Traffic • Traffic between servers in the datacenter • Communication within “big data” computations • Traffic may shift on small timescales (< minutes)
  • 25. Common traffic pattern: “Elephants” and “Mice” 55% of flows 3% of bytes 5% of flows 35% of bytes <100KB >10MB • Web search, data mining (Microsoft) [Alizadeh 2010]
  • 26. Implications (3) • Applications High bandwidth any-to-any communication (“bisection bandwidth”) Low latency is critical Worst-case (“tail”) latency is critical
  • 27. High Bandwidth • Ideal: Each server can talk to any other server at its full access link rate • Conceptually: DC network as one giant switch
  • 28. DC Network: Just a Giant Switch! Slides from: Alizadeh, HotNets 2012 H1 H2 H3 H4 H5 H6 H7 H8 H9
  • 29. H1 H2 H3 H4 H5 H6 H7 H8 H9 DC Network: Just a Giant Switch! 29 Slides from: Alizadeh, HotNets 2012
  • 31. DC Network: Just a Giant Switch! H1 H2 H3 H4 H5 H6 H7 H8 H9 H1 H2 H3 H4 H5 H6 H7 H8 H9 TX RX Slides from: Alizadeh, HotNets 2012
  • 32. High Bandwidth • Ideal: Each server can talk to any other server at its full access link rate • Conceptually: DC network as one giant switch – Would require a 10 Pbits/sec switch! • 1M ports (one port/server) • 10Gbps per port • Practical approach: build a network of switches (“fabric”) with high “bisection bandwidth” – Each switch has practical #ports and link speeds
  • 33. Bisection Bandwidth • Partition a network into two equal parts • Minimum bandwidth between the partitions is the bisection bandwidth • Full bisection bandwidth: bisection bandwidth in an N node network is N/2 times the bandwidth of a single link nodes of any two halves can communicate at full speed with each other.
  • 34. Achieving Full Bisection Bandwidth CR CR AR AR AR AR . . . S S Internet S S … S S … . . . ~ 40 servers/rack 10Gbps links O(40x10) Gbps O(40x10x100) Gbps “scale up” approach
  • 35. Achieving Full Bisection Bandwidth • Problem: “Scaling up” a traditional tree topology is expensive! – requires non-commodity / impractical / link and switch components • Solutions? – Over-subscribe (i.e., provision less than full BBW) – Better topologies
  • 36. Oversubscription Need techniques to avoid congesting oversubscribed links! 10G 20G 40G 10G 10G 20G
  • 37. Better topologies? 10G 20G 40G “scale up” approach “scale out” approach
  • 38. “scale out” approach • E.g., `Clos’ topology – Multi-stage network – All switches have k ports – k/2 ports up, k/2 down • E.g., with 3 stages, k=48 – k3/4 hosts = 27,648 servers • All links have same speed Better topologies?
  • 39. Challenges in scale-out designs? • Topology offers high bisection bandwidth • All other system components must be able to exploit this available capacity – Routing must use all paths – Transport protocol must fill all pipes (fast)
  • 40. Low Latency Two (related) issues: 1) Very low RTTs within the DC (approaching 1μsec) Implications? – BW x delay: 10Gbps x 1μsec = 10000 bits = 2.5 packets – Consider TX 500B @ 10Gbps = 0.4μs per hop = 2μs if a packet traverses 5 hops and waits behind one packet at every hop – What does this mean for buffering and switch design? – What does this mean for congestion control?
  • 41. Low Latency Two (related) issues: 1) Very low RTTs within the DC (approaching 1μsec) 2) Applications want low latency – predictable / guaranteed bounds on flow completion time, including the worst-case! (recall: `best effort’ vs. `guaranteed service’ debates) – How is still an open question
  • 42. What’s different about DC networks? Characteristics • Huge scale – ~20,000 switches/routers – contrast: AT&T ~500 routers
  • 43. What’s different about DC networks? Characteristics • Huge scale • Limited geographic scope – High bandwidth: 10/40/100G (Contrast: DSL/WiFi) – Very low RTT: 1-10s μsecs. (Contrast: 100s msecs)
  • 44. What’s different about DC networks? Characteristics • Huge scale • Limited geographic scope • Limited heterogeneity – link speeds, technologies, latencies, …
  • 45. What’s different about DC networks? Characteristics • Huge scale • Limited geographic scope • Limited heterogeneity • Regular/planned topologies (e.g., trees) – Contrast: ad-hoc evolution of wide-area topologies
  • 46. What’s different about DC networks? Goals • Extreme bisection bandwidth requirements – recall: all that east-west traffic – target: any server can communicate at its full link speed – How: next lecture
  • 47. What’s different about DC networks? Goals • Extreme bisection bandwidth requirements • Extreme latency requirements – real money on the line – current target: 1μs RTTs – how? Next lecture
  • 48. What’s different about DC networks? Goals • Extreme bisection bandwidth requirements • Extreme latency requirements • Predictable, deterministic performance – “your packet will reach in Xms, or not at all” – “your VM will always see at least YGbps throughput” – How is still an open question
  • 49. What’s different about DC networks? Goals • Extreme bisection bandwidth requirements • Extreme latency requirements • Predictable, deterministic performance • Differentiating between tenants is key – e.g., “No traffic between VMs of tenant A and tenant B” – “Tenant X cannot consume more than XGbps” – “Tenant Y’s traffic is low priority” – How: lecture on SDN (Nov 24)
  • 50. What’s different about DC networks? Goals • Extreme bisection bandwidth requirements • Extreme latency requirements • Predictable, deterministic performance • Differentiating between tenants is key • Scalability (of course)
  • 51. What’s different about DC networks? Goals • Extreme bisection bandwidth requirements • Extreme latency requirements • Predictable, deterministic performance • Differentiating between tenants is key • Scalability (of course) • Cost/efficiency – focus on commodity solutions, ease of management
  • 52. What’s different about DC networks? New degrees of (design) freedom • Single administrative domain – Can deviate from standards, invent your own, etc. – “Green field” deployment is still feasible
  • 53. What’s different about DC networks? New degrees of (design) freedom • Single administrative domain • Control over network and endpoint(s) – can change (say) addressing, congestion control, etc. – can add mechanisms for security/policy/etc. at the endpoints (typically in the hypervisor)
  • 54. What’s different about DC networks? New degrees of (design) freedom • Single administrative domain • Control over network and endpoint(s) • Control over the placement of traffic source/sink – e.g., map-reduce scheduler chooses where tasks run – Can control what traffic crosses which links
  • 55. Summary • Recap: datacenters – new characteristics and goals – some liberating, some constraining – scalability is the baseline requirement – more emphasis on performance – less emphasis on heterogeneity – less emphasis on interoperability

Editor's Notes

  1. -- incast? -- one big switch abstraction…