SlideShare a Scribd company logo
1 of 28
Data Center Traffic and
Measurements
Hakim Weatherspoon
Assistant Professor, Dept of Computer Science
CS 5413: High Performance Systems and Networking
November 10, 2014
Slides from SIGCOMM Internet Measurement Conference (IMC) 2010 presentation of
“Analysis and Network Traffic Characteristics of Data Centers in the wild”
Goals for Today
• Analysis and Network Traffic Characteristics of
Data Centers in the wild
– T. Benson, A. Akella, and D. A. Maltz. In Proceedings of
the 10th ACM SIGCOMM conference on Internet
measurement (IMC), pp. 267-280. ACM, 2010.
The Importance of Data Centers
• “A 1-millisecond advantage in trading applications
can be worth $100 million a year to a major
brokerage firm”
• Internal users
– Line-of-Business apps
– Production test beds
• External users
– Web portals
– Web services
– Multimedia applications
– Chat/IM
The Case for Understanding Data Center Traffic
• Better understanding  better techniques
• Better traffic engineering techniques
– Avoid data losses
– Improve app performance
• Better Quality of Service techniques
– Better control over jitter
– Allow multimedia apps
• Better energy saving techniques
– Reduce data center’s energy footprint
– Reduce operating expenditures
• Initial stab network level traffic + app relationships
Take aways and Insights Gained
• 75% of traffic stays within a rack (Clouds)
– Applications are not uniformly placed
• Half packets are small (< 200B)
– Keep alive integral in application design
• At most 25% of core links highly utilized
– Effective routing algorithm to reduce utilization
– Load balance across paths and migrate VMs
• Questioned popular assumptions
– Do we need more bisection? No
– Is centralization feasible? Yes
Canonical Data Center Architecture
Core (L3)
Edge (L2)
Top-of-Rack
Aggregation (L2)
Application
servers
Dataset: Data Centers Studied
DC Role DC
Name
Location Number
Devices
Universities EDU1 US-Mid 22
EDU2 US-Mid 36
EDU3 US-Mid 11
Private
Enterprise
PRV1 US-Mid 97
PRV2 US-West 100
Commercial
Clouds
CLD1 US-West 562
CLD2 US-West 763
CLD3 US-East 612
CLD4 S. America 427
CLD5 S. America 427
 10 data centers
 3 classes
 Universities
 Private enterprise
 Clouds
 Internal users
 Univ/priv
 Small
 Local to campus
 External users
 Clouds
 Large
 Globally diverse
Dataset: Collection
• SNMP
– Poll SNMP MIBs
– Bytes-in/bytes-out/discards
– > 10 Days
– Averaged over 5 mins
• Packet Traces
– Cisco port span
– 12 hours
• Topology
– Cisco Discovery Protocol
DC
Name
SNMP Packet
Traces
Topology
EDU1 Yes Yes Yes
EDU2 Yes Yes Yes
EDU3 Yes Yes Yes
PRV1 Yes Yes Yes
PRV2 Yes Yes Yes
CLD1 Yes No No
CLD2 Yes No No
CLD3 Yes No No
CLD4 Yes No No
CLD5 Yes No No
Canonical Data Center Architecture
Core (L3)
Edge (L2)
Top-of-Rack
Aggregation (L2)
Application
servers
Packet
Sniffers
SNMP & Topology
From ALL Links
Applications
• Start at bottom
– Analyze running applications
– Use packet traces
• BroID tool for identification
– Quantify amount of traffic from each app
Applications
• Differences between various bars
• Clustering of applications
– PRV2_2 hosts secured portions of applications
– PRV2_3 hosts unsecure portions of applications
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
PRV2_1 PRV2_2 PRV2_3 PRV2_4 EDU1 EDU2 EDU3
AFS
NCP
SMB
LDAP
HTTPS
HTTP
OTHER
Analyzing Packet Traces
• Transmission patterns of the applications
• Properties of packet crucial for
– Understanding effectiveness of techniques
• ON-OFF traffic at edges
– Binned in 15 and 100 m. secs
– We observe that ON-OFF persists 13
• Understanding arrival process
– Range of acceptable models
• What is the arrival process?
– Heavy-tail for the 3 distributions
• ON, OFF times, Inter-arrival,
– Lognormal across all data
centers
• Different from Pareto of WAN
– Need new models
14
Data
Center
Off Period
Dist
ON periods
Dist
Inter-arrival
Dist
Prv2_1 Lognormal Lognormal Lognormal
Prv2_2 Lognormal Lognormal Lognormal
Prv2_3 Lognormal Lognormal Lognormal
Prv2_4 Lognormal Lognormal Lognormal
EDU1 Lognormal Weibull Weibull
EDU2 Lognormal Weibull Weibull
EDU3 Lognormal Weibull Weibull
Data Center Traffic is Bursty
Packet Size Distribution
• Bimodal (200B and 1400B)
• Small packets
– TCP acknowledgements
– Keep alive packets
• Persistent connections  important to apps
Canonical Data Center Architecture
Core (L3)
Edge (L2)
Top-of-Rack
Aggregation (L2)
Application
servers
Intra-Rack Versus Extra-Rack
• Quantify amount of traffic using interconnect
– Perspective for interconnect analysis
Edge
Application
servers
Extra-Rack
Intra-Rack
Extra-Rack = Sum of Uplinks
Intra-Rack = Sum of Server Links – Extra-Rack
Intra-Rack Versus Extra-Rack Results
• Clouds: most traffic stays within a rack (75%)
– Colocation of apps and dependent components
• Other DCs: > 50% leaves the rack
– Un-optimized placement
0
10
20
30
40
50
60
70
80
90
100
EDU1 EDU2 EDU3 PRV1 PRV2 CLD1 CLD2 CLD3 CLD4 CLD5
Extra-Rack
Inter-Rack
Extra-Rack Traffic on DC Interconnect
• Utilization: core > agg > edge
– Aggregation of many unto few
• Tail of core utilization differs
– Hot-spots  links with > 70% util
– Prevalence of hot-spots differs across data centers
• Low persistence: PRV2, EDU1, EDU2, EDU3, CLD1, CLD3
• High persistence/low prevalence: PRV1, CLD2
– 2-8% are hotspots > 50%
• High persistence/high prevalence: CLD4, CLD5
– 15% are hotspots > 50%
Persistence of Core Hot-Spots
Prevalence of Core Hot-Spots
• Low persistence: very few concurrent hotspots
• High persistence: few concurrent hotspots
• High prevalence: < 25% are hotspots at any time
0 10 20 30 40 50
Time (in Hours)
0.6%
0.0%
0.0%
0.0%
6.0%
24.0%
Observations from Interconnect
• Links utils low at edge and agg
• Core most utilized
– Hot-spots exists (> 70% utilization)
– < 25% links are hotspots
– Loss occurs on less utilized links (< 70%)
• Implicating momentary bursts
• Time-of-Day variations exists
– Variation an order of magnitude larger at core
• Apply these results to evaluate DC design
requirements
Assumption 1: Larger Bisection
• Need for larger bisection
– VL2 [Sigcomm ‘09], Monsoon [Presto ‘08],Fat-Tree
[Sigcomm ‘08], Portland [Sigcomm ‘09], Hedera [NSDI ’10]
– Congestion at oversubscribed core links
Argument for Larger Bisection
• Need for larger bisection
– VL2 [Sigcomm ‘09], Monsoon [Presto ‘08],Fat-Tree
[Sigcomm ‘08], Portland [Sigcomm ‘09], Hedera [NSDI ’10]
– Congestion at oversubscribed core links
– Increase core links and eliminate congestion
Core
Edge
Aggregation
Application
servers
Bisection
Links
(bottleneck)
App
Links
If Σ traffic (App ) > 1 then more device are
Σ capacity(Bisection needed at the bisection
Calculating Bisection Bandwidth
• Given our data: current applications and DC design
– NO, more bisection is not required
– Aggregate bisection is only 30% utilized
• Need to better utilize existing network
– Load balance across paths
– Migrate VMs across racks
Bisection Demand
Insights Gained
• 75% of traffic stays within a rack (Clouds)
– Applications are not uniformly placed
• Half packets are small (< 200B)
– Keep alive integral in application design
• At most 25% of core links highly utilized
– Effective routing algorithm to reduce utilization
– Load balance across paths and migrate VMs
• Questioned popular assumptions
– Do we need more bisection? No
– Is centralization feasible? Yes
Related Works
• IMC ‘09 [Kandula`09]
– Traffic is unpredictable
– Most traffic stays within a rack
• Cloud measurements [Wang’10,Li’10]
– Study application performance
– End-2-End measurements
Before Next time
• Project Interim report
– Due Monday, November 24.
– And meet with groups, TA, and professor
• Fractus Upgrade: Should be back online
• Required review and reading for Wednesday, November
12
– SoNIC: Precise Realtime Software Access and Control of Wired Networks, K.
Lee, H. Wang and H. Weatherspoon. USENIX symposium on Networked
Systems Design and Implementation (NSDI), April 2013, pages 213-225.
– https://www.usenix.org/system/files/conference/nsdi13/nsdi13-final138.pdf
• Check piazza: http://piazza.com/cornell/fall2014/cs5413
• Check website for updated schedule

More Related Content

Similar to 20-datacenter-measurements.pptx

ONF & iSDX Webinar
ONF & iSDX WebinarONF & iSDX Webinar
ONF & iSDX WebinarKatie Hyman
 
An Architecture for Data Intensive Service Enabled by Next Generation Optical...
An Architecture for Data Intensive Service Enabled by Next Generation Optical...An Architecture for Data Intensive Service Enabled by Next Generation Optical...
An Architecture for Data Intensive Service Enabled by Next Generation Optical...Tal Lavian Ph.D.
 
Velocity San Jose 2017: Traffic shifts: Avoiding disasters at scale
Velocity San Jose 2017: Traffic shifts: Avoiding disasters at scaleVelocity San Jose 2017: Traffic shifts: Avoiding disasters at scale
Velocity San Jose 2017: Traffic shifts: Avoiding disasters at scaleMichael Kehoe
 
Data Centre Network Optimization
Data Centre Network OptimizationData Centre Network Optimization
Data Centre Network OptimizationIJAEMSJORNAL
 
PLNOG 3: John Evans - Best Practices in Network Planning
PLNOG 3: John Evans - Best Practices in Network PlanningPLNOG 3: John Evans - Best Practices in Network Planning
PLNOG 3: John Evans - Best Practices in Network PlanningPROIDEA
 
SDN and NFV Value in Business Services - A Presentation By Cox Communications
SDN and NFV Value in Business Services - A Presentation By Cox CommunicationsSDN and NFV Value in Business Services - A Presentation By Cox Communications
SDN and NFV Value in Business Services - A Presentation By Cox CommunicationsCisco Service Provider
 
PacketCloud: an Open Platform for Elastic In-network Services.
PacketCloud: an Open Platform for Elastic In-network Services. PacketCloud: an Open Platform for Elastic In-network Services.
PacketCloud: an Open Platform for Elastic In-network Services. yeung2000
 
Cloud Analytics Engine Value - Juniper Networks
Cloud Analytics Engine Value - Juniper Networks Cloud Analytics Engine Value - Juniper Networks
Cloud Analytics Engine Value - Juniper Networks Juniper Networks
 
Introduction to SDN
Introduction to SDNIntroduction to SDN
Introduction to SDNNetCraftsmen
 
What is Your Edge From the Cloud to the Edge, Extending Your Reach
What is Your Edge From the Cloud to the Edge, Extending Your ReachWhat is Your Edge From the Cloud to the Edge, Extending Your Reach
What is Your Edge From the Cloud to the Edge, Extending Your ReachSUSE
 
MPLS/SDN 2013 Intercloud Standardization and Testbeds - Sill
MPLS/SDN 2013 Intercloud Standardization and Testbeds - SillMPLS/SDN 2013 Intercloud Standardization and Testbeds - Sill
MPLS/SDN 2013 Intercloud Standardization and Testbeds - SillAlan Sill
 
Presentation on Data Center Use-Case & Trends
Presentation on Data Center Use-Case & TrendsPresentation on Data Center Use-Case & Trends
Presentation on Data Center Use-Case & TrendsAmod Dani
 
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...Tal Lavian Ph.D.
 
Network Fundamentals: Ch4 - Transport Layer
Network Fundamentals: Ch4 - Transport LayerNetwork Fundamentals: Ch4 - Transport Layer
Network Fundamentals: Ch4 - Transport LayerAbdelkhalik Mosa
 
Traffic Optimization in Multi-Layered WANs using SDN
Traffic Optimization in Multi-Layered WANs using SDN Traffic Optimization in Multi-Layered WANs using SDN
Traffic Optimization in Multi-Layered WANs using SDN Infinera
 
ThousandEyes at Network Field Day 12
ThousandEyes at Network Field Day 12ThousandEyes at Network Field Day 12
ThousandEyes at Network Field Day 12ThousandEyes
 

Similar to 20-datacenter-measurements.pptx (20)

Link_NwkingforDevOps
Link_NwkingforDevOpsLink_NwkingforDevOps
Link_NwkingforDevOps
 
ONF & iSDX Webinar
ONF & iSDX WebinarONF & iSDX Webinar
ONF & iSDX Webinar
 
An Architecture for Data Intensive Service Enabled by Next Generation Optical...
An Architecture for Data Intensive Service Enabled by Next Generation Optical...An Architecture for Data Intensive Service Enabled by Next Generation Optical...
An Architecture for Data Intensive Service Enabled by Next Generation Optical...
 
Velocity San Jose 2017: Traffic shifts: Avoiding disasters at scale
Velocity San Jose 2017: Traffic shifts: Avoiding disasters at scaleVelocity San Jose 2017: Traffic shifts: Avoiding disasters at scale
Velocity San Jose 2017: Traffic shifts: Avoiding disasters at scale
 
Data Centre Network Optimization
Data Centre Network OptimizationData Centre Network Optimization
Data Centre Network Optimization
 
PLNOG 3: John Evans - Best Practices in Network Planning
PLNOG 3: John Evans - Best Practices in Network PlanningPLNOG 3: John Evans - Best Practices in Network Planning
PLNOG 3: John Evans - Best Practices in Network Planning
 
Network cost services
Network cost servicesNetwork cost services
Network cost services
 
SDN and NFV Value in Business Services - A Presentation By Cox Communications
SDN and NFV Value in Business Services - A Presentation By Cox CommunicationsSDN and NFV Value in Business Services - A Presentation By Cox Communications
SDN and NFV Value in Business Services - A Presentation By Cox Communications
 
PacketCloud: an Open Platform for Elastic In-network Services.
PacketCloud: an Open Platform for Elastic In-network Services. PacketCloud: an Open Platform for Elastic In-network Services.
PacketCloud: an Open Platform for Elastic In-network Services.
 
Cloud Analytics Engine Value - Juniper Networks
Cloud Analytics Engine Value - Juniper Networks Cloud Analytics Engine Value - Juniper Networks
Cloud Analytics Engine Value - Juniper Networks
 
Introduction to SDN
Introduction to SDNIntroduction to SDN
Introduction to SDN
 
What is Your Edge From the Cloud to the Edge, Extending Your Reach
What is Your Edge From the Cloud to the Edge, Extending Your ReachWhat is Your Edge From the Cloud to the Edge, Extending Your Reach
What is Your Edge From the Cloud to the Edge, Extending Your Reach
 
MPLS/SDN 2013 Intercloud Standardization and Testbeds - Sill
MPLS/SDN 2013 Intercloud Standardization and Testbeds - SillMPLS/SDN 2013 Intercloud Standardization and Testbeds - Sill
MPLS/SDN 2013 Intercloud Standardization and Testbeds - Sill
 
Presentation on Data Center Use-Case & Trends
Presentation on Data Center Use-Case & TrendsPresentation on Data Center Use-Case & Trends
Presentation on Data Center Use-Case & Trends
 
Lambda Data Grid
Lambda Data GridLambda Data Grid
Lambda Data Grid
 
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
 
Network Fundamentals: Ch4 - Transport Layer
Network Fundamentals: Ch4 - Transport LayerNetwork Fundamentals: Ch4 - Transport Layer
Network Fundamentals: Ch4 - Transport Layer
 
Simplifying Wired Network Deployments with Software-Defined Networking (SDN)
Simplifying Wired Network Deployments with Software-Defined Networking (SDN)Simplifying Wired Network Deployments with Software-Defined Networking (SDN)
Simplifying Wired Network Deployments with Software-Defined Networking (SDN)
 
Traffic Optimization in Multi-Layered WANs using SDN
Traffic Optimization in Multi-Layered WANs using SDN Traffic Optimization in Multi-Layered WANs using SDN
Traffic Optimization in Multi-Layered WANs using SDN
 
ThousandEyes at Network Field Day 12
ThousandEyes at Network Field Day 12ThousandEyes at Network Field Day 12
ThousandEyes at Network Field Day 12
 

Recently uploaded

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 

Recently uploaded (20)

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 

20-datacenter-measurements.pptx

  • 1. Data Center Traffic and Measurements Hakim Weatherspoon Assistant Professor, Dept of Computer Science CS 5413: High Performance Systems and Networking November 10, 2014 Slides from SIGCOMM Internet Measurement Conference (IMC) 2010 presentation of “Analysis and Network Traffic Characteristics of Data Centers in the wild”
  • 2. Goals for Today • Analysis and Network Traffic Characteristics of Data Centers in the wild – T. Benson, A. Akella, and D. A. Maltz. In Proceedings of the 10th ACM SIGCOMM conference on Internet measurement (IMC), pp. 267-280. ACM, 2010.
  • 3. The Importance of Data Centers • “A 1-millisecond advantage in trading applications can be worth $100 million a year to a major brokerage firm” • Internal users – Line-of-Business apps – Production test beds • External users – Web portals – Web services – Multimedia applications – Chat/IM
  • 4. The Case for Understanding Data Center Traffic • Better understanding  better techniques • Better traffic engineering techniques – Avoid data losses – Improve app performance • Better Quality of Service techniques – Better control over jitter – Allow multimedia apps • Better energy saving techniques – Reduce data center’s energy footprint – Reduce operating expenditures • Initial stab network level traffic + app relationships
  • 5. Take aways and Insights Gained • 75% of traffic stays within a rack (Clouds) – Applications are not uniformly placed • Half packets are small (< 200B) – Keep alive integral in application design • At most 25% of core links highly utilized – Effective routing algorithm to reduce utilization – Load balance across paths and migrate VMs • Questioned popular assumptions – Do we need more bisection? No – Is centralization feasible? Yes
  • 6. Canonical Data Center Architecture Core (L3) Edge (L2) Top-of-Rack Aggregation (L2) Application servers
  • 7. Dataset: Data Centers Studied DC Role DC Name Location Number Devices Universities EDU1 US-Mid 22 EDU2 US-Mid 36 EDU3 US-Mid 11 Private Enterprise PRV1 US-Mid 97 PRV2 US-West 100 Commercial Clouds CLD1 US-West 562 CLD2 US-West 763 CLD3 US-East 612 CLD4 S. America 427 CLD5 S. America 427  10 data centers  3 classes  Universities  Private enterprise  Clouds  Internal users  Univ/priv  Small  Local to campus  External users  Clouds  Large  Globally diverse
  • 8. Dataset: Collection • SNMP – Poll SNMP MIBs – Bytes-in/bytes-out/discards – > 10 Days – Averaged over 5 mins • Packet Traces – Cisco port span – 12 hours • Topology – Cisco Discovery Protocol DC Name SNMP Packet Traces Topology EDU1 Yes Yes Yes EDU2 Yes Yes Yes EDU3 Yes Yes Yes PRV1 Yes Yes Yes PRV2 Yes Yes Yes CLD1 Yes No No CLD2 Yes No No CLD3 Yes No No CLD4 Yes No No CLD5 Yes No No
  • 9. Canonical Data Center Architecture Core (L3) Edge (L2) Top-of-Rack Aggregation (L2) Application servers Packet Sniffers SNMP & Topology From ALL Links
  • 10. Applications • Start at bottom – Analyze running applications – Use packet traces • BroID tool for identification – Quantify amount of traffic from each app
  • 11. Applications • Differences between various bars • Clustering of applications – PRV2_2 hosts secured portions of applications – PRV2_3 hosts unsecure portions of applications 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% PRV2_1 PRV2_2 PRV2_3 PRV2_4 EDU1 EDU2 EDU3 AFS NCP SMB LDAP HTTPS HTTP OTHER
  • 12. Analyzing Packet Traces • Transmission patterns of the applications • Properties of packet crucial for – Understanding effectiveness of techniques • ON-OFF traffic at edges – Binned in 15 and 100 m. secs – We observe that ON-OFF persists 13
  • 13. • Understanding arrival process – Range of acceptable models • What is the arrival process? – Heavy-tail for the 3 distributions • ON, OFF times, Inter-arrival, – Lognormal across all data centers • Different from Pareto of WAN – Need new models 14 Data Center Off Period Dist ON periods Dist Inter-arrival Dist Prv2_1 Lognormal Lognormal Lognormal Prv2_2 Lognormal Lognormal Lognormal Prv2_3 Lognormal Lognormal Lognormal Prv2_4 Lognormal Lognormal Lognormal EDU1 Lognormal Weibull Weibull EDU2 Lognormal Weibull Weibull EDU3 Lognormal Weibull Weibull Data Center Traffic is Bursty
  • 14. Packet Size Distribution • Bimodal (200B and 1400B) • Small packets – TCP acknowledgements – Keep alive packets • Persistent connections  important to apps
  • 15. Canonical Data Center Architecture Core (L3) Edge (L2) Top-of-Rack Aggregation (L2) Application servers
  • 16. Intra-Rack Versus Extra-Rack • Quantify amount of traffic using interconnect – Perspective for interconnect analysis Edge Application servers Extra-Rack Intra-Rack Extra-Rack = Sum of Uplinks Intra-Rack = Sum of Server Links – Extra-Rack
  • 17. Intra-Rack Versus Extra-Rack Results • Clouds: most traffic stays within a rack (75%) – Colocation of apps and dependent components • Other DCs: > 50% leaves the rack – Un-optimized placement 0 10 20 30 40 50 60 70 80 90 100 EDU1 EDU2 EDU3 PRV1 PRV2 CLD1 CLD2 CLD3 CLD4 CLD5 Extra-Rack Inter-Rack
  • 18. Extra-Rack Traffic on DC Interconnect • Utilization: core > agg > edge – Aggregation of many unto few • Tail of core utilization differs – Hot-spots  links with > 70% util – Prevalence of hot-spots differs across data centers
  • 19. • Low persistence: PRV2, EDU1, EDU2, EDU3, CLD1, CLD3 • High persistence/low prevalence: PRV1, CLD2 – 2-8% are hotspots > 50% • High persistence/high prevalence: CLD4, CLD5 – 15% are hotspots > 50% Persistence of Core Hot-Spots
  • 20. Prevalence of Core Hot-Spots • Low persistence: very few concurrent hotspots • High persistence: few concurrent hotspots • High prevalence: < 25% are hotspots at any time 0 10 20 30 40 50 Time (in Hours) 0.6% 0.0% 0.0% 0.0% 6.0% 24.0%
  • 21. Observations from Interconnect • Links utils low at edge and agg • Core most utilized – Hot-spots exists (> 70% utilization) – < 25% links are hotspots – Loss occurs on less utilized links (< 70%) • Implicating momentary bursts • Time-of-Day variations exists – Variation an order of magnitude larger at core • Apply these results to evaluate DC design requirements
  • 22. Assumption 1: Larger Bisection • Need for larger bisection – VL2 [Sigcomm ‘09], Monsoon [Presto ‘08],Fat-Tree [Sigcomm ‘08], Portland [Sigcomm ‘09], Hedera [NSDI ’10] – Congestion at oversubscribed core links
  • 23. Argument for Larger Bisection • Need for larger bisection – VL2 [Sigcomm ‘09], Monsoon [Presto ‘08],Fat-Tree [Sigcomm ‘08], Portland [Sigcomm ‘09], Hedera [NSDI ’10] – Congestion at oversubscribed core links – Increase core links and eliminate congestion
  • 24. Core Edge Aggregation Application servers Bisection Links (bottleneck) App Links If Σ traffic (App ) > 1 then more device are Σ capacity(Bisection needed at the bisection Calculating Bisection Bandwidth
  • 25. • Given our data: current applications and DC design – NO, more bisection is not required – Aggregate bisection is only 30% utilized • Need to better utilize existing network – Load balance across paths – Migrate VMs across racks Bisection Demand
  • 26. Insights Gained • 75% of traffic stays within a rack (Clouds) – Applications are not uniformly placed • Half packets are small (< 200B) – Keep alive integral in application design • At most 25% of core links highly utilized – Effective routing algorithm to reduce utilization – Load balance across paths and migrate VMs • Questioned popular assumptions – Do we need more bisection? No – Is centralization feasible? Yes
  • 27. Related Works • IMC ‘09 [Kandula`09] – Traffic is unpredictable – Most traffic stays within a rack • Cloud measurements [Wang’10,Li’10] – Study application performance – End-2-End measurements
  • 28. Before Next time • Project Interim report – Due Monday, November 24. – And meet with groups, TA, and professor • Fractus Upgrade: Should be back online • Required review and reading for Wednesday, November 12 – SoNIC: Precise Realtime Software Access and Control of Wired Networks, K. Lee, H. Wang and H. Weatherspoon. USENIX symposium on Networked Systems Design and Implementation (NSDI), April 2013, pages 213-225. – https://www.usenix.org/system/files/conference/nsdi13/nsdi13-final138.pdf • Check piazza: http://piazza.com/cornell/fall2014/cs5413 • Check website for updated schedule

Editor's Notes

  1. Rack of application servers Point out 3 layers (core, gag, edge), show how 2-Tier and 3-Tier are different Show links captured by SNMP Show links captured by packet sniffers
  2. Data from 10 date centers -- 5 clouds, 3 uni, 2 enterprise Big data centers used for clouds Enterprise/uni data located closer to premise of users
  3. Rack of application servers Point out 3 layers (core, gag, edge), show how 2-Tier and 3-Tier are different Show links captured by SNMP Show links captured by packet sniffers
  4. App mix found in the dc impact the results we look at next
  5. Important for understanding effective assumptions and types of techniques that can be used
  6. What are you curvingfitting for log normal parameters or the actual distributions Not fit == different We == conventionally
  7. Rack of application servers Point out 3 layers (core, gag, edge), show how 2-Tier and 3-Tier are different Show links captured by SNMP Show links captured by packet sniffers
  8. intra-rack - traffic within extra-rack – traffic leaving Use equation to show how derived
  9. intra-rack - traffic within extra-rack – traffic leaving Use equation to show how derived