Using SD-WAN over the internet required understanding latency variation on different paths, This presents results compairing various global city pairs and how latency varies, along with a comparison to AWS backbone performance for the same paths.
How to move a mission critical system to 4 AWS regions in one year?Wojciech Gawroński
A year ago our team was challenged to enhance the scope and scale of an existing platform, that is providing significant revenue for our client. As the designers and maintainers of that solution, we decided to leverage AWS cloud during that transition. In the presentation, I would like to discuss how we have tackled that migration - with the assumption that we had to move in a limited resource, hybrid cloud environment - working in close cooperation with teams responsible for other parts of the system. As I stated previously - it was a challenge - and I would like to talk what problems we have solved during that process. Also, what services we have leveraged to smooth the transition. And last, but not least - I would like to present how we have maintained the delivery pipeline, automation and massive pile of CloudFormation templates and why AWS Lambda is an excellent glue for any operational work you have to do in the cloud. Our hard work paid off. In October 2017 we have deployed our system into 4th AWS region. Bare with me during the talk, and you will learn how we achieved that
In 2014, AWS launched more than 400 new services and features further allowing more than a million customers, including fast-growing startups, large enterprises, and government agencies across 190 countries, to innovate quickly, lower IT costs and scale applications globally. In this session Glenn picks his Top 10 AWS launches in 2014 and the potential impact these new services can have for your business to drive innovation and growth while lowering IT costs.
As the Head of Architecture for Amazon Web Services, APAC, Glenn is responsible for creating architectural best practices and working with customers on how they use the cloud for business transformation and supporting customers that are making strategic investments with AWS.
Reasons to attend:
- Keep up with the rapidly evolving AWS Cloud platform.
- Learn about the new features and services that have become available during 2014.
- Get the very latest details on Service updates and announcements.
How to move a mission critical system to 4 AWS regions in one year?Wojciech Gawroński
A year ago our team was challenged to enhance the scope and scale of an existing platform, that is providing significant revenue for our client. As the designers and maintainers of that solution, we decided to leverage AWS cloud during that transition. In the presentation, I would like to discuss how we have tackled that migration - with the assumption that we had to move in a limited resource, hybrid cloud environment - working in close cooperation with teams responsible for other parts of the system. As I stated previously - it was a challenge - and I would like to talk what problems we have solved during that process. Also, what services we have leveraged to smooth the transition. And last, but not least - I would like to present how we have maintained the delivery pipeline, automation and massive pile of CloudFormation templates and why AWS Lambda is an excellent glue for any operational work you have to do in the cloud. Our hard work paid off. In October 2017 we have deployed our system into 4th AWS region. Bare with me during the talk, and you will learn how we achieved that
In 2014, AWS launched more than 400 new services and features further allowing more than a million customers, including fast-growing startups, large enterprises, and government agencies across 190 countries, to innovate quickly, lower IT costs and scale applications globally. In this session Glenn picks his Top 10 AWS launches in 2014 and the potential impact these new services can have for your business to drive innovation and growth while lowering IT costs.
As the Head of Architecture for Amazon Web Services, APAC, Glenn is responsible for creating architectural best practices and working with customers on how they use the cloud for business transformation and supporting customers that are making strategic investments with AWS.
Reasons to attend:
- Keep up with the rapidly evolving AWS Cloud platform.
- Learn about the new features and services that have become available during 2014.
- Get the very latest details on Service updates and announcements.
Multi cloud network leveraging sd-wan reference architectureMatsuo Sawahashi
A lot of clients use multiple public clouds as a result of selecting a cloud that meets their requirements.
In connecting on-premise DCs and clouds, as a result of connecting with individual lines, it has become a complicated, lacking flexibility, and high cost network.
This presentation produces a simple and flexible with low cost multi-cloud networking reference architecture leveraging a cloud exchange service with SD-WAN (Software Defined Wide Area Network).
Enterprise Service Delivery from the AWS Cloud (ARC208) | AWS re:Invent 2013Amazon Web Services
(Presented by Citrix)
As we move to a world where all users are mobile and apps are increasingly delivered from the cloud, security, compliance, and user experience service-level expectations are higher than ever, necessitating that IT look beyond traditional methods for delivering applications. However, there are intelligent cloud networking and provisioning solutions on AWS that can be leveraged to create a service delivery model that addresses the new paradigm. Learn how Citrix NetScaler VPX on AWS provides full application visibility and control through a combination of customer case studies and demos.
In this session, you learn how to:
-Deploy Citrix application delivery technologies (NetScaler, NetScaler Gateway, CloudBridge) into AWS
-Optimize next-gen web applications delivered from AWS, using traffic management and application acceleration capabilities
-Provide global application availability across on-premises data centers and multiple AWS regions using CloudBridge, global server load balancing, and Amazon Route 53 DNS
Unified Management of Wi-Fi & LAN with Ruckus Unleashed and Cloud Solutions
During this webinar we covered three major technologies under unified management umbrella:
• Fast, efficient, low-latency wired infrastructure able to support most demanding customer environments.
• Cutting edge best on the market Wi-Fi including new Wi-Fi6 (802.11ax) standard that is recently gaining its market momentum and becomes available in easy to deploy,
scalable networks with robust security and unified wired and wireless management.
• Cloud Management built upon SmartZone controller and taking everything best out of it.
All this with an intuitive interface making administrative tasks fast and efficient without the complexity of typical controller-based deployments.
A Day in the Life of a Cloud Network Engineer at Netflix - NET303 - re:Invent...Amazon Web Services
Netflix is big and dynamic. At Netflix, IP addresses mean nothing in the cloud. This is a big challenge with Amazon VPC Flow Logs. VPC Flow Log entries only present network-level information (L3 and L4), which is virtually meaningless. Our goal is to map each IP address back to an application, at scale, to derive true network-level insight within Amazon VPC. In this session, the Cloud Network Engineering team discusses the temporal nature of IP address utilization in AWS and the problem with looking at OSI Layer 3 and Layer 4 information in the cloud.
Analytical, prototyping, model-based systems engineering and custom discrete-event model development of automotive networks are inaccurate, expensive, and takes too long to do detailed routing analysis, Quality-of-Service (QoS) trade-off, and bandwidth exploration. To capture the nuances of QoS, scheduling, buffer management, and network topologies, these solutions require a considerable amount of time, costs, and customization. To achieve the reliability of wiring harness, the latency and bandwidth measurements of automotive networks must be accurate, tested for failure conditions, and simulated for security breaches, traffic spikes, and translations.
QConPlus'21 - Beating the Speed of Light with Intelligent Request RoutingSergey Fedorov
Network request latency is crucial for many Internet applications. For Netflix it matters even outside video streaming - lower latencies to our AWS cloud endpoints mean smoother browsing experience for hundreds of millions of members. The catch - Netflix service is used on hundreds of millions of devices all around the world, connecting to our data centers over the open Internet - an ever-changing global network with many possible paths, distributed ownership and lack of centralized control.
This talk is both about API acceleration technology and data-driven approach to building distributed systems at a global scale that are safe to deploy and easy to maintain. While this talk demonstrates Netflix’s journey, the main principles and techniques can easily be applied and practiced by every owner of Internet-based services.
From this talk you’ll learn:
- how to build the Internet latency map for your customers;
- how to leverage the knowledge of network protocols and edge infrastructure to do the impossible - beat the speed of light;
- how to use a data-driven approach to evolve your client-server interactions;
- how to do that with a small team, on a tight schedule and minimal risk to your users.
OpWorks is a customizable, web-based solution for facility operators, superintendents, and managers. OpWorks lets you create a single user interface for data entry and reporting, and assists with making better facility management and operating decisions.
Pace of Innovation at AWS - London Summit Enteprise Track RePlayAmazon Web Services
In this session we will present some of the stories around innovation in the Amazon Web Services Datacenter including how it partners with other vendors in the hardware space.
Telstra Programmable Networks & Scaling a Serverless Team with AutomationPolarSeven Pty Ltd
Hosted by PolarSeven Cloud Consulting - http://polarseven.com
Our monthly AWS User Group Sydney presentation night.
http://www.meetup.com/AWS-Sydney/
Introductions and What's New In AWS - by PolarSeven
Session 1:
Telstra
https://www.telstra.com.au/
A new kind of network that will adapt with you in real time, offer a new level of control and flexibility and evolve to suit your business. Peter Galloway, formerly with AWS & Cisco, and now with Telstra will be decoding TPN, SDN, NFV and what it means for AWS.
Watch the video presentation here
https://youtu.be/nV8NRrVfb-w
Session 2:
A Cloud Guru
https://acloud.guru/
Dan Parker, Software Developer at A Cloud Guru is going to talk about automation inside serverless teams. He’ll show you how they’ve fully automated the creation & destruction of serverless environments using Ansible, CloudFormation & Serverless Framework.
Watch the video presentation here
https://youtu.be/tC4CxoNVkkw
As enterprises move to the cloud, robust connectivity is often an early consideration. AWS Direct Connect provides a more consistent network experience for accessing your AWS resources, typically with greater bandwidth and reduced network costs. This session dives deep into the features of AWS Direct Connect and VPNs. We discuss deployment architectures and demonstrate the process from start to finish. We’ll show you how to configure public and private virtual interfaces, configure routers, use VPN backup, and provide secure communication between sites by using the AWS VPN CloudHub.
Azure vs AWS Best Practices: What You Need to KnowRightScale
Azure is now the clear #2 in public cloud behind AWS. While some cloud users are evaluating Azure vs. AWS, many enterprises are planning to use both cloud providers. But there are some notable differences between how the two clouds operate and the best practices for deploying workloads in each.
The Azure vs. AWS Best Practices: What You Need to Know webinar will cover:
Recent and coming enhancements for Azure.
Azure vs. AWS differences for compute, networking, and storage.
Best practices for cloud deployments in Azure and AWS.
How to use both Azure and AWS.
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Multi cloud network leveraging sd-wan reference architectureMatsuo Sawahashi
A lot of clients use multiple public clouds as a result of selecting a cloud that meets their requirements.
In connecting on-premise DCs and clouds, as a result of connecting with individual lines, it has become a complicated, lacking flexibility, and high cost network.
This presentation produces a simple and flexible with low cost multi-cloud networking reference architecture leveraging a cloud exchange service with SD-WAN (Software Defined Wide Area Network).
Enterprise Service Delivery from the AWS Cloud (ARC208) | AWS re:Invent 2013Amazon Web Services
(Presented by Citrix)
As we move to a world where all users are mobile and apps are increasingly delivered from the cloud, security, compliance, and user experience service-level expectations are higher than ever, necessitating that IT look beyond traditional methods for delivering applications. However, there are intelligent cloud networking and provisioning solutions on AWS that can be leveraged to create a service delivery model that addresses the new paradigm. Learn how Citrix NetScaler VPX on AWS provides full application visibility and control through a combination of customer case studies and demos.
In this session, you learn how to:
-Deploy Citrix application delivery technologies (NetScaler, NetScaler Gateway, CloudBridge) into AWS
-Optimize next-gen web applications delivered from AWS, using traffic management and application acceleration capabilities
-Provide global application availability across on-premises data centers and multiple AWS regions using CloudBridge, global server load balancing, and Amazon Route 53 DNS
Unified Management of Wi-Fi & LAN with Ruckus Unleashed and Cloud Solutions
During this webinar we covered three major technologies under unified management umbrella:
• Fast, efficient, low-latency wired infrastructure able to support most demanding customer environments.
• Cutting edge best on the market Wi-Fi including new Wi-Fi6 (802.11ax) standard that is recently gaining its market momentum and becomes available in easy to deploy,
scalable networks with robust security and unified wired and wireless management.
• Cloud Management built upon SmartZone controller and taking everything best out of it.
All this with an intuitive interface making administrative tasks fast and efficient without the complexity of typical controller-based deployments.
A Day in the Life of a Cloud Network Engineer at Netflix - NET303 - re:Invent...Amazon Web Services
Netflix is big and dynamic. At Netflix, IP addresses mean nothing in the cloud. This is a big challenge with Amazon VPC Flow Logs. VPC Flow Log entries only present network-level information (L3 and L4), which is virtually meaningless. Our goal is to map each IP address back to an application, at scale, to derive true network-level insight within Amazon VPC. In this session, the Cloud Network Engineering team discusses the temporal nature of IP address utilization in AWS and the problem with looking at OSI Layer 3 and Layer 4 information in the cloud.
Analytical, prototyping, model-based systems engineering and custom discrete-event model development of automotive networks are inaccurate, expensive, and takes too long to do detailed routing analysis, Quality-of-Service (QoS) trade-off, and bandwidth exploration. To capture the nuances of QoS, scheduling, buffer management, and network topologies, these solutions require a considerable amount of time, costs, and customization. To achieve the reliability of wiring harness, the latency and bandwidth measurements of automotive networks must be accurate, tested for failure conditions, and simulated for security breaches, traffic spikes, and translations.
QConPlus'21 - Beating the Speed of Light with Intelligent Request RoutingSergey Fedorov
Network request latency is crucial for many Internet applications. For Netflix it matters even outside video streaming - lower latencies to our AWS cloud endpoints mean smoother browsing experience for hundreds of millions of members. The catch - Netflix service is used on hundreds of millions of devices all around the world, connecting to our data centers over the open Internet - an ever-changing global network with many possible paths, distributed ownership and lack of centralized control.
This talk is both about API acceleration technology and data-driven approach to building distributed systems at a global scale that are safe to deploy and easy to maintain. While this talk demonstrates Netflix’s journey, the main principles and techniques can easily be applied and practiced by every owner of Internet-based services.
From this talk you’ll learn:
- how to build the Internet latency map for your customers;
- how to leverage the knowledge of network protocols and edge infrastructure to do the impossible - beat the speed of light;
- how to use a data-driven approach to evolve your client-server interactions;
- how to do that with a small team, on a tight schedule and minimal risk to your users.
OpWorks is a customizable, web-based solution for facility operators, superintendents, and managers. OpWorks lets you create a single user interface for data entry and reporting, and assists with making better facility management and operating decisions.
Pace of Innovation at AWS - London Summit Enteprise Track RePlayAmazon Web Services
In this session we will present some of the stories around innovation in the Amazon Web Services Datacenter including how it partners with other vendors in the hardware space.
Telstra Programmable Networks & Scaling a Serverless Team with AutomationPolarSeven Pty Ltd
Hosted by PolarSeven Cloud Consulting - http://polarseven.com
Our monthly AWS User Group Sydney presentation night.
http://www.meetup.com/AWS-Sydney/
Introductions and What's New In AWS - by PolarSeven
Session 1:
Telstra
https://www.telstra.com.au/
A new kind of network that will adapt with you in real time, offer a new level of control and flexibility and evolve to suit your business. Peter Galloway, formerly with AWS & Cisco, and now with Telstra will be decoding TPN, SDN, NFV and what it means for AWS.
Watch the video presentation here
https://youtu.be/nV8NRrVfb-w
Session 2:
A Cloud Guru
https://acloud.guru/
Dan Parker, Software Developer at A Cloud Guru is going to talk about automation inside serverless teams. He’ll show you how they’ve fully automated the creation & destruction of serverless environments using Ansible, CloudFormation & Serverless Framework.
Watch the video presentation here
https://youtu.be/tC4CxoNVkkw
As enterprises move to the cloud, robust connectivity is often an early consideration. AWS Direct Connect provides a more consistent network experience for accessing your AWS resources, typically with greater bandwidth and reduced network costs. This session dives deep into the features of AWS Direct Connect and VPNs. We discuss deployment architectures and demonstrate the process from start to finish. We’ll show you how to configure public and private virtual interfaces, configure routers, use VPN backup, and provide secure communication between sites by using the AWS VPN CloudHub.
Azure vs AWS Best Practices: What You Need to KnowRightScale
Azure is now the clear #2 in public cloud behind AWS. While some cloud users are evaluating Azure vs. AWS, many enterprises are planning to use both cloud providers. But there are some notable differences between how the two clouds operate and the best practices for deploying workloads in each.
The Azure vs. AWS Best Practices: What You Need to Know webinar will cover:
Recent and coming enhancements for Azure.
Azure vs. AWS differences for compute, networking, and storage.
Best practices for cloud deployments in Azure and AWS.
How to use both Azure and AWS.
Similar to Internet Backbone Latency Variability Performance Study Results (20)
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
3. Methodology: Tools used
TOOL DESCRIPTION
Cedexis • Cedexis Measures Internet Health
• Collects 14 billion RUM data points per day
• Data collection methodology
Catchpoint • Catchpoint is a leading monitoring system
• Data collection methodology
• We set up tests from last mile agents to Speedtest last mile servers
Speedtest • Ookla/Speedtest is a leading last mile performance tool
• Used Catchpoint to test against Speedtest servers
4. Methodology: Definition of “Response”
Response = Send + Wait
• Excludes all one-time setup/negotiations like DNS and Connect
• Response is a better measure of real Internet response than Ping
DNS SERVER
WEB SERVER
CLIENT CLIENT1. DNS Lookup 2. Connect 3. TLS 4. Send 5. Wait 6. Load
a b a b c a a b c
RESPONSE
6. Methodology
Calculate CORE by subtracting the last mile
FIRST
MILE
MIDDLE
MILE
LAST
MILE
X
Y
Core
Response
Core
Response Long Haul
(Cedexis)
Response Last Mile
(Catchpoint/Speedtest) =-
7. Results
Long Haul (Cedexis) Last Mile (Catchpoint/Speedtest) Core
End User Location End User ISP Origin Server
Median
(ms)
SD
(ms)
Variance
(ms2)
Median
(ms)
SD
(ms)
Variance
(ms2)
Median
(ms)
SD
(ms)
Diff Variances
(ms2)
Bangalore Atria SJC AWS 224 125 15625 3 5.88 34.57 221 125 15590
Bangalore Atria London AWS 148 92 8464 3 5.88 34.57 145 92 8429
Bangalore Atria Tokyo AWS 119 99 9801 3 5.88 34.57 116 99 9766
Bangalore Atria Sydney AWS 295 106 11236 3 5.88 34.57 292 106 11201
DC Cox SJC AWS 101 119 14161 16 6.08 36.97 85 119 14124
DC Cox Tokyo AWS 186 112 12544 16 6.08 36.97 170 112 12507
DC Cox Sydney AWS 264 163 26569 16 6.08 36.97 248 163 26532
Tokyo Ucom SJC AWS 105 102 10404 4 7.46 55.65 101 102 10348
Tokyo Ucom London AWS 205 98 9604 4 7.46 55.65 201 98 9548
Tokyo Ucom Sydney AWS 191 68 4624 4 7.46 55.65 187 68 4568
London BT SJC AWS 174 123 15129 10 12 144.00 164 122 14985
London BT Tokyo AWS 271 160 25600 10 12 144.00 261 160 25456
London BT Sydney AWS 348 182 33124 10 12 144.00 338 182 32980
Melbourne Singtel/Optus SJC AWS 198 92 8464 6 5.2 27.04 192 92 8437
Melbourne Singtel/Optus London AWS 337 191 36481 6 5.2 27.04 331 191 36454
SF Comcast London AWS 166 134 17956 17 9.08 82.45 149 134 17874
SF Comcast Virginia AWS 89 103 10609 17 9.08 82.45 72 103 10527
8. Why the Problem in the Middle Mile?
• $$$ 20X* investment
• paid for by customers
• $ 6X* investment
• least cost peering and routing
• $$$ 50X* investment
• paid for by customers
FIRST
MILE
LAST
MILE
* Source: Akamai
Telia
Carrier
GTT
LEVEL 3
9. Example: Core & Last Mile Traceroutes
FIRST
MILE
MIDDLE
MILE
LAST
MILE
12. Middle Mile (Core) Median (ms) SD (ms)
City Backbone Origin Server Part 1 Part 2 Part 1 Part 2
Bangalore Tata SJC AWS 221 224 125 153
Bangalore Tata London AWS 145 153 92 50
Bangalore Tata Tokyo AWS 116 114 99 66
Bangalore Tata Sydney AWS 292 300 106 154
DC Comcast SJC AWS 85 74 119 35
DC Comcast Tokyo AWS 170 191 112 118
DC Comcast Sydney AWS 248 250 163 346
Tokyo NTT SJC AWS 101 109 102 70
Tokyo NTT London AWS 201 230 98 81
Tokyo NTT Sydney AWS 187 110 68 84
London BT SJC AWS 164 181 122 228
London BT Tokyo AWS 261 271 160 273
London BT Sydney AWS 338 336 182 346
Melbourne Telstra SJC AWS 192 168 92 9
Melbourne Telstra London AWS 331 307 191 1725
SF Level3 London AWS 149 145 134 68
SF Level3 Virginia AWS 72 76 103 71
Results
This one is interesting.
Telstra is doing something
special with SJC AWS
15. Part 2 Backbone - AWS Part 3 AWS – AWS Part 2 Backbone – AWS Part 3 AWS - AWS
Backbone Agent AWS Agent Origin Server Median (ms) Median (ms) SD (ms) SD (ms)
Tata Mumbai AWS SJC AWS 224 240 152.86 6.18
Tata Mumbai AWS London AWS 153 113 50.13 3.63
Tata Mumbai AWS Tokyo AWS 114 121 65.78 4.41
Tata Mumbai AWS Sydney AWS 300 228 154.17 9.43
Comcast DC AWS SJC AWS 74 61 34.54 9.91
Comcast DC AWS Tokyo AWS 191 172 118.08 7.63
Comcast DC AWS Sydney AWS 250 205 345.50 5.22
NTT Tokyo AWS SJC AWS 109 113 69.81 6.70
NTT Tokyo AWS London AWS 230 247 81.13 6.19
NTT Tokyo AWS Sydney AWS 110 104 83.60 11.78
BT London AWS SJC AWS 181 137 227.80 10.51
BT London AWS Tokyo AWS 271 247 272.98 122.24
BT London AWS Sydney AWS 336 281 345.50 16.87
Telstra Sydney AWS SJC AWS 168 147 9.36 10.59
Telstra Sydney AWS London AWS 307 280 1725.37 16.77
Level3 SJC AWS London AWS 145 140 68.11 52.87
Level3 SJC AWS Virginia AWS 76 63 71.17 91.82
DC Azure SJC AWS 73 9.47
Results
Poor stability
AWS-to-AWS on-net
is significantly more
consistent/reliable
AWS-to-AWS on-net has better/lower latency
16. Conclusions
• Internet variability is most dramatic in the core
• On high latency paths, to provide the stable performance
needed by latency sensitive applications, use a private
network such as:
• SD-CORE accessed via internet VPN to local POPs, or
• MPLS, which requires fiber or wire connections
• SD-WAN using two internet connections can mitigate much
of the public internet variability, depending on overall path
17. Conclusions
• Understand that latency varies on the internet
• Look at your application requirements for latency and packet
loss
• If a median latency is 70ms with a SD of 34 ms is adequate
for your application performance, you don’t need MPLS with
two internet circuits.
• If median latency for a path is 307 ms with an SD 1,725 ms,
you might not want to depend purely on the internet
• There are differences in internet backbones.
This project began as a result of the widespread marketing of SD-WAN as a way to replace for MPLS. I have seen plenty of SD-WAN implementation work very well using internet connectivity only. But I also don’t believe the hype when it comes to making recommendations to enterprise clients that are asking for my advice. So while working with a client with offices in India and performing analysis with Netflow and ThousandEyes, I had statistical evidence of this variability. This lead to the further study that I will summarized today.
RUM data points are “Real Use Monitoring” data points.
Cedexis Radar collects data from more than 50,000 networks daily with fees from 130 service providers.
Real user monitoring (RUM) means fully understanding how internet performance impacts customer satisfaction and engagement.
Cedexis Radar gathers RUM data from each step between the client and any of the clouds, data centers, and CDNs hosting your applications
to build a holistic picture of internet health. Every request creates more data, continuously updating this unique real-time virtual map of the web.
A small piece of nonblocking RUM specific JavaScript is inserts into the designated web pages. When an end user visits a RUM enabled page, the RUM JavaScript will collect performance data and beacon it back to Akamai via a 1x1 pixel. The data is then processed and stored for visualization within the portal.
Catchpoint has a global network of 700+ monitoring nodes spanning backbone, broadband, cloud, enterprise, last mile, and wireless. Comprehensively detect issues across third party services, CDNs, DNS, APIs, cloud providers, networks, systems, and more.
Ookla/Speedtest uses up to four HTTP threads during the download and upload portions of the test. ... After the pre-test, if the connection speed is at least 4 megabits per second, then Speedtest.net will use four threads. Otherwise, it will default to two threads.
So how did we use these tools?
When measuring Response, we are measuring the send and wait time. This is better than ping, which can have a low priority response and not be an accurate measure of response.
But we exclude DNS and connect, whose response times can vary, unrelated to our measurement goals and therefore make our results invalid.
So how did we calculate the Internet Core performance?
The CORE or middle mile, was calculated by measuring the Long Haul response (using Cedexis) and subtracting the Last Mile response, using Catchpoint and Speedtest, which are the tools I described in slide 2.
The results are quite striking. For example:
Bangalore to San Jose: 221ms with a SD of 125 ms. So latency could be as high as 346 ms
London to Sydney: 338 ms with a SD of 182 ms. So latency could be as high as 520 ms
Even a relatively short path like SF to Virginia: 72ms with an SD of 103ms. Or as high as 175ms!
For latency sensitive applications like voice or video would you feel comfortable with depending 100% on the internet for these long paths?
Definition
Variance (symbolized by S2) is a measure of how spread out a data set is. It is calculated as the average squared deviation of each number from the mean of a data set.
Standard deviation (the square root of the variance, symbolized by S).
Why is the problem in the middle mile?
There is far more investment in the first and last mile.
Least cost peering and routing is going to take the more cost effective path, unless you are willing pay.
While you surely cannot read this. The point is this:
With this 216 ms path from India to San Jose.
The last mile traceroute has 5 hops, zero packet loss and two ASNs
The core traceroute has 18 hops across 6 ASNs and 3 countries.
Think of the potential BGP changes across this path
So let’s look at the Core measurements
Part 2 = Internet Core
Part 3 = AWS backbone
DC Azure to SJC AWS is slower than DC AWS to SJC AWS