SlideShare a Scribd company logo
1 of 22
Analyses and Takeaways
Featured Speakers
IanWaters
EMEA Marketing Director
Mike Hicks
Principal Solution Analyst
3
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
BeforeWe Begin…
• If you have any questions, please type them in the Questions window.
• If you have any audio problems, please chat us for help.
• A recording of this presentation will be sent to you in a few days.
• Interested in more outage analysis and Internet insights? Check out the ThousandEyes blog and The
Internet Report podcast.
Anatomy of an Outage
• Understanding different types of
Internet outages is important to mitigate
their impact.
• Outages can vary in blast radius, be
planned or unplanned, and have varying
MTTR.
• Network outages depend on where the
problem occurs, with transit network
incidents impacting multiple providers.
• Tracking outages can help teams
identify patterns and prevent customer
service disruptions.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
Outage and Degradation Impacts
BGP
ISP
CDN
DNS SaaS Apps
Services
APIs
Data Center
Cloud
DDoS
Protection
SSE
RISKAND
COMPLIANCE
Is our traffic
getting routed out
of region?
SERVICE
AVAILABILITY
Which cloud
regions are
impacted?
SITUATIONAL
AWARENESS
Are regional ISPs
spoofing our DNS
records?
SERVICE
RECOVERY
Did we
successfully
cut over to
our DDoS
mitigation
service?
NETWORK
SECURITY
Are SASE routing
policies working
as we expect?
CUSTOMER
SUPPORT
Is an Internet
outage preventing
users from reaching
our service?
WORKFORCE
PRODUCTIVITY
Will our Salesforce
dev updates degrade
performance
for some global users?
$32,000
$120,000
$3,500
3474
REVENUE
PROTECTION
Is the payment
gateway down
or just
unreachable?
2023 Outages by the Numbers: ISP Compared to CSP
• ThousandEyes reported an increase in cloud service provider (CSP) outages in 2023.
• CSP outages are the second most common type of disruption after ISP outages.
• The ratio of CSP outages to ISP outages increased in 2023.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
2023 Outages by the Numbers:
U.S.-centric Outages in Relation to Global Outages
• U.S.-centric outages increased to 37% in 2023 from 34% in 2022.
• Smaller, contained outages are becoming more common.
• Localized outages have different impacts and require different responses compared to global outages.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
2023 Outages by the Numbers: Application Outages
• The number and frequency of application outages have been on the rise over the past year.
• Application-related disruptions can have a bigger impact than network outages, though they are not as common.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
Connections Are Complex
Branch Office
Employee
BYOD
Corp devices
IOT
Cameras and
sensors
IoT
VDI
People, places,
and things
Edge
BYOD
Data Center
IOT
Core
network
Mobile
networks
Core
network
Peering
Access
networks
Wireless
network
Wireless
gateway
DNS
Cloud
and SaaS
Cloud
providers
Datacenter
infrastructure
Cloud connectivity
Direct
connect
ISP transit
providers
SaaS
onramp
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
8
3
9
3
5
4
6
8
6
Correlate
performance
across every layer
Time Correlated
Microsoft
(1/25)
Outlook
(2/7)
Virgin Media
(4/4)
AWS
(6/13)
Slack (8/2)
Square
(9/8)
Workday +
Cloudflare (11/2)
2023 Outage Timeline
Purple = Application Outage
Red = Network Outage
Blue = Infrastructure Outage
Bookmark the InternetOutages
Timeline for outage updates
throughout the year.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
Microsoft 365 (1/25/23)
• Microsoft started experiencing service related issues around 07:05
AM (UTC).
• The disruption was triggered by an external BGP change
by Microsoft that impacted connected service providers
• Microsoft BGP prefixes were withdrawn completely
but then almost immediately re-advertised.
• Affected smaller (/24) prefixes and summary prefixes (/12).
• Cascading impact on global routing tables, causing significant
churn.
• Prefixes were either withdrawn or re-advertised to
transit providers.
• Large amount of packet loss were seen as well as
HTTP and DNS timeouts.
• Timeouts seen in the application “Response,” further indicating the
effect of the network on service availability.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
Microsoft Outlook (2/7/23)
• Starting around 03:55 UTC,
Outlook became unavailable.
• Network path was working
properly, butThousandEyes
observed elevated server response
timeouts and slow page loading.
• Majority of the errors were HTTP
server timeouts, indicating an
application issue.
• Incident was mostly concentrated
in the U.S. and lasted ~2 hours.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
Virgin Media (4/4/23)
• From approximately 00:30 to 17:30 UTC,
two outages impacted the reachability of
Virgin Media UK’s network and services.
• The first incident began at approximately
00:30 UTC and appeared to coincide with a
series of BGP route withdrawals.
• Second incident was shorter, but the
networks experienced similar BGP and
reachability issues.
• Outages were overnight and due to the
repeat nature, could indicate maintenance
issues.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
AWS (6/13/23)
• Outage impacted services within US-EAST-1.
• Lasted two hours and increased latency,
server timeouts, and HTTP server errors were
observed.
• AWS console access was also affected,
making troubleshooting difficult.
• AWS confirmed the issue was due to a
capacity management subsystem failure.
• Organizations leveraging cloud services, such
as those offered byAWS, should be aware of
the relationships in their digital ecosystem,
regardless of whether those relationships are
services or networks.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
Slack (8/2/23)
• Application outage that lasted from 4:01 PM
to 6 PM (UTC).
• Network paths and accessibility were
unaffected.
• Initially could be seen as HTTP 500 errors and
higher-than-normal page load times.
• During the outage, users were unable to
upload files or share screenshots.
• Root cause—work on a “routine database
cluster migration”—that accidentally reduced
database capacity to the point that it could
not support a regularly scheduled job
running.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
Square (9/8/23)
• Outage lasted over 18 hours.
• Backend issue that prevented the
platform from processing payment
transactions.
• Users reported various problems,
from terminal connections dropping
out, to payments appearing to
complete but then not showing up
in business accounts.
• ThousandEyes observed
intermittent dropouts and 503
‘service unavailable’ errors.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
Workday + Cloudflare (11/2/23)
• Cloudflare andWorkday
experienced a major outage due to
multiple infrastructure provider
failures.
• DR resources took 6 hours to come
online and full resolution took 36
hours.
• Initial cause was a partial mains
power outage at a Flexential data
center in Portland.
• Further generator and grid failures
resulted in a complete power loss
and ungraceful shutdown.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
Takeaways
• Understanding how your application works is important for quickly identifying failures and making improvements.
• Just because your application is working doesn't mean it's functioning optimally.
• Knowing how all parts of the service work together is crucial for ongoing design and future optimizations.
• Improved visibility and operational optimizations can prevent outages and minimize their impact.
• Tracking different categories of outages and degradations over time can be helpful.
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
• Subscribe to our blog to keep up-to-date!
thousandeyes.com/blog/
• Tune in toThe Internet Report Podcast.
https://www.thousandeyes.com/the-internet-report/
20
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
Next Steps
• New tutorial videos on our features
thousandeyes.com/resources/?cat=tutorial
• New Getting Started Guides
docs.thousandeyes.com/product-documentation/getting-started
Blog and
Podcast
Learning
Resources
Support
Community
• Still have questions? Ask us on the ThousandEyes Support
Community AMA: http://bit.ly/2023Outages
Q&A
21
© 2023 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
22
© 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.

More Related Content

Similar to The Top Outages of 2023: Analyses and Takeaways

Outage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedIn
Outage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedInOutage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedIn
Outage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedInThousandEyes
 
Microsoft Outage Analysis
Microsoft Outage AnalysisMicrosoft Outage Analysis
Microsoft Outage AnalysisThousandEyes
 
Introduction to ThousandEyes
Introduction to ThousandEyesIntroduction to ThousandEyes
Introduction to ThousandEyesThousandEyes
 
How to Evaluate, Rollout and Operationalize Your SD-WAN Projects
How to Evaluate, Rollout and Operationalize Your SD-WAN ProjectsHow to Evaluate, Rollout and Operationalize Your SD-WAN Projects
How to Evaluate, Rollout and Operationalize Your SD-WAN ProjectsThousandEyes
 
How to Evaluate, Rollout, and Operationalize Your SD-WAN Projects
How to Evaluate, Rollout, and Operationalize Your SD-WAN ProjectsHow to Evaluate, Rollout, and Operationalize Your SD-WAN Projects
How to Evaluate, Rollout, and Operationalize Your SD-WAN ProjectsThousandEyes
 
Visualizing Your Network Health - Driving Visibility in Increasingly Complex...
Visualizing Your Network Health -  Driving Visibility in Increasingly Complex...Visualizing Your Network Health -  Driving Visibility in Increasingly Complex...
Visualizing Your Network Health - Driving Visibility in Increasingly Complex...DellNMS
 
Optimizing and Troubleshooting Digital Experience for a Hybrid Workforce
Optimizing and Troubleshooting Digital Experience for a Hybrid WorkforceOptimizing and Troubleshooting Digital Experience for a Hybrid Workforce
Optimizing and Troubleshooting Digital Experience for a Hybrid WorkforceThousandEyes
 
EMEA Optimizing and Troubleshooting Digital Experience for a Hybrid Workforce
EMEA Optimizing and Troubleshooting Digital Experience for a Hybrid WorkforceEMEA Optimizing and Troubleshooting Digital Experience for a Hybrid Workforce
EMEA Optimizing and Troubleshooting Digital Experience for a Hybrid WorkforceThousandEyes
 
Introduction To ThousandEyes
Introduction To ThousandEyesIntroduction To ThousandEyes
Introduction To ThousandEyesThousandEyes
 
0328apjcintrotothousandeyeswebinar-230328233735-4df10d7f.pdf
0328apjcintrotothousandeyeswebinar-230328233735-4df10d7f.pdf0328apjcintrotothousandeyeswebinar-230328233735-4df10d7f.pdf
0328apjcintrotothousandeyeswebinar-230328233735-4df10d7f.pdfSaurabh Chauhan
 
Optimizing and Troubleshooting Digital Experience for a Hybrid Workforce
Optimizing and Troubleshooting Digital Experience for a Hybrid WorkforceOptimizing and Troubleshooting Digital Experience for a Hybrid Workforce
Optimizing and Troubleshooting Digital Experience for a Hybrid WorkforceThousandEyes
 
Getting Started with ThousandEyes Proof of Concepts
Getting Started with ThousandEyes Proof of ConceptsGetting Started with ThousandEyes Proof of Concepts
Getting Started with ThousandEyes Proof of ConceptsThousandEyes
 
Introduction to ThousandEyes
Introduction to ThousandEyesIntroduction to ThousandEyes
Introduction to ThousandEyesThousandEyes
 
The Top Outages of 2021: Analysis and Takeaways
The Top Outages of 2021: Analysis and TakeawaysThe Top Outages of 2021: Analysis and Takeaways
The Top Outages of 2021: Analysis and TakeawaysThousandEyes
 
Visualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkVisualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkDellNMS
 
Introduction to ThousandEyes
Introduction to ThousandEyesIntroduction to ThousandEyes
Introduction to ThousandEyesThousandEyes
 
Introduction to ThousandEyes
Introduction to ThousandEyesIntroduction to ThousandEyes
Introduction to ThousandEyesThousandEyes
 
Introduction to ThousandEyes
Introduction to ThousandEyesIntroduction to ThousandEyes
Introduction to ThousandEyesThousandEyes
 
06_08_emea_how_to_evaluate_rollout_and_operationalize_your_sdwan_projects_web...
06_08_emea_how_to_evaluate_rollout_and_operationalize_your_sdwan_projects_web...06_08_emea_how_to_evaluate_rollout_and_operationalize_your_sdwan_projects_web...
06_08_emea_how_to_evaluate_rollout_and_operationalize_your_sdwan_projects_web...ThousandEyes
 
Introduction to ThousandEyes
Introduction to ThousandEyesIntroduction to ThousandEyes
Introduction to ThousandEyesThousandEyes
 

Similar to The Top Outages of 2023: Analyses and Takeaways (20)

Outage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedIn
Outage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedInOutage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedIn
Outage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedIn
 
Microsoft Outage Analysis
Microsoft Outage AnalysisMicrosoft Outage Analysis
Microsoft Outage Analysis
 
Introduction to ThousandEyes
Introduction to ThousandEyesIntroduction to ThousandEyes
Introduction to ThousandEyes
 
How to Evaluate, Rollout and Operationalize Your SD-WAN Projects
How to Evaluate, Rollout and Operationalize Your SD-WAN ProjectsHow to Evaluate, Rollout and Operationalize Your SD-WAN Projects
How to Evaluate, Rollout and Operationalize Your SD-WAN Projects
 
How to Evaluate, Rollout, and Operationalize Your SD-WAN Projects
How to Evaluate, Rollout, and Operationalize Your SD-WAN ProjectsHow to Evaluate, Rollout, and Operationalize Your SD-WAN Projects
How to Evaluate, Rollout, and Operationalize Your SD-WAN Projects
 
Visualizing Your Network Health - Driving Visibility in Increasingly Complex...
Visualizing Your Network Health -  Driving Visibility in Increasingly Complex...Visualizing Your Network Health -  Driving Visibility in Increasingly Complex...
Visualizing Your Network Health - Driving Visibility in Increasingly Complex...
 
Optimizing and Troubleshooting Digital Experience for a Hybrid Workforce
Optimizing and Troubleshooting Digital Experience for a Hybrid WorkforceOptimizing and Troubleshooting Digital Experience for a Hybrid Workforce
Optimizing and Troubleshooting Digital Experience for a Hybrid Workforce
 
EMEA Optimizing and Troubleshooting Digital Experience for a Hybrid Workforce
EMEA Optimizing and Troubleshooting Digital Experience for a Hybrid WorkforceEMEA Optimizing and Troubleshooting Digital Experience for a Hybrid Workforce
EMEA Optimizing and Troubleshooting Digital Experience for a Hybrid Workforce
 
Introduction To ThousandEyes
Introduction To ThousandEyesIntroduction To ThousandEyes
Introduction To ThousandEyes
 
0328apjcintrotothousandeyeswebinar-230328233735-4df10d7f.pdf
0328apjcintrotothousandeyeswebinar-230328233735-4df10d7f.pdf0328apjcintrotothousandeyeswebinar-230328233735-4df10d7f.pdf
0328apjcintrotothousandeyeswebinar-230328233735-4df10d7f.pdf
 
Optimizing and Troubleshooting Digital Experience for a Hybrid Workforce
Optimizing and Troubleshooting Digital Experience for a Hybrid WorkforceOptimizing and Troubleshooting Digital Experience for a Hybrid Workforce
Optimizing and Troubleshooting Digital Experience for a Hybrid Workforce
 
Getting Started with ThousandEyes Proof of Concepts
Getting Started with ThousandEyes Proof of ConceptsGetting Started with ThousandEyes Proof of Concepts
Getting Started with ThousandEyes Proof of Concepts
 
Introduction to ThousandEyes
Introduction to ThousandEyesIntroduction to ThousandEyes
Introduction to ThousandEyes
 
The Top Outages of 2021: Analysis and Takeaways
The Top Outages of 2021: Analysis and TakeawaysThe Top Outages of 2021: Analysis and Takeaways
The Top Outages of 2021: Analysis and Takeaways
 
Visualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkVisualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your Network
 
Introduction to ThousandEyes
Introduction to ThousandEyesIntroduction to ThousandEyes
Introduction to ThousandEyes
 
Introduction to ThousandEyes
Introduction to ThousandEyesIntroduction to ThousandEyes
Introduction to ThousandEyes
 
Introduction to ThousandEyes
Introduction to ThousandEyesIntroduction to ThousandEyes
Introduction to ThousandEyes
 
06_08_emea_how_to_evaluate_rollout_and_operationalize_your_sdwan_projects_web...
06_08_emea_how_to_evaluate_rollout_and_operationalize_your_sdwan_projects_web...06_08_emea_how_to_evaluate_rollout_and_operationalize_your_sdwan_projects_web...
06_08_emea_how_to_evaluate_rollout_and_operationalize_your_sdwan_projects_web...
 
Introduction to ThousandEyes
Introduction to ThousandEyesIntroduction to ThousandEyes
Introduction to ThousandEyes
 

More from ThousandEyes

How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
New ThousandEyes Product Features and Release Highlights: March 2024
New ThousandEyes Product Features and Release Highlights: March 2024New ThousandEyes Product Features and Release Highlights: March 2024
New ThousandEyes Product Features and Release Highlights: March 2024ThousandEyes
 
Assure Patient and Clinician Digital Experiences with ThousandEyes for Health...
Assure Patient and Clinician Digital Experiences with ThousandEyes for Health...Assure Patient and Clinician Digital Experiences with ThousandEyes for Health...
Assure Patient and Clinician Digital Experiences with ThousandEyes for Health...ThousandEyes
 
AMER Introduction to ThousandEyes Webinar
AMER Introduction to ThousandEyes WebinarAMER Introduction to ThousandEyes Webinar
AMER Introduction to ThousandEyes WebinarThousandEyes
 
New ThousandEyes Product Features and Release Highlights: February 2024
New ThousandEyes Product Features and Release Highlights: February 2024New ThousandEyes Product Features and Release Highlights: February 2024
New ThousandEyes Product Features and Release Highlights: February 2024ThousandEyes
 
Enhancing SaaS Performance: A Hands-on Workshop for Partners
Enhancing SaaS Performance: A Hands-on Workshop for PartnersEnhancing SaaS Performance: A Hands-on Workshop for Partners
Enhancing SaaS Performance: A Hands-on Workshop for PartnersThousandEyes
 
ThousandEyes Enterprise Digital Workshop - Spanish
ThousandEyes Enterprise Digital Workshop - SpanishThousandEyes Enterprise Digital Workshop - Spanish
ThousandEyes Enterprise Digital Workshop - SpanishThousandEyes
 
ThousandEyes Enterprise Digital Workshop - German
ThousandEyes Enterprise Digital Workshop - GermanThousandEyes Enterprise Digital Workshop - German
ThousandEyes Enterprise Digital Workshop - GermanThousandEyes
 
ThousandEyes Enterprise Digital Workshop
ThousandEyes Enterprise Digital WorkshopThousandEyes Enterprise Digital Workshop
ThousandEyes Enterprise Digital WorkshopThousandEyes
 
Introduction to ThousandEyes and Meraki MX for Partners
Introduction to ThousandEyes and Meraki MX for PartnersIntroduction to ThousandEyes and Meraki MX for Partners
Introduction to ThousandEyes and Meraki MX for PartnersThousandEyes
 
Level-up Your Cloud Visibility Into AWS With ThousandEyes
Level-up Your Cloud Visibility Into AWS With ThousandEyesLevel-up Your Cloud Visibility Into AWS With ThousandEyes
Level-up Your Cloud Visibility Into AWS With ThousandEyesThousandEyes
 
Level-up Your Cloud Visibility Into AWS With ThousandEyes
Level-up Your Cloud Visibility Into AWS With ThousandEyesLevel-up Your Cloud Visibility Into AWS With ThousandEyes
Level-up Your Cloud Visibility Into AWS With ThousandEyesThousandEyes
 
Level-up Your Cloud Visibility Into AWS With ThousandEyes
Level-up Your Cloud Visibility Into AWS With ThousandEyesLevel-up Your Cloud Visibility Into AWS With ThousandEyes
Level-up Your Cloud Visibility Into AWS With ThousandEyesThousandEyes
 
How Financial Institutions Can Deliver Seamless Customer Digital Engagements
How Financial Institutions Can Deliver Seamless Customer Digital EngagementsHow Financial Institutions Can Deliver Seamless Customer Digital Engagements
How Financial Institutions Can Deliver Seamless Customer Digital EngagementsThousandEyes
 
New ThousandEyes Product Features and Release Highlights: November 2023
New ThousandEyes Product Features and Release Highlights: November 2023New ThousandEyes Product Features and Release Highlights: November 2023
New ThousandEyes Product Features and Release Highlights: November 2023ThousandEyes
 
New ThousandEyes Product Features and Release Highlights: October 2023
New ThousandEyes Product Features and Release Highlights: October 2023New ThousandEyes Product Features and Release Highlights: October 2023
New ThousandEyes Product Features and Release Highlights: October 2023ThousandEyes
 
Introduction to ThousandEyes and Meraki MX for Partners
Introduction to ThousandEyes and Meraki MX for PartnersIntroduction to ThousandEyes and Meraki MX for Partners
Introduction to ThousandEyes and Meraki MX for PartnersThousandEyes
 
Introduction to ThousandEyes and Meraki MX for Partners in Spanish
Introduction to ThousandEyes and Meraki MX for Partners in SpanishIntroduction to ThousandEyes and Meraki MX for Partners in Spanish
Introduction to ThousandEyes and Meraki MX for Partners in SpanishThousandEyes
 

More from ThousandEyes (20)

How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
New ThousandEyes Product Features and Release Highlights: March 2024
New ThousandEyes Product Features and Release Highlights: March 2024New ThousandEyes Product Features and Release Highlights: March 2024
New ThousandEyes Product Features and Release Highlights: March 2024
 
Assure Patient and Clinician Digital Experiences with ThousandEyes for Health...
Assure Patient and Clinician Digital Experiences with ThousandEyes for Health...Assure Patient and Clinician Digital Experiences with ThousandEyes for Health...
Assure Patient and Clinician Digital Experiences with ThousandEyes for Health...
 
AMER Introduction to ThousandEyes Webinar
AMER Introduction to ThousandEyes WebinarAMER Introduction to ThousandEyes Webinar
AMER Introduction to ThousandEyes Webinar
 
New ThousandEyes Product Features and Release Highlights: February 2024
New ThousandEyes Product Features and Release Highlights: February 2024New ThousandEyes Product Features and Release Highlights: February 2024
New ThousandEyes Product Features and Release Highlights: February 2024
 
Enhancing SaaS Performance: A Hands-on Workshop for Partners
Enhancing SaaS Performance: A Hands-on Workshop for PartnersEnhancing SaaS Performance: A Hands-on Workshop for Partners
Enhancing SaaS Performance: A Hands-on Workshop for Partners
 
ThousandEyes Enterprise Digital Workshop - Spanish
ThousandEyes Enterprise Digital Workshop - SpanishThousandEyes Enterprise Digital Workshop - Spanish
ThousandEyes Enterprise Digital Workshop - Spanish
 
ThousandEyes Enterprise Digital Workshop - German
ThousandEyes Enterprise Digital Workshop - GermanThousandEyes Enterprise Digital Workshop - German
ThousandEyes Enterprise Digital Workshop - German
 
ThousandEyes Enterprise Digital Workshop
ThousandEyes Enterprise Digital WorkshopThousandEyes Enterprise Digital Workshop
ThousandEyes Enterprise Digital Workshop
 
Introduction to ThousandEyes and Meraki MX for Partners
Introduction to ThousandEyes and Meraki MX for PartnersIntroduction to ThousandEyes and Meraki MX for Partners
Introduction to ThousandEyes and Meraki MX for Partners
 
Level-up Your Cloud Visibility Into AWS With ThousandEyes
Level-up Your Cloud Visibility Into AWS With ThousandEyesLevel-up Your Cloud Visibility Into AWS With ThousandEyes
Level-up Your Cloud Visibility Into AWS With ThousandEyes
 
Level-up Your Cloud Visibility Into AWS With ThousandEyes
Level-up Your Cloud Visibility Into AWS With ThousandEyesLevel-up Your Cloud Visibility Into AWS With ThousandEyes
Level-up Your Cloud Visibility Into AWS With ThousandEyes
 
Level-up Your Cloud Visibility Into AWS With ThousandEyes
Level-up Your Cloud Visibility Into AWS With ThousandEyesLevel-up Your Cloud Visibility Into AWS With ThousandEyes
Level-up Your Cloud Visibility Into AWS With ThousandEyes
 
How Financial Institutions Can Deliver Seamless Customer Digital Engagements
How Financial Institutions Can Deliver Seamless Customer Digital EngagementsHow Financial Institutions Can Deliver Seamless Customer Digital Engagements
How Financial Institutions Can Deliver Seamless Customer Digital Engagements
 
New ThousandEyes Product Features and Release Highlights: November 2023
New ThousandEyes Product Features and Release Highlights: November 2023New ThousandEyes Product Features and Release Highlights: November 2023
New ThousandEyes Product Features and Release Highlights: November 2023
 
New ThousandEyes Product Features and Release Highlights: October 2023
New ThousandEyes Product Features and Release Highlights: October 2023New ThousandEyes Product Features and Release Highlights: October 2023
New ThousandEyes Product Features and Release Highlights: October 2023
 
Introduction to ThousandEyes and Meraki MX for Partners
Introduction to ThousandEyes and Meraki MX for PartnersIntroduction to ThousandEyes and Meraki MX for Partners
Introduction to ThousandEyes and Meraki MX for Partners
 
Introduction to ThousandEyes and Meraki MX for Partners in Spanish
Introduction to ThousandEyes and Meraki MX for Partners in SpanishIntroduction to ThousandEyes and Meraki MX for Partners in Spanish
Introduction to ThousandEyes and Meraki MX for Partners in Spanish
 

Recently uploaded

UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxUI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxAndreas Kunz
 
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...Akihiro Suda
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...Technogeeks
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
Understanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM ArchitectureUnderstanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM Architecturerahul_net
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)jennyeacort
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZABSYZ Inc
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Mater
 

Recently uploaded (20)

UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxUI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
 
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Advantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your BusinessAdvantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your Business
 
2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
Understanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM ArchitectureUnderstanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM Architecture
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZ
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)
 

The Top Outages of 2023: Analyses and Takeaways

  • 2. Featured Speakers IanWaters EMEA Marketing Director Mike Hicks Principal Solution Analyst
  • 3. 3 © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved. BeforeWe Begin… • If you have any questions, please type them in the Questions window. • If you have any audio problems, please chat us for help. • A recording of this presentation will be sent to you in a few days. • Interested in more outage analysis and Internet insights? Check out the ThousandEyes blog and The Internet Report podcast.
  • 4. Anatomy of an Outage • Understanding different types of Internet outages is important to mitigate their impact. • Outages can vary in blast radius, be planned or unplanned, and have varying MTTR. • Network outages depend on where the problem occurs, with transit network incidents impacting multiple providers. • Tracking outages can help teams identify patterns and prevent customer service disruptions. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 5. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved. Outage and Degradation Impacts BGP ISP CDN DNS SaaS Apps Services APIs Data Center Cloud DDoS Protection SSE RISKAND COMPLIANCE Is our traffic getting routed out of region? SERVICE AVAILABILITY Which cloud regions are impacted? SITUATIONAL AWARENESS Are regional ISPs spoofing our DNS records? SERVICE RECOVERY Did we successfully cut over to our DDoS mitigation service? NETWORK SECURITY Are SASE routing policies working as we expect? CUSTOMER SUPPORT Is an Internet outage preventing users from reaching our service? WORKFORCE PRODUCTIVITY Will our Salesforce dev updates degrade performance for some global users? $32,000 $120,000 $3,500 3474 REVENUE PROTECTION Is the payment gateway down or just unreachable?
  • 6. 2023 Outages by the Numbers: ISP Compared to CSP • ThousandEyes reported an increase in cloud service provider (CSP) outages in 2023. • CSP outages are the second most common type of disruption after ISP outages. • The ratio of CSP outages to ISP outages increased in 2023. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 7. 2023 Outages by the Numbers: U.S.-centric Outages in Relation to Global Outages • U.S.-centric outages increased to 37% in 2023 from 34% in 2022. • Smaller, contained outages are becoming more common. • Localized outages have different impacts and require different responses compared to global outages. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 8. 2023 Outages by the Numbers: Application Outages • The number and frequency of application outages have been on the rise over the past year. • Application-related disruptions can have a bigger impact than network outages, though they are not as common. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 9. Connections Are Complex Branch Office Employee BYOD Corp devices IOT Cameras and sensors IoT VDI People, places, and things Edge BYOD Data Center IOT Core network Mobile networks Core network Peering Access networks Wireless network Wireless gateway DNS Cloud and SaaS Cloud providers Datacenter infrastructure Cloud connectivity Direct connect ISP transit providers SaaS onramp © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 11. Microsoft (1/25) Outlook (2/7) Virgin Media (4/4) AWS (6/13) Slack (8/2) Square (9/8) Workday + Cloudflare (11/2) 2023 Outage Timeline Purple = Application Outage Red = Network Outage Blue = Infrastructure Outage Bookmark the InternetOutages Timeline for outage updates throughout the year. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 12. Microsoft 365 (1/25/23) • Microsoft started experiencing service related issues around 07:05 AM (UTC). • The disruption was triggered by an external BGP change by Microsoft that impacted connected service providers • Microsoft BGP prefixes were withdrawn completely but then almost immediately re-advertised. • Affected smaller (/24) prefixes and summary prefixes (/12). • Cascading impact on global routing tables, causing significant churn. • Prefixes were either withdrawn or re-advertised to transit providers. • Large amount of packet loss were seen as well as HTTP and DNS timeouts. • Timeouts seen in the application “Response,” further indicating the effect of the network on service availability. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 13. Microsoft Outlook (2/7/23) • Starting around 03:55 UTC, Outlook became unavailable. • Network path was working properly, butThousandEyes observed elevated server response timeouts and slow page loading. • Majority of the errors were HTTP server timeouts, indicating an application issue. • Incident was mostly concentrated in the U.S. and lasted ~2 hours. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 14. Virgin Media (4/4/23) • From approximately 00:30 to 17:30 UTC, two outages impacted the reachability of Virgin Media UK’s network and services. • The first incident began at approximately 00:30 UTC and appeared to coincide with a series of BGP route withdrawals. • Second incident was shorter, but the networks experienced similar BGP and reachability issues. • Outages were overnight and due to the repeat nature, could indicate maintenance issues. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 15. AWS (6/13/23) • Outage impacted services within US-EAST-1. • Lasted two hours and increased latency, server timeouts, and HTTP server errors were observed. • AWS console access was also affected, making troubleshooting difficult. • AWS confirmed the issue was due to a capacity management subsystem failure. • Organizations leveraging cloud services, such as those offered byAWS, should be aware of the relationships in their digital ecosystem, regardless of whether those relationships are services or networks. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 16. Slack (8/2/23) • Application outage that lasted from 4:01 PM to 6 PM (UTC). • Network paths and accessibility were unaffected. • Initially could be seen as HTTP 500 errors and higher-than-normal page load times. • During the outage, users were unable to upload files or share screenshots. • Root cause—work on a “routine database cluster migration”—that accidentally reduced database capacity to the point that it could not support a regularly scheduled job running. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 17. Square (9/8/23) • Outage lasted over 18 hours. • Backend issue that prevented the platform from processing payment transactions. • Users reported various problems, from terminal connections dropping out, to payments appearing to complete but then not showing up in business accounts. • ThousandEyes observed intermittent dropouts and 503 ‘service unavailable’ errors. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 18. Workday + Cloudflare (11/2/23) • Cloudflare andWorkday experienced a major outage due to multiple infrastructure provider failures. • DR resources took 6 hours to come online and full resolution took 36 hours. • Initial cause was a partial mains power outage at a Flexential data center in Portland. • Further generator and grid failures resulted in a complete power loss and ungraceful shutdown. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 19. Takeaways • Understanding how your application works is important for quickly identifying failures and making improvements. • Just because your application is working doesn't mean it's functioning optimally. • Knowing how all parts of the service work together is crucial for ongoing design and future optimizations. • Improved visibility and operational optimizations can prevent outages and minimize their impact. • Tracking different categories of outages and degradations over time can be helpful. © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 20. • Subscribe to our blog to keep up-to-date! thousandeyes.com/blog/ • Tune in toThe Internet Report Podcast. https://www.thousandeyes.com/the-internet-report/ 20 © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved. Next Steps • New tutorial videos on our features thousandeyes.com/resources/?cat=tutorial • New Getting Started Guides docs.thousandeyes.com/product-documentation/getting-started Blog and Podcast Learning Resources Support Community • Still have questions? Ask us on the ThousandEyes Support Community AMA: http://bit.ly/2023Outages
  • 21. Q&A 21 © 2023 Cisco Systems, Inc. and/or its affiliates. All rights reserved.
  • 22. 22 © 2024 Cisco Systems, Inc. and/or its affiliates. All rights reserved.