SlideShare a Scribd company logo
1 of 25
Download to read offline
Measure Twice, Alert Once (or Less)
How Priceline Leverages Catchpoint
2
© Catchpoint. Confidential. All rights reserved.
priceline.com is part of the Priceline
Group – sister companies are Agoda,
Booking.com, Kayak, Opentable,
Rentalcars.com
priceline.com has been using Catchpoint
since 2010; several sister companies have
since joined us
Your Presenter
Tom Gerhard
Technical Fellow (Logging and monitoring), priceline.com
3
© Catchpoint. Confidential. All rights reserved.
5 Years
Record-level data storage for in-
depth historical analysis and
flexible, multi-dimensional analytics
One Focus
To help businesses deliver
amazing end user experiences
Best Analytics
Deepest, fastest analytics, with raw data
to slice and dice on any dimension and
pre-built or customizable visualizations
to consume data your preferred way
700+
Vantage points comprising the
world’s largest footprint to test
where your end users are, across
geographies, consumer ISPs,
mobile, and backbone networks
440+
Customers including leaders like
Google, Honeywell, Qualtrics,
Priceline, Kate Spade, Comcast,
Zscaler, and LinkedIn
HQ
In New York, NY
(Offices in LA, Seattle, Boston,
London, Hamburg, and Bangalore)
50+
Vantage points in China’s 20
largest cities, on all 4 major ISPs
and top cloud providers
18
Monitor types – the most
extensive monitor types for
complete coverage
About Catchpoint
4
© Catchpoint. Confidential. All rights reserved.
Extensive Monitor Types for Complete Coverage
5
© Catchpoint. Confidential. All rights reserved.
.
Intro/Overview
How we use Catchpoint
• Identify key pages (and infrastructure, such as DNS)
• Create tests, alerts, and reports
• Sit back and enjoy (maybe not!)
That’s not what this talk is about!
We’re going to discuss some non-traditional applications for synthetic
monitoring.
6
© Catchpoint. Confidential. All rights reserved.
Situation: We do our primary alerting from a set of backbone nodes.
When they alert, the #1 question is “Is it us, or the internet?”
Solution: “First mile” nodes; one in each of our data centers.
Run the same test on each of the first mile nodes, using overrides to
route the test to the local (to the node) data center.
When an alert fires, it’s a simple matter to compare the backbone results
with the first mile test.
#1 First Mile Baseline Tests
7
© Catchpoint. Confidential. All rights reserved.
First Mile Node
8
© Catchpoint. Confidential. All rights reserved.
Test Setup
9
© Catchpoint. Confidential. All rights reserved.
10
© Catchpoint. Confidential. All rights reserved.
11
© Catchpoint. Confidential. All rights reserved.
If you have third-party objects, they won’t observe your overrides; if your test
results depend on them, they could go south.
Consider using zones and alerting only on your core zone(s)
What can go Wrong?
(Settings -> Zones)
12
© Catchpoint. Confidential. All rights reserved.
Inadvertent testing of your outbound connection
• Most likely will be obvious if your first mile tests fail and your backbone
tests don’t
• If they all fail, you probably have a real issue, and the first mile tests have
given you additional data points
You don’t have an Enterprise node? Consider using a node close to your
data center.
What Can Go Wrong?
13
© Catchpoint. Confidential. All rights reserved.
Situation: Infrastructure changes are being contemplated. What will the
impact be?
Solution: Run variant tests using overrides.
Examples:
• CDN Evaluation
• Measure impact of third-party content
• Worse case testing of non-responding objects
#2 What-if Scenarios
14
© Catchpoint. Confidential. All rights reserved.
Change in CDN configuration, comparison of CDNs, etc.
#2a CDN Evaluation
Your CDN’s DNS alias or IP
Your CDN’s DNS alias or IP
Your CDN’s DNS alias or IP
Your CDN’s DNS alias or IP
15
© Catchpoint. Confidential. All rights reserved.
Example: Block google publisher tags:
#2b Impact of Third-Party Content
… or delay using (Request Delay)
16
© Catchpoint. Confidential. All rights reserved.
Move specific objects to another domain (perhaps a CDN?)
#2c Other Overrides
Can also use ${randomlist(a,b)} macro to substitute from a list
CDN’s DNS alias or IP
17
© Catchpoint. Confidential. All rights reserved.
Situation: You’d like to enable all the add-on features, but worry about point
consumption, and you only need the extra information occasionally
Solution: Run a stripped-down test at your alerting frequency and a shadow
test on fewer nodes and/or less frequently with additional features (screen
shots, filmstrips, headers, etc.)
#3 Test Budget Management
18
© Catchpoint. Confidential. All rights reserved.
Situation: you have internal or external customers with a Service Level
Agreement (SLA)
Solution: use test configured to provide the required data and provide a
daily/weekly/monthly report as needed.
#4 SLA Reporting
19
© Catchpoint. Confidential. All rights reserved.
Problem: You want to avoid the “frog in a pot of cold water” effect.
Performance may degrade at a slow, but steady pace that doesn’t trip alerts
based on breakouts from patterns.
Solution: Define KPIs and set up alerts with absolute values for them, or
periodically review long-term trends.
#5 Long-term Trending
20
© Catchpoint. Confidential. All rights reserved.
Example: Boiling a Frog?
Test name
21
© Catchpoint. Confidential. All rights reserved.
Catchpoint Insight: To learn it is to love it
Favorite use cases:
• Identify hosts behind the load balancer
• Capture host- and load balancer-reported response times
• Capture CDN routing information
• Use Javascript to capture specialized metrics
Free Advice
22
© Catchpoint. Confidential. All rights reserved.
Get the most out of your deployment by taking advantage of integrations that are available.
Free Advice
23
© Catchpoint. Confidential. All rights reserved.
Our favorites:
Slack – Alert webhook API
Splunk – Use the Catchpoint Splunk add-on to pull test data for key tests into
Splunk so it can be combined with other data for display.
Single Sign-on – We use Okta to manage access to the Catchpoint portal,
simplifying administration and compliance.
Free Advice
24
© Catchpoint. Confidential. All rights reserved.
Your monitoring and alerting strategy needs a holistic view of your operation,
from as many perspectives as possible.
None of these techniques, by themselves, is a silver bullet. It’s important to
expect that something not visible from one perspective may be revealed from
another, and that they can also act as sanity checks against each other.
How does it all stack up?
Any Questions?
© Catchpoint. Confidential. All rights reserved.

More Related Content

Similar to Measure Twice, Alert Once (or Less)

Detecting Hacks: Anomaly Detection on Networking Data
Detecting Hacks: Anomaly Detection on Networking DataDetecting Hacks: Anomaly Detection on Networking Data
Detecting Hacks: Anomaly Detection on Networking DataDataWorks Summit
 
Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...
Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...
Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...Amazon Web Services
 
apidays LIVE New York 2021 - Simplify Open Policy Agent with Styra DAS by Tim...
apidays LIVE New York 2021 - Simplify Open Policy Agent with Styra DAS by Tim...apidays LIVE New York 2021 - Simplify Open Policy Agent with Styra DAS by Tim...
apidays LIVE New York 2021 - Simplify Open Policy Agent with Styra DAS by Tim...apidays
 
Cloud Migration - The Earlier You Instrument, The Faster You Go
Cloud Migration - The Earlier You Instrument, The Faster You GoCloud Migration - The Earlier You Instrument, The Faster You Go
Cloud Migration - The Earlier You Instrument, The Faster You GoKevin Downs
 
Data driven approaches in a technology startup
Data driven approaches in a technology startupData driven approaches in a technology startup
Data driven approaches in a technology startupRakuten Group, Inc.
 
Moving to cloud a checklist of top tips to help you migrate to the cloud © ...
Moving to cloud   a checklist of top tips to help you migrate to the cloud © ...Moving to cloud   a checklist of top tips to help you migrate to the cloud © ...
Moving to cloud a checklist of top tips to help you migrate to the cloud © ...Paul McCormack
 
Monitoring in the DevOps Era
Monitoring in the DevOps EraMonitoring in the DevOps Era
Monitoring in the DevOps EraMike Kavis
 
Skytap parasoft webinar new years resolution- accelerate sdlc
Skytap parasoft webinar new years resolution- accelerate sdlcSkytap parasoft webinar new years resolution- accelerate sdlc
Skytap parasoft webinar new years resolution- accelerate sdlcSkytap Cloud
 
PLOTCON NYC: Interactive Visual Statistics on Massive Datasets
PLOTCON NYC: Interactive Visual Statistics on Massive DatasetsPLOTCON NYC: Interactive Visual Statistics on Massive Datasets
PLOTCON NYC: Interactive Visual Statistics on Massive DatasetsPlotly
 
Kamanja: Driving Business Value through Real-Time Decisioning Solutions
Kamanja: Driving Business Value through Real-Time Decisioning SolutionsKamanja: Driving Business Value through Real-Time Decisioning Solutions
Kamanja: Driving Business Value through Real-Time Decisioning SolutionsGreg Makowski
 
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...InfluxData
 
Distributed Solar Systems at EDF Renewables and AWS IoT: A Natural Fit (PUT30...
Distributed Solar Systems at EDF Renewables and AWS IoT: A Natural Fit (PUT30...Distributed Solar Systems at EDF Renewables and AWS IoT: A Natural Fit (PUT30...
Distributed Solar Systems at EDF Renewables and AWS IoT: A Natural Fit (PUT30...Amazon Web Services
 
Gomez Blazing Fast Cloud Best Practices
Gomez Blazing Fast Cloud Best Practices Gomez Blazing Fast Cloud Best Practices
Gomez Blazing Fast Cloud Best Practices Compuware APM
 
HPE | Network Virtualization | POC
HPE | Network Virtualization | POCHPE | Network Virtualization | POC
HPE | Network Virtualization | POCJeffrey Nunn
 
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...Fwdays
 
Troubleshooting: A High-Value Asset For The Service-Provider Discipline
Troubleshooting: A High-Value Asset For The Service-Provider DisciplineTroubleshooting: A High-Value Asset For The Service-Provider Discipline
Troubleshooting: A High-Value Asset For The Service-Provider DisciplineSagi Brody
 
Taking IT Analytics to the Next Level
Taking IT Analytics to the Next LevelTaking IT Analytics to the Next Level
Taking IT Analytics to the Next LevelCA Technologies
 
Monitoring and Troubleshooting for Online Operations
Monitoring and Troubleshooting for Online OperationsMonitoring and Troubleshooting for Online Operations
Monitoring and Troubleshooting for Online OperationsThousandEyes
 

Similar to Measure Twice, Alert Once (or Less) (20)

Detecting Hacks: Anomaly Detection on Networking Data
Detecting Hacks: Anomaly Detection on Networking DataDetecting Hacks: Anomaly Detection on Networking Data
Detecting Hacks: Anomaly Detection on Networking Data
 
Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...
Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...
Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...
 
apidays LIVE New York 2021 - Simplify Open Policy Agent with Styra DAS by Tim...
apidays LIVE New York 2021 - Simplify Open Policy Agent with Styra DAS by Tim...apidays LIVE New York 2021 - Simplify Open Policy Agent with Styra DAS by Tim...
apidays LIVE New York 2021 - Simplify Open Policy Agent with Styra DAS by Tim...
 
Cloud Migration - The Earlier You Instrument, The Faster You Go
Cloud Migration - The Earlier You Instrument, The Faster You GoCloud Migration - The Earlier You Instrument, The Faster You Go
Cloud Migration - The Earlier You Instrument, The Faster You Go
 
Data driven approaches in a technology startup
Data driven approaches in a technology startupData driven approaches in a technology startup
Data driven approaches in a technology startup
 
Moving to cloud a checklist of top tips to help you migrate to the cloud © ...
Moving to cloud   a checklist of top tips to help you migrate to the cloud © ...Moving to cloud   a checklist of top tips to help you migrate to the cloud © ...
Moving to cloud a checklist of top tips to help you migrate to the cloud © ...
 
Monitoring in the DevOps Era
Monitoring in the DevOps EraMonitoring in the DevOps Era
Monitoring in the DevOps Era
 
Skytap parasoft webinar new years resolution- accelerate sdlc
Skytap parasoft webinar new years resolution- accelerate sdlcSkytap parasoft webinar new years resolution- accelerate sdlc
Skytap parasoft webinar new years resolution- accelerate sdlc
 
PLOTCON NYC: Interactive Visual Statistics on Massive Datasets
PLOTCON NYC: Interactive Visual Statistics on Massive DatasetsPLOTCON NYC: Interactive Visual Statistics on Massive Datasets
PLOTCON NYC: Interactive Visual Statistics on Massive Datasets
 
SnappyFlow Presentation.pdf
SnappyFlow Presentation.pdfSnappyFlow Presentation.pdf
SnappyFlow Presentation.pdf
 
Kamanja: Driving Business Value through Real-Time Decisioning Solutions
Kamanja: Driving Business Value through Real-Time Decisioning SolutionsKamanja: Driving Business Value through Real-Time Decisioning Solutions
Kamanja: Driving Business Value through Real-Time Decisioning Solutions
 
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...
 
Distributed Solar Systems at EDF Renewables and AWS IoT: A Natural Fit (PUT30...
Distributed Solar Systems at EDF Renewables and AWS IoT: A Natural Fit (PUT30...Distributed Solar Systems at EDF Renewables and AWS IoT: A Natural Fit (PUT30...
Distributed Solar Systems at EDF Renewables and AWS IoT: A Natural Fit (PUT30...
 
Gomez Blazing Fast Cloud Best Practices
Gomez Blazing Fast Cloud Best Practices Gomez Blazing Fast Cloud Best Practices
Gomez Blazing Fast Cloud Best Practices
 
HPE | Network Virtualization | POC
HPE | Network Virtualization | POCHPE | Network Virtualization | POC
HPE | Network Virtualization | POC
 
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
 
CS-Op Analytics
CS-Op AnalyticsCS-Op Analytics
CS-Op Analytics
 
Troubleshooting: A High-Value Asset For The Service-Provider Discipline
Troubleshooting: A High-Value Asset For The Service-Provider DisciplineTroubleshooting: A High-Value Asset For The Service-Provider Discipline
Troubleshooting: A High-Value Asset For The Service-Provider Discipline
 
Taking IT Analytics to the Next Level
Taking IT Analytics to the Next LevelTaking IT Analytics to the Next Level
Taking IT Analytics to the Next Level
 
Monitoring and Troubleshooting for Online Operations
Monitoring and Troubleshooting for Online OperationsMonitoring and Troubleshooting for Online Operations
Monitoring and Troubleshooting for Online Operations
 

More from DevOps.com

Modernizing on IBM Z Made Easier With Open Source Software
Modernizing on IBM Z Made Easier With Open Source SoftwareModernizing on IBM Z Made Easier With Open Source Software
Modernizing on IBM Z Made Easier With Open Source SoftwareDevOps.com
 
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...DevOps.com
 
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...DevOps.com
 
Next Generation Vulnerability Assessment Using Datadog and Snyk
Next Generation Vulnerability Assessment Using Datadog and SnykNext Generation Vulnerability Assessment Using Datadog and Snyk
Next Generation Vulnerability Assessment Using Datadog and SnykDevOps.com
 
Vulnerability Discovery in the Cloud
Vulnerability Discovery in the CloudVulnerability Discovery in the Cloud
Vulnerability Discovery in the CloudDevOps.com
 
2021 Open Source Governance: Top Ten Trends and Predictions
2021 Open Source Governance: Top Ten Trends and Predictions2021 Open Source Governance: Top Ten Trends and Predictions
2021 Open Source Governance: Top Ten Trends and PredictionsDevOps.com
 
A New Year’s Ransomware Resolution
A New Year’s Ransomware ResolutionA New Year’s Ransomware Resolution
A New Year’s Ransomware ResolutionDevOps.com
 
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)DevOps.com
 
Don't Panic! Effective Incident Response
Don't Panic! Effective Incident ResponseDon't Panic! Effective Incident Response
Don't Panic! Effective Incident ResponseDevOps.com
 
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's Culture
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's CultureCreating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's Culture
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's CultureDevOps.com
 
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with Teleport
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with TeleportRole Based Access Controls (RBAC) for SSH and Kubernetes Access with Teleport
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with TeleportDevOps.com
 
Monitoring Serverless Applications with Datadog
Monitoring Serverless Applications with DatadogMonitoring Serverless Applications with Datadog
Monitoring Serverless Applications with DatadogDevOps.com
 
Deliver your App Anywhere … Publicly or Privately
Deliver your App Anywhere … Publicly or PrivatelyDeliver your App Anywhere … Publicly or Privately
Deliver your App Anywhere … Publicly or PrivatelyDevOps.com
 
Securing medical apps in the age of covid final
Securing medical apps in the age of covid finalSecuring medical apps in the age of covid final
Securing medical apps in the age of covid finalDevOps.com
 
How to Build a Healthy On-Call Culture
How to Build a Healthy On-Call CultureHow to Build a Healthy On-Call Culture
How to Build a Healthy On-Call CultureDevOps.com
 
The Evolving Role of the Developer in 2021
The Evolving Role of the Developer in 2021The Evolving Role of the Developer in 2021
The Evolving Role of the Developer in 2021DevOps.com
 
Service Mesh: Two Big Words But Do You Need It?
Service Mesh: Two Big Words But Do You Need It?Service Mesh: Two Big Words But Do You Need It?
Service Mesh: Two Big Words But Do You Need It?DevOps.com
 
Secure Data Sharing in OpenShift Environments
Secure Data Sharing in OpenShift EnvironmentsSecure Data Sharing in OpenShift Environments
Secure Data Sharing in OpenShift EnvironmentsDevOps.com
 
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...DevOps.com
 
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...DevOps.com
 

More from DevOps.com (20)

Modernizing on IBM Z Made Easier With Open Source Software
Modernizing on IBM Z Made Easier With Open Source SoftwareModernizing on IBM Z Made Easier With Open Source Software
Modernizing on IBM Z Made Easier With Open Source Software
 
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
 
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
 
Next Generation Vulnerability Assessment Using Datadog and Snyk
Next Generation Vulnerability Assessment Using Datadog and SnykNext Generation Vulnerability Assessment Using Datadog and Snyk
Next Generation Vulnerability Assessment Using Datadog and Snyk
 
Vulnerability Discovery in the Cloud
Vulnerability Discovery in the CloudVulnerability Discovery in the Cloud
Vulnerability Discovery in the Cloud
 
2021 Open Source Governance: Top Ten Trends and Predictions
2021 Open Source Governance: Top Ten Trends and Predictions2021 Open Source Governance: Top Ten Trends and Predictions
2021 Open Source Governance: Top Ten Trends and Predictions
 
A New Year’s Ransomware Resolution
A New Year’s Ransomware ResolutionA New Year’s Ransomware Resolution
A New Year’s Ransomware Resolution
 
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
 
Don't Panic! Effective Incident Response
Don't Panic! Effective Incident ResponseDon't Panic! Effective Incident Response
Don't Panic! Effective Incident Response
 
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's Culture
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's CultureCreating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's Culture
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's Culture
 
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with Teleport
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with TeleportRole Based Access Controls (RBAC) for SSH and Kubernetes Access with Teleport
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with Teleport
 
Monitoring Serverless Applications with Datadog
Monitoring Serverless Applications with DatadogMonitoring Serverless Applications with Datadog
Monitoring Serverless Applications with Datadog
 
Deliver your App Anywhere … Publicly or Privately
Deliver your App Anywhere … Publicly or PrivatelyDeliver your App Anywhere … Publicly or Privately
Deliver your App Anywhere … Publicly or Privately
 
Securing medical apps in the age of covid final
Securing medical apps in the age of covid finalSecuring medical apps in the age of covid final
Securing medical apps in the age of covid final
 
How to Build a Healthy On-Call Culture
How to Build a Healthy On-Call CultureHow to Build a Healthy On-Call Culture
How to Build a Healthy On-Call Culture
 
The Evolving Role of the Developer in 2021
The Evolving Role of the Developer in 2021The Evolving Role of the Developer in 2021
The Evolving Role of the Developer in 2021
 
Service Mesh: Two Big Words But Do You Need It?
Service Mesh: Two Big Words But Do You Need It?Service Mesh: Two Big Words But Do You Need It?
Service Mesh: Two Big Words But Do You Need It?
 
Secure Data Sharing in OpenShift Environments
Secure Data Sharing in OpenShift EnvironmentsSecure Data Sharing in OpenShift Environments
Secure Data Sharing in OpenShift Environments
 
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...
 
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...
 

Recently uploaded

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingScyllaDB
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024Lorenzo Miniero
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMKumar Satyam
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringWSO2
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxMarkSteadman7
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaWSO2
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe中 央社
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAnitaRaj43
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightSafe Software
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityVictorSzoltysek
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data SciencePaolo Missier
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptxFIDO Alliance
 
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...FIDO Alliance
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseWSO2
 

Recently uploaded (20)

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern Enterprise
 

Measure Twice, Alert Once (or Less)

  • 1. Measure Twice, Alert Once (or Less) How Priceline Leverages Catchpoint
  • 2. 2 © Catchpoint. Confidential. All rights reserved. priceline.com is part of the Priceline Group – sister companies are Agoda, Booking.com, Kayak, Opentable, Rentalcars.com priceline.com has been using Catchpoint since 2010; several sister companies have since joined us Your Presenter Tom Gerhard Technical Fellow (Logging and monitoring), priceline.com
  • 3. 3 © Catchpoint. Confidential. All rights reserved. 5 Years Record-level data storage for in- depth historical analysis and flexible, multi-dimensional analytics One Focus To help businesses deliver amazing end user experiences Best Analytics Deepest, fastest analytics, with raw data to slice and dice on any dimension and pre-built or customizable visualizations to consume data your preferred way 700+ Vantage points comprising the world’s largest footprint to test where your end users are, across geographies, consumer ISPs, mobile, and backbone networks 440+ Customers including leaders like Google, Honeywell, Qualtrics, Priceline, Kate Spade, Comcast, Zscaler, and LinkedIn HQ In New York, NY (Offices in LA, Seattle, Boston, London, Hamburg, and Bangalore) 50+ Vantage points in China’s 20 largest cities, on all 4 major ISPs and top cloud providers 18 Monitor types – the most extensive monitor types for complete coverage About Catchpoint
  • 4. 4 © Catchpoint. Confidential. All rights reserved. Extensive Monitor Types for Complete Coverage
  • 5. 5 © Catchpoint. Confidential. All rights reserved. . Intro/Overview How we use Catchpoint • Identify key pages (and infrastructure, such as DNS) • Create tests, alerts, and reports • Sit back and enjoy (maybe not!) That’s not what this talk is about! We’re going to discuss some non-traditional applications for synthetic monitoring.
  • 6. 6 © Catchpoint. Confidential. All rights reserved. Situation: We do our primary alerting from a set of backbone nodes. When they alert, the #1 question is “Is it us, or the internet?” Solution: “First mile” nodes; one in each of our data centers. Run the same test on each of the first mile nodes, using overrides to route the test to the local (to the node) data center. When an alert fires, it’s a simple matter to compare the backbone results with the first mile test. #1 First Mile Baseline Tests
  • 7. 7 © Catchpoint. Confidential. All rights reserved. First Mile Node
  • 8. 8 © Catchpoint. Confidential. All rights reserved. Test Setup
  • 9. 9 © Catchpoint. Confidential. All rights reserved.
  • 10. 10 © Catchpoint. Confidential. All rights reserved.
  • 11. 11 © Catchpoint. Confidential. All rights reserved. If you have third-party objects, they won’t observe your overrides; if your test results depend on them, they could go south. Consider using zones and alerting only on your core zone(s) What can go Wrong? (Settings -> Zones)
  • 12. 12 © Catchpoint. Confidential. All rights reserved. Inadvertent testing of your outbound connection • Most likely will be obvious if your first mile tests fail and your backbone tests don’t • If they all fail, you probably have a real issue, and the first mile tests have given you additional data points You don’t have an Enterprise node? Consider using a node close to your data center. What Can Go Wrong?
  • 13. 13 © Catchpoint. Confidential. All rights reserved. Situation: Infrastructure changes are being contemplated. What will the impact be? Solution: Run variant tests using overrides. Examples: • CDN Evaluation • Measure impact of third-party content • Worse case testing of non-responding objects #2 What-if Scenarios
  • 14. 14 © Catchpoint. Confidential. All rights reserved. Change in CDN configuration, comparison of CDNs, etc. #2a CDN Evaluation Your CDN’s DNS alias or IP Your CDN’s DNS alias or IP Your CDN’s DNS alias or IP Your CDN’s DNS alias or IP
  • 15. 15 © Catchpoint. Confidential. All rights reserved. Example: Block google publisher tags: #2b Impact of Third-Party Content … or delay using (Request Delay)
  • 16. 16 © Catchpoint. Confidential. All rights reserved. Move specific objects to another domain (perhaps a CDN?) #2c Other Overrides Can also use ${randomlist(a,b)} macro to substitute from a list CDN’s DNS alias or IP
  • 17. 17 © Catchpoint. Confidential. All rights reserved. Situation: You’d like to enable all the add-on features, but worry about point consumption, and you only need the extra information occasionally Solution: Run a stripped-down test at your alerting frequency and a shadow test on fewer nodes and/or less frequently with additional features (screen shots, filmstrips, headers, etc.) #3 Test Budget Management
  • 18. 18 © Catchpoint. Confidential. All rights reserved. Situation: you have internal or external customers with a Service Level Agreement (SLA) Solution: use test configured to provide the required data and provide a daily/weekly/monthly report as needed. #4 SLA Reporting
  • 19. 19 © Catchpoint. Confidential. All rights reserved. Problem: You want to avoid the “frog in a pot of cold water” effect. Performance may degrade at a slow, but steady pace that doesn’t trip alerts based on breakouts from patterns. Solution: Define KPIs and set up alerts with absolute values for them, or periodically review long-term trends. #5 Long-term Trending
  • 20. 20 © Catchpoint. Confidential. All rights reserved. Example: Boiling a Frog? Test name
  • 21. 21 © Catchpoint. Confidential. All rights reserved. Catchpoint Insight: To learn it is to love it Favorite use cases: • Identify hosts behind the load balancer • Capture host- and load balancer-reported response times • Capture CDN routing information • Use Javascript to capture specialized metrics Free Advice
  • 22. 22 © Catchpoint. Confidential. All rights reserved. Get the most out of your deployment by taking advantage of integrations that are available. Free Advice
  • 23. 23 © Catchpoint. Confidential. All rights reserved. Our favorites: Slack – Alert webhook API Splunk – Use the Catchpoint Splunk add-on to pull test data for key tests into Splunk so it can be combined with other data for display. Single Sign-on – We use Okta to manage access to the Catchpoint portal, simplifying administration and compliance. Free Advice
  • 24. 24 © Catchpoint. Confidential. All rights reserved. Your monitoring and alerting strategy needs a holistic view of your operation, from as many perspectives as possible. None of these techniques, by themselves, is a silver bullet. It’s important to expect that something not visible from one perspective may be revealed from another, and that they can also act as sanity checks against each other. How does it all stack up?
  • 25. Any Questions? © Catchpoint. Confidential. All rights reserved.