SlideShare a Scribd company logo
1 of 69
Download to read offline
LISA19@szelechoski @kimschles
Sarah Zelechoski and Kim Schlesinger
LISA19 - Portland
Fuzzy Lines: Aligning Teams to
Monitor Your Application Ecosystem
LISA19@szelechoski @kimschles
OPS
DEV
APP CONFIG SECRETS CONTAINERIZATION
APP BUILD, INTEGRATION, TEST & DEPLOY
APP HEALTH, SCALING, PERFORMANCE, TUNING
APP UPTIME & MONITORING
INFRASTRUCTURE UPTIME & MONITORING
CLOUD PROVIDER NETWORKING INSTANCES
DNS IAM RBAC SCALING
INFRASTRUCTURE AS CODE & CLUSTER CONFIG
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
Separate ops and dev teams must build
relationships between people, use
processes to increase communication,
and leverage a set of shared tools.
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
whoami
Kim (she/her)
● Site Reliability Engineer
● Background in education and
web development
● Interested in how inclusive
cultures impact business
outcomes
LISA19@szelechoski @kimschles
whoami
Sarah Z (she/her)
● VP of Engineering
● Lead @ Fairwinds SREs & Devs
● Background in operations
● DevOps and engineering culture
enthusiast
● Focused on building strong teams
LISA19@szelechoski @kimschles
Agenda
People Process Tools
LISA19@szelechoski @kimschles
Agenda
People Process Tools
LISA19@szelechoski @kimschles
People : Desired Outcomes
● Both Dev and Ops teams are willing participants in a strong
partnership
● Shared goals and direction are key to both teams’ identities
● Teams are thoughtful of and accountable to each other for
their actions
LISA19@szelechoski @kimschles
People: Recommendation 1
Develop a Group Narrative
Who are we?
LISA19@szelechoski @kimschles
Develop a Group Narrative
● Start with a history lesson
○ Understand existing struggles
○ Share impetus for change
● Create a relatable vision
○ Know what problems you intend to solve together
○ Envision a better future
○ Insert symbolism as reminders
● Everyone should be able to tell the story
○ Culture is learned and passed on by members
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
People: Recommendation 2
Commit to Shared Values
How do we work together?
LISA19@szelechoski @kimschles
Commit to Shared Values
● Ask yourselves challenging questions
● Focus on human interactions
○ How people should treat each other
○ Not skills or capabilities
● Keep it short & make it memorable
○ List of 3 or 4
○ Acronyms are useful
● Require commitment and active participation
○ All levels of the organization
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
People: Recommendation 3
Self Regulation
We’re all in this together
LISA19@szelechoski @kimschles
Self Regulation
● Create simple prompts that allow individuals to enforce
shared values
○ Grassroots
○ Frictionless, simple
● Set the expectation that feedback is welcome and necessary
○ Introspection not conflict
LISA19@szelechoski @kimschles
🐝
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
We don’t do that here.
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
Agenda
People Process Tools
LISA19@szelechoski @kimschles
Process: Desired Outcomes
● Both teams understand and maintain their individual
responsibilities, while also helping in the gray area
● Dev and Ops teams communicate openly and often
● All work happens in a transparent fashion and understanding
is built through sharing context
LISA19@szelechoski @kimschles
Process: Recommendation 1
Define Shared Responsibilities
LISA19@szelechoski @kimschles
Define Shared Responsibility
● Define areas of responsibility, not tasks
○ Who is responsible for what vs. Who can do what
○ There will be a gray area
● Where lines are fuzzy, help each other
○ Make it ok to ask for assistance
○ Do your due diligence
LISA19@szelechoski @kimschles
OPS
DEV
APP CONFIG SECRETS CONTAINERIZATION
APP BUILD, INTEGRATION, TEST & DEPLOY
APP HEALTH, SCALING, PERFORMANCE, TUNING
APP UPTIME & MONITORING
INFRASTRUCTURE UPTIME & MONITORING
CLOUD PROVIDER NETWORKING INSTANCES
DNS IAM RBAC SCALING
INFRASTRUCTURE AS CODE & CLUSTER CONFIG
LISA19@szelechoski @kimschles
OPS
DEV
APP CONFIG SECRETS CONTAINERIZATION
APP BUILD, INTEGRATION, TEST & DEPLOY
APP HEALTH, SCALING, PERFORMANCE, TUNING
APP UPTIME & MONITORING
INFRASTRUCTURE UPTIME & MONITORING
CLOUD PROVIDER NETWORKING INSTANCES
DNS IAM RBAC SCALING
INFRASTRUCTURE AS CODE & CLUSTER CONFIG
LISA19@szelechoski @kimschles
Define Shared Responsibility
● Define areas of responsibility, not tasks
○ Who is responsible for what vs. Who can do what
○ There will be a gray area
● Where lines are fuzzy, help each other
○ Make it ok to ask for assistance
○ Do your due diligence
●
LISA19@szelechoski @kimschles
Where lines are fuzzy,
help each other
LISA19@szelechoski @kimschles
Process: Recommendation 2
Shared Slack Channels
LISA19@szelechoski @kimschles
Shared Slack Channels
● Targeted discourse and conversation
○ Paired real-time or asynchronous troubleshooting
● Being open and public
○ Opens the relationship
○ People feel like they can ask for help
○ Inherent awareness
○ Postmortems
LISA19@szelechoski @kimschles
Process: Recommendation 3
Weekly Syncs
LISA19@szelechoski @kimschles
Weekly Syncs
● Increase transparency and understanding
○ Teams are doing work that affects each other
○ Important to know what is happening outside your own bubble
○ Understand why certain work is happening; why decisions are being made
● Sync priorities across teams
○ Align to ensure collaboration
● Share the impact and value of work
○ Work for each other
LISA19@szelechoski @kimschles
Agenda
People Process Tools
LISA19@szelechoski @kimschles
Tools: Desired Outcomes
● Monitor your infrastructure system and workloads, both ops
and dev
● Increase confidence in monitoring
● Decrease time to resolution
LISA19@szelechoski @kimschles
Tools: Recommendation 1
Shared Monitoring Platform
LISA19@szelechoski @kimschles
Ops monitors and alerts
SlackPagerDuty
LISA19@szelechoski @kimschles
Ops monitors and alerts Dev monitors and alerts
??SlackPagerDuty
LISA19@szelechoski @kimschles
Shared Monitoring Platform
Ops monitors and alerts Dev monitors and alerts
??SlackPagerDuty
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
The Tale of the
Phantom Scaling
👻
LISA19@szelechoski @kimschles
Classic Elastic Load Balancer Application Load Balancer
LISA19@szelechoski @kimschles
“something unusual going
on with launch today...”
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
dev
ops
LISA19@szelechoski @kimschles
dev
ops
LISA19@szelechoski @kimschles
Benefits of a Shared Monitoring Platform
● Supports communication in regular meetings and shared
slack channels
● Single pane of glass for both teams
● Team-specific dashboards
● Decreased time to resolution for issues
LISA19@szelechoski @kimschles
Datadog Alternatives
● Honeycomb
● Sensu
● Sysdig
● New Relic
LISA19@szelechoski @kimschles
Tools: Recommendation 2
Monitors as Code
LISA19@szelechoski @kimschles
Monitor Families
● aws-quotas
● aws
● elasticsearch
● gcp-quotas
● gcp
● istio
● kubernetes
● papertrail
● rds
LISA19@szelechoski @kimschles
Kubernetes Monitors
● Cluster disk usage
● Cluster disk usage high
● Cluster memory
● Cluster network errors
● Cronjob failed to start
● Deployment replica alert
● External DNS registry errors
● External DNS source errors
● High node I/O wait time
● HPA failures
● Job failure
● Kube state metrics missing
● Kubelet health
● Nginx config reload failure
● Node not ready
● NTP off
● Pod crashes
● Pods pending
● System load average high
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
LISA19@szelechoski @kimschles
Benefits of Monitors as Code
● Repeatable
● Familiar
● Transparency and vulnerability by sharing work with others
● Collaboration via PRs and code reviews
● More accessible for people who use screen readers
LISA19@szelechoski @kimschles
You can navigate the fuzzy line
between ops and dev by focusing on
the relationship between teams.
LISA19@szelechoski @kimschles
Recap
LISA19@szelechoski @kimschles
People
Develop a Group Narrative
Commit to Shared Values
Self Regulation
LISA19@szelechoski @kimschles
Process
Define Shared Responsibilities
Shared Slack Channels
Weekly Syncs
LISA19@szelechoski @kimschles
Tools
Shared Monitoring Platform
Monitors as Code
LISA19@szelechoski @kimschles
We’re Hiring!
https://www.fairwinds.com/careers
LISA19@szelechoski @kimschles
Resources
● Fairwinds
● How to Setup a Shared Slack Channel
● Datadog
● Terraform: Datadog Monitor Resource
LISA19@szelechoski @kimschles
Thank You!
Sarah Zelechoski
@szelechoski
🌈✨
Kim Schlesinger
@kimschles
😄
kimschlesinger.com

More Related Content

Similar to Fuzzy lines: aligning teams to monitor your application ecosystem

ALN_Nepal-Agile_for_the_real_world
ALN_Nepal-Agile_for_the_real_worldALN_Nepal-Agile_for_the_real_world
ALN_Nepal-Agile_for_the_real_world
Roland Leibundgut
 

Similar to Fuzzy lines: aligning teams to monitor your application ecosystem (20)

Crash Course: Managing Software People And Teams
Crash Course: Managing Software People And TeamsCrash Course: Managing Software People And Teams
Crash Course: Managing Software People And Teams
 
Tackle Your Everyday Business Problems Like an Architect, Melissa Shepard
Tackle Your Everyday Business Problems Like an Architect, Melissa ShepardTackle Your Everyday Business Problems Like an Architect, Melissa Shepard
Tackle Your Everyday Business Problems Like an Architect, Melissa Shepard
 
Evidence-based Change - for Human Capital Analytics at Copenhagen Business Sc...
Evidence-based Change - for Human Capital Analytics at Copenhagen Business Sc...Evidence-based Change - for Human Capital Analytics at Copenhagen Business Sc...
Evidence-based Change - for Human Capital Analytics at Copenhagen Business Sc...
 
The CAP Theorem of Humans
The CAP Theorem of HumansThe CAP Theorem of Humans
The CAP Theorem of Humans
 
Open Source Product Management with KEMP Tech's PM
Open Source Product Management with KEMP Tech's PMOpen Source Product Management with KEMP Tech's PM
Open Source Product Management with KEMP Tech's PM
 
The Latest in DevOps: Elite Performance, Productivity, and Scaling - Google
The Latest in DevOps: Elite Performance, Productivity, and Scaling - GoogleThe Latest in DevOps: Elite Performance, Productivity, and Scaling - Google
The Latest in DevOps: Elite Performance, Productivity, and Scaling - Google
 
Craig Peters: Running Great Review Meetings
Craig Peters: Running Great Review MeetingsCraig Peters: Running Great Review Meetings
Craig Peters: Running Great Review Meetings
 
Aligning Your Organization's Strategic Direction, Roadmaps, and Technology, A...
Aligning Your Organization's Strategic Direction, Roadmaps, and Technology, A...Aligning Your Organization's Strategic Direction, Roadmaps, and Technology, A...
Aligning Your Organization's Strategic Direction, Roadmaps, and Technology, A...
 
Agile Leadership: A Different Mindset
Agile Leadership: A Different MindsetAgile Leadership: A Different Mindset
Agile Leadership: A Different Mindset
 
Related searches at LinkedIn
Related searches at LinkedInRelated searches at LinkedIn
Related searches at LinkedIn
 
(PROJEKTURA) agileadria agile for corporations
(PROJEKTURA) agileadria agile for corporations(PROJEKTURA) agileadria agile for corporations
(PROJEKTURA) agileadria agile for corporations
 
Kickstart CTA preparations #sfarchsummit19
Kickstart CTA preparations #sfarchsummit19Kickstart CTA preparations #sfarchsummit19
Kickstart CTA preparations #sfarchsummit19
 
Collaboration Hacks - Corona Edition
Collaboration Hacks - Corona EditionCollaboration Hacks - Corona Edition
Collaboration Hacks - Corona Edition
 
ALN_Nepal-Agile_for_the_real_world
ALN_Nepal-Agile_for_the_real_worldALN_Nepal-Agile_for_the_real_world
ALN_Nepal-Agile_for_the_real_world
 
Agile Tour London 2018: DASHBOARDS AND CULTURE – HOW OPENNESS CHANGES YOUR BE...
Agile Tour London 2018: DASHBOARDS AND CULTURE – HOW OPENNESS CHANGES YOUR BE...Agile Tour London 2018: DASHBOARDS AND CULTURE – HOW OPENNESS CHANGES YOUR BE...
Agile Tour London 2018: DASHBOARDS AND CULTURE – HOW OPENNESS CHANGES YOUR BE...
 
How Calance Simplified Project Management for LAUSD
How Calance Simplified Project Management for LAUSDHow Calance Simplified Project Management for LAUSD
How Calance Simplified Project Management for LAUSD
 
WebVisions2016 Presumptive Design Workshop
WebVisions2016 Presumptive Design WorkshopWebVisions2016 Presumptive Design Workshop
WebVisions2016 Presumptive Design Workshop
 
Docs are everyone's business
Docs are everyone's business   Docs are everyone's business
Docs are everyone's business
 
SMX West 2018 #SMXInsights
SMX West 2018 #SMXInsightsSMX West 2018 #SMXInsights
SMX West 2018 #SMXInsights
 
Best of Show - #SMXInsights from SMX West 2018
Best of Show - #SMXInsights from SMX West 2018Best of Show - #SMXInsights from SMX West 2018
Best of Show - #SMXInsights from SMX West 2018
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Fuzzy lines: aligning teams to monitor your application ecosystem