SlideShare a Scribd company logo
Automating SRE from “Hello World”
to Enterprise Scale with
Andreas Grabner
DevOps Activist at Dynatrace
DevRel for Keptn
@grabnerandi, https://www.linkedin.com/in/grabnerandi
Star us @ https://github.com/keptn/keptn
Followus @keptnProject
Slack Us @ https://slack.keptn.sh
Visit us @ https://keptn.sh
5
Confidential
Problem #1
„Our DevOps teams is building monolithic automation that becomes
more complex than the microservices we deploy!“
Confidential 6
Too complexautomationpipelines are too hard to maintain!
ChristianHeckelmann
Senior DevOps Engineer
2800
projects
966
CI/CDs
„I am constantlyreacting to
‚Pipeline Broken – please fix!‘“
7
Confidential
Problem #2
„Our DevOps team builds and then copy/pastes tool integrations across
all our automation code due to lack of standards!“
Confidential 8
Duplicated Pipeline Code hard to maintain, extend & optimize
pipeline {
stages {
stage ('Depl oy to dev na mespac e') {
ste ps {
c ontain er(‘he lm’) {
}
}
}
stage ('Run tests' ) {
ste ps {
c ontain er(‘jm eter’) {
}
}
}
stage (‘Eval uate p erform ance’) {
ste ps {
c ontain er(‘cu rl’) {
}
}
}
if (e valuat ion.pa ssed) {
sta ge('De ploy t o stag ing') {
s teps {
conta iner(‘ helm’) {
}
}
}
}
}
}
pipeline {
stages {
stage ('Depl oy to dev na mespac e') {
ste ps {
c ontain er(‘ku stomiz e’) {
}
}
}
stage ('Run tests' ) {
ste ps {
c ontain er(‘jm eter’) {
}
}
}
stage (‘Eval uate p erform ance’) {
ste ps {
c ontain er(‘cu rl’) {
}
}
}
if (e valuat ion.pa ssed) {
sta ge('De ploy t o stag ing') {
s teps {
conta iner(‘ helm’) {
}
}
}
}
}
}
1 Service =
1 Pipeline
1 Project = x Pipelines n Teams = n*x Pipelines
pipeline {
stages {
stage ('Depl oy to dev na mespac e') {
ste ps {
c ontain er(‘he lm’) {
}
}
}
stage ('Run tests' ) {
ste ps {
c ontain er(‘jm eter’) {
}
}
}
stage (‘Eval uate p erform ance’) {
ste ps {
c ontain er(‘cu rl’) {
}
}
}
if (e valuat ion.pa ssed) {
sta ge('De ploy t o stag ing') {
s teps {
conta iner(‘ helm’) {
}
}
}
}
}
}
25
services
96
workloads
Dieter Ladenhauf
Senior ACE Engineer
„Onboarding or updating pipelines
is manual & error prone!‘“
9
Confidential
Problem #3
“Our SRE teams can’t scale as resiliency testing and enforcing SLO
analysis is hard to automate!”
Confidential 10
Too much manual expert workon test execution & SLO Analysis
~90%of test reruns
9:1 ratio
script maintenancevs creation
only 10% projects
performance tested
Test Result Analysis
MonitoringConfiguration
~ 80%time
spent in manual...
Scripts Creation
SLO Report Generation
15-20
tests /
year
< 5
Apps
„We are limited in scaling SRE
due to manual expert tasks!‘“
Roman Ferstl
Managing Director
Confidential 11
3 examples of how Keptn solves these problems ...
Sumit Nagal
Principal Engineer
Roman Ferstl
Managing Director
ChristianHeckelmann
Senior DevOps Engineer
50+
Continuous SLO Validations
15x
Perf tests
10x
tested apps
~60
Services
14
Stages
Tool stack
SLO-based Quality Gate
Automation
Performance & Resiliance
Test Automation
Scaling SRE through
Delivery Automation
Confidential 12
And some nice testimonials fromour community ...
Confidential 13
Keptn: Data-DrivenDelivery& Operations Automation
You
(Dev/Ops/SRE)
bring your configuration
pick your use case
SLO-Quality
Gates
Progressive
Delivery
Auto-
Remediation
Declaration GitOps SLOs Standards
SLI/SLO shipyard runbook
SRE
Automation
workload
Monitoring Delivery Reliability Remediation
automates configuration and provides self-service for
through event-driven process orchestrationbased on
connect your tools
14
Confidential
Keptn Architecture
Driven by new requirements in the cloud native world
Confidential 15
We lookedat the fundamential problems our users have
Mixed information about
• Process (build,deploy,test,evaluate, …)
• Target platform (k8s, …)
• Environments(dev, hardening,…)
• Tools (Terraform,Helm, hey, …)
No clear separation of concerns
• Developers
• Define which artifact to use
• Want fast feedback on their code
• DevOps Engineers
• Define which tools to use
• Ensure tools are properly configured
• Site ReliabilityEngineers
• Define delivery processes
• Define operations workflows
Confidential 16
Solution: Remove hard dependencies and integrations
Build
Prepare
Deploy
Test
Notify
Rollback
Config Mgmt.
Deploy
Test
Monitoring
ChatOps
Rollback
Confidential 17
Solution: Remove hard dependencies and integrations
Build
Prepare
Deploy
Test
Notify
Rollback
Config Mgmt.
Deploy
Test
Monitoring
ChatOps
Rollback
Eventing
Event:Deploy
Artifact:container1
Stage:Dev
Strategy:Blue/Green
which events to generate ➔ Process Definition who consumesevents ➔ Tool Definition
Confidential 18
Eventing
Solution:Keptn is built on an architecture that supports this paradigm
Application Plane (=Process Definition)
Define overall process for delivery and operations
Control Plane
Follow application logic and communicate/configure required services
API
Site Reliability
Engineer
DevOps
Developer
uniform.yaml
config-change*: helm
deploy*: JMeter
deploy-finish: Lighthouse
problem*: Remediation
all: Slack, Dynatrace
Execution Plane (=Tool Definition)
Deploy Service
(Helm, Jenkins …)
Test Service
(JMeter, Neotys, ..)
Validation Service
(Keptn Lighthouse …)
Remediation Service
(Keptn Remediation, SNOW …)
Config Service
(Git, …)
Monitoring Service
(Prometheus,
Dynatrace, …)
Artifact /
Microservice
config.change:artifact:x.y deploy.finished:http://service1 tests.finished:OK evaluation.done:98% Score problem.open:HighFailure
shipyard.yaml
- dev: direct, functional, SLO
- staging: B/G, perf, SLO
- prod: canary, real-user, SLA
remediation.yaml
- high-failure-rate:
- scaleup, rollback
- full-disk:
- cleandir;adjustlog-level
Confidential 19
Demo #1: SLO-Driven Progressive Delivery and Operations Automation with Keptn
Developer
Pull Request
(1)Deploy & Test
(2)Evaluate SLOs
(3)Auto-Promote
(4)Deploy & Test
(5)Evaluate SLOs
(7)Deploy Blue / Green
(8)Evaluate SLOs
(1)Action 1: Scale Up
(2)Evaluate SLOs
(3)Action 2: Roll Back
(6)Promote? (9)Toggle Blue/Green
#1 - IncreasedSpeedand Quality of ProgressiveDelivery through Automation #2 - Automate Operations
(10)Re-Evaluate SLOs (4)Evaluate SLOs
Closed-Loop Remediation
Observability
Confidential 21
User Example: ProgressiveDelivery with Keptn
Patrick Hofmann
Sr. Consultant
CI
CD
22
Confidential
Data Driven Automation
SLOs to connect Ops with Dev
Confidential 23
RootCause: Lengthy manual approval
Build
Deployto
„Test“
Run Test
In „Test“ Manual Approval
Promote to
„Staging“
Functional: Test Result Trend Not Enough Performance: ManualComparison Is Slow Monitoring: Too much unstructed data
~30-60min
Which metrics are important
and which build is therefore better
Which data comesfrom my test
and is relevantfor business transactions
Is this regression impacting
key business use cases
Confidential 24
Learning from Google‘s SRE Practices
• Service Level Indicators(SLIs)
• Definition:Measurable Metrics as the base for evaluation
• Example:ErrorRate of Login Requests
• Service Level Objectives (SLOs)
• Definition:Bindingtargets for Service Level Indicators
• Example:Login Error Rate must be less than 2% over a 30 day period
• Service Level Agreements (SLAs)
• Definition:Business Agreement between consumer and provider typicallybasedon SLO
• Example:Logins must be reliable & fast (Error Rate, Response Time, Throughput)99% within a 30 daywindow
• Google Cloud YouTube Video
• SLIs, SLOs, SLAs, oh my! (class SRE implements DevOps): https://www.youtube.com/watch?v=tEylFyxbDLE
SLIs drive SLOs which inform SLAs
Confidential 25
Applying SRE Best Practices across the lifecycle
Authentication Service
0.89s
0.5%
May 2020 June 2020
0.61s
2.5%
1000/s 1600/s
Service X
xxs
xx% yys
yy%
xx/s yy/s
Production
Shift-Left
Continuous Delivery
Authentication Service
Commit
#1
Commit
#2
Commit
#3
Commit
#4
Service X
Quality Gates
Confidential 26
Explainer on SLI/SLO Validation as part of Continuous Delivery with Dynatrace & Keptn!
Overall Failure Rate
Query: builtin:service.errors.total
Test Step LOGIN Response Time
Query: calc:service.teststeprt:filter(Test, LOGIN)
Test Step LOGIN # Service Calls
Query: calc:service.testsvc:filter(tx, LOGIN)
<= 5%
<= 2%
<=150ms & <=+10%
<= 400ms
<= +0%
Build 1
0%
80ms
100ms
SLO: Overall Score Goal 90% 75%
Response Time 95th Perc
Query: builtin:service.responsetime(p95)
<=100ms
<= 250ms
SLO
SLIs (Service Level Indicators) warn
pass
1
100%
Build 2
4%
120ms
90ms
1
75%
Build 3
1%
90ms
120ms
2
62.5%
Build 4
0%
95ms
95ms
1
100%
Build 1 Build 2 Build 3 Build 4
$ keptn send event start-evaluation myproject myservice starttime=build1_deploy endtime=build1_testsdone
$ keptn send event start-evaluation myproject myservice starttime=build2_deploy endtime=build2_testsdone
$ keptn send event start-evaluation myproject myservice starttime=build3_deploy endtime=build3_testsdone
$ keptn send event start-evaluation myproject myservice starttime=build4_teststart endtime=build4_testsend
DevOps
Confidential 27
SLI/SLO-based evaluationimplementationin Keptn
SLIs defined per SLI Provider as YAML
SLI Provider specific queries,e.g: DynatraceMetrics Query
Quality Gates
...
Dynatrace Prometheus Neoload
Scores SLIs
Queries SLI
Providers with
SLI Definitions &
Timeframe
SLOs defined on Keptn Service Level as YAML
List of objectives with fixed or relative pass & warn criteria
indicators:
error_rate: "builtin:service.errors.total.count:merge(0):avg"
count_dbcalls: "calc:service.toptestdbcalls:merge(0):sum"
jvm_memory: "builtin:tech.jvm.memory.pool.committed:merge(0):sum"
objectives:
- sli: error_rate
pass:
- criteria:
- "<=1“ # We expect a max error rate of 1%
- sli: jvm_memory
- sli: count_dbcalls
pass:
- criteria:
- "=+2%" # We allow a 2% increase in DB Calls to previous runs
warning:
- criteria:
- "<=10" # We expect no more than 10 DB Calls per TX
total_score:
pass: "90%"
warning: "75%"
0.5 1.0 0.0 info
7/8
(87.5%)
4/8
(50%)
$ keptn start-evaluation 30m myservice sli.yaml slo.yaml
5 DB Calls 360MB 4.3% 123
SLI Value:
SLI Score:
Total Score
2
3
4
Tool X
1
Confidential 28
Demo: Automated SLI/SLO Validationbased on Dynatrace Dashboards
You: Just build a dashboard! : Automates the analysis!
15.5/16
(97%)
8/16
(50%)
Confidential 29
User Example: Automating Build Approvals using Keptn‘s SLIs/SLOs in GitLab
Christian Heckelmann
Senior Systems Engineer
87.5%: passed
AutomatedSLI/SLO based Quality Gates
Trigger Evaluation
30
Confidential
Data Driven Operations
Remediation focused on your SLOs
Confidential 31
Keptn – Closed-LoopRemediation
version: 0.2.0
kind: Remediation
metadata:
name: remediation-ecommerce
spec:
remediations:
- problemType: Conversion Rate Dropped
actionsOnOpen:
- name: Scaling ReplicaSet by 1
action: scaling
values:
increment: +1
- name: Stop Ad Campaign
action: googleadtoggle
values:
enable: off
campaign: $campaignid
Problem: Conversion Rate Dropped
Root Cause: CPU Pressure
Get
remediation
action(s)
Execute
remediation
action(s)
Re-validate
SLO/BLO
Escalate
Scale Up
Stop
Campaign
1 2
1 2
1
2
Confidential 32
Too risky? Start in Pre-Prod leveraging Chaos Engineering to define & test Auto-Remediation
version: 0.2.0
kind: Remediation
metadata:
name: remediation-ecommerce
spec:
remediations:
- problemType: High CPU on ReportGen
actionsOnOpen:
- name: Stop Traffic
action: configureLoadBalancer
values:
action: stopTraffic
ip: $problem.hostIp
- name: Restart Process
action: executeAnsible
values:
script: restartProcess
process: $problem.processID
Problem: Slow ReportGen Service
Root Cause: High CPU on host
Get
remediation
action(s)
Execute
remediation
action(s)
Re-validate
SLO/BLO
Escalate
Stop
Traffic
Restart
Process
1 2
1 2
1
2
33
Confidential
Let‘s wrap it up!
Confidential 34
Keptn: Data-DrivenDelivery& Operations Automation
You
(Dev/Ops/SRE)
bring your configuration
pick your use case
SLO-Quality
Gates
Progressive
Delivery
Auto-
Remediation
Declaration GitOps SLOs Standards
shipyard SLI/SLO runbook
SRE
Automation
workload
Monitoring Delivery Reliability Remediation
automates configuration and provides self-service for
through event-driven process orchestrationbased on
connect your tools
Confidential 35
Keptn 0.8 was just releasedearly March2021 and opens new use cases
• Multi-Cluster Setup
Separationof Control & Execution Plane
D
E
V
I
N
T
P
R
D
Confidential 36
Keptn 0.8 was just releasedearly March2021 and opens new use cases
• Shipyard v0.2.0: Explicit vs opinionated sequences
Confidential 37
Get started with our tutorials: tutorials.keptn.sh
Questions & Answers
Automating SRE from “Hello World”
to Enterprise Scale with
Andreas Grabner
DevOps Activist at Dynatrace
DevRel for Keptn
@grabnerandi, https://www.linkedin.com/in/grabnerandi
Star us @ https://github.com/keptn/keptn
Followus @keptnProject
Slack Us @ https://slack.keptn.sh
Visit us @ https://keptn.sh

More Related Content

What's hot

What manufacturing teaches about DevOps
What manufacturing teaches about DevOpsWhat manufacturing teaches about DevOps
What manufacturing teaches about DevOps
Gordon Haff
 
Chef for DevOps - an Introduction
Chef for DevOps - an IntroductionChef for DevOps - an Introduction
Chef for DevOps - an Introduction
Sanjeev Sharma
 
Journée DevOps : De l'intégration continue au déploiement continu avec Jenkins
Journée DevOps : De l'intégration continue au déploiement continu avec JenkinsJournée DevOps : De l'intégration continue au déploiement continu avec Jenkins
Journée DevOps : De l'intégration continue au déploiement continu avec Jenkins
Publicis Sapient Engineering
 
Continuously serving the developer community with Continuous Integration and...
Continuously serving the developer community with  Continuous Integration and...Continuously serving the developer community with  Continuous Integration and...
Continuously serving the developer community with Continuous Integration and...
Thoughtworks
 
Performance monitoring - Adoniram Mishra, Rupesh Dubey, ThoughtWorks
Performance monitoring - Adoniram Mishra, Rupesh Dubey, ThoughtWorksPerformance monitoring - Adoniram Mishra, Rupesh Dubey, ThoughtWorks
Performance monitoring - Adoniram Mishra, Rupesh Dubey, ThoughtWorks
Thoughtworks
 
PuppetConf 2016: Continuous Delivery and DevOps with Jenkins and Puppet Enter...
PuppetConf 2016: Continuous Delivery and DevOps with Jenkins and Puppet Enter...PuppetConf 2016: Continuous Delivery and DevOps with Jenkins and Puppet Enter...
PuppetConf 2016: Continuous Delivery and DevOps with Jenkins and Puppet Enter...
Puppet
 
Database CI/CD Pipeline
Database CI/CD PipelineDatabase CI/CD Pipeline
Database CI/CD Pipeline
muhammadhashir57
 
JavaOne 2015 - Swimming upstream in the container revolution
JavaOne 2015 - Swimming upstream in the container revolutionJavaOne 2015 - Swimming upstream in the container revolution
JavaOne 2015 - Swimming upstream in the container revolution
Bert Jan Schrijver
 
Building an SRE Organization @ Squarespace
Building an SRE Organization @ SquarespaceBuilding an SRE Organization @ Squarespace
Building an SRE Organization @ Squarespace
Franklin Angulo
 
Using Habitat to Unify Dev to CI to Production - Configmgmt Camp Feb/2018 Gent
Using Habitat to Unify Dev to CI to Production - Configmgmt Camp Feb/2018 GentUsing Habitat to Unify Dev to CI to Production - Configmgmt Camp Feb/2018 Gent
Using Habitat to Unify Dev to CI to Production - Configmgmt Camp Feb/2018 Gent
Salim Afiune Maya
 
DevOps, Common use cases, Architectures, Best Practices
DevOps, Common use cases, Architectures, Best PracticesDevOps, Common use cases, Architectures, Best Practices
DevOps, Common use cases, Architectures, Best Practices
Shiva Narayanaswamy
 
Infrastructure and Compliance Delight with Chef Automate
Infrastructure and Compliance Delight with Chef AutomateInfrastructure and Compliance Delight with Chef Automate
Infrastructure and Compliance Delight with Chef Automate
Matt Ray
 
How To Be a Java Automated Testing Superstar
How To Be a Java Automated Testing SuperstarHow To Be a Java Automated Testing Superstar
How To Be a Java Automated Testing Superstar
VMware Tanzu
 
Cloud native development without the toil
Cloud native development without the toilCloud native development without the toil
Cloud native development without the toil
Ambassador Labs
 
JavaOne 2015: Top Performance Patterns Deep Dive
JavaOne 2015: Top Performance Patterns Deep DiveJavaOne 2015: Top Performance Patterns Deep Dive
JavaOne 2015: Top Performance Patterns Deep Dive
Andreas Grabner
 
Metrics-driven Continuous Delivery
Metrics-driven Continuous DeliveryMetrics-driven Continuous Delivery
Metrics-driven Continuous Delivery
Andrew Phillips
 
DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code DeploysDevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
Andreas Grabner
 
Jeffrey Snover - Empowering DevOps with Azure Stack
Jeffrey Snover - Empowering DevOps with Azure StackJeffrey Snover - Empowering DevOps with Azure Stack
Jeffrey Snover - Empowering DevOps with Azure Stack
WinOps Conf
 
Release Readiness Validation with Keptn for Austrian Online Banking Software
Release Readiness Validation with Keptn for Austrian Online Banking SoftwareRelease Readiness Validation with Keptn for Austrian Online Banking Software
Release Readiness Validation with Keptn for Austrian Online Banking Software
Andreas Grabner
 
Compliance Automation with InSpec
Compliance Automation with InSpecCompliance Automation with InSpec
Compliance Automation with InSpec
Nathen Harvey
 

What's hot (20)

What manufacturing teaches about DevOps
What manufacturing teaches about DevOpsWhat manufacturing teaches about DevOps
What manufacturing teaches about DevOps
 
Chef for DevOps - an Introduction
Chef for DevOps - an IntroductionChef for DevOps - an Introduction
Chef for DevOps - an Introduction
 
Journée DevOps : De l'intégration continue au déploiement continu avec Jenkins
Journée DevOps : De l'intégration continue au déploiement continu avec JenkinsJournée DevOps : De l'intégration continue au déploiement continu avec Jenkins
Journée DevOps : De l'intégration continue au déploiement continu avec Jenkins
 
Continuously serving the developer community with Continuous Integration and...
Continuously serving the developer community with  Continuous Integration and...Continuously serving the developer community with  Continuous Integration and...
Continuously serving the developer community with Continuous Integration and...
 
Performance monitoring - Adoniram Mishra, Rupesh Dubey, ThoughtWorks
Performance monitoring - Adoniram Mishra, Rupesh Dubey, ThoughtWorksPerformance monitoring - Adoniram Mishra, Rupesh Dubey, ThoughtWorks
Performance monitoring - Adoniram Mishra, Rupesh Dubey, ThoughtWorks
 
PuppetConf 2016: Continuous Delivery and DevOps with Jenkins and Puppet Enter...
PuppetConf 2016: Continuous Delivery and DevOps with Jenkins and Puppet Enter...PuppetConf 2016: Continuous Delivery and DevOps with Jenkins and Puppet Enter...
PuppetConf 2016: Continuous Delivery and DevOps with Jenkins and Puppet Enter...
 
Database CI/CD Pipeline
Database CI/CD PipelineDatabase CI/CD Pipeline
Database CI/CD Pipeline
 
JavaOne 2015 - Swimming upstream in the container revolution
JavaOne 2015 - Swimming upstream in the container revolutionJavaOne 2015 - Swimming upstream in the container revolution
JavaOne 2015 - Swimming upstream in the container revolution
 
Building an SRE Organization @ Squarespace
Building an SRE Organization @ SquarespaceBuilding an SRE Organization @ Squarespace
Building an SRE Organization @ Squarespace
 
Using Habitat to Unify Dev to CI to Production - Configmgmt Camp Feb/2018 Gent
Using Habitat to Unify Dev to CI to Production - Configmgmt Camp Feb/2018 GentUsing Habitat to Unify Dev to CI to Production - Configmgmt Camp Feb/2018 Gent
Using Habitat to Unify Dev to CI to Production - Configmgmt Camp Feb/2018 Gent
 
DevOps, Common use cases, Architectures, Best Practices
DevOps, Common use cases, Architectures, Best PracticesDevOps, Common use cases, Architectures, Best Practices
DevOps, Common use cases, Architectures, Best Practices
 
Infrastructure and Compliance Delight with Chef Automate
Infrastructure and Compliance Delight with Chef AutomateInfrastructure and Compliance Delight with Chef Automate
Infrastructure and Compliance Delight with Chef Automate
 
How To Be a Java Automated Testing Superstar
How To Be a Java Automated Testing SuperstarHow To Be a Java Automated Testing Superstar
How To Be a Java Automated Testing Superstar
 
Cloud native development without the toil
Cloud native development without the toilCloud native development without the toil
Cloud native development without the toil
 
JavaOne 2015: Top Performance Patterns Deep Dive
JavaOne 2015: Top Performance Patterns Deep DiveJavaOne 2015: Top Performance Patterns Deep Dive
JavaOne 2015: Top Performance Patterns Deep Dive
 
Metrics-driven Continuous Delivery
Metrics-driven Continuous DeliveryMetrics-driven Continuous Delivery
Metrics-driven Continuous Delivery
 
DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code DeploysDevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
DevOps Days Toronto: From 6 Months Waterfall to 1 hour Code Deploys
 
Jeffrey Snover - Empowering DevOps with Azure Stack
Jeffrey Snover - Empowering DevOps with Azure StackJeffrey Snover - Empowering DevOps with Azure Stack
Jeffrey Snover - Empowering DevOps with Azure Stack
 
Release Readiness Validation with Keptn for Austrian Online Banking Software
Release Readiness Validation with Keptn for Austrian Online Banking SoftwareRelease Readiness Validation with Keptn for Austrian Online Banking Software
Release Readiness Validation with Keptn for Austrian Online Banking Software
 
Compliance Automation with InSpec
Compliance Automation with InSpecCompliance Automation with InSpec
Compliance Automation with InSpec
 

Similar to Auto sre with keptn

A Guide to Event-Driven SRE-inspired DevOps
A Guide to Event-Driven SRE-inspired DevOpsA Guide to Event-Driven SRE-inspired DevOps
A Guide to Event-Driven SRE-inspired DevOps
Andreas Grabner
 
Using SLOs for Continuous Performance Optimizations of Your k8s Workloads
Using SLOs for Continuous Performance Optimizations of Your k8s WorkloadsUsing SLOs for Continuous Performance Optimizations of Your k8s Workloads
Using SLOs for Continuous Performance Optimizations of Your k8s Workloads
ScyllaDB
 
Performance Engineering Masterclass: Efficient Automation with the Help of SR...
Performance Engineering Masterclass: Efficient Automation with the Help of SR...Performance Engineering Masterclass: Efficient Automation with the Help of SR...
Performance Engineering Masterclass: Efficient Automation with the Help of SR...
ScyllaDB
 
Jenkins Online Meetup - Automated SLI based Build Validation with Keptn
Jenkins Online Meetup - Automated SLI based Build Validation with KeptnJenkins Online Meetup - Automated SLI based Build Validation with Keptn
Jenkins Online Meetup - Automated SLI based Build Validation with Keptn
Andreas Grabner
 
Chicago DevOps Meetup Nov2019
Chicago DevOps Meetup Nov2019Chicago DevOps Meetup Nov2019
Chicago DevOps Meetup Nov2019
Mike Villiger
 
Metrics driven dev ops 2017
Metrics driven dev ops 2017Metrics driven dev ops 2017
Metrics driven dev ops 2017
Jerry Tan
 
Don't Deploy Into the Dark: DORA Metrics for your K8s GitOps Deployments
Don't Deploy Into the Dark: DORA Metrics for your K8s GitOps DeploymentsDon't Deploy Into the Dark: DORA Metrics for your K8s GitOps Deployments
Don't Deploy Into the Dark: DORA Metrics for your K8s GitOps Deployments
Andreas Grabner
 
Overcoming (organizational) scalability issues in your Prometheus ecosystem
Overcoming (organizational) scalability issues in your Prometheus ecosystemOvercoming (organizational) scalability issues in your Prometheus ecosystem
Overcoming (organizational) scalability issues in your Prometheus ecosystem
QAware GmbH
 
Overcoming scalability issues in your prometheus ecosystem
Overcoming scalability issues in your prometheus ecosystemOvercoming scalability issues in your prometheus ecosystem
Overcoming scalability issues in your prometheus ecosystem
Nebulaworks
 
Building Autonomous Operations for Kubernetes with keptn
Building Autonomous Operations for Kubernetes with keptnBuilding Autonomous Operations for Kubernetes with keptn
Building Autonomous Operations for Kubernetes with keptn
Johannes Bräuer
 
Adding Security to your SLO-based Release Validation with Keptn
Adding Security to your SLO-based Release Validation with KeptnAdding Security to your SLO-based Release Validation with Keptn
Adding Security to your SLO-based Release Validation with Keptn
Andreas Grabner
 
Continuous Testing
Continuous TestingContinuous Testing
Continuous Testing
Karim Fanadka
 
Continuous Testing 2016
Continuous Testing 2016Continuous Testing 2016
Continuous Testing 2016
Karim Fanadka
 
Keptn: Unbreakable Continuous Delivery - Berlin CI/CD Meetup
Keptn: Unbreakable Continuous Delivery - Berlin CI/CD MeetupKeptn: Unbreakable Continuous Delivery - Berlin CI/CD Meetup
Keptn: Unbreakable Continuous Delivery - Berlin CI/CD Meetup
Jürgen Etzlstorfer
 
Surviving the Script-apocalypse
Surviving the Script-apocalypseSurviving the Script-apocalypse
Surviving the Script-apocalypse
DevOps.com
 
Continuous Delivery: How RightScale Releases Weekly
Continuous Delivery: How RightScale Releases WeeklyContinuous Delivery: How RightScale Releases Weekly
Continuous Delivery: How RightScale Releases Weekly
RightScale
 
Devops is all greek
Devops is all greekDevops is all greek
Devops is all greek
Lori MacVittie
 
When Web Services Go Bad
When Web Services Go BadWhen Web Services Go Bad
When Web Services Go Bad
Steve Loughran
 
Neotys PAC - Stijn Schepers
Neotys PAC - Stijn SchepersNeotys PAC - Stijn Schepers
Neotys PAC - Stijn Schepers
Neotys_Partner
 
Karim Fanadka
Karim FanadkaKarim Fanadka
Karim Fanadka
CodeFest
 

Similar to Auto sre with keptn (20)

A Guide to Event-Driven SRE-inspired DevOps
A Guide to Event-Driven SRE-inspired DevOpsA Guide to Event-Driven SRE-inspired DevOps
A Guide to Event-Driven SRE-inspired DevOps
 
Using SLOs for Continuous Performance Optimizations of Your k8s Workloads
Using SLOs for Continuous Performance Optimizations of Your k8s WorkloadsUsing SLOs for Continuous Performance Optimizations of Your k8s Workloads
Using SLOs for Continuous Performance Optimizations of Your k8s Workloads
 
Performance Engineering Masterclass: Efficient Automation with the Help of SR...
Performance Engineering Masterclass: Efficient Automation with the Help of SR...Performance Engineering Masterclass: Efficient Automation with the Help of SR...
Performance Engineering Masterclass: Efficient Automation with the Help of SR...
 
Jenkins Online Meetup - Automated SLI based Build Validation with Keptn
Jenkins Online Meetup - Automated SLI based Build Validation with KeptnJenkins Online Meetup - Automated SLI based Build Validation with Keptn
Jenkins Online Meetup - Automated SLI based Build Validation with Keptn
 
Chicago DevOps Meetup Nov2019
Chicago DevOps Meetup Nov2019Chicago DevOps Meetup Nov2019
Chicago DevOps Meetup Nov2019
 
Metrics driven dev ops 2017
Metrics driven dev ops 2017Metrics driven dev ops 2017
Metrics driven dev ops 2017
 
Don't Deploy Into the Dark: DORA Metrics for your K8s GitOps Deployments
Don't Deploy Into the Dark: DORA Metrics for your K8s GitOps DeploymentsDon't Deploy Into the Dark: DORA Metrics for your K8s GitOps Deployments
Don't Deploy Into the Dark: DORA Metrics for your K8s GitOps Deployments
 
Overcoming (organizational) scalability issues in your Prometheus ecosystem
Overcoming (organizational) scalability issues in your Prometheus ecosystemOvercoming (organizational) scalability issues in your Prometheus ecosystem
Overcoming (organizational) scalability issues in your Prometheus ecosystem
 
Overcoming scalability issues in your prometheus ecosystem
Overcoming scalability issues in your prometheus ecosystemOvercoming scalability issues in your prometheus ecosystem
Overcoming scalability issues in your prometheus ecosystem
 
Building Autonomous Operations for Kubernetes with keptn
Building Autonomous Operations for Kubernetes with keptnBuilding Autonomous Operations for Kubernetes with keptn
Building Autonomous Operations for Kubernetes with keptn
 
Adding Security to your SLO-based Release Validation with Keptn
Adding Security to your SLO-based Release Validation with KeptnAdding Security to your SLO-based Release Validation with Keptn
Adding Security to your SLO-based Release Validation with Keptn
 
Continuous Testing
Continuous TestingContinuous Testing
Continuous Testing
 
Continuous Testing 2016
Continuous Testing 2016Continuous Testing 2016
Continuous Testing 2016
 
Keptn: Unbreakable Continuous Delivery - Berlin CI/CD Meetup
Keptn: Unbreakable Continuous Delivery - Berlin CI/CD MeetupKeptn: Unbreakable Continuous Delivery - Berlin CI/CD Meetup
Keptn: Unbreakable Continuous Delivery - Berlin CI/CD Meetup
 
Surviving the Script-apocalypse
Surviving the Script-apocalypseSurviving the Script-apocalypse
Surviving the Script-apocalypse
 
Continuous Delivery: How RightScale Releases Weekly
Continuous Delivery: How RightScale Releases WeeklyContinuous Delivery: How RightScale Releases Weekly
Continuous Delivery: How RightScale Releases Weekly
 
Devops is all greek
Devops is all greekDevops is all greek
Devops is all greek
 
When Web Services Go Bad
When Web Services Go BadWhen Web Services Go Bad
When Web Services Go Bad
 
Neotys PAC - Stijn Schepers
Neotys PAC - Stijn SchepersNeotys PAC - Stijn Schepers
Neotys PAC - Stijn Schepers
 
Karim Fanadka
Karim FanadkaKarim Fanadka
Karim Fanadka
 

More from LibbySchulze

Running distributed tests with k6.pdf
Running distributed tests with k6.pdfRunning distributed tests with k6.pdf
Running distributed tests with k6.pdf
LibbySchulze
 
Extending Kubectl.pptx
Extending Kubectl.pptxExtending Kubectl.pptx
Extending Kubectl.pptx
LibbySchulze
 
Enhancing Data Protection Workflows with Kanister And Argo Workflows
Enhancing Data Protection Workflows with Kanister And Argo WorkflowsEnhancing Data Protection Workflows with Kanister And Argo Workflows
Enhancing Data Protection Workflows with Kanister And Argo Workflows
LibbySchulze
 
Fallacies in Platform Engineering.pdf
Fallacies in Platform Engineering.pdfFallacies in Platform Engineering.pdf
Fallacies in Platform Engineering.pdf
LibbySchulze
 
Intro to Fluvio.pptx.pdf
Intro to Fluvio.pptx.pdfIntro to Fluvio.pptx.pdf
Intro to Fluvio.pptx.pdf
LibbySchulze
 
Enhance your Kafka Infrastructure with Fluvio.pptx
Enhance your Kafka Infrastructure with Fluvio.pptxEnhance your Kafka Infrastructure with Fluvio.pptx
Enhance your Kafka Infrastructure with Fluvio.pptx
LibbySchulze
 
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdfCNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
LibbySchulze
 
Oh The Places You'll Sign.pdf
Oh The Places You'll Sign.pdfOh The Places You'll Sign.pdf
Oh The Places You'll Sign.pdf
LibbySchulze
 
Rancher MasterClass - Avoiding-configuration-drift.pptx
Rancher  MasterClass - Avoiding-configuration-drift.pptxRancher  MasterClass - Avoiding-configuration-drift.pptx
Rancher MasterClass - Avoiding-configuration-drift.pptx
LibbySchulze
 
vFunction Konveyor Meetup - Why App Modernization Projects Fail - Aug 2022.pptx
vFunction Konveyor Meetup - Why App Modernization Projects Fail - Aug 2022.pptxvFunction Konveyor Meetup - Why App Modernization Projects Fail - Aug 2022.pptx
vFunction Konveyor Meetup - Why App Modernization Projects Fail - Aug 2022.pptx
LibbySchulze
 
CNCF Live Webinar: Low Footprint Java Containers with GraalVM
CNCF Live Webinar: Low Footprint Java Containers with GraalVMCNCF Live Webinar: Low Footprint Java Containers with GraalVM
CNCF Live Webinar: Low Footprint Java Containers with GraalVM
LibbySchulze
 
EnRoute-OPA-Integration.pdf
EnRoute-OPA-Integration.pdfEnRoute-OPA-Integration.pdf
EnRoute-OPA-Integration.pdf
LibbySchulze
 
AirGap_zusammen_neu.pdf
AirGap_zusammen_neu.pdfAirGap_zusammen_neu.pdf
AirGap_zusammen_neu.pdf
LibbySchulze
 
Copy of OTel Me All About OpenTelemetry The Current & Future State, Navigatin...
Copy of OTel Me All About OpenTelemetry The Current & Future State, Navigatin...Copy of OTel Me All About OpenTelemetry The Current & Future State, Navigatin...
Copy of OTel Me All About OpenTelemetry The Current & Future State, Navigatin...
LibbySchulze
 
OTel Me All About OpenTelemetry The Current & Future State, Navigating the Pr...
OTel Me All About OpenTelemetry The Current & Future State, Navigating the Pr...OTel Me All About OpenTelemetry The Current & Future State, Navigating the Pr...
OTel Me All About OpenTelemetry The Current & Future State, Navigating the Pr...
LibbySchulze
 
CNCF_ A step to step guide to platforming your delivery setup.pdf
CNCF_ A step to step guide to platforming your delivery setup.pdfCNCF_ A step to step guide to platforming your delivery setup.pdf
CNCF_ A step to step guide to platforming your delivery setup.pdf
LibbySchulze
 
CNCF Online - Data Protection Guardrails using Open Policy Agent (OPA).pdf
CNCF Online - Data Protection Guardrails using Open Policy Agent (OPA).pdfCNCF Online - Data Protection Guardrails using Open Policy Agent (OPA).pdf
CNCF Online - Data Protection Guardrails using Open Policy Agent (OPA).pdf
LibbySchulze
 
Securing Windows workloads.pdf
Securing Windows workloads.pdfSecuring Windows workloads.pdf
Securing Windows workloads.pdf
LibbySchulze
 
Securing Windows workloads.pdf
Securing Windows workloads.pdfSecuring Windows workloads.pdf
Securing Windows workloads.pdf
LibbySchulze
 
Advancements in Kubernetes Workload Identity for Azure
Advancements in Kubernetes Workload Identity for AzureAdvancements in Kubernetes Workload Identity for Azure
Advancements in Kubernetes Workload Identity for Azure
LibbySchulze
 

More from LibbySchulze (20)

Running distributed tests with k6.pdf
Running distributed tests with k6.pdfRunning distributed tests with k6.pdf
Running distributed tests with k6.pdf
 
Extending Kubectl.pptx
Extending Kubectl.pptxExtending Kubectl.pptx
Extending Kubectl.pptx
 
Enhancing Data Protection Workflows with Kanister And Argo Workflows
Enhancing Data Protection Workflows with Kanister And Argo WorkflowsEnhancing Data Protection Workflows with Kanister And Argo Workflows
Enhancing Data Protection Workflows with Kanister And Argo Workflows
 
Fallacies in Platform Engineering.pdf
Fallacies in Platform Engineering.pdfFallacies in Platform Engineering.pdf
Fallacies in Platform Engineering.pdf
 
Intro to Fluvio.pptx.pdf
Intro to Fluvio.pptx.pdfIntro to Fluvio.pptx.pdf
Intro to Fluvio.pptx.pdf
 
Enhance your Kafka Infrastructure with Fluvio.pptx
Enhance your Kafka Infrastructure with Fluvio.pptxEnhance your Kafka Infrastructure with Fluvio.pptx
Enhance your Kafka Infrastructure with Fluvio.pptx
 
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdfCNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
 
Oh The Places You'll Sign.pdf
Oh The Places You'll Sign.pdfOh The Places You'll Sign.pdf
Oh The Places You'll Sign.pdf
 
Rancher MasterClass - Avoiding-configuration-drift.pptx
Rancher  MasterClass - Avoiding-configuration-drift.pptxRancher  MasterClass - Avoiding-configuration-drift.pptx
Rancher MasterClass - Avoiding-configuration-drift.pptx
 
vFunction Konveyor Meetup - Why App Modernization Projects Fail - Aug 2022.pptx
vFunction Konveyor Meetup - Why App Modernization Projects Fail - Aug 2022.pptxvFunction Konveyor Meetup - Why App Modernization Projects Fail - Aug 2022.pptx
vFunction Konveyor Meetup - Why App Modernization Projects Fail - Aug 2022.pptx
 
CNCF Live Webinar: Low Footprint Java Containers with GraalVM
CNCF Live Webinar: Low Footprint Java Containers with GraalVMCNCF Live Webinar: Low Footprint Java Containers with GraalVM
CNCF Live Webinar: Low Footprint Java Containers with GraalVM
 
EnRoute-OPA-Integration.pdf
EnRoute-OPA-Integration.pdfEnRoute-OPA-Integration.pdf
EnRoute-OPA-Integration.pdf
 
AirGap_zusammen_neu.pdf
AirGap_zusammen_neu.pdfAirGap_zusammen_neu.pdf
AirGap_zusammen_neu.pdf
 
Copy of OTel Me All About OpenTelemetry The Current & Future State, Navigatin...
Copy of OTel Me All About OpenTelemetry The Current & Future State, Navigatin...Copy of OTel Me All About OpenTelemetry The Current & Future State, Navigatin...
Copy of OTel Me All About OpenTelemetry The Current & Future State, Navigatin...
 
OTel Me All About OpenTelemetry The Current & Future State, Navigating the Pr...
OTel Me All About OpenTelemetry The Current & Future State, Navigating the Pr...OTel Me All About OpenTelemetry The Current & Future State, Navigating the Pr...
OTel Me All About OpenTelemetry The Current & Future State, Navigating the Pr...
 
CNCF_ A step to step guide to platforming your delivery setup.pdf
CNCF_ A step to step guide to platforming your delivery setup.pdfCNCF_ A step to step guide to platforming your delivery setup.pdf
CNCF_ A step to step guide to platforming your delivery setup.pdf
 
CNCF Online - Data Protection Guardrails using Open Policy Agent (OPA).pdf
CNCF Online - Data Protection Guardrails using Open Policy Agent (OPA).pdfCNCF Online - Data Protection Guardrails using Open Policy Agent (OPA).pdf
CNCF Online - Data Protection Guardrails using Open Policy Agent (OPA).pdf
 
Securing Windows workloads.pdf
Securing Windows workloads.pdfSecuring Windows workloads.pdf
Securing Windows workloads.pdf
 
Securing Windows workloads.pdf
Securing Windows workloads.pdfSecuring Windows workloads.pdf
Securing Windows workloads.pdf
 
Advancements in Kubernetes Workload Identity for Azure
Advancements in Kubernetes Workload Identity for AzureAdvancements in Kubernetes Workload Identity for Azure
Advancements in Kubernetes Workload Identity for Azure
 

Recently uploaded

How to make a complaint to the police for Social Media Fraud.pdf
How to make a complaint to the police for Social Media Fraud.pdfHow to make a complaint to the police for Social Media Fraud.pdf
How to make a complaint to the police for Social Media Fraud.pdf
Infosec train
 
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
APNIC
 
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
3a0sd7z3
 
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
thezot
 
Securing BGP: Operational Strategies and Best Practices for Network Defenders...
Securing BGP: Operational Strategies and Best Practices for Network Defenders...Securing BGP: Operational Strategies and Best Practices for Network Defenders...
Securing BGP: Operational Strategies and Best Practices for Network Defenders...
APNIC
 
HijackLoader Evolution: Interactive Process Hollowing
HijackLoader Evolution: Interactive Process HollowingHijackLoader Evolution: Interactive Process Hollowing
HijackLoader Evolution: Interactive Process Hollowing
Donato Onofri
 
cyber crime.pptx..........................
cyber crime.pptx..........................cyber crime.pptx..........................
cyber crime.pptx..........................
GNAMBIKARAO
 
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
3a0sd7z3
 
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
rtunex8r
 
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
dtagbe
 
Bengaluru Dreamin' 24 - Personal Branding
Bengaluru Dreamin' 24 - Personal BrandingBengaluru Dreamin' 24 - Personal Branding
Bengaluru Dreamin' 24 - Personal Branding
Tarandeep Singh
 

Recently uploaded (11)

How to make a complaint to the police for Social Media Fraud.pdf
How to make a complaint to the police for Social Media Fraud.pdfHow to make a complaint to the police for Social Media Fraud.pdf
How to make a complaint to the police for Social Media Fraud.pdf
 
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
 
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
 
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
 
Securing BGP: Operational Strategies and Best Practices for Network Defenders...
Securing BGP: Operational Strategies and Best Practices for Network Defenders...Securing BGP: Operational Strategies and Best Practices for Network Defenders...
Securing BGP: Operational Strategies and Best Practices for Network Defenders...
 
HijackLoader Evolution: Interactive Process Hollowing
HijackLoader Evolution: Interactive Process HollowingHijackLoader Evolution: Interactive Process Hollowing
HijackLoader Evolution: Interactive Process Hollowing
 
cyber crime.pptx..........................
cyber crime.pptx..........................cyber crime.pptx..........................
cyber crime.pptx..........................
 
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
 
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
 
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
 
Bengaluru Dreamin' 24 - Personal Branding
Bengaluru Dreamin' 24 - Personal BrandingBengaluru Dreamin' 24 - Personal Branding
Bengaluru Dreamin' 24 - Personal Branding
 

Auto sre with keptn

  • 1. Automating SRE from “Hello World” to Enterprise Scale with Andreas Grabner DevOps Activist at Dynatrace DevRel for Keptn @grabnerandi, https://www.linkedin.com/in/grabnerandi Star us @ https://github.com/keptn/keptn Followus @keptnProject Slack Us @ https://slack.keptn.sh Visit us @ https://keptn.sh
  • 2. 5 Confidential Problem #1 „Our DevOps teams is building monolithic automation that becomes more complex than the microservices we deploy!“
  • 3. Confidential 6 Too complexautomationpipelines are too hard to maintain! ChristianHeckelmann Senior DevOps Engineer 2800 projects 966 CI/CDs „I am constantlyreacting to ‚Pipeline Broken – please fix!‘“
  • 4. 7 Confidential Problem #2 „Our DevOps team builds and then copy/pastes tool integrations across all our automation code due to lack of standards!“
  • 5. Confidential 8 Duplicated Pipeline Code hard to maintain, extend & optimize pipeline { stages { stage ('Depl oy to dev na mespac e') { ste ps { c ontain er(‘he lm’) { } } } stage ('Run tests' ) { ste ps { c ontain er(‘jm eter’) { } } } stage (‘Eval uate p erform ance’) { ste ps { c ontain er(‘cu rl’) { } } } if (e valuat ion.pa ssed) { sta ge('De ploy t o stag ing') { s teps { conta iner(‘ helm’) { } } } } } } pipeline { stages { stage ('Depl oy to dev na mespac e') { ste ps { c ontain er(‘ku stomiz e’) { } } } stage ('Run tests' ) { ste ps { c ontain er(‘jm eter’) { } } } stage (‘Eval uate p erform ance’) { ste ps { c ontain er(‘cu rl’) { } } } if (e valuat ion.pa ssed) { sta ge('De ploy t o stag ing') { s teps { conta iner(‘ helm’) { } } } } } } 1 Service = 1 Pipeline 1 Project = x Pipelines n Teams = n*x Pipelines pipeline { stages { stage ('Depl oy to dev na mespac e') { ste ps { c ontain er(‘he lm’) { } } } stage ('Run tests' ) { ste ps { c ontain er(‘jm eter’) { } } } stage (‘Eval uate p erform ance’) { ste ps { c ontain er(‘cu rl’) { } } } if (e valuat ion.pa ssed) { sta ge('De ploy t o stag ing') { s teps { conta iner(‘ helm’) { } } } } } } 25 services 96 workloads Dieter Ladenhauf Senior ACE Engineer „Onboarding or updating pipelines is manual & error prone!‘“
  • 6. 9 Confidential Problem #3 “Our SRE teams can’t scale as resiliency testing and enforcing SLO analysis is hard to automate!”
  • 7. Confidential 10 Too much manual expert workon test execution & SLO Analysis ~90%of test reruns 9:1 ratio script maintenancevs creation only 10% projects performance tested Test Result Analysis MonitoringConfiguration ~ 80%time spent in manual... Scripts Creation SLO Report Generation 15-20 tests / year < 5 Apps „We are limited in scaling SRE due to manual expert tasks!‘“ Roman Ferstl Managing Director
  • 8. Confidential 11 3 examples of how Keptn solves these problems ... Sumit Nagal Principal Engineer Roman Ferstl Managing Director ChristianHeckelmann Senior DevOps Engineer 50+ Continuous SLO Validations 15x Perf tests 10x tested apps ~60 Services 14 Stages Tool stack SLO-based Quality Gate Automation Performance & Resiliance Test Automation Scaling SRE through Delivery Automation
  • 9. Confidential 12 And some nice testimonials fromour community ...
  • 10. Confidential 13 Keptn: Data-DrivenDelivery& Operations Automation You (Dev/Ops/SRE) bring your configuration pick your use case SLO-Quality Gates Progressive Delivery Auto- Remediation Declaration GitOps SLOs Standards SLI/SLO shipyard runbook SRE Automation workload Monitoring Delivery Reliability Remediation automates configuration and provides self-service for through event-driven process orchestrationbased on connect your tools
  • 11. 14 Confidential Keptn Architecture Driven by new requirements in the cloud native world
  • 12. Confidential 15 We lookedat the fundamential problems our users have Mixed information about • Process (build,deploy,test,evaluate, …) • Target platform (k8s, …) • Environments(dev, hardening,…) • Tools (Terraform,Helm, hey, …) No clear separation of concerns • Developers • Define which artifact to use • Want fast feedback on their code • DevOps Engineers • Define which tools to use • Ensure tools are properly configured • Site ReliabilityEngineers • Define delivery processes • Define operations workflows
  • 13. Confidential 16 Solution: Remove hard dependencies and integrations Build Prepare Deploy Test Notify Rollback Config Mgmt. Deploy Test Monitoring ChatOps Rollback
  • 14. Confidential 17 Solution: Remove hard dependencies and integrations Build Prepare Deploy Test Notify Rollback Config Mgmt. Deploy Test Monitoring ChatOps Rollback Eventing Event:Deploy Artifact:container1 Stage:Dev Strategy:Blue/Green which events to generate ➔ Process Definition who consumesevents ➔ Tool Definition
  • 15. Confidential 18 Eventing Solution:Keptn is built on an architecture that supports this paradigm Application Plane (=Process Definition) Define overall process for delivery and operations Control Plane Follow application logic and communicate/configure required services API Site Reliability Engineer DevOps Developer uniform.yaml config-change*: helm deploy*: JMeter deploy-finish: Lighthouse problem*: Remediation all: Slack, Dynatrace Execution Plane (=Tool Definition) Deploy Service (Helm, Jenkins …) Test Service (JMeter, Neotys, ..) Validation Service (Keptn Lighthouse …) Remediation Service (Keptn Remediation, SNOW …) Config Service (Git, …) Monitoring Service (Prometheus, Dynatrace, …) Artifact / Microservice config.change:artifact:x.y deploy.finished:http://service1 tests.finished:OK evaluation.done:98% Score problem.open:HighFailure shipyard.yaml - dev: direct, functional, SLO - staging: B/G, perf, SLO - prod: canary, real-user, SLA remediation.yaml - high-failure-rate: - scaleup, rollback - full-disk: - cleandir;adjustlog-level
  • 16. Confidential 19 Demo #1: SLO-Driven Progressive Delivery and Operations Automation with Keptn Developer Pull Request (1)Deploy & Test (2)Evaluate SLOs (3)Auto-Promote (4)Deploy & Test (5)Evaluate SLOs (7)Deploy Blue / Green (8)Evaluate SLOs (1)Action 1: Scale Up (2)Evaluate SLOs (3)Action 2: Roll Back (6)Promote? (9)Toggle Blue/Green #1 - IncreasedSpeedand Quality of ProgressiveDelivery through Automation #2 - Automate Operations (10)Re-Evaluate SLOs (4)Evaluate SLOs Closed-Loop Remediation Observability
  • 17. Confidential 21 User Example: ProgressiveDelivery with Keptn Patrick Hofmann Sr. Consultant CI CD
  • 19. Confidential 23 RootCause: Lengthy manual approval Build Deployto „Test“ Run Test In „Test“ Manual Approval Promote to „Staging“ Functional: Test Result Trend Not Enough Performance: ManualComparison Is Slow Monitoring: Too much unstructed data ~30-60min Which metrics are important and which build is therefore better Which data comesfrom my test and is relevantfor business transactions Is this regression impacting key business use cases
  • 20. Confidential 24 Learning from Google‘s SRE Practices • Service Level Indicators(SLIs) • Definition:Measurable Metrics as the base for evaluation • Example:ErrorRate of Login Requests • Service Level Objectives (SLOs) • Definition:Bindingtargets for Service Level Indicators • Example:Login Error Rate must be less than 2% over a 30 day period • Service Level Agreements (SLAs) • Definition:Business Agreement between consumer and provider typicallybasedon SLO • Example:Logins must be reliable & fast (Error Rate, Response Time, Throughput)99% within a 30 daywindow • Google Cloud YouTube Video • SLIs, SLOs, SLAs, oh my! (class SRE implements DevOps): https://www.youtube.com/watch?v=tEylFyxbDLE SLIs drive SLOs which inform SLAs
  • 21. Confidential 25 Applying SRE Best Practices across the lifecycle Authentication Service 0.89s 0.5% May 2020 June 2020 0.61s 2.5% 1000/s 1600/s Service X xxs xx% yys yy% xx/s yy/s Production Shift-Left Continuous Delivery Authentication Service Commit #1 Commit #2 Commit #3 Commit #4 Service X Quality Gates
  • 22. Confidential 26 Explainer on SLI/SLO Validation as part of Continuous Delivery with Dynatrace & Keptn! Overall Failure Rate Query: builtin:service.errors.total Test Step LOGIN Response Time Query: calc:service.teststeprt:filter(Test, LOGIN) Test Step LOGIN # Service Calls Query: calc:service.testsvc:filter(tx, LOGIN) <= 5% <= 2% <=150ms & <=+10% <= 400ms <= +0% Build 1 0% 80ms 100ms SLO: Overall Score Goal 90% 75% Response Time 95th Perc Query: builtin:service.responsetime(p95) <=100ms <= 250ms SLO SLIs (Service Level Indicators) warn pass 1 100% Build 2 4% 120ms 90ms 1 75% Build 3 1% 90ms 120ms 2 62.5% Build 4 0% 95ms 95ms 1 100% Build 1 Build 2 Build 3 Build 4 $ keptn send event start-evaluation myproject myservice starttime=build1_deploy endtime=build1_testsdone $ keptn send event start-evaluation myproject myservice starttime=build2_deploy endtime=build2_testsdone $ keptn send event start-evaluation myproject myservice starttime=build3_deploy endtime=build3_testsdone $ keptn send event start-evaluation myproject myservice starttime=build4_teststart endtime=build4_testsend DevOps
  • 23. Confidential 27 SLI/SLO-based evaluationimplementationin Keptn SLIs defined per SLI Provider as YAML SLI Provider specific queries,e.g: DynatraceMetrics Query Quality Gates ... Dynatrace Prometheus Neoload Scores SLIs Queries SLI Providers with SLI Definitions & Timeframe SLOs defined on Keptn Service Level as YAML List of objectives with fixed or relative pass & warn criteria indicators: error_rate: "builtin:service.errors.total.count:merge(0):avg" count_dbcalls: "calc:service.toptestdbcalls:merge(0):sum" jvm_memory: "builtin:tech.jvm.memory.pool.committed:merge(0):sum" objectives: - sli: error_rate pass: - criteria: - "<=1“ # We expect a max error rate of 1% - sli: jvm_memory - sli: count_dbcalls pass: - criteria: - "=+2%" # We allow a 2% increase in DB Calls to previous runs warning: - criteria: - "<=10" # We expect no more than 10 DB Calls per TX total_score: pass: "90%" warning: "75%" 0.5 1.0 0.0 info 7/8 (87.5%) 4/8 (50%) $ keptn start-evaluation 30m myservice sli.yaml slo.yaml 5 DB Calls 360MB 4.3% 123 SLI Value: SLI Score: Total Score 2 3 4 Tool X 1
  • 24. Confidential 28 Demo: Automated SLI/SLO Validationbased on Dynatrace Dashboards You: Just build a dashboard! : Automates the analysis! 15.5/16 (97%) 8/16 (50%)
  • 25. Confidential 29 User Example: Automating Build Approvals using Keptn‘s SLIs/SLOs in GitLab Christian Heckelmann Senior Systems Engineer 87.5%: passed AutomatedSLI/SLO based Quality Gates Trigger Evaluation
  • 27. Confidential 31 Keptn – Closed-LoopRemediation version: 0.2.0 kind: Remediation metadata: name: remediation-ecommerce spec: remediations: - problemType: Conversion Rate Dropped actionsOnOpen: - name: Scaling ReplicaSet by 1 action: scaling values: increment: +1 - name: Stop Ad Campaign action: googleadtoggle values: enable: off campaign: $campaignid Problem: Conversion Rate Dropped Root Cause: CPU Pressure Get remediation action(s) Execute remediation action(s) Re-validate SLO/BLO Escalate Scale Up Stop Campaign 1 2 1 2 1 2
  • 28. Confidential 32 Too risky? Start in Pre-Prod leveraging Chaos Engineering to define & test Auto-Remediation version: 0.2.0 kind: Remediation metadata: name: remediation-ecommerce spec: remediations: - problemType: High CPU on ReportGen actionsOnOpen: - name: Stop Traffic action: configureLoadBalancer values: action: stopTraffic ip: $problem.hostIp - name: Restart Process action: executeAnsible values: script: restartProcess process: $problem.processID Problem: Slow ReportGen Service Root Cause: High CPU on host Get remediation action(s) Execute remediation action(s) Re-validate SLO/BLO Escalate Stop Traffic Restart Process 1 2 1 2 1 2
  • 30. Confidential 34 Keptn: Data-DrivenDelivery& Operations Automation You (Dev/Ops/SRE) bring your configuration pick your use case SLO-Quality Gates Progressive Delivery Auto- Remediation Declaration GitOps SLOs Standards shipyard SLI/SLO runbook SRE Automation workload Monitoring Delivery Reliability Remediation automates configuration and provides self-service for through event-driven process orchestrationbased on connect your tools
  • 31. Confidential 35 Keptn 0.8 was just releasedearly March2021 and opens new use cases • Multi-Cluster Setup Separationof Control & Execution Plane D E V I N T P R D
  • 32. Confidential 36 Keptn 0.8 was just releasedearly March2021 and opens new use cases • Shipyard v0.2.0: Explicit vs opinionated sequences
  • 33. Confidential 37 Get started with our tutorials: tutorials.keptn.sh
  • 34. Questions & Answers Automating SRE from “Hello World” to Enterprise Scale with Andreas Grabner DevOps Activist at Dynatrace DevRel for Keptn @grabnerandi, https://www.linkedin.com/in/grabnerandi Star us @ https://github.com/keptn/keptn Followus @keptnProject Slack Us @ https://slack.keptn.sh Visit us @ https://keptn.sh