SRE 101
Diego Pacheco
@diego_pacheco
ā Cat's Father
ā Principal Software Architect
ā Agile Coach
ā SOA/Microservices Expert
ā DevOps Practitioner
ā Speaker
ā Author
diegopacheco
http://diego-pacheco.blogspot.com.br/
About me...
https://diegopacheco.github.io/
Reference books
What is SRE
"It's what happens when you ask a software
engineer to design an operations function."
-- Ben Treynor Sloss. 2000
SRE Culture
MTTD, MTTR
Error Budget and SLA/SLOs
Jitter, Retries, Exponential Back-off
SRE and DevOps
SV Culture: You Build it you run it.
Observability
Actionable Alerts
Production Ready Checklists
ā— Unit + Integration Tests
ā— Chaos Testing
ā— Stress Testing
ā— Automated DB Scripts
ā— Metrics
ā— Logs
ā— Deploy Automation
ā— Security
ā— Docker / Kubernetes yamls
ā— Build Hygiene
ā— Architecture Review
Automation
Incident Drill Downs
Throttling, Caps and Limits
No Unbounded
Avoid Cold Starts and Cold Caches
Education
SRE 101
Diego Pacheco

SRE 101