Many things can go awry on the journey from pull request (PR) open to merge to production deployment. Issues can arise from the application code, layers of YAML configuration, underlying infrastructure or pipeline logic itself. How can distributed tracing and trace-derived metrics bring developers and operators together for troubleshooting paradise? I’ll unpack a deploy gone bad from both vantage points, gaining an empathy for the engineer who needs to deploy their changes and an ops engineer who is responsible for keeping the system up and running. With signals from OpenTelemetry I will show how increasing the observability of your deploy system can facilitate better collaboration and quicker troubleshooting.
11. @paigerduty
Dev and Ops have
distinct perspectives
and data on CI/CD
platform its not
always
straightforward to
know who is
responsible for an
issue
11
28. @paigerduty
a Path to Production
28
1
3
5
6
4
2
PR checks
PR Review
Merge PR
Deploy to
Integration
Env
Deploy to
Staging Env
Deploy to
Production
Env