SlideShare a Scribd company logo
AA261
     DevOps lessons in
collaborative maintenance
Lindsay Holmwood
    @auxesis
Software Manager
          @
Bulletproof Networks
Trigger warning: death
January 31, 2000




Puerto Vallarta
Seattle
Departed PVR at 13.37 PST
Ascended to 31,000ft
2 hours into flight:
Jammed horizontal stabiliser
No trim control
Redirected to LAX
Pilots unjammed
horizontal stabilisers
2 pilots
3 crew
83 passengers
This is a maintenance accident. Alaska
Airlines' maintenance and inspection of its
horizontal stabilizer activation system was
poorly conceived and woefully executed. The
failure was compounded by poor oversight...
had any of the managers, mechanics,
inspectors, supervisors or FAA overseers
whose job it was to protect this mechanism
done their job conscientiously, this accident
cannot happen.
                  -- John J. Goglia, NTSB Board Member
hindsight != foresight
[hindsight] converts a once
vague, unlikely future into an
   immediate, certain past
                   -- Sidney Dekker
This is a maintenance accident. Alaska
Airlines' maintenance and inspection of its
horizontal stabilizer activation system was
poorly conceived and woefully executed. The
failure was compounded by poor oversight...
had any of the managers, mechanics,
inspectors, supervisors or FAA overseers
whose job it was to protect this mechanism
done their job conscientiously, this accident
cannot happen.
                  -- John J. Goglia, NTSB Board Member
This is a maintenance accident. Alaska
Airlines' maintenance and inspection of its
horizontal stabilizer activation system was
poorly conceived and woefully executed. The
failure was compounded by poor oversight...
had any of the managers, mechanics,
inspectors, supervisors or FAA overseers
whose job it was to protect this mechanism
done their job conscientiously, this accident
cannot happen.
                  -- John J. Goglia, NTSB Board Member
“poorly conceived
       and
woefully executed”
DC-9 -> MD-80 -> MD-83
Evolutionary
product development
Appropriated
maintenance schedules
Jackscrew
lubrication interval
1965   every 300-350 hours           launch of DC-9


1985   every 700 hours               industry deregulation


1987   every 1000 hours              industry standardisation


1991   every 1200 hours              industry standardisation


1994   every 1600 hours              industry standardisation


1996   every 8 months (2550 hours)   Alaska Airlines policy change
1965   every 300-350 hours           launch of DC-9


1985   every 700 hours               industry deregulation


1987   every 1000 hours              industry standardisation


1991   every 1200 hours              industry standardisation


1994   every 1600 hours              industry standardisation


1996   every 8 months (2550 hours)   Alaska Airlines policy change
1965   every 300-350 hours           launch of DC-9


1985   every 700 hours               industry deregulation


1987   every 1000 hours              industry standardisation


1991   every 1200 hours              industry standardisation


1994   every 1600 hours              industry standardisation


1996   every 8 months (2550 hours)   Alaska Airlines policy change
1965   every 300-350 hours           launch of DC-9


1985   every 700 hours               industry deregulation


1987   every 1000 hours              industry standardisation


1991   every 1200 hours              industry standardisation


1994   every 1600 hours              industry standardisation


1996   every 8 months (2550 hours)   Alaska Airlines policy change
1965   every 300-350 hours           launch of DC-9


1985   every 700 hours               industry deregulation


1987   every 1000 hours              industry standardisation


1991   every 1200 hours              industry standardisation


1994   every 1600 hours              industry standardisation


1996   every 8 months (2550 hours)   Alaska Airlines policy change
1965   every 300-350 hours           launch of DC-9


1985   every 700 hours               industry deregulation


1987   every 1000 hours              industry standardisation


1991   every 1200 hours              industry standardisation


1994   every 1600 hours              industry standardisation


1996   every 8 months (2550 hours)   Alaska Airlines policy change
1965   every 300-350 hours           launch of DC-9


1985   every 700 hours               industry deregulation


1987   every 1000 hours              industry standardisation


1991   every 1200 hours              industry standardisation


1994   every 1600 hours              industry standardisation


1996   every 8 months (2550 hours)   Alaska Airlines policy change
Decrementalism
Complex system constraints
     Jens Rasmussen
wo
  rkl
     oad
wo
            rkl
               oad


economy
wo
            rkl
               oad


economy        saf
           ety
tim
   e
e
tim



      cost
ty
     ali
qu            cost
e
 tim
wo
            rkl
               oad


economy        saf
           ety
wo
            rkl
               oad


economy        saf
           ety
wo
            rkl
               oad


economy        saf
           ety
wo
            rkl
               oad


economy        saf
           ety
wo
            rkl
               oad


economy        saf
           ety
wo
            rkl
               oad


economy        saf
           ety
wo
            rkl
               oad


economy        saf
           ety
wo
            rkl
               oad


economy        saf
           ety
wo
            rkl
               oad


economy        saf
          ety
outside: failure of foresight




   oad



                           saf
rkl




                                ety
wo



         economy
outside: failure of foresight




   oad



                           saf
rkl            inside:




                                ety
             trade-offs
wo
          in direction of
         greater efficiency



         economy
trade-offs
 in direction of
greater efficiency
trade-offs
 in direction of
greater efficiency
Constraints on knowledge
Why would they make bad
decisions intentionally?
Decisions seemed rational
Local rationalisation
“people make what they
  consider to be the best
decision based on available
 knowledge at the time”
This is a maintenance accident. Alaska
Airlines' maintenance and inspection of its
horizontal stabilizer activation system was
poorly conceived and woefully executed. The
failure was compounded by poor oversight...
had any of the managers, mechanics,
inspectors, supervisors or FAA overseers
whose job it was to protect this mechanism
done their job conscientiously, this accident
cannot happen.
                  -- John J. Goglia, NTSB Board Member
wo
            rkl
               oad


economy        saf
          ety
ty
     ali
qu            cost
e
 tim
Devops constraints
“God, our ops team are arseholes. I just want
to deploy this change and go home!”
“God, our ops team are arseholes. I just want
to deploy this change and go home!”
        oad




                        saf
        rkl




                         ety
      wo




              economy
“God, our ops team are arseholes. I just want
to deploy this change and go home!”
        oad




                                 oad
                        saf




                                                 saf
        rkl




                                 rkl
                         ety




                                                  ety
      wo




                               wo
              economy                  economy
What are the circumstances?
Where are the tensions?
Have ops been burnt before?
Is there deployment friction?
            Why?
Is deployment high-risk?
Is deployment time consuming?
Is deployment important
    to the business?
“It’s 3am an the pager has gone off again. Why
can’t these devs just write code that works?”
“It’s 3am an the pager has gone off again. Why
can’t these devs just write code that works?”
        oad




                        saf
        rkl




                         ety
      wo




              economy
“It’s 3am an the pager has gone off again. Why
can’t these devs just write code that works?”
        oad




                                 oad
                        saf




                                                 saf
        rkl




                                 rkl
                         ety




                                                  ety
      wo




                               wo
              economy                  economy
[hindsight] converts a once
vague, unlikely future into an
   immediate, certain past
                   -- Sidney Dekker
What are the circumstances?
Where are the tensions?
Why didn’t the dev know the
 code would fail like this?
Why weren’t you involved
when the code was written?
How is code reviewed?
Is the infrastructure anti-fragile?
Is the code anti-fragile?
Hindsight bias
[hindsight] converts a once
vague, unlikely future into an
   immediate, certain past
                   -- Sidney Dekker
What are the motivations?
“amoral actors”
wo
            rkl
               oad


economy
               saf
           ety
wo
            rkl
               oad


economy
               saf
           ety
“root cause” is simply the
point you stop looking
                    -- Sidney Dekker
What are the circumstances?
Where are the tensions?
Thank you!
Thank you!
Liked the talk? Let @auxesis know!
Sidney Dekker [books]
Field Guide to Understand Human Error
Drift Into Failure
Just Culture

Dan Manges [blog]
How incidents affect infrastructure priorities

More Related Content

Similar to AA261: DevOps lessons in collaborative maintenance

AGME Expansion Project Final 122104
AGME Expansion Project Final 122104AGME Expansion Project Final 122104
AGME Expansion Project Final 122104
Richard Houdlette
 
Disruptive Business Model
Disruptive Business ModelDisruptive Business Model
Disruptive Business Model
Yves Pigneur
 
Integrated Rig Stacking Solutions LLC
Integrated Rig Stacking Solutions LLCIntegrated Rig Stacking Solutions LLC
Integrated Rig Stacking Solutions LLC
Chip Keener
 
Compressed air repair implementation
Compressed air repair implementationCompressed air repair implementation
Compressed air repair implementation
TNenergy
 
Ppt for automatic plant irrigation system
Ppt for automatic plant irrigation systemPpt for automatic plant irrigation system
Ppt for automatic plant irrigation system
stk25
 

Similar to AA261: DevOps lessons in collaborative maintenance (11)

Ips connect 2015 marc mueller stoffels
Ips connect 2015 marc mueller stoffelsIps connect 2015 marc mueller stoffels
Ips connect 2015 marc mueller stoffels
 
AGME Expansion Project Final 122104
AGME Expansion Project Final 122104AGME Expansion Project Final 122104
AGME Expansion Project Final 122104
 
Disruptive Business Model
Disruptive Business ModelDisruptive Business Model
Disruptive Business Model
 
Future of the U.S. Energy Grid
Future of the U.S. Energy GridFuture of the U.S. Energy Grid
Future of the U.S. Energy Grid
 
Integrated Rig Stacking Solutions LLC
Integrated Rig Stacking Solutions LLCIntegrated Rig Stacking Solutions LLC
Integrated Rig Stacking Solutions LLC
 
Reliability
ReliabilityReliability
Reliability
 
Helicopter Aviation: Human Factors
Helicopter Aviation: Human FactorsHelicopter Aviation: Human Factors
Helicopter Aviation: Human Factors
 
Compressed air repair implementation
Compressed air repair implementationCompressed air repair implementation
Compressed air repair implementation
 
Wave energy
Wave energyWave energy
Wave energy
 
Shimpo high precision catalog
Shimpo high precision catalogShimpo high precision catalog
Shimpo high precision catalog
 
Ppt for automatic plant irrigation system
Ppt for automatic plant irrigation systemPpt for automatic plant irrigation system
Ppt for automatic plant irrigation system
 

More from Lindsay Holmwood

Behaviour driven infrastructure
Behaviour driven infrastructureBehaviour driven infrastructure
Behaviour driven infrastructure
Lindsay Holmwood
 
Your own (little) gem: building an online business with Ruby
Your own (little) gem: building an online business with RubyYour own (little) gem: building an online business with Ruby
Your own (little) gem: building an online business with Ruby
Lindsay Holmwood
 

More from Lindsay Holmwood (12)

Escalating complexity: DevOps learnings from Air France 447
Escalating complexity: DevOps learnings from Air France 447Escalating complexity: DevOps learnings from Air France 447
Escalating complexity: DevOps learnings from Air France 447
 
Islands: Puppet at Bulletproof Networks
Islands: Puppet at Bulletproof NetworksIslands: Puppet at Bulletproof Networks
Islands: Puppet at Bulletproof Networks
 
Load testing with Blitz
Load testing with BlitzLoad testing with Blitz
Load testing with Blitz
 
Latency: The Silent Monitoring System Killer
Latency: The Silent Monitoring System KillerLatency: The Silent Monitoring System Killer
Latency: The Silent Monitoring System Killer
 
Rump - making Puppetmaster-less Puppet meaty
Rump - making Puppetmaster-less Puppet meatyRump - making Puppetmaster-less Puppet meaty
Rump - making Puppetmaster-less Puppet meaty
 
Behaviour driven infrastructure
Behaviour driven infrastructureBehaviour driven infrastructure
Behaviour driven infrastructure
 
Burn down the silos! Helping dev and ops gel on high availability websites
Burn down the silos! Helping dev and ops gel on high availability websitesBurn down the silos! Helping dev and ops gel on high availability websites
Burn down the silos! Helping dev and ops gel on high availability websites
 
Behaviour Driven Monitoring with cucumber-nagios
Behaviour Driven Monitoring with cucumber-nagiosBehaviour Driven Monitoring with cucumber-nagios
Behaviour Driven Monitoring with cucumber-nagios
 
Flapjack: rethinking monitoring for the cloud
Flapjack: rethinking monitoring for the cloudFlapjack: rethinking monitoring for the cloud
Flapjack: rethinking monitoring for the cloud
 
Monitoring web application behaviour with cucumber-nagios
Monitoring web application behaviour with cucumber-nagiosMonitoring web application behaviour with cucumber-nagios
Monitoring web application behaviour with cucumber-nagios
 
Your own (little) gem: building an online business with Ruby
Your own (little) gem: building an online business with RubyYour own (little) gem: building an online business with Ruby
Your own (little) gem: building an online business with Ruby
 
Deploying Merb
Deploying MerbDeploying Merb
Deploying Merb
 

Recently uploaded

Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 

Recently uploaded (20)

Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 

AA261: DevOps lessons in collaborative maintenance

  • 1. AA261 DevOps lessons in collaborative maintenance
  • 2. Lindsay Holmwood @auxesis
  • 3. Software Manager @ Bulletproof Networks
  • 4.
  • 8. Departed PVR at 13.37 PST
  • 10. 2 hours into flight: Jammed horizontal stabiliser
  • 13.
  • 15.
  • 16.
  • 17. 2 pilots 3 crew 83 passengers
  • 18.
  • 19.
  • 20.
  • 21.
  • 22. This is a maintenance accident. Alaska Airlines' maintenance and inspection of its horizontal stabilizer activation system was poorly conceived and woefully executed. The failure was compounded by poor oversight... had any of the managers, mechanics, inspectors, supervisors or FAA overseers whose job it was to protect this mechanism done their job conscientiously, this accident cannot happen. -- John J. Goglia, NTSB Board Member
  • 24. [hindsight] converts a once vague, unlikely future into an immediate, certain past -- Sidney Dekker
  • 25. This is a maintenance accident. Alaska Airlines' maintenance and inspection of its horizontal stabilizer activation system was poorly conceived and woefully executed. The failure was compounded by poor oversight... had any of the managers, mechanics, inspectors, supervisors or FAA overseers whose job it was to protect this mechanism done their job conscientiously, this accident cannot happen. -- John J. Goglia, NTSB Board Member
  • 26. This is a maintenance accident. Alaska Airlines' maintenance and inspection of its horizontal stabilizer activation system was poorly conceived and woefully executed. The failure was compounded by poor oversight... had any of the managers, mechanics, inspectors, supervisors or FAA overseers whose job it was to protect this mechanism done their job conscientiously, this accident cannot happen. -- John J. Goglia, NTSB Board Member
  • 27.
  • 28. “poorly conceived and woefully executed”
  • 29.
  • 30.
  • 31.
  • 32. DC-9 -> MD-80 -> MD-83
  • 33.
  • 37.
  • 38. 1965 every 300-350 hours launch of DC-9 1985 every 700 hours industry deregulation 1987 every 1000 hours industry standardisation 1991 every 1200 hours industry standardisation 1994 every 1600 hours industry standardisation 1996 every 8 months (2550 hours) Alaska Airlines policy change
  • 39. 1965 every 300-350 hours launch of DC-9 1985 every 700 hours industry deregulation 1987 every 1000 hours industry standardisation 1991 every 1200 hours industry standardisation 1994 every 1600 hours industry standardisation 1996 every 8 months (2550 hours) Alaska Airlines policy change
  • 40. 1965 every 300-350 hours launch of DC-9 1985 every 700 hours industry deregulation 1987 every 1000 hours industry standardisation 1991 every 1200 hours industry standardisation 1994 every 1600 hours industry standardisation 1996 every 8 months (2550 hours) Alaska Airlines policy change
  • 41. 1965 every 300-350 hours launch of DC-9 1985 every 700 hours industry deregulation 1987 every 1000 hours industry standardisation 1991 every 1200 hours industry standardisation 1994 every 1600 hours industry standardisation 1996 every 8 months (2550 hours) Alaska Airlines policy change
  • 42. 1965 every 300-350 hours launch of DC-9 1985 every 700 hours industry deregulation 1987 every 1000 hours industry standardisation 1991 every 1200 hours industry standardisation 1994 every 1600 hours industry standardisation 1996 every 8 months (2550 hours) Alaska Airlines policy change
  • 43. 1965 every 300-350 hours launch of DC-9 1985 every 700 hours industry deregulation 1987 every 1000 hours industry standardisation 1991 every 1200 hours industry standardisation 1994 every 1600 hours industry standardisation 1996 every 8 months (2550 hours) Alaska Airlines policy change
  • 44. 1965 every 300-350 hours launch of DC-9 1985 every 700 hours industry deregulation 1987 every 1000 hours industry standardisation 1991 every 1200 hours industry standardisation 1994 every 1600 hours industry standardisation 1996 every 8 months (2550 hours) Alaska Airlines policy change
  • 46. Complex system constraints Jens Rasmussen
  • 47.
  • 48. wo rkl oad
  • 49. wo rkl oad economy
  • 50. wo rkl oad economy saf ety
  • 51.
  • 52. tim e
  • 53. e tim cost
  • 54. ty ali qu cost e tim
  • 55. wo rkl oad economy saf ety
  • 56. wo rkl oad economy saf ety
  • 57. wo rkl oad economy saf ety
  • 58. wo rkl oad economy saf ety
  • 59. wo rkl oad economy saf ety
  • 60. wo rkl oad economy saf ety
  • 61. wo rkl oad economy saf ety
  • 62. wo rkl oad economy saf ety
  • 63. wo rkl oad economy saf ety
  • 64. outside: failure of foresight oad saf rkl ety wo economy
  • 65. outside: failure of foresight oad saf rkl inside: ety trade-offs wo in direction of greater efficiency economy
  • 66. trade-offs in direction of greater efficiency
  • 67. trade-offs in direction of greater efficiency
  • 69. Why would they make bad decisions intentionally?
  • 72. “people make what they consider to be the best decision based on available knowledge at the time”
  • 73. This is a maintenance accident. Alaska Airlines' maintenance and inspection of its horizontal stabilizer activation system was poorly conceived and woefully executed. The failure was compounded by poor oversight... had any of the managers, mechanics, inspectors, supervisors or FAA overseers whose job it was to protect this mechanism done their job conscientiously, this accident cannot happen. -- John J. Goglia, NTSB Board Member
  • 74. wo rkl oad economy saf ety
  • 75. ty ali qu cost e tim
  • 76.
  • 78. “God, our ops team are arseholes. I just want to deploy this change and go home!”
  • 79. “God, our ops team are arseholes. I just want to deploy this change and go home!” oad saf rkl ety wo economy
  • 80. “God, our ops team are arseholes. I just want to deploy this change and go home!” oad oad saf saf rkl rkl ety ety wo wo economy economy
  • 81. What are the circumstances?
  • 82. Where are the tensions?
  • 83. Have ops been burnt before?
  • 84. Is there deployment friction? Why?
  • 86. Is deployment time consuming?
  • 87. Is deployment important to the business?
  • 88.
  • 89. “It’s 3am an the pager has gone off again. Why can’t these devs just write code that works?”
  • 90. “It’s 3am an the pager has gone off again. Why can’t these devs just write code that works?” oad saf rkl ety wo economy
  • 91. “It’s 3am an the pager has gone off again. Why can’t these devs just write code that works?” oad oad saf saf rkl rkl ety ety wo wo economy economy
  • 92. [hindsight] converts a once vague, unlikely future into an immediate, certain past -- Sidney Dekker
  • 93. What are the circumstances?
  • 94. Where are the tensions?
  • 95. Why didn’t the dev know the code would fail like this?
  • 96. Why weren’t you involved when the code was written?
  • 97. How is code reviewed?
  • 98. Is the infrastructure anti-fragile?
  • 99. Is the code anti-fragile?
  • 100.
  • 102. [hindsight] converts a once vague, unlikely future into an immediate, certain past -- Sidney Dekker
  • 103.
  • 104. What are the motivations?
  • 106. wo rkl oad economy saf ety
  • 107. wo rkl oad economy saf ety
  • 108. “root cause” is simply the point you stop looking -- Sidney Dekker
  • 109. What are the circumstances?
  • 110. Where are the tensions?
  • 112. Thank you! Liked the talk? Let @auxesis know!
  • 113. Sidney Dekker [books] Field Guide to Understand Human Error Drift Into Failure Just Culture Dan Manges [blog] How incidents affect infrastructure priorities