SlideShare a Scribd company logo
Organisational Failure

                                         Prof Ian Sommerville


                                                      Video link



Organisational Failure, York EngD Course in LSCITS, 2012           Slide 1
Organisational failure


  •       Why and how organisational factors can contribute to
          system failures




Organisational Failure, York EngD Course in LSCITS, 2012   Slide 2
Why organisations matter?
  •       Organisations have multiple, inter-related, potentially
          conflicting goals:
        –       Efficient resource utilisation
        –       Timely delivery of products/services
        –       Customer satisfaction
        –       Owner satisfaction
        –       Regulatory compliance
        –       Safety and dependability
        –       Maintenance of reputation/brand
        –       Future development


Organisational Failure, York EngD Course in LSCITS, 2012     Slide 3
Decision making
  •       Organisational decision making involves taking all of
          these into account
        –       Inevitably, this sometimes means making compromises that
                affect the safety and dependability of a system
  •       These compromises lead to vulnerabilities and
          hazards that may then compromise the safety or
          dependability of the system
  •       In complex organisations, there are competing
          priorities in different parts of the organisation
        –       Shifting power and authority in an organisation affects
                decision making
        –       May be deliberate lack of communications across the
                organisation
Organisational Failure, York EngD Course in LSCITS, 2012                  Slide 4
NASA Challenger disaster
  •       Space shuttle exploded shortly after take-off
  •       The cause was the failure of rubber seals (O-rings)
          that allowed hot gas to escape and make contact with
          fuel tanks which then exploded
  •       Subsequent enquiry showed that O-ring failure was
          due to brittleness at low temperatures
  •       Arguably, decision makers were complacent because
        –       Redundant (primary and secondary) O-rings in the system
        –       Damage to primary O-rings had been tolerated in previous
                launches

Organisational Failure, York EngD Course in LSCITS, 2012             Slide 5
Organisational failure
  •       Engineers were concerned about launching in low
          temperatures and advised against launch
  •       But goals other than safety and dependability took
          precedence and engineers were overruled
        –         „Owner‟ satisfaction
              •      already several delays to flight
        –         Future planning
              •      NASA wanted a success to support budget negotiations
        –         Resource utilisation
              •      Reluctance to address known problem with O-rings because of
                     costs
Organisational Failure, York EngD Course in LSCITS, 2012                    Slide 6
Normal accidents
  •       Developed by Charles Perrow who conducted a study of a
          nuclear accident in the USA (Three Mile Island)
  •       Official conclusion was that the problems were due to
          “human error”
  •       Perrow disagreed with this and argues that failures are
          „normal‟ and inevitable in complex systems which have:
        –         Interactive complexity
              •      The presence of unfamiliar, unplanned and unexpected sequences
                     of events in a system that are not visible or immediately
                     comprehensible
        –         Tight coupling
              •      The presence of interdependent components.
              •      Tight coupling will make a system more prone to cascading errors.

Organisational Failure, York EngD Course in LSCITS, 2012                         Slide 7
Organisational Failure, York EngD Course in LSCITS, 2012   Slide 8
Redundancy
  •       The use of redundancy is a fundamental technique in
          achieving system safety
        –       Primary and secondary O-rings on space shuttle
        –       Quintuple redundancy in Airbus FCS

  •       Failure of primary system can be tolerated
  •       Perrow argues that redundancy can decrease rather
          than increase safety:
        –       Increases complexity and coupling in the system
        –       Provides reassurance that system faults can be tolerated



Organisational Failure, York EngD Course in LSCITS, 2012               Slide 9
Failures or successes
  •       Normal accident theory is based on extensive studies
          of system failures
  •       It argues that failure is systemic and an inherent
          characteristic of the system itself
  •       Alternative perspective is based on studies of
          success
        –       Why are there some areas that are apparently complex (e.g.
                air traffic management) where failures are relatively
                uncommon?

  •       Led to the notion of high-reliability organisations

Organisational Failure, York EngD Course in LSCITS, 2012             Slide 10
Failure-free organisations?
  •       High-reliability organisation (HRO) researchers
          disagree that complex, highly interdependent
          systems will inevitably have accidents
        –       They believe organisations are able to compensate for
                technical shortcomings through their methods of operation, in
                essence they argue that organisations can be ‘failure free’.

  •       Based on studies of „reliable‟ organisations
        –       Aircraft carriers
        –       Air traffic control
        –       Nuclear power stations
        –       Intensive care units

Organisational Failure, York EngD Course in LSCITS, 2012               Slide 11
Aircraft carrier flight operations




Organisational Failure, York EngD Course in LSCITS, 2012   Slide 12
Nuclear powered carriers
  •       Complex systems
        –       Carriers are 24 stories high and carry enough fuel for 15
                years. 2000 telephones. 3,360 compartments and spaces
        –       Multiple software intensive systems (command systems,
                aircraft software)
        –       Dangerous objects (aircraft, fuel, and explosives) in close
                proximity.
        –       Aircraft taking off and landing in 48-60 second intervals.
        –       6000 crew. Several different kinds of aircraft, multiple
                squadrons.
        –       All work interdependently and must be coordinated.


Organisational Failure, York EngD Course in LSCITS, 2012                     Slide 13
Nuclear powered carriers
 •   High risk
     –   Nuclear reactor accidents
     –   Fire, flooding, grounding, collision
     –   Fuel and weapons explosions
     –   Mistaken identification of friends and foes
     –   High risks both to crew and a much larger public

 •   High reliability
     –   Low “crunch rates”
     –   comparatively few major accidents

 •        High reliability achieved through organisational
          design
Organisational Failure, York EngD Course in LSCITS, 2012     Slide 14
High Reliability Organisations
  •       High Reliability Organisations (HROs) have particular
          qualities
        –       Reliability takes precedence over efficiency
        –       Preoccupation with failure, not success
        –       Share the big picture
        –       Focus on details
        –       Migrate decisions




Organisational Failure, York EngD Course in LSCITS, 2012       Slide 15
Reliability over Efficiency
        –       Reliability comes before efficiency but cannot replace it
        –       Decisions are made on the grounds of reliability first and then
                efficiency
        –       Efficiency initiatives are treated with scepticism
        –       Managers regularly talk to and familiarise themselves with
                staff about how they do their work and why. This stops
                managers focusing just on figures.
        –       Organisations develop safety measures as well as financial
                measures, and include these in employee evaluations
        –       Organisations assign value to the avoidance of accidents
        –       High redundancy despite cost
        –       Cautious actions when necessary despite cost
Organisational Failure, York EngD Course in LSCITS, 2012                    Slide 16
Preoccupation with Failure
  •       HROs recognise that:
        –       Workers need to be heedful to the possibility of failure
        –       Failures are normal but accidents should be avoided
        –       Acknowledge there can be unexpected failure modes, even
                in common activities

  •       HROs address failure by:
        –       Constant training of all people (simulations, apprenticing,
                practice)
        –       Using incident reporting
        –       Designing in extensive redundancy
        –       Maintaining contingencies for critical operations
        –       Requiring proofs that something is safe, not that it is unsafe
Organisational Failure, York EngD Course in LSCITS, 2012                   Slide 17
Carrier operations
        –         There is constant tracking of issues around malfunctioning,
                  defective and substandard equipment.
              •      They act on these by training crew how to overcome problems
                     and pressuring vendors to make improvements
        –         Extensive redundancy (overlapping jobs, multiple channels
                  and centres of communications, spare parts, multiple sources
                  for decision making).
              •      Example: if an aircrafts landing gear warning light comes on, the
                     spotter, commander and pilot all work together to establish what
                     the issues is.
        –         Multiple contingencies are maintained
              •      Example: There will always be multiple options for how to land
                     the plane (or for the pilot to escape).
Organisational Failure, York EngD Course in LSCITS, 2012                        Slide 18
Sharing the Big Picture
  •       HROs recognise that:
        –       If people are narrowly focused they will act only in their own
                interest
        –       People need to maintain awareness of other people and
                events around the organisation

  •       HROs
        –       Train people broadly
        –       Educate people about overarching objectives, and set
                statements of purpose
        –       Give people access to information on what is happening
                elsewhere
        –       Clearly specify how people and teams fit into the whole
Organisational Failure, York EngD Course in LSCITS, 2012                  Slide 19
Reluctance to Simplify
  •       HROS are reluctant to simplify
              •      All organisations have to simplify and abstract, to filter out
                     unnecessary information (particularly for getting “big pictures”)

  •       Rather, HROs
        –         Use labels and categories as little as possible as they stop
                  you from looking further into details and events.
        –         Continually rework labels and categories
        –         Listen to wisdom, but with skepticism
        –         Do not focus on information that supports expectations, but
                  focus on that which doesn‟t fit or disconfirms desires


Organisational Failure, York EngD Course in LSCITS, 2012                          Slide 20
Migration of decision making
  •       HROs migrate decision making as far down the
          organisation as possible
        –       Decisions are not made by one central authority

  •       HROs recognise:
        –       Decisions need to be made where there is expertise
        –       Decisions often need to be made quickly
        –       People must be trained in making decisions and are given
                the right resources to do so
        –       Skill levels and legitimacy through the organisation and
                people are trusted


Organisational Failure, York EngD Course in LSCITS, 2012                   Slide 21
HROs and Normal Accidents
  •       HRO theory is sometimes presented as conflicting
          with Normal Accidents
        –       HRO proponents may argue that accidents are not „normal‟
        –       Leveson critiques work on HROs and argues that they are
                not based on concerns of tightly coupled systems

  •       Arguably, an HRO is an organisation that has taken
          active steps to:
        –       reduce coupling and
        –       reduce interactions
        –       Once that has been achieved, the driver for HRO‟s is
                perhaps a strong „safety culture‟ to promote safety across the
                organisation
Organisational Failure, York EngD Course in LSCITS, 2012                Slide 22
Organisational vulnerabilities

  •       Organisational vulnerabilities are characteristics of an
          organisation that weaken defensive layers and so
          may lead to system failure.
  •       Examples of organisational vulnerabilities
        –       Over-reliance on process to achieve safety/dependability
        –       Responsibility failures
        –       Weak safety/dependability culture
        –       Under-resourcing of safety




Organisational Failure, York EngD Course in LSCITS, 2012               Slide 23
Over-reliance on process
  •       Quality standards such as ISO 9000 place great
          emphasis on process and process assurance
        –       Implication of these standards is that process is paramount

  •       This tends to promote a belief that focusing on
          process is the way to achieve safety and
          dependability
  •       However, processes are never isolated and have to
          be enacted in a dynamic context
  •       Sometimes necessary to deviate from the „normal‟
          process to achieve safety and dependability

Organisational Failure, York EngD Course in LSCITS, 2012                Slide 24
Responsibility failures
  •       System failures are often a consequence of
          responsibility failures
        –       Unassigned responsibility
        –       Misassigned responsibility
        –       Misunderstood responsibility
        –       Duplicated responsibilities
        –       Responsibility overload
        –       Responsibility fragility

  •       Responsibility failures may be a consequence of poor
          communications and/or under-resourcing

Organisational Failure, York EngD Course in LSCITS, 2012   Slide 25
Organisational culture
  •       “The way that we do things around here”
  •       Culture may conflict with public statements of
          priorities
        –       “The patient comes first”
        –       “Safety is our goal”

  •       Investment banking
        –       High risk, high reward
        –       Lack of regulation or weak compliance with regulations
        –       Large-scale failures


Organisational Failure, York EngD Course in LSCITS, 2012                 Slide 26
Safety culture
  •       Some organisations have developed a strong safety
          culture where safety is seen as a priority by all
          members of the organisation
  •       Safety culture (UK HSE)
        –       “The product of individual and group values, attitudes,
                perceptions, competencies, and patterns of behaviour that
                determine the commitment to, and the style and proficiency
                of, an organization‟s health and safety management”




Organisational Failure, York EngD Course in LSCITS, 2012              Slide 27
Safety culture (Reason)




Organisational Failure, York EngD Course in LSCITS, 2012   Slide 28
Safety maturity




Organisational Failure, York EngD Course in LSCITS, 2012   Slide 29
Under-resourcing
  •       If operations are under-resourced then safety and
          dependability are often sacrificed
  •       Organisational priorities focus on optimising resource
          utilisation to continue service delivery
        –         Safety and dependability may be seen as an avoidable
                  overhead

  •       Example
        –         Cleaning services in hospital outsourced to save money
        –         Competitive tender
        –         Under-resourced so quality of service reduced
              •      Consequent increase in hospital acquired infections
Organisational Failure, York EngD Course in LSCITS, 2012                   Slide 30
Complex systems
  •       Complexity = Coupling + Interaction
  •       Lesson for LSCITS
        –       Increasing complexity will lead to unpredictable system failure
        –       Strive to build LSITS rather than LSCITS

  •       Improve safety by
        –       Reducing coupling
        –       Reducing interactions
        –       Redundancy may not improve safety as it increases complexity in
                the system

  •       Address problems at organisational as well as the system level


Organisational Failure, York EngD Course in LSCITS, 2012                          Slide 31
Key points
  •       Organisational decisions, influenced by structure and
          culture, often have a major impact on safety and
          dependability
  •       Normal Accident Theory postulates that accidents are
          inevitable in complex, tightly coupled systems
  •       High-reliability organisations aim to achieve safety
          through a set of practices that aim to reduce failures
  •       Organisational vulnerabilities include over-reliance on
          process, responsibility failures, poor safety culture
          and under-resourcing

Organisational Failure, York EngD Course in LSCITS, 2012     Slide 32

More Related Content

What's hot

Entrepreneurship
EntrepreneurshipEntrepreneurship
Entrepreneurship
Seta Wicaksana
 
Peter senge,Management,Five disipline
Peter senge,Management,Five disiplinePeter senge,Management,Five disipline
Peter senge,Management,Five disiplineUzair4292
 
EMPOWERING WOMEN IN LEADERSHIP: HOW TO DEVELOP FEMALE TALENT
EMPOWERING WOMEN IN LEADERSHIP: HOW TO DEVELOP FEMALE TALENTEMPOWERING WOMEN IN LEADERSHIP: HOW TO DEVELOP FEMALE TALENT
EMPOWERING WOMEN IN LEADERSHIP: HOW TO DEVELOP FEMALE TALENT
Human Capital Media
 
ORGANISATIONAL CHANGE & STRESS MANAGEMENT
ORGANISATIONAL CHANGE & STRESS MANAGEMENTORGANISATIONAL CHANGE & STRESS MANAGEMENT
ORGANISATIONAL CHANGE & STRESS MANAGEMENT
Shilpi Panchal
 
Theories of-entrepreneurship
Theories of-entrepreneurshipTheories of-entrepreneurship
Theories of-entrepreneurship
shrinivas kulkarni
 
The 8 Biggest Business Trends In 2022
The 8 Biggest Business Trends In 2022 The 8 Biggest Business Trends In 2022
The 8 Biggest Business Trends In 2022
Bernard Marr
 
The Future of Learning & Development
The Future of Learning & DevelopmentThe Future of Learning & Development
The Future of Learning & Development
futurethink
 
Communication-Robbins&Judge-Team11
Communication-Robbins&Judge-Team11Communication-Robbins&Judge-Team11
Communication-Robbins&Judge-Team11
Bao Nguyen
 
Learning and organizational reward system
Learning and organizational reward systemLearning and organizational reward system
Learning and organizational reward system
Anurag Priyadarshi
 
Perception and Individual Decision Making
Perception and Individual Decision MakingPerception and Individual Decision Making
Perception and Individual Decision Making
Curtin University, Perth, Australia
 
Diversity at workplace
Diversity at workplaceDiversity at workplace
Diversity at workplace
Imran Alam
 
Test your Unconscious Bias
Test your Unconscious BiasTest your Unconscious Bias
Test your Unconscious Bias
Kwintessential Consulting Ltd.
 
Entrepreneurship Ecosystems: Design & build an "ecosystem-in-a-box"
Entrepreneurship Ecosystems: Design & build an "ecosystem-in-a-box"Entrepreneurship Ecosystems: Design & build an "ecosystem-in-a-box"
Entrepreneurship Ecosystems: Design & build an "ecosystem-in-a-box"
Laura IH Bennett
 
Organizational Behavior
Organizational BehaviorOrganizational Behavior
Organizational Behavior
ahmad bassiouny
 
Perception and individual decision making
Perception and individual decision makingPerception and individual decision making
Perception and individual decision making
Nitasha Dhawan
 
Whistleblowing Presentation
Whistleblowing PresentationWhistleblowing Presentation
Whistleblowing Presentation
katlyntrzaska
 
Corporate Entrepreneurship: How?
Corporate Entrepreneurship: How?Corporate Entrepreneurship: How?
Corporate Entrepreneurship: How?
Kelvin Koh Tong Weng
 
LEADERSHIP ENTERPRENEURSHIP AND
 LEADERSHIP ENTERPRENEURSHIP AND LEADERSHIP ENTERPRENEURSHIP AND
LEADERSHIP ENTERPRENEURSHIP AND
Arushi Arora
 
Note of Organizational Behavior HRMT 5210
Note of Organizational Behavior HRMT 5210Note of Organizational Behavior HRMT 5210
Note of Organizational Behavior HRMT 5210
Rashna Maharjan
 
Chapter 6 performance management
Chapter  6 performance management Chapter  6 performance management
Chapter 6 performance management
Preeti Bhaskar
 

What's hot (20)

Entrepreneurship
EntrepreneurshipEntrepreneurship
Entrepreneurship
 
Peter senge,Management,Five disipline
Peter senge,Management,Five disiplinePeter senge,Management,Five disipline
Peter senge,Management,Five disipline
 
EMPOWERING WOMEN IN LEADERSHIP: HOW TO DEVELOP FEMALE TALENT
EMPOWERING WOMEN IN LEADERSHIP: HOW TO DEVELOP FEMALE TALENTEMPOWERING WOMEN IN LEADERSHIP: HOW TO DEVELOP FEMALE TALENT
EMPOWERING WOMEN IN LEADERSHIP: HOW TO DEVELOP FEMALE TALENT
 
ORGANISATIONAL CHANGE & STRESS MANAGEMENT
ORGANISATIONAL CHANGE & STRESS MANAGEMENTORGANISATIONAL CHANGE & STRESS MANAGEMENT
ORGANISATIONAL CHANGE & STRESS MANAGEMENT
 
Theories of-entrepreneurship
Theories of-entrepreneurshipTheories of-entrepreneurship
Theories of-entrepreneurship
 
The 8 Biggest Business Trends In 2022
The 8 Biggest Business Trends In 2022 The 8 Biggest Business Trends In 2022
The 8 Biggest Business Trends In 2022
 
The Future of Learning & Development
The Future of Learning & DevelopmentThe Future of Learning & Development
The Future of Learning & Development
 
Communication-Robbins&Judge-Team11
Communication-Robbins&Judge-Team11Communication-Robbins&Judge-Team11
Communication-Robbins&Judge-Team11
 
Learning and organizational reward system
Learning and organizational reward systemLearning and organizational reward system
Learning and organizational reward system
 
Perception and Individual Decision Making
Perception and Individual Decision MakingPerception and Individual Decision Making
Perception and Individual Decision Making
 
Diversity at workplace
Diversity at workplaceDiversity at workplace
Diversity at workplace
 
Test your Unconscious Bias
Test your Unconscious BiasTest your Unconscious Bias
Test your Unconscious Bias
 
Entrepreneurship Ecosystems: Design & build an "ecosystem-in-a-box"
Entrepreneurship Ecosystems: Design & build an "ecosystem-in-a-box"Entrepreneurship Ecosystems: Design & build an "ecosystem-in-a-box"
Entrepreneurship Ecosystems: Design & build an "ecosystem-in-a-box"
 
Organizational Behavior
Organizational BehaviorOrganizational Behavior
Organizational Behavior
 
Perception and individual decision making
Perception and individual decision makingPerception and individual decision making
Perception and individual decision making
 
Whistleblowing Presentation
Whistleblowing PresentationWhistleblowing Presentation
Whistleblowing Presentation
 
Corporate Entrepreneurship: How?
Corporate Entrepreneurship: How?Corporate Entrepreneurship: How?
Corporate Entrepreneurship: How?
 
LEADERSHIP ENTERPRENEURSHIP AND
 LEADERSHIP ENTERPRENEURSHIP AND LEADERSHIP ENTERPRENEURSHIP AND
LEADERSHIP ENTERPRENEURSHIP AND
 
Note of Organizational Behavior HRMT 5210
Note of Organizational Behavior HRMT 5210Note of Organizational Behavior HRMT 5210
Note of Organizational Behavior HRMT 5210
 
Chapter 6 performance management
Chapter  6 performance management Chapter  6 performance management
Chapter 6 performance management
 

Similar to Organizational Failure (LSCITS EngD 2012)

Sociotechnical systems resilience
Sociotechnical systems resilienceSociotechnical systems resilience
Sociotechnical systems resilienceJean-René RUAULT
 
Fuller.david
Fuller.davidFuller.david
Fuller.davidNASAPMC
 
CS5032 Lecture 13: organisations and failure
CS5032 Lecture 13: organisations and failureCS5032 Lecture 13: organisations and failure
CS5032 Lecture 13: organisations and failureJohn Rooksby
 
3 Lessons from High Reliability Organizations
3 Lessons from High Reliability Organizations3 Lessons from High Reliability Organizations
3 Lessons from High Reliability Organizations
Wilson Perumal and Company
 
Designing Complex Systems for Recovery (LSCITS EngD 2011)
Designing Complex Systems for Recovery (LSCITS EngD 2011)Designing Complex Systems for Recovery (LSCITS EngD 2011)
Designing Complex Systems for Recovery (LSCITS EngD 2011)
Ian Sommerville
 
Aviation Training, Safety Management System
Aviation Training, Safety Management SystemAviation Training, Safety Management System
Aviation Training, Safety Management System
pghclearingsolutions
 
DARWIN Webinar 'The sharp end' by Anders Ellerstrand
DARWIN Webinar 'The sharp end' by Anders EllerstrandDARWIN Webinar 'The sharp end' by Anders Ellerstrand
DARWIN Webinar 'The sharp end' by Anders Ellerstrand
Peter O'Leary
 
Navigating the risks in implementing Hybrid Cloud, Agile and Project Manageme...
Navigating the risks in implementing Hybrid Cloud, Agile and Project Manageme...Navigating the risks in implementing Hybrid Cloud, Agile and Project Manageme...
Navigating the risks in implementing Hybrid Cloud, Agile and Project Manageme...
Livingstone Advisory
 
Testing Hyper-Complex Systems: What Can We Know? What Can We Claim?
Testing Hyper-Complex Systems: What Can We Know? What Can We Claim?Testing Hyper-Complex Systems: What Can We Know? What Can We Claim?
Testing Hyper-Complex Systems: What Can We Know? What Can We Claim?
TechWell
 
Improving the safety culture at VTTI by using serious gaming
Improving the safety culture at VTTI by using serious gamingImproving the safety culture at VTTI by using serious gaming
Improving the safety culture at VTTI by using serious gamingjhjsmits
 
Improving the safety culture at terminals using serious gaming
Improving the safety culture at terminals using serious gamingImproving the safety culture at terminals using serious gaming
Improving the safety culture at terminals using serious gamingjhjsmits
 
CS 5032 L4 requirements engineering 2013
CS 5032 L4 requirements engineering 2013CS 5032 L4 requirements engineering 2013
CS 5032 L4 requirements engineering 2013Ian Sommerville
 
The importance of considering human factors
The importance of considering human factorsThe importance of considering human factors
The importance of considering human factors
NSW Environment and Planning
 
Reducing Accident in OG Industry.pdf
Reducing Accident in OG Industry.pdfReducing Accident in OG Industry.pdf
Reducing Accident in OG Industry.pdf
DianValarbi
 
Process Safety Awareness | PSM | Gaurav Singh Rajput
Process Safety Awareness | PSM | Gaurav Singh RajputProcess Safety Awareness | PSM | Gaurav Singh Rajput
Process Safety Awareness | PSM | Gaurav Singh Rajput
Gaurav Singh Rajput
 
Responsibility Modelling
Responsibility ModellingResponsibility Modelling
Responsibility Modelling
Ian Sommerville
 
Dependability requirements for LSCITS
Dependability requirements for LSCITSDependability requirements for LSCITS
Dependability requirements for LSCITS
Ian Sommerville
 
Resp modellingintro
Resp modellingintroResp modellingintro
Resp modellingintro
Ian Sommerville
 
Modes of project resilience APM event 9th February 2016
Modes of project resilience APM event 9th February 2016Modes of project resilience APM event 9th February 2016
Modes of project resilience APM event 9th February 2016
Association for Project Management
 
Safety and security in distributed systems
Safety and security in distributed systems Safety and security in distributed systems
Safety and security in distributed systems
Einar Landre
 

Similar to Organizational Failure (LSCITS EngD 2012) (20)

Sociotechnical systems resilience
Sociotechnical systems resilienceSociotechnical systems resilience
Sociotechnical systems resilience
 
Fuller.david
Fuller.davidFuller.david
Fuller.david
 
CS5032 Lecture 13: organisations and failure
CS5032 Lecture 13: organisations and failureCS5032 Lecture 13: organisations and failure
CS5032 Lecture 13: organisations and failure
 
3 Lessons from High Reliability Organizations
3 Lessons from High Reliability Organizations3 Lessons from High Reliability Organizations
3 Lessons from High Reliability Organizations
 
Designing Complex Systems for Recovery (LSCITS EngD 2011)
Designing Complex Systems for Recovery (LSCITS EngD 2011)Designing Complex Systems for Recovery (LSCITS EngD 2011)
Designing Complex Systems for Recovery (LSCITS EngD 2011)
 
Aviation Training, Safety Management System
Aviation Training, Safety Management SystemAviation Training, Safety Management System
Aviation Training, Safety Management System
 
DARWIN Webinar 'The sharp end' by Anders Ellerstrand
DARWIN Webinar 'The sharp end' by Anders EllerstrandDARWIN Webinar 'The sharp end' by Anders Ellerstrand
DARWIN Webinar 'The sharp end' by Anders Ellerstrand
 
Navigating the risks in implementing Hybrid Cloud, Agile and Project Manageme...
Navigating the risks in implementing Hybrid Cloud, Agile and Project Manageme...Navigating the risks in implementing Hybrid Cloud, Agile and Project Manageme...
Navigating the risks in implementing Hybrid Cloud, Agile and Project Manageme...
 
Testing Hyper-Complex Systems: What Can We Know? What Can We Claim?
Testing Hyper-Complex Systems: What Can We Know? What Can We Claim?Testing Hyper-Complex Systems: What Can We Know? What Can We Claim?
Testing Hyper-Complex Systems: What Can We Know? What Can We Claim?
 
Improving the safety culture at VTTI by using serious gaming
Improving the safety culture at VTTI by using serious gamingImproving the safety culture at VTTI by using serious gaming
Improving the safety culture at VTTI by using serious gaming
 
Improving the safety culture at terminals using serious gaming
Improving the safety culture at terminals using serious gamingImproving the safety culture at terminals using serious gaming
Improving the safety culture at terminals using serious gaming
 
CS 5032 L4 requirements engineering 2013
CS 5032 L4 requirements engineering 2013CS 5032 L4 requirements engineering 2013
CS 5032 L4 requirements engineering 2013
 
The importance of considering human factors
The importance of considering human factorsThe importance of considering human factors
The importance of considering human factors
 
Reducing Accident in OG Industry.pdf
Reducing Accident in OG Industry.pdfReducing Accident in OG Industry.pdf
Reducing Accident in OG Industry.pdf
 
Process Safety Awareness | PSM | Gaurav Singh Rajput
Process Safety Awareness | PSM | Gaurav Singh RajputProcess Safety Awareness | PSM | Gaurav Singh Rajput
Process Safety Awareness | PSM | Gaurav Singh Rajput
 
Responsibility Modelling
Responsibility ModellingResponsibility Modelling
Responsibility Modelling
 
Dependability requirements for LSCITS
Dependability requirements for LSCITSDependability requirements for LSCITS
Dependability requirements for LSCITS
 
Resp modellingintro
Resp modellingintroResp modellingintro
Resp modellingintro
 
Modes of project resilience APM event 9th February 2016
Modes of project resilience APM event 9th February 2016Modes of project resilience APM event 9th February 2016
Modes of project resilience APM event 9th February 2016
 
Safety and security in distributed systems
Safety and security in distributed systems Safety and security in distributed systems
Safety and security in distributed systems
 

More from Ian Sommerville

Ultra Large Scale Systems
Ultra Large Scale SystemsUltra Large Scale Systems
Ultra Large Scale Systems
Ian Sommerville
 
Resilience and recovery
Resilience and recoveryResilience and recovery
Resilience and recovery
Ian Sommerville
 
LSCITS-engineering
LSCITS-engineeringLSCITS-engineering
LSCITS-engineering
Ian Sommerville
 
Requirements reality
Requirements realityRequirements reality
Requirements reality
Ian Sommerville
 
Conceptual systems design
Conceptual systems designConceptual systems design
Conceptual systems design
Ian Sommerville
 
Requirements Engineering for LSCITS
Requirements Engineering for LSCITSRequirements Engineering for LSCITS
Requirements Engineering for LSCITS
Ian Sommerville
 
An introduction to LSCITS
An introduction to LSCITSAn introduction to LSCITS
An introduction to LSCITS
Ian Sommerville
 
Internet worm-case-study
Internet worm-case-studyInternet worm-case-study
Internet worm-case-study
Ian Sommerville
 
Designing software for a million users
Designing software for a million usersDesigning software for a million users
Designing software for a million users
Ian Sommerville
 
Security case buffer overflow
Security case buffer overflowSecurity case buffer overflow
Security case buffer overflowIan Sommerville
 
CS5032 Case study Ariane 5 launcher failure
CS5032 Case study Ariane 5 launcher failureCS5032 Case study Ariane 5 launcher failure
CS5032 Case study Ariane 5 launcher failureIan Sommerville
 
CS5032 Case study Kegworth air disaster
CS5032 Case study Kegworth air disasterCS5032 Case study Kegworth air disaster
CS5032 Case study Kegworth air disasterIan Sommerville
 
CS5032 L19 cybersecurity 1
CS5032 L19 cybersecurity 1CS5032 L19 cybersecurity 1
CS5032 L19 cybersecurity 1Ian Sommerville
 
CS5032 L20 cybersecurity 2
CS5032 L20 cybersecurity 2CS5032 L20 cybersecurity 2
CS5032 L20 cybersecurity 2Ian Sommerville
 
L17 CS5032 critical infrastructure
L17 CS5032 critical infrastructureL17 CS5032 critical infrastructure
L17 CS5032 critical infrastructureIan Sommerville
 
CS5032 Case study Maroochy water breach
CS5032 Case study Maroochy water breachCS5032 Case study Maroochy water breach
CS5032 Case study Maroochy water breachIan Sommerville
 
CS 5032 L18 Critical infrastructure 2: SCADA systems
CS 5032 L18 Critical infrastructure 2: SCADA systemsCS 5032 L18 Critical infrastructure 2: SCADA systems
CS 5032 L18 Critical infrastructure 2: SCADA systemsIan Sommerville
 
CS5032 L9 security engineering 1 2013
CS5032 L9 security engineering 1 2013CS5032 L9 security engineering 1 2013
CS5032 L9 security engineering 1 2013Ian Sommerville
 
CS5032 L10 security engineering 2 2013
CS5032 L10 security engineering 2 2013CS5032 L10 security engineering 2 2013
CS5032 L10 security engineering 2 2013Ian Sommerville
 
CS5032 L11 validation and reliability testing 2013
CS5032 L11 validation and reliability testing 2013CS5032 L11 validation and reliability testing 2013
CS5032 L11 validation and reliability testing 2013Ian Sommerville
 

More from Ian Sommerville (20)

Ultra Large Scale Systems
Ultra Large Scale SystemsUltra Large Scale Systems
Ultra Large Scale Systems
 
Resilience and recovery
Resilience and recoveryResilience and recovery
Resilience and recovery
 
LSCITS-engineering
LSCITS-engineeringLSCITS-engineering
LSCITS-engineering
 
Requirements reality
Requirements realityRequirements reality
Requirements reality
 
Conceptual systems design
Conceptual systems designConceptual systems design
Conceptual systems design
 
Requirements Engineering for LSCITS
Requirements Engineering for LSCITSRequirements Engineering for LSCITS
Requirements Engineering for LSCITS
 
An introduction to LSCITS
An introduction to LSCITSAn introduction to LSCITS
An introduction to LSCITS
 
Internet worm-case-study
Internet worm-case-studyInternet worm-case-study
Internet worm-case-study
 
Designing software for a million users
Designing software for a million usersDesigning software for a million users
Designing software for a million users
 
Security case buffer overflow
Security case buffer overflowSecurity case buffer overflow
Security case buffer overflow
 
CS5032 Case study Ariane 5 launcher failure
CS5032 Case study Ariane 5 launcher failureCS5032 Case study Ariane 5 launcher failure
CS5032 Case study Ariane 5 launcher failure
 
CS5032 Case study Kegworth air disaster
CS5032 Case study Kegworth air disasterCS5032 Case study Kegworth air disaster
CS5032 Case study Kegworth air disaster
 
CS5032 L19 cybersecurity 1
CS5032 L19 cybersecurity 1CS5032 L19 cybersecurity 1
CS5032 L19 cybersecurity 1
 
CS5032 L20 cybersecurity 2
CS5032 L20 cybersecurity 2CS5032 L20 cybersecurity 2
CS5032 L20 cybersecurity 2
 
L17 CS5032 critical infrastructure
L17 CS5032 critical infrastructureL17 CS5032 critical infrastructure
L17 CS5032 critical infrastructure
 
CS5032 Case study Maroochy water breach
CS5032 Case study Maroochy water breachCS5032 Case study Maroochy water breach
CS5032 Case study Maroochy water breach
 
CS 5032 L18 Critical infrastructure 2: SCADA systems
CS 5032 L18 Critical infrastructure 2: SCADA systemsCS 5032 L18 Critical infrastructure 2: SCADA systems
CS 5032 L18 Critical infrastructure 2: SCADA systems
 
CS5032 L9 security engineering 1 2013
CS5032 L9 security engineering 1 2013CS5032 L9 security engineering 1 2013
CS5032 L9 security engineering 1 2013
 
CS5032 L10 security engineering 2 2013
CS5032 L10 security engineering 2 2013CS5032 L10 security engineering 2 2013
CS5032 L10 security engineering 2 2013
 
CS5032 L11 validation and reliability testing 2013
CS5032 L11 validation and reliability testing 2013CS5032 L11 validation and reliability testing 2013
CS5032 L11 validation and reliability testing 2013
 

Recently uploaded

FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 

Organizational Failure (LSCITS EngD 2012)

  • 1. Organisational Failure Prof Ian Sommerville Video link Organisational Failure, York EngD Course in LSCITS, 2012 Slide 1
  • 2. Organisational failure • Why and how organisational factors can contribute to system failures Organisational Failure, York EngD Course in LSCITS, 2012 Slide 2
  • 3. Why organisations matter? • Organisations have multiple, inter-related, potentially conflicting goals: – Efficient resource utilisation – Timely delivery of products/services – Customer satisfaction – Owner satisfaction – Regulatory compliance – Safety and dependability – Maintenance of reputation/brand – Future development Organisational Failure, York EngD Course in LSCITS, 2012 Slide 3
  • 4. Decision making • Organisational decision making involves taking all of these into account – Inevitably, this sometimes means making compromises that affect the safety and dependability of a system • These compromises lead to vulnerabilities and hazards that may then compromise the safety or dependability of the system • In complex organisations, there are competing priorities in different parts of the organisation – Shifting power and authority in an organisation affects decision making – May be deliberate lack of communications across the organisation Organisational Failure, York EngD Course in LSCITS, 2012 Slide 4
  • 5. NASA Challenger disaster • Space shuttle exploded shortly after take-off • The cause was the failure of rubber seals (O-rings) that allowed hot gas to escape and make contact with fuel tanks which then exploded • Subsequent enquiry showed that O-ring failure was due to brittleness at low temperatures • Arguably, decision makers were complacent because – Redundant (primary and secondary) O-rings in the system – Damage to primary O-rings had been tolerated in previous launches Organisational Failure, York EngD Course in LSCITS, 2012 Slide 5
  • 6. Organisational failure • Engineers were concerned about launching in low temperatures and advised against launch • But goals other than safety and dependability took precedence and engineers were overruled – „Owner‟ satisfaction • already several delays to flight – Future planning • NASA wanted a success to support budget negotiations – Resource utilisation • Reluctance to address known problem with O-rings because of costs Organisational Failure, York EngD Course in LSCITS, 2012 Slide 6
  • 7. Normal accidents • Developed by Charles Perrow who conducted a study of a nuclear accident in the USA (Three Mile Island) • Official conclusion was that the problems were due to “human error” • Perrow disagreed with this and argues that failures are „normal‟ and inevitable in complex systems which have: – Interactive complexity • The presence of unfamiliar, unplanned and unexpected sequences of events in a system that are not visible or immediately comprehensible – Tight coupling • The presence of interdependent components. • Tight coupling will make a system more prone to cascading errors. Organisational Failure, York EngD Course in LSCITS, 2012 Slide 7
  • 8. Organisational Failure, York EngD Course in LSCITS, 2012 Slide 8
  • 9. Redundancy • The use of redundancy is a fundamental technique in achieving system safety – Primary and secondary O-rings on space shuttle – Quintuple redundancy in Airbus FCS • Failure of primary system can be tolerated • Perrow argues that redundancy can decrease rather than increase safety: – Increases complexity and coupling in the system – Provides reassurance that system faults can be tolerated Organisational Failure, York EngD Course in LSCITS, 2012 Slide 9
  • 10. Failures or successes • Normal accident theory is based on extensive studies of system failures • It argues that failure is systemic and an inherent characteristic of the system itself • Alternative perspective is based on studies of success – Why are there some areas that are apparently complex (e.g. air traffic management) where failures are relatively uncommon? • Led to the notion of high-reliability organisations Organisational Failure, York EngD Course in LSCITS, 2012 Slide 10
  • 11. Failure-free organisations? • High-reliability organisation (HRO) researchers disagree that complex, highly interdependent systems will inevitably have accidents – They believe organisations are able to compensate for technical shortcomings through their methods of operation, in essence they argue that organisations can be ‘failure free’. • Based on studies of „reliable‟ organisations – Aircraft carriers – Air traffic control – Nuclear power stations – Intensive care units Organisational Failure, York EngD Course in LSCITS, 2012 Slide 11
  • 12. Aircraft carrier flight operations Organisational Failure, York EngD Course in LSCITS, 2012 Slide 12
  • 13. Nuclear powered carriers • Complex systems – Carriers are 24 stories high and carry enough fuel for 15 years. 2000 telephones. 3,360 compartments and spaces – Multiple software intensive systems (command systems, aircraft software) – Dangerous objects (aircraft, fuel, and explosives) in close proximity. – Aircraft taking off and landing in 48-60 second intervals. – 6000 crew. Several different kinds of aircraft, multiple squadrons. – All work interdependently and must be coordinated. Organisational Failure, York EngD Course in LSCITS, 2012 Slide 13
  • 14. Nuclear powered carriers • High risk – Nuclear reactor accidents – Fire, flooding, grounding, collision – Fuel and weapons explosions – Mistaken identification of friends and foes – High risks both to crew and a much larger public • High reliability – Low “crunch rates” – comparatively few major accidents • High reliability achieved through organisational design Organisational Failure, York EngD Course in LSCITS, 2012 Slide 14
  • 15. High Reliability Organisations • High Reliability Organisations (HROs) have particular qualities – Reliability takes precedence over efficiency – Preoccupation with failure, not success – Share the big picture – Focus on details – Migrate decisions Organisational Failure, York EngD Course in LSCITS, 2012 Slide 15
  • 16. Reliability over Efficiency – Reliability comes before efficiency but cannot replace it – Decisions are made on the grounds of reliability first and then efficiency – Efficiency initiatives are treated with scepticism – Managers regularly talk to and familiarise themselves with staff about how they do their work and why. This stops managers focusing just on figures. – Organisations develop safety measures as well as financial measures, and include these in employee evaluations – Organisations assign value to the avoidance of accidents – High redundancy despite cost – Cautious actions when necessary despite cost Organisational Failure, York EngD Course in LSCITS, 2012 Slide 16
  • 17. Preoccupation with Failure • HROs recognise that: – Workers need to be heedful to the possibility of failure – Failures are normal but accidents should be avoided – Acknowledge there can be unexpected failure modes, even in common activities • HROs address failure by: – Constant training of all people (simulations, apprenticing, practice) – Using incident reporting – Designing in extensive redundancy – Maintaining contingencies for critical operations – Requiring proofs that something is safe, not that it is unsafe Organisational Failure, York EngD Course in LSCITS, 2012 Slide 17
  • 18. Carrier operations – There is constant tracking of issues around malfunctioning, defective and substandard equipment. • They act on these by training crew how to overcome problems and pressuring vendors to make improvements – Extensive redundancy (overlapping jobs, multiple channels and centres of communications, spare parts, multiple sources for decision making). • Example: if an aircrafts landing gear warning light comes on, the spotter, commander and pilot all work together to establish what the issues is. – Multiple contingencies are maintained • Example: There will always be multiple options for how to land the plane (or for the pilot to escape). Organisational Failure, York EngD Course in LSCITS, 2012 Slide 18
  • 19. Sharing the Big Picture • HROs recognise that: – If people are narrowly focused they will act only in their own interest – People need to maintain awareness of other people and events around the organisation • HROs – Train people broadly – Educate people about overarching objectives, and set statements of purpose – Give people access to information on what is happening elsewhere – Clearly specify how people and teams fit into the whole Organisational Failure, York EngD Course in LSCITS, 2012 Slide 19
  • 20. Reluctance to Simplify • HROS are reluctant to simplify • All organisations have to simplify and abstract, to filter out unnecessary information (particularly for getting “big pictures”) • Rather, HROs – Use labels and categories as little as possible as they stop you from looking further into details and events. – Continually rework labels and categories – Listen to wisdom, but with skepticism – Do not focus on information that supports expectations, but focus on that which doesn‟t fit or disconfirms desires Organisational Failure, York EngD Course in LSCITS, 2012 Slide 20
  • 21. Migration of decision making • HROs migrate decision making as far down the organisation as possible – Decisions are not made by one central authority • HROs recognise: – Decisions need to be made where there is expertise – Decisions often need to be made quickly – People must be trained in making decisions and are given the right resources to do so – Skill levels and legitimacy through the organisation and people are trusted Organisational Failure, York EngD Course in LSCITS, 2012 Slide 21
  • 22. HROs and Normal Accidents • HRO theory is sometimes presented as conflicting with Normal Accidents – HRO proponents may argue that accidents are not „normal‟ – Leveson critiques work on HROs and argues that they are not based on concerns of tightly coupled systems • Arguably, an HRO is an organisation that has taken active steps to: – reduce coupling and – reduce interactions – Once that has been achieved, the driver for HRO‟s is perhaps a strong „safety culture‟ to promote safety across the organisation Organisational Failure, York EngD Course in LSCITS, 2012 Slide 22
  • 23. Organisational vulnerabilities • Organisational vulnerabilities are characteristics of an organisation that weaken defensive layers and so may lead to system failure. • Examples of organisational vulnerabilities – Over-reliance on process to achieve safety/dependability – Responsibility failures – Weak safety/dependability culture – Under-resourcing of safety Organisational Failure, York EngD Course in LSCITS, 2012 Slide 23
  • 24. Over-reliance on process • Quality standards such as ISO 9000 place great emphasis on process and process assurance – Implication of these standards is that process is paramount • This tends to promote a belief that focusing on process is the way to achieve safety and dependability • However, processes are never isolated and have to be enacted in a dynamic context • Sometimes necessary to deviate from the „normal‟ process to achieve safety and dependability Organisational Failure, York EngD Course in LSCITS, 2012 Slide 24
  • 25. Responsibility failures • System failures are often a consequence of responsibility failures – Unassigned responsibility – Misassigned responsibility – Misunderstood responsibility – Duplicated responsibilities – Responsibility overload – Responsibility fragility • Responsibility failures may be a consequence of poor communications and/or under-resourcing Organisational Failure, York EngD Course in LSCITS, 2012 Slide 25
  • 26. Organisational culture • “The way that we do things around here” • Culture may conflict with public statements of priorities – “The patient comes first” – “Safety is our goal” • Investment banking – High risk, high reward – Lack of regulation or weak compliance with regulations – Large-scale failures Organisational Failure, York EngD Course in LSCITS, 2012 Slide 26
  • 27. Safety culture • Some organisations have developed a strong safety culture where safety is seen as a priority by all members of the organisation • Safety culture (UK HSE) – “The product of individual and group values, attitudes, perceptions, competencies, and patterns of behaviour that determine the commitment to, and the style and proficiency of, an organization‟s health and safety management” Organisational Failure, York EngD Course in LSCITS, 2012 Slide 27
  • 28. Safety culture (Reason) Organisational Failure, York EngD Course in LSCITS, 2012 Slide 28
  • 29. Safety maturity Organisational Failure, York EngD Course in LSCITS, 2012 Slide 29
  • 30. Under-resourcing • If operations are under-resourced then safety and dependability are often sacrificed • Organisational priorities focus on optimising resource utilisation to continue service delivery – Safety and dependability may be seen as an avoidable overhead • Example – Cleaning services in hospital outsourced to save money – Competitive tender – Under-resourced so quality of service reduced • Consequent increase in hospital acquired infections Organisational Failure, York EngD Course in LSCITS, 2012 Slide 30
  • 31. Complex systems • Complexity = Coupling + Interaction • Lesson for LSCITS – Increasing complexity will lead to unpredictable system failure – Strive to build LSITS rather than LSCITS • Improve safety by – Reducing coupling – Reducing interactions – Redundancy may not improve safety as it increases complexity in the system • Address problems at organisational as well as the system level Organisational Failure, York EngD Course in LSCITS, 2012 Slide 31
  • 32. Key points • Organisational decisions, influenced by structure and culture, often have a major impact on safety and dependability • Normal Accident Theory postulates that accidents are inevitable in complex, tightly coupled systems • High-reliability organisations aim to achieve safety through a set of practices that aim to reduce failures • Organisational vulnerabilities include over-reliance on process, responsibility failures, poor safety culture and under-resourcing Organisational Failure, York EngD Course in LSCITS, 2012 Slide 32