SlideShare a Scribd company logo
INFRASTRUCTURE
RELIABILITY AND
RISK
ASSESSMENTS
        Steven Shapiro, P.E., ATD
        Mission Critical Practice Lead
        Morrison Hershfield
        Mission Critical



                  Morrison Hershfield Mission Critical
WHAT YOU NEED TO KNOW
AGENDA


• RISK ASSESSMENT

• INFRASTRUCTURE RELIABILITY
                 COOLING                          POWER




          Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RISK ASSESSMENTS



• WHY

• SITE EVALUATION

• METRICS



             Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Causes of Critical Failures

  •    Location
  •    Design
  •    Redundancy level
  •    Construction
  •    Quality of equipment
  •    Age                                   Lurking Vulnerabilities
  •    Operations & Maintenance program
  •    Personnel training
  •    Level of operator coverage
  •    Thoroughness of the commissioning program




                                       5
                            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
 WHY
Causes of Critical Failures

• Equipment failure
• Operator error
• Natural disaster
• Design error
• Installation error
• Commissioning or test deficiency
• Maintenance oversight
• Equipment design




 WHY                      Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Causes of Critical Failures


• Root cause not always easy to ascertain
• Combination of factors (Cascading Failures)
• Latent failures
• Most occur during change of state events
• More maintenance does not necessarily mean higher availability
• Non-Fault tolerant systems




  WHY
  FILURES                  Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Causes of Critical Failures
                                     Commissioning or
                                      Test Deficiency
                                            4%

                 System Design                               Equipment
  Natural Disaster    20%                                      Design
        3%                                                      13%
   Maintenance
    Oversight
       4%
                                                                         Equipment Failure
                                                                               28%
    Installation Error
           10%           Human Error
                            18%




 WHY                             Morrison Hershfield Mission Critical – Infrastructure and Risk Assessment
WHY DO RISK ASSESSMENT

• Alignment of business mission and facility performance expectation

• Quantifies the risk and exposure of the critical facilities to failure

• Identifies vulnerabilities and single points of failure

• First step in creating an action plan for site hardening

• Benchmark against the industry

• Assists in developing business case for capital expenditures




 RISK ASSESSMENT              Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
SITE EVALUATION

STEP 1

• Quantify reliability expectations
• Develop resiliency metrics




 RISK ASSESSMENT      Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
SITE EVALUATION

STEP 2
   •   Develop PRA model     (Probabilistic Risk Assessment)




   •   Identify Single Points of Failure within critical systems
   •   Evaluate redundancy of critical systems
   •   Capacity and expendability analysis
   •   Adequacy of Engineered Systems
   •   Operation and maintenance policies, practices and procedures
   •   Adequacy of maintenance and testing programs
   •   Evaluate risks associated with site location
   •   Overall Risk Analysis
   •   Evaluate the adequacy of operations and maintenance programs


 RISK ASSESSMENT            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
SITE EVALUATION


STEP 2 cont.
• Harmonics analysis
• EMF studies
• Short circuit & coordination studies
• Air flow modeling-CFD




 RISK ASSESSMENT            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
SITE EVALUATION

STEP 3
   • Perform gap analysis
STEP 4
   • Recommendations for upgrade/alteration to optimize facility
      performance
   • Budget and schedule development
   • Assess risk during implementation
   • Benchmark findings with industry standards




 RISK ASSESSMENT         Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RISK ASSESSMENT METRICS

   • Probability of Failure/Reliability
   • Availability
   • MTTF
   • MTTR
   • Susceptibility to natural disasters
   • Fault tolerance
   • Single Points of Failure
   • Maintainability
   • Operational readiness
   • Maintenance program

 RISK ASSESSMENT            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
INFRASTRUCTURE RELIABILITY



 • RELIABILITY / AVAILABLITY

 • RELIABILITY MODELING

 • RELIABILITY CONSIDERATIONS




 RELIABILITY    Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY


• “Reliability” is used as an umbrella definition

• May Refer to Availability, Durability, Quality

• Five 9’s ????

• Reliability = Probability of Successful Operation




 RELIABILITY                Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY AND AVAILABILITY



•     Reliability predicts how likely is the system to fail.

•     Availability is a measure (or a future prediction) of what percentage
      of the time the system will operating properly




    RELIABILITY                 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
AVAILABILITY

Five 9’s refers to Availability

Availability (A) = Average fraction of time Something is in service
and performing intended function.

99.999% availability means:
    • 5.3 minutes of downtime each year
                       or
    • 1.77 hours of downtime every 20 years

Availability does not specify how often an outage occurs



 RELIABILITY                  Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
AVAILABILITY


Availability (A) = MTBF/(MTBF + MTTR)

  MTTF: Mean Time To Failure
  MTBF: Mean Time Between Failures
  MTTR: Mean Time to Repair or Downtime
  MTBF=MTTF+MTTR




 RELIABILITY            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY BATHTUB CURVE

      Failure Rate




                     early                                                                   wear-out
                     life                        useful life                                 period

                             0.5
                                       Time (t) Years YEARS                       12 14

 RELIABILITY                       Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY MODELING


•      Used to compare system designs and assist in the evaluation of
       risk versus the cost to mitigate the risk.

•      Failure and Repair data comes from IEEE 493, Recommended
       Practice for Design of Reliable Industrial and Commercial Power
       Systems (IEEE Gold Book)




    RELIABILITY              Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY MODELING

Components used for reliability modeling of the electrical system shown
here:

•   Utility power
•   Generator
•   Circuit breakers
•   Switchboards
•   Cables
•   Automatic Transfer Switch
•   UPS module
•   Battery
•   Static Bypass Switch
•   Rack Power



 RELIABILITY              Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY MODELING




                                         Reliability Block 
                                         Diagram (RBD)


 RELIABILITY   Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY MODELING

Shown below are the results of the calculations




                         Hours         Hours




 RELIABILITY              Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
THE TRADITIONAL CLASSIFICATION SYSTEM
           The Uptime Institute
Tier 1 – Basic Non-Redundant Data Center
         Single path for power and cooling distribution without redundant
         components

Tier 2 – Basic Redundant Data Center
         Single path for power and cooling distribution with redundant
         components

Tier 3 – Concurrently Maintainable Data Center
         Multiple paths for power and cooling distribution with only one path
         active and with redundant components

Tier 4 – Fault Tolerant Data Center
         Multiple active power and cooling distribution paths with redundant
         components and fault tolerant


RELIABILITY                   Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Tier Definitions


                             TIER REQUIREMENTS
                                  Tier I Tier II                Tier III    Tier IV
                                                               1 Active
Number of Delivery Paths             1              1                      2 Active
                                                              1 Passive
Redundancy                         N             N+1             N+1     2N Minimum
Compartmentalization               No            No               No          Yes
Concurrent Maintainability         No            No              Yes          Yes
Fault Tolerance                    No            No               No          Yes
Availability                     99.67          99.75          99.982       99.95
Downtime in Hr/Yr                 28.8           22              1.6          0.4




  RELIABILITY                 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Data Center Cost

From the UI

• Tier I - $10,000 US/kW of Useable UPS Power Output

• Tier II - $11,000 US/kW of Useable UPS Power Output

• Tier III - $20,000 US/kW of Useable UPS Power Output

• Tier IV - $22,000 US/kW of Useable UPS Power Output

• Plus $225 US/SF of Computer Room




 RELIABILITY                Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
HOW MUCH REDUNDANCY IS ENOUGH?




RELIABILITY   Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations

Assumptions

• Various configurations examined for single or dual utility feeders, UPS,
  Generators, STS’s, single or dual cords

• Compare Reliability at 2000 KW and 4000 KW Load

• 5 Year Probability of Failure




 RELIABILITY                  Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Single utility feeder, parallel redundant UPS and
     generators, single cord IT equipment
2N UPS, N+1 Generators, ASTSs, Dual Cord Rack
Two Utility Feeders, 2(N+1) UPS, 2(N+1) Generators,
               ASTSs, Dual Cord Rack
Distributed Redundant UPS, N+2 Generators, Two
   Utility Feeders, ASTSs and Dual Cord Rack
Reliability Considerations




RELIABILITY         Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations




 RELIABILITY        Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations




 RELIABILITY        Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations




 RELIABILITY        Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations




 RELIABILITY        Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations
Emergency Diesel Generators

                                       fail to start


                                       fail after ½ hour



                                        fail after 8 hours



                                        fail after 24 hours


Study Performed by Idaho National Engineering Laboratory – February 1996 at Nuclear Power Plants



  RELIABILITY                                  Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations


• 2(N+1) UPS/Generator with dual utility feeders - most reliable
  topology
• 2(N+1) UPS > 2N UPS by small margin
• 2N > Distributed Redundant by small margin
• Significant improvement if a second utility feeder
  is provided
• N+2 and/or 2N generator systems are more reliable than N+1
• Hybrid configuration in a hybrid facility is sometimes the best solution




 RELIABILITY               Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations


•   Assess the condition of the mechanical plant in conjunction with the
    electrical system
•   The facility reliability will be driven by the least reliable component
    (typically the electrical infrastructure)




 RELIABILITY                Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
System Reliability Block




                Electrical System                                      Electrical          Mechanical




         Electrical systempow    ering the                          Mechanical systemsupporting critical
                   critical load                                                    load




 RELIABILITY                                 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
System Reliability Block
                                                  MTBF                 Availability              Pf (3 years)
Electrical system
alone                                           330,184                  0.99999                       8.10%
Mechanical system
alone                                           178,611                 0.999943                       11.70%
Electrical system
supporting mechanical                           108,500                 0.999985                       21.40%
Overall mechanical
system                                           70,087                 0.999931                       29.20%
Combined electrical
mechanical system                                57,819                 0.999922                       36.90%


                  Electrical System                               Electrical           Mechanical




            Electrical system powering the                     Mechanical system supporting critical
                      critical load                                            load


  RELIABILITY                                Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
The Cost of Reliability
 Reliability


 99.9999

  99.999


  99.99

  99.9


  99.0

  .9
               $   $$   $$$      $$$$      $$$$$

 RELIABILITY             Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Key Takeaways – Risk Assessment

 • What Reliability Level Do you Really Need Based on Your Business
   Case?

 • Minimize Single Points of Failure

 • Concurrent Maintainability?

 • Fault Tolerance?

 • Ensure Adequacy of Operations, Maintenance and Testing Programs

 • How to justify the cost to upgrade from present state?




 RISK ASSESSMENT             Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Key Takeaways – Reliability

•    Design objective – find optimum compromise between cost and reliability
•    Size matters – larger facilities yield lower reliability
•    System architecture and design implementation is more important role
     than equipment selection
•    Segregate system in independent blocks
•    Eliminate common source components to minimize fault propagation (i.e.
     LBS, hot-tie, manual bus ties)
•    Move single points of failures as close to the load as possible
•    Always maintain two independent sources of power to the critical load
•    Optimize the design of monitoring and controls circuits
•    Keep it simple/minimize human intervention/Utilize Automation


    RELIABILITY                    Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Thank you and please feel
QUESTIONS?                                            free to contact me

Steven Shapiro, PE, ATD
SShapiro@MorrisonHershfield.com
914.420.3213
http://www.linkedin.com/in/stevenshapirope
References:
Uptime Institute White Papers:
Tier Myths and Misconceptions
Data Center Site Infrastructure Tier Standard: Topology
Building Areas/Systems Reviewed

‫׀‬   General Construction
‫׀‬   Electrical
‫׀‬   Mechanical
‫׀‬   Plumbing And Fire Protection
‫׀‬   Operation and Maintenance
‫׀‬   Security 
‫׀‬   Load Density

                                 48
                      Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT
Site Reliability
•   Is Project Compatible With Zoning
•   Natural Environment Issues
‫׀‬   Seismic Zone
‫׀‬   Geo Technical Reports
‫׀‬   Sub Surface Conditions
‫׀‬   Tornado/hurricane Risk
‫׀‬   Site Flood Potential
‫׀‬   Fire Potential
‫׀‬   Site Topography
‫׀‬   Weather Extremes
•   Man‐Made Environment Issues
‫׀‬   Power/Data and Communication/Water Supply/Sanitary Sewer Availability
‫׀‬   ISP Connectivity to Mirror and DR Sites
‫׀‬   Proximity of Hazardous Operational Facilities, i.e. Nuclear Power Plants, Military Bases, 
    Chemical Plants, Tank Farms, Water/Sewage Treatment Plants, Dams/Reservoirs, Gas 
    Stations, etc.
‫׀‬   Distance to Airports & Freeways
‫׀‬   Distance to Emergency Services, i.e. Fire and Police Departments, Hospital 


                                                49
                                     Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT
Building Areas/Systems Reviewed
Building Utilities and Physical Issues
‫ ׀‬General building systems and area characteristics
‫ ׀‬Life safety and environmental
Electrical Systems
‫ ׀‬Utility feeders
‫ ׀‬Service entry
‫ ׀‬Base building electrical distribution system including busways, step‐down 
    transformers, switchgear and distribution panels
‫׀‬   Uninterruptible power supply (UPS) systems
‫׀‬   Battery systems
‫׀‬   Power Distribution System including the critical computer rooms
‫׀‬   Emergency/standby generator and fuel system
‫׀‬   Normal/standby power transfer switchgear
‫׀‬   Grounding
‫׀‬   Emergency Power Off Systems
‫׀‬   Lightning protection system
‫׀‬   Fire alarm and smoke detection systems


                                            50
                                 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT
Building Areas/Systems Reviewed
•   Mechanical Systems
‫׀‬   Critical Systems Chilled Water Plant:  Chillers, pumps, piping distribution system, 
    controls, etc
‫׀‬   Critical Systems Condenser Water System:  Cooling towers, pumps, piping, etc
‫׀‬   Critical Systems Air Handling Systems
‫׀‬   Critical Systems Air Distribution
‫׀‬   Critical Systems Secondary Chilled Water Loop
‫׀‬   Fuel Oil Systems
‫׀‬   Boiler Systems
‫׀‬   Compressed Air Systems
•   Plumbing Systems
‫׀‬   Domestic Water Systems
‫׀‬   Natural Gas Systems
‫׀‬   Fire Suppression Systems (Water and Gaseous)
•   Operation and Maintenance of the Critical Support Systems
‫׀‬   Maintenance procedures and programs
‫׀‬   Normal operating procedures
‫׀‬   Emergency operating procedures
‫׀‬   Training programs and methods
‫׀‬   Spare parts



                                                51
                                     Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT
Building Areas/Systems Reviewed
•   Building Automation
‫׀‬   Building Automation Systems.
‫׀‬   Physical Security Systems.
‫׀‬   Access control
‫׀‬   Intrusion detection
‫׀‬   CCTV systems
‫׀‬   ID badging systems
‫׀‬   Intercom systems
‫׀‬   Smoke Purge Systems
•   Technology Systems
‫׀‬   Entrance Facility Feeds.
‫׀‬   Telephone Company Services.
•   Systems Integration:
‫׀‬   The integration, compatibility and interaction of the above systems with each 
    other, as well as with the other building elements will be reviewed to ensure that 
    the systems are compatible and fully integrated.
                                              52
                                   Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT

More Related Content

What's hot

Introduction to it auditing
Introduction to it auditingIntroduction to it auditing
Introduction to it auditing
Damilola Mosaku
 
Privacy Trends: Key practical steps on ISO/IEC 27701:2019 implementation
Privacy Trends: Key practical steps on ISO/IEC 27701:2019 implementationPrivacy Trends: Key practical steps on ISO/IEC 27701:2019 implementation
Privacy Trends: Key practical steps on ISO/IEC 27701:2019 implementation
PECB
 
Assess Your Business Continuity Management Process
Assess Your Business Continuity Management ProcessAssess Your Business Continuity Management Process
Assess Your Business Continuity Management Process
Anand Subramaniam
 
Security audit
Security auditSecurity audit
Security audit
Rosaria Dee
 
Information Security and the SDLC
Information Security and the SDLCInformation Security and the SDLC
Business Continuity And Disaster Recovery Notes
Business Continuity And Disaster Recovery NotesBusiness Continuity And Disaster Recovery Notes
Business Continuity And Disaster Recovery Notes
Alan McSweeney
 
Information security
Information securityInformation security
Information security
avinashbalakrishnan2
 
CISSP Chapter 1 BCP
CISSP Chapter 1 BCPCISSP Chapter 1 BCP
CISSP Chapter 1 BCP
Karthikeyan Dhayalan
 
Governance, Risk & Compliance Management Solution
Governance, Risk & Compliance Management SolutionGovernance, Risk & Compliance Management Solution
Governance, Risk & Compliance Management Solution
Rishabh Software
 
Business Impact Analysis
Business Impact AnalysisBusiness Impact Analysis
Business Impact Analysis
dlfrench
 
IT Audit methodologies
IT Audit methodologiesIT Audit methodologies
IT Audit methodologies
genetics
 
Business Continuity Management
Business Continuity ManagementBusiness Continuity Management
Business Continuity Management
ECC International
 
The Security Vulnerability Assessment Process & Best Practices
The Security Vulnerability Assessment Process & Best PracticesThe Security Vulnerability Assessment Process & Best Practices
The Security Vulnerability Assessment Process & Best Practices
Kellep Charles
 
Business Continuity Planning
Business Continuity PlanningBusiness Continuity Planning
Business Continuity Planning
alanlund
 
Disaster Recovery Plan for IT
Disaster Recovery Plan for ITDisaster Recovery Plan for IT
Disaster Recovery Plan for IT
hhuihhui
 
How to Effectively Audit your IT Infrastructure
How to Effectively Audit your IT InfrastructureHow to Effectively Audit your IT Infrastructure
How to Effectively Audit your IT Infrastructure
Netwrix Corporation
 
Information security
Information securityInformation security
Information security
LJ PROJECTS
 
BUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptx
BUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptxBUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptx
BUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptx
JayLloyd8
 
Presentation on iso 27001-2013, Internal Auditing and BCM
Presentation on iso 27001-2013, Internal Auditing and BCMPresentation on iso 27001-2013, Internal Auditing and BCM
Presentation on iso 27001-2013, Internal Auditing and BCM
Shantanu Rai
 
How to Steer Cyber Security with Only One KPI: The Cyber Risk Resilience
How to Steer Cyber Security with Only One KPI: The Cyber Risk ResilienceHow to Steer Cyber Security with Only One KPI: The Cyber Risk Resilience
How to Steer Cyber Security with Only One KPI: The Cyber Risk Resilience
Priyanka Aash
 

What's hot (20)

Introduction to it auditing
Introduction to it auditingIntroduction to it auditing
Introduction to it auditing
 
Privacy Trends: Key practical steps on ISO/IEC 27701:2019 implementation
Privacy Trends: Key practical steps on ISO/IEC 27701:2019 implementationPrivacy Trends: Key practical steps on ISO/IEC 27701:2019 implementation
Privacy Trends: Key practical steps on ISO/IEC 27701:2019 implementation
 
Assess Your Business Continuity Management Process
Assess Your Business Continuity Management ProcessAssess Your Business Continuity Management Process
Assess Your Business Continuity Management Process
 
Security audit
Security auditSecurity audit
Security audit
 
Information Security and the SDLC
Information Security and the SDLCInformation Security and the SDLC
Information Security and the SDLC
 
Business Continuity And Disaster Recovery Notes
Business Continuity And Disaster Recovery NotesBusiness Continuity And Disaster Recovery Notes
Business Continuity And Disaster Recovery Notes
 
Information security
Information securityInformation security
Information security
 
CISSP Chapter 1 BCP
CISSP Chapter 1 BCPCISSP Chapter 1 BCP
CISSP Chapter 1 BCP
 
Governance, Risk & Compliance Management Solution
Governance, Risk & Compliance Management SolutionGovernance, Risk & Compliance Management Solution
Governance, Risk & Compliance Management Solution
 
Business Impact Analysis
Business Impact AnalysisBusiness Impact Analysis
Business Impact Analysis
 
IT Audit methodologies
IT Audit methodologiesIT Audit methodologies
IT Audit methodologies
 
Business Continuity Management
Business Continuity ManagementBusiness Continuity Management
Business Continuity Management
 
The Security Vulnerability Assessment Process & Best Practices
The Security Vulnerability Assessment Process & Best PracticesThe Security Vulnerability Assessment Process & Best Practices
The Security Vulnerability Assessment Process & Best Practices
 
Business Continuity Planning
Business Continuity PlanningBusiness Continuity Planning
Business Continuity Planning
 
Disaster Recovery Plan for IT
Disaster Recovery Plan for ITDisaster Recovery Plan for IT
Disaster Recovery Plan for IT
 
How to Effectively Audit your IT Infrastructure
How to Effectively Audit your IT InfrastructureHow to Effectively Audit your IT Infrastructure
How to Effectively Audit your IT Infrastructure
 
Information security
Information securityInformation security
Information security
 
BUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptx
BUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptxBUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptx
BUSINESS-CONTINUITY-AND-DISASTER-RECOVERY.pptx
 
Presentation on iso 27001-2013, Internal Auditing and BCM
Presentation on iso 27001-2013, Internal Auditing and BCMPresentation on iso 27001-2013, Internal Auditing and BCM
Presentation on iso 27001-2013, Internal Auditing and BCM
 
How to Steer Cyber Security with Only One KPI: The Cyber Risk Resilience
How to Steer Cyber Security with Only One KPI: The Cyber Risk ResilienceHow to Steer Cyber Security with Only One KPI: The Cyber Risk Resilience
How to Steer Cyber Security with Only One KPI: The Cyber Risk Resilience
 

Similar to Risk Assessments and Reliability, What You Need To Know

Reliability Maintenance Engineering 2 - 2 Reliability Techniques
Reliability Maintenance Engineering 2 - 2 Reliability TechniquesReliability Maintenance Engineering 2 - 2 Reliability Techniques
Reliability Maintenance Engineering 2 - 2 Reliability Techniques
Accendo Reliability
 
BAHAN PRESENTASI RCM.pptx
BAHAN PRESENTASI RCM.pptxBAHAN PRESENTASI RCM.pptx
BAHAN PRESENTASI RCM.pptx
QienKing
 
Reducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering MethodsReducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering Methods
Wilde Analysis Ltd.
 
PyBay 2018: Production-Ready Python Applications
PyBay 2018: Production-Ready Python ApplicationsPyBay 2018: Production-Ready Python Applications
PyBay 2018: Production-Ready Python Applications
Michael Kehoe
 
Failure Mode and Effect Analysis
Failure Mode and Effect AnalysisFailure Mode and Effect Analysis
Failure Mode and Effect Analysis
tulasiva
 
Operational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss PreventionOperational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss Prevention
Michael Marshall, PE
 
MERC Capabilities Briefing
MERC Capabilities BriefingMERC Capabilities Briefing
MERC Capabilities Briefing
Herb MacMillan
 
Reliability Program Approval Presentation_
Reliability Program Approval Presentation_Reliability Program Approval Presentation_
Reliability Program Approval Presentation_
Chad Broussard
 
Managing your OnStream Inspection Program and External vs Internal inspections
Managing your OnStream Inspection Program and External vs Internal inspectionsManaging your OnStream Inspection Program and External vs Internal inspections
Managing your OnStream Inspection Program and External vs Internal inspections
Edwin A Merrick
 
Maintenance types
Maintenance typesMaintenance types
Maintenance types
Motasem Ash
 
Introduction to The Augustus Group and RBI
Introduction to The Augustus Group and RBIIntroduction to The Augustus Group and RBI
Introduction to The Augustus Group and RBI
Edwin A Merrick
 
Reliability Engineering in Biomanufacturing - Presentation by Michael Andrews
Reliability Engineering in Biomanufacturing - Presentation by Michael AndrewsReliability Engineering in Biomanufacturing - Presentation by Michael Andrews
Reliability Engineering in Biomanufacturing - Presentation by Michael Andrews
WPICPE
 
Rbi
RbiRbi
Mechanical Integrity.pdf
Mechanical Integrity.pdfMechanical Integrity.pdf
Mechanical Integrity.pdf
aashir14
 
Reliability Engineering 101 : Tonex Training
Reliability Engineering 101 : Tonex TrainingReliability Engineering 101 : Tonex Training
Reliability Engineering 101 : Tonex Training
Bryan Len
 
Res Technical recruitment & training profile
Res Technical recruitment & training profile Res Technical recruitment & training profile
Res Technical recruitment & training profile
Alaa Thabet
 
Turner.john
Turner.johnTurner.john
Turner.john
NASAPMC
 
Turner.john
Turner.johnTurner.john
Turner.john
NASAPMC
 
FMEA.pptx
FMEA.pptxFMEA.pptx
FMEA.pptx
tulasiva
 
Risk leadership perspectives Risk Manager of the Year
Risk leadership perspectives Risk Manager of the YearRisk leadership perspectives Risk Manager of the Year
Risk leadership perspectives Risk Manager of the Year
Karl Davey
 

Similar to Risk Assessments and Reliability, What You Need To Know (20)

Reliability Maintenance Engineering 2 - 2 Reliability Techniques
Reliability Maintenance Engineering 2 - 2 Reliability TechniquesReliability Maintenance Engineering 2 - 2 Reliability Techniques
Reliability Maintenance Engineering 2 - 2 Reliability Techniques
 
BAHAN PRESENTASI RCM.pptx
BAHAN PRESENTASI RCM.pptxBAHAN PRESENTASI RCM.pptx
BAHAN PRESENTASI RCM.pptx
 
Reducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering MethodsReducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering Methods
 
PyBay 2018: Production-Ready Python Applications
PyBay 2018: Production-Ready Python ApplicationsPyBay 2018: Production-Ready Python Applications
PyBay 2018: Production-Ready Python Applications
 
Failure Mode and Effect Analysis
Failure Mode and Effect AnalysisFailure Mode and Effect Analysis
Failure Mode and Effect Analysis
 
Operational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss PreventionOperational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss Prevention
 
MERC Capabilities Briefing
MERC Capabilities BriefingMERC Capabilities Briefing
MERC Capabilities Briefing
 
Reliability Program Approval Presentation_
Reliability Program Approval Presentation_Reliability Program Approval Presentation_
Reliability Program Approval Presentation_
 
Managing your OnStream Inspection Program and External vs Internal inspections
Managing your OnStream Inspection Program and External vs Internal inspectionsManaging your OnStream Inspection Program and External vs Internal inspections
Managing your OnStream Inspection Program and External vs Internal inspections
 
Maintenance types
Maintenance typesMaintenance types
Maintenance types
 
Introduction to The Augustus Group and RBI
Introduction to The Augustus Group and RBIIntroduction to The Augustus Group and RBI
Introduction to The Augustus Group and RBI
 
Reliability Engineering in Biomanufacturing - Presentation by Michael Andrews
Reliability Engineering in Biomanufacturing - Presentation by Michael AndrewsReliability Engineering in Biomanufacturing - Presentation by Michael Andrews
Reliability Engineering in Biomanufacturing - Presentation by Michael Andrews
 
Rbi
RbiRbi
Rbi
 
Mechanical Integrity.pdf
Mechanical Integrity.pdfMechanical Integrity.pdf
Mechanical Integrity.pdf
 
Reliability Engineering 101 : Tonex Training
Reliability Engineering 101 : Tonex TrainingReliability Engineering 101 : Tonex Training
Reliability Engineering 101 : Tonex Training
 
Res Technical recruitment & training profile
Res Technical recruitment & training profile Res Technical recruitment & training profile
Res Technical recruitment & training profile
 
Turner.john
Turner.johnTurner.john
Turner.john
 
Turner.john
Turner.johnTurner.john
Turner.john
 
FMEA.pptx
FMEA.pptxFMEA.pptx
FMEA.pptx
 
Risk leadership perspectives Risk Manager of the Year
Risk leadership perspectives Risk Manager of the YearRisk leadership perspectives Risk Manager of the Year
Risk leadership perspectives Risk Manager of the Year
 

Recently uploaded

Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
DianaGray10
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
Neo4j
 
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
saastr
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
BibashShahi
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Precisely
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
Ivo Velitchkov
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 

Recently uploaded (20)

Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
 
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 

Risk Assessments and Reliability, What You Need To Know

  • 1. INFRASTRUCTURE RELIABILITY AND RISK ASSESSMENTS Steven Shapiro, P.E., ATD Mission Critical Practice Lead Morrison Hershfield Mission Critical Morrison Hershfield Mission Critical
  • 2. WHAT YOU NEED TO KNOW AGENDA • RISK ASSESSMENT • INFRASTRUCTURE RELIABILITY COOLING POWER Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 3. RISK ASSESSMENTS • WHY • SITE EVALUATION • METRICS Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 4. Causes of Critical Failures • Location • Design • Redundancy level • Construction • Quality of equipment • Age Lurking Vulnerabilities • Operations & Maintenance program • Personnel training • Level of operator coverage • Thoroughness of the commissioning program 5 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments WHY
  • 5. Causes of Critical Failures • Equipment failure • Operator error • Natural disaster • Design error • Installation error • Commissioning or test deficiency • Maintenance oversight • Equipment design WHY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 6. Causes of Critical Failures • Root cause not always easy to ascertain • Combination of factors (Cascading Failures) • Latent failures • Most occur during change of state events • More maintenance does not necessarily mean higher availability • Non-Fault tolerant systems WHY FILURES Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 7. Causes of Critical Failures Commissioning or Test Deficiency 4% System Design Equipment Natural Disaster 20% Design 3% 13% Maintenance Oversight 4% Equipment Failure 28% Installation Error 10% Human Error 18% WHY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessment
  • 8. WHY DO RISK ASSESSMENT • Alignment of business mission and facility performance expectation • Quantifies the risk and exposure of the critical facilities to failure • Identifies vulnerabilities and single points of failure • First step in creating an action plan for site hardening • Benchmark against the industry • Assists in developing business case for capital expenditures RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 9. SITE EVALUATION STEP 1 • Quantify reliability expectations • Develop resiliency metrics RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 10. SITE EVALUATION STEP 2 • Develop PRA model (Probabilistic Risk Assessment) • Identify Single Points of Failure within critical systems • Evaluate redundancy of critical systems • Capacity and expendability analysis • Adequacy of Engineered Systems • Operation and maintenance policies, practices and procedures • Adequacy of maintenance and testing programs • Evaluate risks associated with site location • Overall Risk Analysis • Evaluate the adequacy of operations and maintenance programs RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 11. SITE EVALUATION STEP 2 cont. • Harmonics analysis • EMF studies • Short circuit & coordination studies • Air flow modeling-CFD RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 12. SITE EVALUATION STEP 3 • Perform gap analysis STEP 4 • Recommendations for upgrade/alteration to optimize facility performance • Budget and schedule development • Assess risk during implementation • Benchmark findings with industry standards RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 13. RISK ASSESSMENT METRICS • Probability of Failure/Reliability • Availability • MTTF • MTTR • Susceptibility to natural disasters • Fault tolerance • Single Points of Failure • Maintainability • Operational readiness • Maintenance program RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 14. INFRASTRUCTURE RELIABILITY • RELIABILITY / AVAILABLITY • RELIABILITY MODELING • RELIABILITY CONSIDERATIONS RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 15. RELIABILITY • “Reliability” is used as an umbrella definition • May Refer to Availability, Durability, Quality • Five 9’s ???? • Reliability = Probability of Successful Operation RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 16. RELIABILITY AND AVAILABILITY • Reliability predicts how likely is the system to fail. • Availability is a measure (or a future prediction) of what percentage of the time the system will operating properly RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 17. AVAILABILITY Five 9’s refers to Availability Availability (A) = Average fraction of time Something is in service and performing intended function. 99.999% availability means: • 5.3 minutes of downtime each year or • 1.77 hours of downtime every 20 years Availability does not specify how often an outage occurs RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 18. AVAILABILITY Availability (A) = MTBF/(MTBF + MTTR) MTTF: Mean Time To Failure MTBF: Mean Time Between Failures MTTR: Mean Time to Repair or Downtime MTBF=MTTF+MTTR RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 19. RELIABILITY BATHTUB CURVE Failure Rate early wear-out life useful life period 0.5 Time (t) Years YEARS 12 14 RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 20. RELIABILITY MODELING • Used to compare system designs and assist in the evaluation of risk versus the cost to mitigate the risk. • Failure and Repair data comes from IEEE 493, Recommended Practice for Design of Reliable Industrial and Commercial Power Systems (IEEE Gold Book) RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 21. RELIABILITY MODELING Components used for reliability modeling of the electrical system shown here: • Utility power • Generator • Circuit breakers • Switchboards • Cables • Automatic Transfer Switch • UPS module • Battery • Static Bypass Switch • Rack Power RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 22. RELIABILITY MODELING Reliability Block  Diagram (RBD) RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 23. RELIABILITY MODELING Shown below are the results of the calculations Hours Hours RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 24. THE TRADITIONAL CLASSIFICATION SYSTEM The Uptime Institute Tier 1 – Basic Non-Redundant Data Center Single path for power and cooling distribution without redundant components Tier 2 – Basic Redundant Data Center Single path for power and cooling distribution with redundant components Tier 3 – Concurrently Maintainable Data Center Multiple paths for power and cooling distribution with only one path active and with redundant components Tier 4 – Fault Tolerant Data Center Multiple active power and cooling distribution paths with redundant components and fault tolerant RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 25. Tier Definitions TIER REQUIREMENTS Tier I Tier II Tier III Tier IV 1 Active Number of Delivery Paths 1 1 2 Active 1 Passive Redundancy N N+1 N+1 2N Minimum Compartmentalization No No No Yes Concurrent Maintainability No No Yes Yes Fault Tolerance No No No Yes Availability 99.67 99.75 99.982 99.95 Downtime in Hr/Yr 28.8 22 1.6 0.4 RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 26. Data Center Cost From the UI • Tier I - $10,000 US/kW of Useable UPS Power Output • Tier II - $11,000 US/kW of Useable UPS Power Output • Tier III - $20,000 US/kW of Useable UPS Power Output • Tier IV - $22,000 US/kW of Useable UPS Power Output • Plus $225 US/SF of Computer Room RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 27. HOW MUCH REDUNDANCY IS ENOUGH? RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 28. Reliability Considerations Assumptions • Various configurations examined for single or dual utility feeders, UPS, Generators, STS’s, single or dual cords • Compare Reliability at 2000 KW and 4000 KW Load • 5 Year Probability of Failure RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 29. Single utility feeder, parallel redundant UPS and generators, single cord IT equipment
  • 30. 2N UPS, N+1 Generators, ASTSs, Dual Cord Rack
  • 31. Two Utility Feeders, 2(N+1) UPS, 2(N+1) Generators, ASTSs, Dual Cord Rack
  • 32. Distributed Redundant UPS, N+2 Generators, Two Utility Feeders, ASTSs and Dual Cord Rack
  • 33. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 34. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 35. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 36. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 37. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 38. Reliability Considerations Emergency Diesel Generators fail to start fail after ½ hour fail after 8 hours fail after 24 hours Study Performed by Idaho National Engineering Laboratory – February 1996 at Nuclear Power Plants RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 39. Reliability Considerations • 2(N+1) UPS/Generator with dual utility feeders - most reliable topology • 2(N+1) UPS > 2N UPS by small margin • 2N > Distributed Redundant by small margin • Significant improvement if a second utility feeder is provided • N+2 and/or 2N generator systems are more reliable than N+1 • Hybrid configuration in a hybrid facility is sometimes the best solution RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 40. Reliability Considerations • Assess the condition of the mechanical plant in conjunction with the electrical system • The facility reliability will be driven by the least reliable component (typically the electrical infrastructure) RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 41. System Reliability Block Electrical System Electrical Mechanical Electrical systempow ering the Mechanical systemsupporting critical critical load load RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 42. System Reliability Block MTBF Availability Pf (3 years) Electrical system alone 330,184 0.99999 8.10% Mechanical system alone 178,611 0.999943 11.70% Electrical system supporting mechanical 108,500 0.999985 21.40% Overall mechanical system 70,087 0.999931 29.20% Combined electrical mechanical system 57,819 0.999922 36.90% Electrical System Electrical Mechanical Electrical system powering the Mechanical system supporting critical critical load load RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 43. The Cost of Reliability Reliability 99.9999 99.999 99.99 99.9 99.0 .9 $ $$ $$$ $$$$ $$$$$ RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 44. Key Takeaways – Risk Assessment • What Reliability Level Do you Really Need Based on Your Business Case? • Minimize Single Points of Failure • Concurrent Maintainability? • Fault Tolerance? • Ensure Adequacy of Operations, Maintenance and Testing Programs • How to justify the cost to upgrade from present state? RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 45. Key Takeaways – Reliability • Design objective – find optimum compromise between cost and reliability • Size matters – larger facilities yield lower reliability • System architecture and design implementation is more important role than equipment selection • Segregate system in independent blocks • Eliminate common source components to minimize fault propagation (i.e. LBS, hot-tie, manual bus ties) • Move single points of failures as close to the load as possible • Always maintain two independent sources of power to the critical load • Optimize the design of monitoring and controls circuits • Keep it simple/minimize human intervention/Utilize Automation RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 46. Thank you and please feel QUESTIONS? free to contact me Steven Shapiro, PE, ATD SShapiro@MorrisonHershfield.com 914.420.3213 http://www.linkedin.com/in/stevenshapirope References: Uptime Institute White Papers: Tier Myths and Misconceptions Data Center Site Infrastructure Tier Standard: Topology
  • 47. Building Areas/Systems Reviewed ‫׀‬ General Construction ‫׀‬ Electrical ‫׀‬ Mechanical ‫׀‬ Plumbing And Fire Protection ‫׀‬ Operation and Maintenance ‫׀‬ Security  ‫׀‬ Load Density 48 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT
  • 48. Site Reliability • Is Project Compatible With Zoning • Natural Environment Issues ‫׀‬ Seismic Zone ‫׀‬ Geo Technical Reports ‫׀‬ Sub Surface Conditions ‫׀‬ Tornado/hurricane Risk ‫׀‬ Site Flood Potential ‫׀‬ Fire Potential ‫׀‬ Site Topography ‫׀‬ Weather Extremes • Man‐Made Environment Issues ‫׀‬ Power/Data and Communication/Water Supply/Sanitary Sewer Availability ‫׀‬ ISP Connectivity to Mirror and DR Sites ‫׀‬ Proximity of Hazardous Operational Facilities, i.e. Nuclear Power Plants, Military Bases,  Chemical Plants, Tank Farms, Water/Sewage Treatment Plants, Dams/Reservoirs, Gas  Stations, etc. ‫׀‬ Distance to Airports & Freeways ‫׀‬ Distance to Emergency Services, i.e. Fire and Police Departments, Hospital  49 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT
  • 49. Building Areas/Systems Reviewed Building Utilities and Physical Issues ‫ ׀‬General building systems and area characteristics ‫ ׀‬Life safety and environmental Electrical Systems ‫ ׀‬Utility feeders ‫ ׀‬Service entry ‫ ׀‬Base building electrical distribution system including busways, step‐down  transformers, switchgear and distribution panels ‫׀‬ Uninterruptible power supply (UPS) systems ‫׀‬ Battery systems ‫׀‬ Power Distribution System including the critical computer rooms ‫׀‬ Emergency/standby generator and fuel system ‫׀‬ Normal/standby power transfer switchgear ‫׀‬ Grounding ‫׀‬ Emergency Power Off Systems ‫׀‬ Lightning protection system ‫׀‬ Fire alarm and smoke detection systems 50 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT
  • 50. Building Areas/Systems Reviewed • Mechanical Systems ‫׀‬ Critical Systems Chilled Water Plant:  Chillers, pumps, piping distribution system,  controls, etc ‫׀‬ Critical Systems Condenser Water System:  Cooling towers, pumps, piping, etc ‫׀‬ Critical Systems Air Handling Systems ‫׀‬ Critical Systems Air Distribution ‫׀‬ Critical Systems Secondary Chilled Water Loop ‫׀‬ Fuel Oil Systems ‫׀‬ Boiler Systems ‫׀‬ Compressed Air Systems • Plumbing Systems ‫׀‬ Domestic Water Systems ‫׀‬ Natural Gas Systems ‫׀‬ Fire Suppression Systems (Water and Gaseous) • Operation and Maintenance of the Critical Support Systems ‫׀‬ Maintenance procedures and programs ‫׀‬ Normal operating procedures ‫׀‬ Emergency operating procedures ‫׀‬ Training programs and methods ‫׀‬ Spare parts 51 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT
  • 51. Building Areas/Systems Reviewed • Building Automation ‫׀‬ Building Automation Systems. ‫׀‬ Physical Security Systems. ‫׀‬ Access control ‫׀‬ Intrusion detection ‫׀‬ CCTV systems ‫׀‬ ID badging systems ‫׀‬ Intercom systems ‫׀‬ Smoke Purge Systems • Technology Systems ‫׀‬ Entrance Facility Feeds. ‫׀‬ Telephone Company Services. • Systems Integration: ‫׀‬ The integration, compatibility and interaction of the above systems with each  other, as well as with the other building elements will be reviewed to ensure that  the systems are compatible and fully integrated. 52 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT