SlideShare a Scribd company logo
1 of 51
Download to read offline
INFRASTRUCTURE
RELIABILITY AND
RISK
ASSESSMENTS
        Steven Shapiro, P.E., ATD
        Mission Critical Practice Lead
        Morrison Hershfield
        Mission Critical



                  Morrison Hershfield Mission Critical
WHAT YOU NEED TO KNOW
AGENDA


• RISK ASSESSMENT

• INFRASTRUCTURE RELIABILITY
                 COOLING                          POWER




          Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RISK ASSESSMENTS



• WHY

• SITE EVALUATION

• METRICS



             Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Causes of Critical Failures

  •    Location
  •    Design
  •    Redundancy level
  •    Construction
  •    Quality of equipment
  •    Age                                   Lurking Vulnerabilities
  •    Operations & Maintenance program
  •    Personnel training
  •    Level of operator coverage
  •    Thoroughness of the commissioning program




                                       5
                            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
 WHY
Causes of Critical Failures

• Equipment failure
• Operator error
• Natural disaster
• Design error
• Installation error
• Commissioning or test deficiency
• Maintenance oversight
• Equipment design




 WHY                      Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Causes of Critical Failures


• Root cause not always easy to ascertain
• Combination of factors (Cascading Failures)
• Latent failures
• Most occur during change of state events
• More maintenance does not necessarily mean higher availability
• Non-Fault tolerant systems




  WHY
  FILURES                  Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Causes of Critical Failures
                                     Commissioning or
                                      Test Deficiency
                                            4%

                 System Design                               Equipment
  Natural Disaster    20%                                      Design
        3%                                                      13%
   Maintenance
    Oversight
       4%
                                                                         Equipment Failure
                                                                               28%
    Installation Error
           10%           Human Error
                            18%




 WHY                             Morrison Hershfield Mission Critical – Infrastructure and Risk Assessment
WHY DO RISK ASSESSMENT

• Alignment of business mission and facility performance expectation

• Quantifies the risk and exposure of the critical facilities to failure

• Identifies vulnerabilities and single points of failure

• First step in creating an action plan for site hardening

• Benchmark against the industry

• Assists in developing business case for capital expenditures




 RISK ASSESSMENT              Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
SITE EVALUATION

STEP 1

• Quantify reliability expectations
• Develop resiliency metrics




 RISK ASSESSMENT      Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
SITE EVALUATION

STEP 2
   •   Develop PRA model     (Probabilistic Risk Assessment)




   •   Identify Single Points of Failure within critical systems
   •   Evaluate redundancy of critical systems
   •   Capacity and expendability analysis
   •   Adequacy of Engineered Systems
   •   Operation and maintenance policies, practices and procedures
   •   Adequacy of maintenance and testing programs
   •   Evaluate risks associated with site location
   •   Overall Risk Analysis
   •   Evaluate the adequacy of operations and maintenance programs


 RISK ASSESSMENT            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
SITE EVALUATION


STEP 2 cont.
• Harmonics analysis
• EMF studies
• Short circuit & coordination studies
• Air flow modeling-CFD




 RISK ASSESSMENT            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
SITE EVALUATION

STEP 3
   • Perform gap analysis
STEP 4
   • Recommendations for upgrade/alteration to optimize facility
      performance
   • Budget and schedule development
   • Assess risk during implementation
   • Benchmark findings with industry standards




 RISK ASSESSMENT         Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RISK ASSESSMENT METRICS

   • Probability of Failure/Reliability
   • Availability
   • MTTF
   • MTTR
   • Susceptibility to natural disasters
   • Fault tolerance
   • Single Points of Failure
   • Maintainability
   • Operational readiness
   • Maintenance program

 RISK ASSESSMENT            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
INFRASTRUCTURE RELIABILITY



 • RELIABILITY / AVAILABLITY

 • RELIABILITY MODELING

 • RELIABILITY CONSIDERATIONS




 RELIABILITY    Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY


• “Reliability” is used as an umbrella definition

• May Refer to Availability, Durability, Quality

• Five 9’s ????

• Reliability = Probability of Successful Operation




 RELIABILITY                Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY AND AVAILABILITY



•     Reliability predicts how likely is the system to fail.

•     Availability is a measure (or a future prediction) of what percentage
      of the time the system will operating properly




    RELIABILITY                 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
AVAILABILITY

Five 9’s refers to Availability

Availability (A) = Average fraction of time Something is in service
and performing intended function.

99.999% availability means:
    • 5.3 minutes of downtime each year
                       or
    • 1.77 hours of downtime every 20 years

Availability does not specify how often an outage occurs



 RELIABILITY                  Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
AVAILABILITY


Availability (A) = MTBF/(MTBF + MTTR)

  MTTF: Mean Time To Failure
  MTBF: Mean Time Between Failures
  MTTR: Mean Time to Repair or Downtime
  MTBF=MTTF+MTTR




 RELIABILITY            Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY BATHTUB CURVE

      Failure Rate




                     early                                                                   wear-out
                     life                        useful life                                 period

                             0.5
                                       Time (t) Years YEARS                       12 14

 RELIABILITY                       Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY MODELING


•      Used to compare system designs and assist in the evaluation of
       risk versus the cost to mitigate the risk.

•      Failure and Repair data comes from IEEE 493, Recommended
       Practice for Design of Reliable Industrial and Commercial Power
       Systems (IEEE Gold Book)




    RELIABILITY              Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY MODELING

Components used for reliability modeling of the electrical system shown
here:

•   Utility power
•   Generator
•   Circuit breakers
•   Switchboards
•   Cables
•   Automatic Transfer Switch
•   UPS module
•   Battery
•   Static Bypass Switch
•   Rack Power



 RELIABILITY              Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY MODELING




                                         Reliability Block 
                                         Diagram (RBD)


 RELIABILITY   Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
RELIABILITY MODELING

Shown below are the results of the calculations




                         Hours         Hours




 RELIABILITY              Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
THE TRADITIONAL CLASSIFICATION SYSTEM
           The Uptime Institute
Tier 1 – Basic Non-Redundant Data Center
         Single path for power and cooling distribution without redundant
         components

Tier 2 – Basic Redundant Data Center
         Single path for power and cooling distribution with redundant
         components

Tier 3 – Concurrently Maintainable Data Center
         Multiple paths for power and cooling distribution with only one path
         active and with redundant components

Tier 4 – Fault Tolerant Data Center
         Multiple active power and cooling distribution paths with redundant
         components and fault tolerant


RELIABILITY                   Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Tier Definitions


                             TIER REQUIREMENTS
                                  Tier I Tier II                Tier III    Tier IV
                                                               1 Active
Number of Delivery Paths             1              1                      2 Active
                                                              1 Passive
Redundancy                         N             N+1             N+1     2N Minimum
Compartmentalization               No            No               No          Yes
Concurrent Maintainability         No            No              Yes          Yes
Fault Tolerance                    No            No               No          Yes
Availability                     99.67          99.75          99.982       99.95
Downtime in Hr/Yr                 28.8           22              1.6          0.4




  RELIABILITY                 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Data Center Cost

From the UI

• Tier I - $10,000 US/kW of Useable UPS Power Output

• Tier II - $11,000 US/kW of Useable UPS Power Output

• Tier III - $20,000 US/kW of Useable UPS Power Output

• Tier IV - $22,000 US/kW of Useable UPS Power Output

• Plus $225 US/SF of Computer Room




 RELIABILITY                Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
HOW MUCH REDUNDANCY IS ENOUGH?




RELIABILITY   Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations

Assumptions

• Various configurations examined for single or dual utility feeders, UPS,
  Generators, STS’s, single or dual cords

• Compare Reliability at 2000 KW and 4000 KW Load

• 5 Year Probability of Failure




 RELIABILITY                  Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Single utility feeder, parallel redundant UPS and
     generators, single cord IT equipment
2N UPS, N+1 Generators, ASTSs, Dual Cord Rack
Two Utility Feeders, 2(N+1) UPS, 2(N+1) Generators,
               ASTSs, Dual Cord Rack
Distributed Redundant UPS, N+2 Generators, Two
   Utility Feeders, ASTSs and Dual Cord Rack
Reliability Considerations




RELIABILITY         Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations




 RELIABILITY        Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations




 RELIABILITY        Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations




 RELIABILITY        Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations




 RELIABILITY        Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations
Emergency Diesel Generators

                                       fail to start


                                       fail after ½ hour



                                        fail after 8 hours



                                        fail after 24 hours


Study Performed by Idaho National Engineering Laboratory – February 1996 at Nuclear Power Plants



  RELIABILITY                                  Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations


• 2(N+1) UPS/Generator with dual utility feeders - most reliable
  topology
• 2(N+1) UPS > 2N UPS by small margin
• 2N > Distributed Redundant by small margin
• Significant improvement if a second utility feeder
  is provided
• N+2 and/or 2N generator systems are more reliable than N+1
• Hybrid configuration in a hybrid facility is sometimes the best solution




 RELIABILITY               Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Reliability Considerations


•   Assess the condition of the mechanical plant in conjunction with the
    electrical system
•   The facility reliability will be driven by the least reliable component
    (typically the electrical infrastructure)




 RELIABILITY                Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
System Reliability Block




                Electrical System                                      Electrical          Mechanical




         Electrical systempow    ering the                          Mechanical systemsupporting critical
                   critical load                                                    load




 RELIABILITY                                 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
System Reliability Block
                                                  MTBF                 Availability              Pf (3 years)
Electrical system
alone                                           330,184                  0.99999                       8.10%
Mechanical system
alone                                           178,611                 0.999943                       11.70%
Electrical system
supporting mechanical                           108,500                 0.999985                       21.40%
Overall mechanical
system                                           70,087                 0.999931                       29.20%
Combined electrical
mechanical system                                57,819                 0.999922                       36.90%


                  Electrical System                               Electrical           Mechanical




            Electrical system powering the                     Mechanical system supporting critical
                      critical load                                            load


  RELIABILITY                                Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
The Cost of Reliability
 Reliability


 99.9999

  99.999


  99.99

  99.9


  99.0

  .9
               $   $$   $$$      $$$$      $$$$$

 RELIABILITY             Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Key Takeaways – Risk Assessment

 • What Reliability Level Do you Really Need Based on Your Business
   Case?

 • Minimize Single Points of Failure

 • Concurrent Maintainability?

 • Fault Tolerance?

 • Ensure Adequacy of Operations, Maintenance and Testing Programs

 • How to justify the cost to upgrade from present state?




 RISK ASSESSMENT             Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Key Takeaways – Reliability

•    Design objective – find optimum compromise between cost and reliability
•    Size matters – larger facilities yield lower reliability
•    System architecture and design implementation is more important role
     than equipment selection
•    Segregate system in independent blocks
•    Eliminate common source components to minimize fault propagation (i.e.
     LBS, hot-tie, manual bus ties)
•    Move single points of failures as close to the load as possible
•    Always maintain two independent sources of power to the critical load
•    Optimize the design of monitoring and controls circuits
•    Keep it simple/minimize human intervention/Utilize Automation


    RELIABILITY                    Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
Thank you and please feel
QUESTIONS?                                            free to contact me

Steven Shapiro, PE, ATD
SShapiro@MorrisonHershfield.com
914.420.3213
http://www.linkedin.com/in/stevenshapirope
References:
Uptime Institute White Papers:
Tier Myths and Misconceptions
Data Center Site Infrastructure Tier Standard: Topology
Building Areas/Systems Reviewed

‫׀‬   General Construction
‫׀‬   Electrical
‫׀‬   Mechanical
‫׀‬   Plumbing And Fire Protection
‫׀‬   Operation and Maintenance
‫׀‬   Security 
‫׀‬   Load Density

                                 48
                      Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT
Site Reliability
•   Is Project Compatible With Zoning
•   Natural Environment Issues
‫׀‬   Seismic Zone
‫׀‬   Geo Technical Reports
‫׀‬   Sub Surface Conditions
‫׀‬   Tornado/hurricane Risk
‫׀‬   Site Flood Potential
‫׀‬   Fire Potential
‫׀‬   Site Topography
‫׀‬   Weather Extremes
•   Man‐Made Environment Issues
‫׀‬   Power/Data and Communication/Water Supply/Sanitary Sewer Availability
‫׀‬   ISP Connectivity to Mirror and DR Sites
‫׀‬   Proximity of Hazardous Operational Facilities, i.e. Nuclear Power Plants, Military Bases, 
    Chemical Plants, Tank Farms, Water/Sewage Treatment Plants, Dams/Reservoirs, Gas 
    Stations, etc.
‫׀‬   Distance to Airports & Freeways
‫׀‬   Distance to Emergency Services, i.e. Fire and Police Departments, Hospital 


                                                49
                                     Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT
Building Areas/Systems Reviewed
Building Utilities and Physical Issues
‫ ׀‬General building systems and area characteristics
‫ ׀‬Life safety and environmental
Electrical Systems
‫ ׀‬Utility feeders
‫ ׀‬Service entry
‫ ׀‬Base building electrical distribution system including busways, step‐down 
    transformers, switchgear and distribution panels
‫׀‬   Uninterruptible power supply (UPS) systems
‫׀‬   Battery systems
‫׀‬   Power Distribution System including the critical computer rooms
‫׀‬   Emergency/standby generator and fuel system
‫׀‬   Normal/standby power transfer switchgear
‫׀‬   Grounding
‫׀‬   Emergency Power Off Systems
‫׀‬   Lightning protection system
‫׀‬   Fire alarm and smoke detection systems


                                            50
                                 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT
Building Areas/Systems Reviewed
•   Mechanical Systems
‫׀‬   Critical Systems Chilled Water Plant:  Chillers, pumps, piping distribution system, 
    controls, etc
‫׀‬   Critical Systems Condenser Water System:  Cooling towers, pumps, piping, etc
‫׀‬   Critical Systems Air Handling Systems
‫׀‬   Critical Systems Air Distribution
‫׀‬   Critical Systems Secondary Chilled Water Loop
‫׀‬   Fuel Oil Systems
‫׀‬   Boiler Systems
‫׀‬   Compressed Air Systems
•   Plumbing Systems
‫׀‬   Domestic Water Systems
‫׀‬   Natural Gas Systems
‫׀‬   Fire Suppression Systems (Water and Gaseous)
•   Operation and Maintenance of the Critical Support Systems
‫׀‬   Maintenance procedures and programs
‫׀‬   Normal operating procedures
‫׀‬   Emergency operating procedures
‫׀‬   Training programs and methods
‫׀‬   Spare parts



                                                51
                                     Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT
Building Areas/Systems Reviewed
•   Building Automation
‫׀‬   Building Automation Systems.
‫׀‬   Physical Security Systems.
‫׀‬   Access control
‫׀‬   Intrusion detection
‫׀‬   CCTV systems
‫׀‬   ID badging systems
‫׀‬   Intercom systems
‫׀‬   Smoke Purge Systems
•   Technology Systems
‫׀‬   Entrance Facility Feeds.
‫׀‬   Telephone Company Services.
•   Systems Integration:
‫׀‬   The integration, compatibility and interaction of the above systems with each 
    other, as well as with the other building elements will be reviewed to ensure that 
    the systems are compatible and fully integrated.
                                              52
                                   Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
    RISK ASSESSMENT

More Related Content

What's hot

Failure mode effect_analysis_fmea
Failure mode effect_analysis_fmeaFailure mode effect_analysis_fmea
Failure mode effect_analysis_fmeaJitesh Gaurav
 
Maintenance planning systems
Maintenance planning  systemsMaintenance planning  systems
Maintenance planning systemsAnil Mohindru
 
Maintenance Strategy, Types of Maintenance
Maintenance Strategy, Types of MaintenanceMaintenance Strategy, Types of Maintenance
Maintenance Strategy, Types of MaintenanceDhanesh S
 
General Maintenance
General MaintenanceGeneral Maintenance
General Maintenancepradhyot05
 
SMRP (1997) Proactive Maintenance Slides by John Day - 1997
SMRP (1997) Proactive Maintenance Slides by John Day - 1997SMRP (1997) Proactive Maintenance Slides by John Day - 1997
SMRP (1997) Proactive Maintenance Slides by John Day - 1997Ricky Smith CMRP, CMRT
 
Maintenance Management (presentation)
Maintenance Management (presentation)Maintenance Management (presentation)
Maintenance Management (presentation)kabul university
 
Computerized maintenance management system (cmms)
Computerized maintenance management system (cmms)Computerized maintenance management system (cmms)
Computerized maintenance management system (cmms)SHIVAJI CHOUDHURY
 
Maintenance module1 ppt number 3
Maintenance module1 ppt number 3Maintenance module1 ppt number 3
Maintenance module1 ppt number 3Dhanesh S
 
Qualitative & Quantitative Analysis
Qualitative & Quantitative AnalysisQualitative & Quantitative Analysis
Qualitative & Quantitative AnalysisAlin Veronika
 
Risk Identification Process PowerPoint Presentation Slides
Risk Identification Process PowerPoint Presentation SlidesRisk Identification Process PowerPoint Presentation Slides
Risk Identification Process PowerPoint Presentation SlidesSlideTeam
 
Best Practices in Maintenance and Reliability
Best Practices in Maintenance and ReliabilityBest Practices in Maintenance and Reliability
Best Practices in Maintenance and ReliabilityRicky Smith CMRP, CMRT
 

What's hot (20)

Preventive maintenance
Preventive maintenancePreventive maintenance
Preventive maintenance
 
Failure mode effect_analysis_fmea
Failure mode effect_analysis_fmeaFailure mode effect_analysis_fmea
Failure mode effect_analysis_fmea
 
Value Engineering
Value EngineeringValue Engineering
Value Engineering
 
Maintenance planning systems
Maintenance planning  systemsMaintenance planning  systems
Maintenance planning systems
 
Maintenance Strategy, Types of Maintenance
Maintenance Strategy, Types of MaintenanceMaintenance Strategy, Types of Maintenance
Maintenance Strategy, Types of Maintenance
 
General Maintenance
General MaintenanceGeneral Maintenance
General Maintenance
 
Maintenance management
Maintenance managementMaintenance management
Maintenance management
 
Risk analysis
Risk analysis  Risk analysis
Risk analysis
 
SMRP (1997) Proactive Maintenance Slides by John Day - 1997
SMRP (1997) Proactive Maintenance Slides by John Day - 1997SMRP (1997) Proactive Maintenance Slides by John Day - 1997
SMRP (1997) Proactive Maintenance Slides by John Day - 1997
 
Autonomous Maintenance
Autonomous MaintenanceAutonomous Maintenance
Autonomous Maintenance
 
Maintenance Management (presentation)
Maintenance Management (presentation)Maintenance Management (presentation)
Maintenance Management (presentation)
 
The 12 Steps of TPM Implementation Poster
The 12 Steps of TPM Implementation PosterThe 12 Steps of TPM Implementation Poster
The 12 Steps of TPM Implementation Poster
 
Computerized maintenance management system (cmms)
Computerized maintenance management system (cmms)Computerized maintenance management system (cmms)
Computerized maintenance management system (cmms)
 
Maintenance module1 ppt number 3
Maintenance module1 ppt number 3Maintenance module1 ppt number 3
Maintenance module1 ppt number 3
 
Qualitative & Quantitative Analysis
Qualitative & Quantitative AnalysisQualitative & Quantitative Analysis
Qualitative & Quantitative Analysis
 
Risk Identification Process PowerPoint Presentation Slides
Risk Identification Process PowerPoint Presentation SlidesRisk Identification Process PowerPoint Presentation Slides
Risk Identification Process PowerPoint Presentation Slides
 
Best Practices in Maintenance and Reliability
Best Practices in Maintenance and ReliabilityBest Practices in Maintenance and Reliability
Best Practices in Maintenance and Reliability
 
Fmea
FmeaFmea
Fmea
 
The Journey Towards Asset Management Excellence
The Journey Towards Asset Management ExcellenceThe Journey Towards Asset Management Excellence
The Journey Towards Asset Management Excellence
 
Preventive maintenance
Preventive maintenance Preventive maintenance
Preventive maintenance
 

Similar to Risk Assessments and Reliability, What You Need To Know

Reliability Maintenance Engineering 2 - 2 Reliability Techniques
Reliability Maintenance Engineering 2 - 2 Reliability TechniquesReliability Maintenance Engineering 2 - 2 Reliability Techniques
Reliability Maintenance Engineering 2 - 2 Reliability TechniquesAccendo Reliability
 
BAHAN PRESENTASI RCM.pptx
BAHAN PRESENTASI RCM.pptxBAHAN PRESENTASI RCM.pptx
BAHAN PRESENTASI RCM.pptxQienKing
 
Reducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering MethodsReducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering MethodsWilde Analysis Ltd.
 
PyBay 2018: Production-Ready Python Applications
PyBay 2018: Production-Ready Python ApplicationsPyBay 2018: Production-Ready Python Applications
PyBay 2018: Production-Ready Python ApplicationsMichael Kehoe
 
Failure Mode and Effect Analysis
Failure Mode and Effect AnalysisFailure Mode and Effect Analysis
Failure Mode and Effect Analysistulasiva
 
Operational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss PreventionOperational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss PreventionMichael Marshall, PE
 
MERC Capabilities Briefing
MERC Capabilities BriefingMERC Capabilities Briefing
MERC Capabilities BriefingHerb MacMillan
 
Reliability Program Approval Presentation_
Reliability Program Approval Presentation_Reliability Program Approval Presentation_
Reliability Program Approval Presentation_Chad Broussard
 
Managing your OnStream Inspection Program and External vs Internal inspections
Managing your OnStream Inspection Program and External vs Internal inspectionsManaging your OnStream Inspection Program and External vs Internal inspections
Managing your OnStream Inspection Program and External vs Internal inspectionsEdwin A Merrick
 
Maintenance types
Maintenance typesMaintenance types
Maintenance typesMotasem Ash
 
Introduction to The Augustus Group and RBI
Introduction to The Augustus Group and RBIIntroduction to The Augustus Group and RBI
Introduction to The Augustus Group and RBIEdwin A Merrick
 
Reliability Engineering in Biomanufacturing - Presentation by Michael Andrews
Reliability Engineering in Biomanufacturing - Presentation by Michael AndrewsReliability Engineering in Biomanufacturing - Presentation by Michael Andrews
Reliability Engineering in Biomanufacturing - Presentation by Michael AndrewsWPICPE
 
Mechanical Integrity.pdf
Mechanical Integrity.pdfMechanical Integrity.pdf
Mechanical Integrity.pdfaashir14
 
Reliability Engineering 101 : Tonex Training
Reliability Engineering 101 : Tonex TrainingReliability Engineering 101 : Tonex Training
Reliability Engineering 101 : Tonex TrainingBryan Len
 
Res Technical recruitment & training profile
Res Technical recruitment & training profile Res Technical recruitment & training profile
Res Technical recruitment & training profile Alaa Thabet
 
Turner.john
Turner.johnTurner.john
Turner.johnNASAPMC
 
Turner.john
Turner.johnTurner.john
Turner.johnNASAPMC
 
Risk leadership perspectives Risk Manager of the Year
Risk leadership perspectives Risk Manager of the YearRisk leadership perspectives Risk Manager of the Year
Risk leadership perspectives Risk Manager of the YearKarl Davey
 

Similar to Risk Assessments and Reliability, What You Need To Know (20)

Reliability Maintenance Engineering 2 - 2 Reliability Techniques
Reliability Maintenance Engineering 2 - 2 Reliability TechniquesReliability Maintenance Engineering 2 - 2 Reliability Techniques
Reliability Maintenance Engineering 2 - 2 Reliability Techniques
 
BAHAN PRESENTASI RCM.pptx
BAHAN PRESENTASI RCM.pptxBAHAN PRESENTASI RCM.pptx
BAHAN PRESENTASI RCM.pptx
 
Reducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering MethodsReducing Product Development Risk with Reliability Engineering Methods
Reducing Product Development Risk with Reliability Engineering Methods
 
PyBay 2018: Production-Ready Python Applications
PyBay 2018: Production-Ready Python ApplicationsPyBay 2018: Production-Ready Python Applications
PyBay 2018: Production-Ready Python Applications
 
Failure Mode and Effect Analysis
Failure Mode and Effect AnalysisFailure Mode and Effect Analysis
Failure Mode and Effect Analysis
 
Operational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss PreventionOperational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss Prevention
 
MERC Capabilities Briefing
MERC Capabilities BriefingMERC Capabilities Briefing
MERC Capabilities Briefing
 
Reliability Program Approval Presentation_
Reliability Program Approval Presentation_Reliability Program Approval Presentation_
Reliability Program Approval Presentation_
 
Managing your OnStream Inspection Program and External vs Internal inspections
Managing your OnStream Inspection Program and External vs Internal inspectionsManaging your OnStream Inspection Program and External vs Internal inspections
Managing your OnStream Inspection Program and External vs Internal inspections
 
Maintenance types
Maintenance typesMaintenance types
Maintenance types
 
Introduction to The Augustus Group and RBI
Introduction to The Augustus Group and RBIIntroduction to The Augustus Group and RBI
Introduction to The Augustus Group and RBI
 
Reliability Engineering in Biomanufacturing - Presentation by Michael Andrews
Reliability Engineering in Biomanufacturing - Presentation by Michael AndrewsReliability Engineering in Biomanufacturing - Presentation by Michael Andrews
Reliability Engineering in Biomanufacturing - Presentation by Michael Andrews
 
Rbi
RbiRbi
Rbi
 
Mechanical Integrity.pdf
Mechanical Integrity.pdfMechanical Integrity.pdf
Mechanical Integrity.pdf
 
Reliability Engineering 101 : Tonex Training
Reliability Engineering 101 : Tonex TrainingReliability Engineering 101 : Tonex Training
Reliability Engineering 101 : Tonex Training
 
Res Technical recruitment & training profile
Res Technical recruitment & training profile Res Technical recruitment & training profile
Res Technical recruitment & training profile
 
Turner.john
Turner.johnTurner.john
Turner.john
 
Turner.john
Turner.johnTurner.john
Turner.john
 
FMEA.pptx
FMEA.pptxFMEA.pptx
FMEA.pptx
 
Risk leadership perspectives Risk Manager of the Year
Risk leadership perspectives Risk Manager of the YearRisk leadership perspectives Risk Manager of the Year
Risk leadership perspectives Risk Manager of the Year
 

Recently uploaded

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 

Recently uploaded (20)

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 

Risk Assessments and Reliability, What You Need To Know

  • 1. INFRASTRUCTURE RELIABILITY AND RISK ASSESSMENTS Steven Shapiro, P.E., ATD Mission Critical Practice Lead Morrison Hershfield Mission Critical Morrison Hershfield Mission Critical
  • 2. WHAT YOU NEED TO KNOW AGENDA • RISK ASSESSMENT • INFRASTRUCTURE RELIABILITY COOLING POWER Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 3. RISK ASSESSMENTS • WHY • SITE EVALUATION • METRICS Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 4. Causes of Critical Failures • Location • Design • Redundancy level • Construction • Quality of equipment • Age Lurking Vulnerabilities • Operations & Maintenance program • Personnel training • Level of operator coverage • Thoroughness of the commissioning program 5 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments WHY
  • 5. Causes of Critical Failures • Equipment failure • Operator error • Natural disaster • Design error • Installation error • Commissioning or test deficiency • Maintenance oversight • Equipment design WHY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 6. Causes of Critical Failures • Root cause not always easy to ascertain • Combination of factors (Cascading Failures) • Latent failures • Most occur during change of state events • More maintenance does not necessarily mean higher availability • Non-Fault tolerant systems WHY FILURES Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 7. Causes of Critical Failures Commissioning or Test Deficiency 4% System Design Equipment Natural Disaster 20% Design 3% 13% Maintenance Oversight 4% Equipment Failure 28% Installation Error 10% Human Error 18% WHY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessment
  • 8. WHY DO RISK ASSESSMENT • Alignment of business mission and facility performance expectation • Quantifies the risk and exposure of the critical facilities to failure • Identifies vulnerabilities and single points of failure • First step in creating an action plan for site hardening • Benchmark against the industry • Assists in developing business case for capital expenditures RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 9. SITE EVALUATION STEP 1 • Quantify reliability expectations • Develop resiliency metrics RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 10. SITE EVALUATION STEP 2 • Develop PRA model (Probabilistic Risk Assessment) • Identify Single Points of Failure within critical systems • Evaluate redundancy of critical systems • Capacity and expendability analysis • Adequacy of Engineered Systems • Operation and maintenance policies, practices and procedures • Adequacy of maintenance and testing programs • Evaluate risks associated with site location • Overall Risk Analysis • Evaluate the adequacy of operations and maintenance programs RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 11. SITE EVALUATION STEP 2 cont. • Harmonics analysis • EMF studies • Short circuit & coordination studies • Air flow modeling-CFD RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 12. SITE EVALUATION STEP 3 • Perform gap analysis STEP 4 • Recommendations for upgrade/alteration to optimize facility performance • Budget and schedule development • Assess risk during implementation • Benchmark findings with industry standards RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 13. RISK ASSESSMENT METRICS • Probability of Failure/Reliability • Availability • MTTF • MTTR • Susceptibility to natural disasters • Fault tolerance • Single Points of Failure • Maintainability • Operational readiness • Maintenance program RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 14. INFRASTRUCTURE RELIABILITY • RELIABILITY / AVAILABLITY • RELIABILITY MODELING • RELIABILITY CONSIDERATIONS RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 15. RELIABILITY • “Reliability” is used as an umbrella definition • May Refer to Availability, Durability, Quality • Five 9’s ???? • Reliability = Probability of Successful Operation RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 16. RELIABILITY AND AVAILABILITY • Reliability predicts how likely is the system to fail. • Availability is a measure (or a future prediction) of what percentage of the time the system will operating properly RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 17. AVAILABILITY Five 9’s refers to Availability Availability (A) = Average fraction of time Something is in service and performing intended function. 99.999% availability means: • 5.3 minutes of downtime each year or • 1.77 hours of downtime every 20 years Availability does not specify how often an outage occurs RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 18. AVAILABILITY Availability (A) = MTBF/(MTBF + MTTR) MTTF: Mean Time To Failure MTBF: Mean Time Between Failures MTTR: Mean Time to Repair or Downtime MTBF=MTTF+MTTR RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 19. RELIABILITY BATHTUB CURVE Failure Rate early wear-out life useful life period 0.5 Time (t) Years YEARS 12 14 RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 20. RELIABILITY MODELING • Used to compare system designs and assist in the evaluation of risk versus the cost to mitigate the risk. • Failure and Repair data comes from IEEE 493, Recommended Practice for Design of Reliable Industrial and Commercial Power Systems (IEEE Gold Book) RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 21. RELIABILITY MODELING Components used for reliability modeling of the electrical system shown here: • Utility power • Generator • Circuit breakers • Switchboards • Cables • Automatic Transfer Switch • UPS module • Battery • Static Bypass Switch • Rack Power RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 22. RELIABILITY MODELING Reliability Block  Diagram (RBD) RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 23. RELIABILITY MODELING Shown below are the results of the calculations Hours Hours RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 24. THE TRADITIONAL CLASSIFICATION SYSTEM The Uptime Institute Tier 1 – Basic Non-Redundant Data Center Single path for power and cooling distribution without redundant components Tier 2 – Basic Redundant Data Center Single path for power and cooling distribution with redundant components Tier 3 – Concurrently Maintainable Data Center Multiple paths for power and cooling distribution with only one path active and with redundant components Tier 4 – Fault Tolerant Data Center Multiple active power and cooling distribution paths with redundant components and fault tolerant RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 25. Tier Definitions TIER REQUIREMENTS Tier I Tier II Tier III Tier IV 1 Active Number of Delivery Paths 1 1 2 Active 1 Passive Redundancy N N+1 N+1 2N Minimum Compartmentalization No No No Yes Concurrent Maintainability No No Yes Yes Fault Tolerance No No No Yes Availability 99.67 99.75 99.982 99.95 Downtime in Hr/Yr 28.8 22 1.6 0.4 RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 26. Data Center Cost From the UI • Tier I - $10,000 US/kW of Useable UPS Power Output • Tier II - $11,000 US/kW of Useable UPS Power Output • Tier III - $20,000 US/kW of Useable UPS Power Output • Tier IV - $22,000 US/kW of Useable UPS Power Output • Plus $225 US/SF of Computer Room RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 27. HOW MUCH REDUNDANCY IS ENOUGH? RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 28. Reliability Considerations Assumptions • Various configurations examined for single or dual utility feeders, UPS, Generators, STS’s, single or dual cords • Compare Reliability at 2000 KW and 4000 KW Load • 5 Year Probability of Failure RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 29. Single utility feeder, parallel redundant UPS and generators, single cord IT equipment
  • 30. 2N UPS, N+1 Generators, ASTSs, Dual Cord Rack
  • 31. Two Utility Feeders, 2(N+1) UPS, 2(N+1) Generators, ASTSs, Dual Cord Rack
  • 32. Distributed Redundant UPS, N+2 Generators, Two Utility Feeders, ASTSs and Dual Cord Rack
  • 33. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 34. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 35. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 36. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 37. Reliability Considerations RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 38. Reliability Considerations Emergency Diesel Generators fail to start fail after ½ hour fail after 8 hours fail after 24 hours Study Performed by Idaho National Engineering Laboratory – February 1996 at Nuclear Power Plants RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 39. Reliability Considerations • 2(N+1) UPS/Generator with dual utility feeders - most reliable topology • 2(N+1) UPS > 2N UPS by small margin • 2N > Distributed Redundant by small margin • Significant improvement if a second utility feeder is provided • N+2 and/or 2N generator systems are more reliable than N+1 • Hybrid configuration in a hybrid facility is sometimes the best solution RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 40. Reliability Considerations • Assess the condition of the mechanical plant in conjunction with the electrical system • The facility reliability will be driven by the least reliable component (typically the electrical infrastructure) RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 41. System Reliability Block Electrical System Electrical Mechanical Electrical systempow ering the Mechanical systemsupporting critical critical load load RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 42. System Reliability Block MTBF Availability Pf (3 years) Electrical system alone 330,184 0.99999 8.10% Mechanical system alone 178,611 0.999943 11.70% Electrical system supporting mechanical 108,500 0.999985 21.40% Overall mechanical system 70,087 0.999931 29.20% Combined electrical mechanical system 57,819 0.999922 36.90% Electrical System Electrical Mechanical Electrical system powering the Mechanical system supporting critical critical load load RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 43. The Cost of Reliability Reliability 99.9999 99.999 99.99 99.9 99.0 .9 $ $$ $$$ $$$$ $$$$$ RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 44. Key Takeaways – Risk Assessment • What Reliability Level Do you Really Need Based on Your Business Case? • Minimize Single Points of Failure • Concurrent Maintainability? • Fault Tolerance? • Ensure Adequacy of Operations, Maintenance and Testing Programs • How to justify the cost to upgrade from present state? RISK ASSESSMENT Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 45. Key Takeaways – Reliability • Design objective – find optimum compromise between cost and reliability • Size matters – larger facilities yield lower reliability • System architecture and design implementation is more important role than equipment selection • Segregate system in independent blocks • Eliminate common source components to minimize fault propagation (i.e. LBS, hot-tie, manual bus ties) • Move single points of failures as close to the load as possible • Always maintain two independent sources of power to the critical load • Optimize the design of monitoring and controls circuits • Keep it simple/minimize human intervention/Utilize Automation RELIABILITY Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments
  • 46. Thank you and please feel QUESTIONS? free to contact me Steven Shapiro, PE, ATD SShapiro@MorrisonHershfield.com 914.420.3213 http://www.linkedin.com/in/stevenshapirope References: Uptime Institute White Papers: Tier Myths and Misconceptions Data Center Site Infrastructure Tier Standard: Topology
  • 47. Building Areas/Systems Reviewed ‫׀‬ General Construction ‫׀‬ Electrical ‫׀‬ Mechanical ‫׀‬ Plumbing And Fire Protection ‫׀‬ Operation and Maintenance ‫׀‬ Security  ‫׀‬ Load Density 48 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT
  • 48. Site Reliability • Is Project Compatible With Zoning • Natural Environment Issues ‫׀‬ Seismic Zone ‫׀‬ Geo Technical Reports ‫׀‬ Sub Surface Conditions ‫׀‬ Tornado/hurricane Risk ‫׀‬ Site Flood Potential ‫׀‬ Fire Potential ‫׀‬ Site Topography ‫׀‬ Weather Extremes • Man‐Made Environment Issues ‫׀‬ Power/Data and Communication/Water Supply/Sanitary Sewer Availability ‫׀‬ ISP Connectivity to Mirror and DR Sites ‫׀‬ Proximity of Hazardous Operational Facilities, i.e. Nuclear Power Plants, Military Bases,  Chemical Plants, Tank Farms, Water/Sewage Treatment Plants, Dams/Reservoirs, Gas  Stations, etc. ‫׀‬ Distance to Airports & Freeways ‫׀‬ Distance to Emergency Services, i.e. Fire and Police Departments, Hospital  49 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT
  • 49. Building Areas/Systems Reviewed Building Utilities and Physical Issues ‫ ׀‬General building systems and area characteristics ‫ ׀‬Life safety and environmental Electrical Systems ‫ ׀‬Utility feeders ‫ ׀‬Service entry ‫ ׀‬Base building electrical distribution system including busways, step‐down  transformers, switchgear and distribution panels ‫׀‬ Uninterruptible power supply (UPS) systems ‫׀‬ Battery systems ‫׀‬ Power Distribution System including the critical computer rooms ‫׀‬ Emergency/standby generator and fuel system ‫׀‬ Normal/standby power transfer switchgear ‫׀‬ Grounding ‫׀‬ Emergency Power Off Systems ‫׀‬ Lightning protection system ‫׀‬ Fire alarm and smoke detection systems 50 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT
  • 50. Building Areas/Systems Reviewed • Mechanical Systems ‫׀‬ Critical Systems Chilled Water Plant:  Chillers, pumps, piping distribution system,  controls, etc ‫׀‬ Critical Systems Condenser Water System:  Cooling towers, pumps, piping, etc ‫׀‬ Critical Systems Air Handling Systems ‫׀‬ Critical Systems Air Distribution ‫׀‬ Critical Systems Secondary Chilled Water Loop ‫׀‬ Fuel Oil Systems ‫׀‬ Boiler Systems ‫׀‬ Compressed Air Systems • Plumbing Systems ‫׀‬ Domestic Water Systems ‫׀‬ Natural Gas Systems ‫׀‬ Fire Suppression Systems (Water and Gaseous) • Operation and Maintenance of the Critical Support Systems ‫׀‬ Maintenance procedures and programs ‫׀‬ Normal operating procedures ‫׀‬ Emergency operating procedures ‫׀‬ Training programs and methods ‫׀‬ Spare parts 51 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT
  • 51. Building Areas/Systems Reviewed • Building Automation ‫׀‬ Building Automation Systems. ‫׀‬ Physical Security Systems. ‫׀‬ Access control ‫׀‬ Intrusion detection ‫׀‬ CCTV systems ‫׀‬ ID badging systems ‫׀‬ Intercom systems ‫׀‬ Smoke Purge Systems • Technology Systems ‫׀‬ Entrance Facility Feeds. ‫׀‬ Telephone Company Services. • Systems Integration: ‫׀‬ The integration, compatibility and interaction of the above systems with each  other, as well as with the other building elements will be reviewed to ensure that  the systems are compatible and fully integrated. 52 Morrison Hershfield Mission Critical – Infrastructure and Risk Assessments RISK ASSESSMENT