SlideShare a Scribd company logo
MTBF / MTTR
                        Availability or recoverability?

                        Presented by 
                        Michael Richardson, Energized Work
                        21 March 2012




ENERGIZED WORK
25 MACKLIN STREET
LONDON WC2B 5NN
+44 (0)20 7691 8933
WWW.ENERGIZEDWORK.COM
Michael Richardson
                                                Twitter: @mr_spb
                                                




                                                Email: michael@energizedwork.com
                                                
                                                #ewtektalk
                                                




© 2012 Energized Work - www.energizedwork.com                                       2
So what is 
high availability?

•      Five nines?
•      No single point of failures?
•      Multiple data centres?
•      Fault tolerance?
•      Load balancing?
•      Uptime?




© 2012 Energized Work - www.energizedwork.com   3
Nines
of availability
                                                    9       9
                                                9
                             9
                                                        9
9
                                                9           9
© 2012 Energized Work - www.energizedwork.com                   4
Nines
of availability

                   Availability
                Downtime per Year
                   One nine (90%)
              36.5 days
                   Two nines (99%)
             3.65 days
                   Three nines (99.9%)
         8.76 hours
                   Four nines (99.99%)
         52.56 minutes
                   Five nines (99.999%)
        5.26 minutes




© 2012 Energized Work - www.energizedwork.com                        5
Problem with
the nines

•  What do they mean?
•  Guaranteed or just an SLA?
•  Multiplicity (99.9% * 99.9% * 99.9% = 99.7%)




© 2012 Energized Work - www.energizedwork.com      6
SLA availability numbers
just aim to provide a level of
confidence in a website’s service




© 2012 Energized Work - www.energizedwork.com   7
No single point of failure
(SPOF)




© 2012 Energized Work - www.energizedwork.com   8
Two of everything?




© 2012 Energized Work - www.energizedwork.com   9
Start with this

                                                 Users




                                                Index.html




© 2012 Energized Work - www.energizedwork.com                10
End with this
                                                     Users


                                       Firewall 1                   Firewall 2



                                        Switch 1                    Switch 2




               WEB1                   WEB2          APP1     APP2                DB1   DB2

© 2012 Energized Work - www.energizedwork.com                                                11
Problems with
eliminating SPOF

•      It’s expensive
•      Where do you draw the line?
•      Are failures independent?
•      Can you guarantee no SPOF?
•      Increased complexity




© 2012 Energized Work - www.energizedwork.com   12
Problem:
Data centres fail




© 2012 Energized Work - www.energizedwork.com   13
Solution:
Get a second data centre




© 2012 Energized Work - www.energizedwork.com   14
Hot – Hot
multisite

•      Full range of services available in multiple locations
•      Easy to automate failover of sites
•      Data consistency is hard
•      Capacity planning concerns



                                                       +


© 2012 Energized Work - www.energizedwork.com                    15
Hot – Warm
multisite

•  Simpler than hot – hot
•  Read / Write ratio dependent
•  Synchronously or asynchronously replicate data?




                                                 +


© 2012 Energized Work - www.energizedwork.com         16
Hot – Cold
multisite

•      Easy to setup
•      Will it work?
•      Can it be trusted?
•      Cold site rapidly becomes stale
•      Is it actually valuable?


                                                +


© 2012 Energized Work - www.energizedwork.com       17
DR multisite


•  Fingers crossed you never need it
•  How can / should you test it?
•  Cloud?




                                                +


© 2012 Energized Work - www.energizedwork.com       18
Problems
with multiple sites

•      It’s expensive
•      Managing more systems
•      Managing data consistency
•      Managing capacity
•      Is it still fail proof?
•      Unless you test it, it’s just a plan





© 2012 Energized Work - www.energizedwork.com   19
We now have
a complex system




© 2012 Energized Work - www.energizedwork.com   20
Complex systems


•  More redundancy and automation leads to more complexity
•  More complexity often adds more points of failure





© 2012 Energized Work - www.energizedwork.com                 21
How complex systems fail
 - Dr. Richard Cook


•  Catastrophe is always just around the corner
•  Human operators have dual roles
•  Change introduces new forms of failure





© 2012 Energized Work - www.energizedwork.com      22
Failure and recovery




© 2012 Energized Work - www.energizedwork.com   23
Questions
for the business

•  What is the cost of downtime?
•  What are the Recovery Time Objectives (RTO)
•  What are the Recovery Point Objectives (RPO)?




© 2012 Energized Work - www.energizedwork.com       24
Aggressive RTO and RPO
are expensive and have a
performance impact




© 2012 Energized Work - www.energizedwork.com   25
RTO / RPO
example

Problem:
•  Simple DB
•  Business can tolerate up to 15 minutes downtime
•  10-minute window of data loss




© 2012 Energized Work - www.energizedwork.com         26
RTO / RPO
example

Possible solution:
•  Continuously replicate data to second host
•  Continue with nightly backups and also copy DB transaction logs
   from the primary host to another system




© 2012 Energized Work - www.energizedwork.com                        27
So what is more important –
increasing availability
or reducing recovery time?





© 2012 Energized Work - www.energizedwork.com   28
MTBF or MTTR?


What about MTTD?




© 2012 Energized Work - www.energizedwork.com   29
The answer is:
It depends




© 2012 Energized Work - www.energizedwork.com   30
Failure
is inevitable




© 2012 Energized Work - www.energizedwork.com   31
Ask anyone




© 2012 Energized Work - www.energizedwork.com   32
License
This presentation is provided under the Creative Commons 
Attribution Share Alike 3.0 Unported License.

               You are free:
                 
               To share – to copy, distribute and transmit the work
               
               To remix – to adapt the work
               
               
               Under the following conditions:
               
               Attribution – You must attribute the work in the manner specified by 
               Energized Work (but not in any way that suggests that Energized Work 
               endorse you or your use of the work).
               
               Share Alike – If you alter, transform, or build upon this work, you may 
               distribute the resulting work only under the same or similar license to this 
               one. 

                                                                                                ENERGIZED WORK
                                                                                                25 MACKLIN STREET
                                                                                                LONDON WC2B 5NN
                                                                                                +44 (0)20 7691 8933
© 2012 Energized Work - www.energizedwork.com                                                   WWW.ENERGIZEDWORK.COM
                                                                                                                    33

More Related Content

What's hot

Muda,Mura & Muri
Muda,Mura & MuriMuda,Mura & Muri
Muda,Mura & Murimanijayjay
 
16 major losses tng
16 major losses tng16 major losses tng
16 major losses tng
Rajasekaran Murugan
 
TPM - tech talk
TPM - tech talk TPM - tech talk
tpm information board.pptx
tpm information board.pptxtpm information board.pptx
tpm information board.pptx
VinhCao22
 
Why Total Productive Maintenance (TPM) In Process Industry? - ADDVALUE - Nile...
Why Total Productive Maintenance (TPM) In Process Industry? - ADDVALUE - Nile...Why Total Productive Maintenance (TPM) In Process Industry? - ADDVALUE - Nile...
Why Total Productive Maintenance (TPM) In Process Industry? - ADDVALUE - Nile...
ADD VALUE CONSULTING Inc
 
[Partial Preview:] Waste-finding Checklists for Manufacturing Companies
[Partial Preview:] Waste-finding Checklists for Manufacturing Companies[Partial Preview:] Waste-finding Checklists for Manufacturing Companies
[Partial Preview:] Waste-finding Checklists for Manufacturing Companies
Operational Excellence Consulting
 
Preventive / Autonomous Maintenance - Beginners Guide - Tamil
Preventive / Autonomous Maintenance - Beginners Guide - TamilPreventive / Autonomous Maintenance - Beginners Guide - Tamil
Preventive / Autonomous Maintenance - Beginners Guide - Tamil
Ananth Palaniappan
 
Autonomous Maintenance
Autonomous MaintenanceAutonomous Maintenance
Autonomous Maintenance
Anand Subramaniam
 
TPM: Autonomous Maintenance
TPM: Autonomous MaintenanceTPM: Autonomous Maintenance
TPM: Autonomous Maintenance
Operational Excellence Consulting
 
Autonomous maintenance ppt
Autonomous maintenance pptAutonomous maintenance ppt
Autonomous maintenance ppt
Suraj1911
 
Muri-Mura-Muda
Muri-Mura-MudaMuri-Mura-Muda
Muri-Mura-Muda
Quality Steps Info
 
Tpmjishu
TpmjishuTpmjishu
Tpmjishu
Jitesh Gaurav
 
TPM for lean manufacturing chp4 step of “jlshu hozen “activities
TPM for lean manufacturing  chp4 step of “jlshu hozen “activitiesTPM for lean manufacturing  chp4 step of “jlshu hozen “activities
TPM for lean manufacturing chp4 step of “jlshu hozen “activities
博行 門眞
 
6 OEE overview
6 OEE overview6 OEE overview
6 OEE overviewWebseology
 
Dwm overview
Dwm overviewDwm overview
Dwm overview
Natarajan Gopalaswamy
 
TPM Overview and AM Workshop
TPM Overview and AM WorkshopTPM Overview and AM Workshop
TPM Overview and AM Workshop
John Gabrielle Cabradilla
 
Total productive maintenance(TPM)
Total productive maintenance(TPM)Total productive maintenance(TPM)
Total productive maintenance(TPM)
Md.Muzahid Khan
 
Tpm
TpmTpm
Jishu hozen
Jishu  hozenJishu  hozen
Jishu hozen
QC Labs
 

What's hot (20)

TPM Activity
TPM ActivityTPM Activity
TPM Activity
 
Muda,Mura & Muri
Muda,Mura & MuriMuda,Mura & Muri
Muda,Mura & Muri
 
16 major losses tng
16 major losses tng16 major losses tng
16 major losses tng
 
TPM - tech talk
TPM - tech talk TPM - tech talk
TPM - tech talk
 
tpm information board.pptx
tpm information board.pptxtpm information board.pptx
tpm information board.pptx
 
Why Total Productive Maintenance (TPM) In Process Industry? - ADDVALUE - Nile...
Why Total Productive Maintenance (TPM) In Process Industry? - ADDVALUE - Nile...Why Total Productive Maintenance (TPM) In Process Industry? - ADDVALUE - Nile...
Why Total Productive Maintenance (TPM) In Process Industry? - ADDVALUE - Nile...
 
[Partial Preview:] Waste-finding Checklists for Manufacturing Companies
[Partial Preview:] Waste-finding Checklists for Manufacturing Companies[Partial Preview:] Waste-finding Checklists for Manufacturing Companies
[Partial Preview:] Waste-finding Checklists for Manufacturing Companies
 
Preventive / Autonomous Maintenance - Beginners Guide - Tamil
Preventive / Autonomous Maintenance - Beginners Guide - TamilPreventive / Autonomous Maintenance - Beginners Guide - Tamil
Preventive / Autonomous Maintenance - Beginners Guide - Tamil
 
Autonomous Maintenance
Autonomous MaintenanceAutonomous Maintenance
Autonomous Maintenance
 
TPM: Autonomous Maintenance
TPM: Autonomous MaintenanceTPM: Autonomous Maintenance
TPM: Autonomous Maintenance
 
Autonomous maintenance ppt
Autonomous maintenance pptAutonomous maintenance ppt
Autonomous maintenance ppt
 
Muri-Mura-Muda
Muri-Mura-MudaMuri-Mura-Muda
Muri-Mura-Muda
 
Tpmjishu
TpmjishuTpmjishu
Tpmjishu
 
TPM for lean manufacturing chp4 step of “jlshu hozen “activities
TPM for lean manufacturing  chp4 step of “jlshu hozen “activitiesTPM for lean manufacturing  chp4 step of “jlshu hozen “activities
TPM for lean manufacturing chp4 step of “jlshu hozen “activities
 
6 OEE overview
6 OEE overview6 OEE overview
6 OEE overview
 
Dwm overview
Dwm overviewDwm overview
Dwm overview
 
TPM Overview and AM Workshop
TPM Overview and AM WorkshopTPM Overview and AM Workshop
TPM Overview and AM Workshop
 
Total productive maintenance(TPM)
Total productive maintenance(TPM)Total productive maintenance(TPM)
Total productive maintenance(TPM)
 
Tpm
TpmTpm
Tpm
 
Jishu hozen
Jishu  hozenJishu  hozen
Jishu hozen
 

Viewers also liked

Reliability - Availability
Reliability -  AvailabilityReliability -  Availability
Reliability - Availability
Tom Jacyszyn
 
Basics in Maintenance
Basics in MaintenanceBasics in Maintenance
Basics in Maintenanceraghuttam
 
Dev and Ops Collaboration and Awareness at Etsy and Flickr
Dev and Ops Collaboration and Awareness at Etsy and FlickrDev and Ops Collaboration and Awareness at Etsy and Flickr
Dev and Ops Collaboration and Awareness at Etsy and Flickr
John Allspaw
 
Reliability engineering ppt-Internship
Reliability engineering ppt-InternshipReliability engineering ppt-Internship
Reliability engineering ppt-Internship
Turbo Energy Limited(a unit of TVS group)
 
mtbf
mtbfmtbf
mttr
mttrmttr
Misuses of MTBF
Misuses of MTBFMisuses of MTBF
Misuses of MTBF
Accendo Reliability
 
Metastability,MTBF,synchronizer & synchronizer failure
Metastability,MTBF,synchronizer & synchronizer failureMetastability,MTBF,synchronizer & synchronizer failure
Metastability,MTBF,synchronizer & synchronizer failureprashant singh
 
A Proposal for an Alternative to MTBF/MTTF
A Proposal for an Alternative to MTBF/MTTFA Proposal for an Alternative to MTBF/MTTF
A Proposal for an Alternative to MTBF/MTTF
ASQ Reliability Division
 
Reducing MTTR and False Escalations: Event Correlation at LinkedIn
Reducing MTTR and False Escalations: Event Correlation at LinkedInReducing MTTR and False Escalations: Event Correlation at LinkedIn
Reducing MTTR and False Escalations: Event Correlation at LinkedIn
Michael Kehoe
 
Principles of RF Microwave Power Measurement
Principles of RF Microwave Power MeasurementPrinciples of RF Microwave Power Measurement
Principles of RF Microwave Power MeasurementRobert Kirchhoefer
 
Rf power measurement
Rf power measurement Rf power measurement
Rf power measurement ruwaghmare
 
Alternatives to MTBF
Alternatives to MTBF Alternatives to MTBF
Alternatives to MTBF
Craig Hillman
 
Trapped by MTBF
Trapped by MTBFTrapped by MTBF
Trapped by MTBF
Accendo Reliability
 
پروژه ویدئو کنفرانس شرکت پارس حیات
پروژه ویدئو کنفرانس شرکت پارس حیاتپروژه ویدئو کنفرانس شرکت پارس حیات
پروژه ویدئو کنفرانس شرکت پارس حیات
شرکت مهندسی نوآوران تحقیق
 
راه اندازی ویدئو پروژکتور در قدیمی ترین دبیرستان تهران - دبیرستان علوی
راه اندازی ویدئو پروژکتور در قدیمی ترین دبیرستان تهران - دبیرستان علویراه اندازی ویدئو پروژکتور در قدیمی ترین دبیرستان تهران - دبیرستان علوی
راه اندازی ویدئو پروژکتور در قدیمی ترین دبیرستان تهران - دبیرستان علوی
شرکت مهندسی نوآوران تحقیق
 

Viewers also liked (20)

Reliability - Availability
Reliability -  AvailabilityReliability -  Availability
Reliability - Availability
 
MTTR
MTTRMTTR
MTTR
 
Basics in Maintenance
Basics in MaintenanceBasics in Maintenance
Basics in Maintenance
 
Dev and Ops Collaboration and Awareness at Etsy and Flickr
Dev and Ops Collaboration and Awareness at Etsy and FlickrDev and Ops Collaboration and Awareness at Etsy and Flickr
Dev and Ops Collaboration and Awareness at Etsy and Flickr
 
Reliability engineering ppt-Internship
Reliability engineering ppt-InternshipReliability engineering ppt-Internship
Reliability engineering ppt-Internship
 
mtbf
mtbfmtbf
mtbf
 
mttr
mttrmttr
mttr
 
Misuses of MTBF
Misuses of MTBFMisuses of MTBF
Misuses of MTBF
 
Metastability,MTBF,synchronizer & synchronizer failure
Metastability,MTBF,synchronizer & synchronizer failureMetastability,MTBF,synchronizer & synchronizer failure
Metastability,MTBF,synchronizer & synchronizer failure
 
Overview and Basic Maintenance
Overview and Basic MaintenanceOverview and Basic Maintenance
Overview and Basic Maintenance
 
A Proposal for an Alternative to MTBF/MTTF
A Proposal for an Alternative to MTBF/MTTFA Proposal for an Alternative to MTBF/MTTF
A Proposal for an Alternative to MTBF/MTTF
 
Reducing MTTR and False Escalations: Event Correlation at LinkedIn
Reducing MTTR and False Escalations: Event Correlation at LinkedInReducing MTTR and False Escalations: Event Correlation at LinkedIn
Reducing MTTR and False Escalations: Event Correlation at LinkedIn
 
Principles of RF Microwave Power Measurement
Principles of RF Microwave Power MeasurementPrinciples of RF Microwave Power Measurement
Principles of RF Microwave Power Measurement
 
Rf power measurement
Rf power measurement Rf power measurement
Rf power measurement
 
Alternatives to MTBF
Alternatives to MTBF Alternatives to MTBF
Alternatives to MTBF
 
Trapped by MTBF
Trapped by MTBFTrapped by MTBF
Trapped by MTBF
 
پروژه ویدئو کنفرانس شرکت پارس حیات
پروژه ویدئو کنفرانس شرکت پارس حیاتپروژه ویدئو کنفرانس شرکت پارس حیات
پروژه ویدئو کنفرانس شرکت پارس حیات
 
Ltx 2003 q1_kpi
Ltx 2003 q1_kpiLtx 2003 q1_kpi
Ltx 2003 q1_kpi
 
راه اندازی ویدئو پروژکتور در قدیمی ترین دبیرستان تهران - دبیرستان علوی
راه اندازی ویدئو پروژکتور در قدیمی ترین دبیرستان تهران - دبیرستان علویراه اندازی ویدئو پروژکتور در قدیمی ترین دبیرستان تهران - دبیرستان علوی
راه اندازی ویدئو پروژکتور در قدیمی ترین دبیرستان تهران - دبیرستان علوی
 
Sf6 gas properties
Sf6 gas propertiesSf6 gas properties
Sf6 gas properties
 

Similar to MTBF / MTTR - Energized Work TekTalk, Mar 2012

System Availability Talk
System Availability TalkSystem Availability Talk
System Availability Talk
m_richardson
 
Disaster Recovery with MySQL and Tungsten
Disaster Recovery with MySQL and TungstenDisaster Recovery with MySQL and Tungsten
Disaster Recovery with MySQL and TungstenJeff Mace
 
Specifics of Managing Large, Complex Projects
Specifics of Managing Large, Complex ProjectsSpecifics of Managing Large, Complex Projects
Specifics of Managing Large, Complex Projects
Jeremie Averous
 
Getting Started Developing with Platform as a Service
Getting Started Developing with Platform as a ServiceGetting Started Developing with Platform as a Service
Getting Started Developing with Platform as a Service
CloudBees
 
Large Complex Projects (PMI-MY presentation Sept 2012)
Large Complex Projects (PMI-MY presentation Sept 2012)Large Complex Projects (PMI-MY presentation Sept 2012)
Large Complex Projects (PMI-MY presentation Sept 2012)
Jeremie Averous
 
Paremus Cloud and OSGi Beyond the VM - OSGi Cloud Workshop March 2012
Paremus Cloud and OSGi Beyond the VM - OSGi Cloud Workshop March 2012Paremus Cloud and OSGi Beyond the VM - OSGi Cloud Workshop March 2012
Paremus Cloud and OSGi Beyond the VM - OSGi Cloud Workshop March 2012
mfrancis
 
How to Plan and Budget for 2013 with Cloud in Mind
How to Plan and Budget for 2013 with Cloud in MindHow to Plan and Budget for 2013 with Cloud in Mind
How to Plan and Budget for 2013 with Cloud in Mind
Bluelock
 
MySQL vs NoSQL
MySQL vs NoSQLMySQL vs NoSQL
MySQL vs NoSQL
Tiago Franco
 
2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]
2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]
2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]
Strangeloop
 
JVM Multitenancy (JavaOne 2012)
JVM Multitenancy (JavaOne 2012)JVM Multitenancy (JavaOne 2012)
JVM Multitenancy (JavaOne 2012)
Graeme_IBM
 
Diving Deeper into DevOps Deployments
Diving Deeper into DevOps DeploymentsDiving Deeper into DevOps Deployments
Diving Deeper into DevOps Deployments
Jules Pierre-Louis
 
Building Agile Data Warehouses with Ralph Hughes
Building Agile Data Warehouses with Ralph HughesBuilding Agile Data Warehouses with Ralph Hughes
Building Agile Data Warehouses with Ralph Hughes
Kalido
 
Get Loose! Microservices and Loosely Coupled Architectures
Get Loose! Microservices and Loosely Coupled Architectures Get Loose! Microservices and Loosely Coupled Architectures
Get Loose! Microservices and Loosely Coupled Architectures
DevOps.com
 
Get Loose! Microservices and Loosely Coupled Architectures
Get Loose! Microservices and Loosely Coupled ArchitecturesGet Loose! Microservices and Loosely Coupled Architectures
Get Loose! Microservices and Loosely Coupled Architectures
Deborah Schalm
 
Virtual Worlds: A Future History
Virtual Worlds: A Future HistoryVirtual Worlds: A Future History
Virtual Worlds: A Future History
Robin Teigland
 
Why the Cloud matters for Encoding
Why the Cloud matters for EncodingWhy the Cloud matters for Encoding
Why the Cloud matters for EncodingBrightcove
 
Scaling mature systems
Scaling mature systemsScaling mature systems
Scaling mature systemsHanMorten
 
The 10 biggest metering and billing mistakes
The 10 biggest metering and billing mistakesThe 10 biggest metering and billing mistakes
The 10 biggest metering and billing mistakes
Flexiant
 
Micro frontends with react and redux dev day
Micro frontends with react and redux   dev dayMicro frontends with react and redux   dev day
Micro frontends with react and redux dev day
Prasanna Venkatesan
 
10 Do’s for DevOps!
 10 Do’s for DevOps!  10 Do’s for DevOps!
10 Do’s for DevOps!
DevOps.com
 

Similar to MTBF / MTTR - Energized Work TekTalk, Mar 2012 (20)

System Availability Talk
System Availability TalkSystem Availability Talk
System Availability Talk
 
Disaster Recovery with MySQL and Tungsten
Disaster Recovery with MySQL and TungstenDisaster Recovery with MySQL and Tungsten
Disaster Recovery with MySQL and Tungsten
 
Specifics of Managing Large, Complex Projects
Specifics of Managing Large, Complex ProjectsSpecifics of Managing Large, Complex Projects
Specifics of Managing Large, Complex Projects
 
Getting Started Developing with Platform as a Service
Getting Started Developing with Platform as a ServiceGetting Started Developing with Platform as a Service
Getting Started Developing with Platform as a Service
 
Large Complex Projects (PMI-MY presentation Sept 2012)
Large Complex Projects (PMI-MY presentation Sept 2012)Large Complex Projects (PMI-MY presentation Sept 2012)
Large Complex Projects (PMI-MY presentation Sept 2012)
 
Paremus Cloud and OSGi Beyond the VM - OSGi Cloud Workshop March 2012
Paremus Cloud and OSGi Beyond the VM - OSGi Cloud Workshop March 2012Paremus Cloud and OSGi Beyond the VM - OSGi Cloud Workshop March 2012
Paremus Cloud and OSGi Beyond the VM - OSGi Cloud Workshop March 2012
 
How to Plan and Budget for 2013 with Cloud in Mind
How to Plan and Budget for 2013 with Cloud in MindHow to Plan and Budget for 2013 with Cloud in Mind
How to Plan and Budget for 2013 with Cloud in Mind
 
MySQL vs NoSQL
MySQL vs NoSQLMySQL vs NoSQL
MySQL vs NoSQL
 
2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]
2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]
2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]
 
JVM Multitenancy (JavaOne 2012)
JVM Multitenancy (JavaOne 2012)JVM Multitenancy (JavaOne 2012)
JVM Multitenancy (JavaOne 2012)
 
Diving Deeper into DevOps Deployments
Diving Deeper into DevOps DeploymentsDiving Deeper into DevOps Deployments
Diving Deeper into DevOps Deployments
 
Building Agile Data Warehouses with Ralph Hughes
Building Agile Data Warehouses with Ralph HughesBuilding Agile Data Warehouses with Ralph Hughes
Building Agile Data Warehouses with Ralph Hughes
 
Get Loose! Microservices and Loosely Coupled Architectures
Get Loose! Microservices and Loosely Coupled Architectures Get Loose! Microservices and Loosely Coupled Architectures
Get Loose! Microservices and Loosely Coupled Architectures
 
Get Loose! Microservices and Loosely Coupled Architectures
Get Loose! Microservices and Loosely Coupled ArchitecturesGet Loose! Microservices and Loosely Coupled Architectures
Get Loose! Microservices and Loosely Coupled Architectures
 
Virtual Worlds: A Future History
Virtual Worlds: A Future HistoryVirtual Worlds: A Future History
Virtual Worlds: A Future History
 
Why the Cloud matters for Encoding
Why the Cloud matters for EncodingWhy the Cloud matters for Encoding
Why the Cloud matters for Encoding
 
Scaling mature systems
Scaling mature systemsScaling mature systems
Scaling mature systems
 
The 10 biggest metering and billing mistakes
The 10 biggest metering and billing mistakesThe 10 biggest metering and billing mistakes
The 10 biggest metering and billing mistakes
 
Micro frontends with react and redux dev day
Micro frontends with react and redux   dev dayMicro frontends with react and redux   dev day
Micro frontends with react and redux dev day
 
10 Do’s for DevOps!
 10 Do’s for DevOps!  10 Do’s for DevOps!
10 Do’s for DevOps!
 

More from Energized Work

Agile Practitioners Feedback to improve teams
Agile Practitioners Feedback to improve teamsAgile Practitioners Feedback to improve teams
Agile Practitioners Feedback to improve teams
Energized Work
 
Surviving SOA
Surviving SOASurviving SOA
Surviving SOA
Energized Work
 
Experience report on agile tools for management teams
Experience report on agile tools for management teamsExperience report on agile tools for management teams
Experience report on agile tools for management teamsEnergized Work
 
Innovation Governance
Innovation GovernanceInnovation Governance
Innovation Governance
Energized Work
 
Business model innovation by experimentation
Business model innovation by experimentationBusiness model innovation by experimentation
Business model innovation by experimentation
Energized Work
 
Debugging Grails Database Performance
Debugging Grails Database PerformanceDebugging Grails Database Performance
Debugging Grails Database PerformanceEnergized Work
 
Governance - Friend or Foe?
Governance - Friend or Foe?Governance - Friend or Foe?
Governance - Friend or Foe?
Energized Work
 
Energized Work: Software that means business
Energized Work: Software that means businessEnergized Work: Software that means business
Energized Work: Software that means businessEnergized Work
 
Product Development in the Land of the Free - Energized Work Presentation
Product Development in the Land of the Free - Energized Work PresentationProduct Development in the Land of the Free - Energized Work Presentation
Product Development in the Land of the Free - Energized Work Presentation
Energized Work
 
Leaning - Energized Work Presentation
Leaning - Energized Work PresentationLeaning - Energized Work Presentation
Leaning - Energized Work PresentationEnergized Work
 
Concept to Cash - Energized Work Presentation
Concept to Cash - Energized Work PresentationConcept to Cash - Energized Work Presentation
Concept to Cash - Energized Work Presentation
Energized Work
 

More from Energized Work (11)

Agile Practitioners Feedback to improve teams
Agile Practitioners Feedback to improve teamsAgile Practitioners Feedback to improve teams
Agile Practitioners Feedback to improve teams
 
Surviving SOA
Surviving SOASurviving SOA
Surviving SOA
 
Experience report on agile tools for management teams
Experience report on agile tools for management teamsExperience report on agile tools for management teams
Experience report on agile tools for management teams
 
Innovation Governance
Innovation GovernanceInnovation Governance
Innovation Governance
 
Business model innovation by experimentation
Business model innovation by experimentationBusiness model innovation by experimentation
Business model innovation by experimentation
 
Debugging Grails Database Performance
Debugging Grails Database PerformanceDebugging Grails Database Performance
Debugging Grails Database Performance
 
Governance - Friend or Foe?
Governance - Friend or Foe?Governance - Friend or Foe?
Governance - Friend or Foe?
 
Energized Work: Software that means business
Energized Work: Software that means businessEnergized Work: Software that means business
Energized Work: Software that means business
 
Product Development in the Land of the Free - Energized Work Presentation
Product Development in the Land of the Free - Energized Work PresentationProduct Development in the Land of the Free - Energized Work Presentation
Product Development in the Land of the Free - Energized Work Presentation
 
Leaning - Energized Work Presentation
Leaning - Energized Work PresentationLeaning - Energized Work Presentation
Leaning - Energized Work Presentation
 
Concept to Cash - Energized Work Presentation
Concept to Cash - Energized Work PresentationConcept to Cash - Energized Work Presentation
Concept to Cash - Energized Work Presentation
 

Recently uploaded

Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 

Recently uploaded (20)

Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 

MTBF / MTTR - Energized Work TekTalk, Mar 2012

  • 1. MTBF / MTTR Availability or recoverability? Presented by Michael Richardson, Energized Work 21 March 2012 ENERGIZED WORK 25 MACKLIN STREET LONDON WC2B 5NN +44 (0)20 7691 8933 WWW.ENERGIZEDWORK.COM
  • 2. Michael Richardson Twitter: @mr_spb Email: michael@energizedwork.com #ewtektalk © 2012 Energized Work - www.energizedwork.com 2
  • 3. So what is high availability? •  Five nines? •  No single point of failures? •  Multiple data centres? •  Fault tolerance? •  Load balancing? •  Uptime? © 2012 Energized Work - www.energizedwork.com 3
  • 4. Nines of availability 9 9 9 9 9 9 9 9 © 2012 Energized Work - www.energizedwork.com 4
  • 5. Nines of availability Availability Downtime per Year One nine (90%) 36.5 days Two nines (99%) 3.65 days Three nines (99.9%) 8.76 hours Four nines (99.99%) 52.56 minutes Five nines (99.999%) 5.26 minutes © 2012 Energized Work - www.energizedwork.com 5
  • 6. Problem with the nines •  What do they mean? •  Guaranteed or just an SLA? •  Multiplicity (99.9% * 99.9% * 99.9% = 99.7%) © 2012 Energized Work - www.energizedwork.com 6
  • 7. SLA availability numbers just aim to provide a level of confidence in a website’s service © 2012 Energized Work - www.energizedwork.com 7
  • 8. No single point of failure (SPOF) © 2012 Energized Work - www.energizedwork.com 8
  • 9. Two of everything? © 2012 Energized Work - www.energizedwork.com 9
  • 10. Start with this Users Index.html © 2012 Energized Work - www.energizedwork.com 10
  • 11. End with this Users Firewall 1 Firewall 2 Switch 1 Switch 2 WEB1 WEB2 APP1 APP2 DB1 DB2 © 2012 Energized Work - www.energizedwork.com 11
  • 12. Problems with eliminating SPOF •  It’s expensive •  Where do you draw the line? •  Are failures independent? •  Can you guarantee no SPOF? •  Increased complexity © 2012 Energized Work - www.energizedwork.com 12
  • 13. Problem: Data centres fail © 2012 Energized Work - www.energizedwork.com 13
  • 14. Solution: Get a second data centre © 2012 Energized Work - www.energizedwork.com 14
  • 15. Hot – Hot multisite •  Full range of services available in multiple locations •  Easy to automate failover of sites •  Data consistency is hard •  Capacity planning concerns + © 2012 Energized Work - www.energizedwork.com 15
  • 16. Hot – Warm multisite •  Simpler than hot – hot •  Read / Write ratio dependent •  Synchronously or asynchronously replicate data? + © 2012 Energized Work - www.energizedwork.com 16
  • 17. Hot – Cold multisite •  Easy to setup •  Will it work? •  Can it be trusted? •  Cold site rapidly becomes stale •  Is it actually valuable? + © 2012 Energized Work - www.energizedwork.com 17
  • 18. DR multisite •  Fingers crossed you never need it •  How can / should you test it? •  Cloud? + © 2012 Energized Work - www.energizedwork.com 18
  • 19. Problems with multiple sites •  It’s expensive •  Managing more systems •  Managing data consistency •  Managing capacity •  Is it still fail proof? •  Unless you test it, it’s just a plan © 2012 Energized Work - www.energizedwork.com 19
  • 20. We now have a complex system © 2012 Energized Work - www.energizedwork.com 20
  • 21. Complex systems •  More redundancy and automation leads to more complexity •  More complexity often adds more points of failure © 2012 Energized Work - www.energizedwork.com 21
  • 22. How complex systems fail - Dr. Richard Cook •  Catastrophe is always just around the corner •  Human operators have dual roles •  Change introduces new forms of failure © 2012 Energized Work - www.energizedwork.com 22
  • 23. Failure and recovery © 2012 Energized Work - www.energizedwork.com 23
  • 24. Questions for the business •  What is the cost of downtime? •  What are the Recovery Time Objectives (RTO) •  What are the Recovery Point Objectives (RPO)? © 2012 Energized Work - www.energizedwork.com 24
  • 25. Aggressive RTO and RPO are expensive and have a performance impact © 2012 Energized Work - www.energizedwork.com 25
  • 26. RTO / RPO example Problem: •  Simple DB •  Business can tolerate up to 15 minutes downtime •  10-minute window of data loss © 2012 Energized Work - www.energizedwork.com 26
  • 27. RTO / RPO example Possible solution: •  Continuously replicate data to second host •  Continue with nightly backups and also copy DB transaction logs from the primary host to another system © 2012 Energized Work - www.energizedwork.com 27
  • 28. So what is more important – increasing availability or reducing recovery time? © 2012 Energized Work - www.energizedwork.com 28
  • 29. MTBF or MTTR? What about MTTD? © 2012 Energized Work - www.energizedwork.com 29
  • 30. The answer is: It depends © 2012 Energized Work - www.energizedwork.com 30
  • 31. Failure is inevitable © 2012 Energized Work - www.energizedwork.com 31
  • 32. Ask anyone © 2012 Energized Work - www.energizedwork.com 32
  • 33. License This presentation is provided under the Creative Commons Attribution Share Alike 3.0 Unported License. You are free: To share – to copy, distribute and transmit the work To remix – to adapt the work Under the following conditions: Attribution – You must attribute the work in the manner specified by Energized Work (but not in any way that suggests that Energized Work endorse you or your use of the work). Share Alike – If you alter, transform, or build upon this work, you may distribute the resulting work only under the same or similar license to this one. ENERGIZED WORK 25 MACKLIN STREET LONDON WC2B 5NN +44 (0)20 7691 8933 © 2012 Energized Work - www.energizedwork.com WWW.ENERGIZEDWORK.COM 33