This document presents a maturity model to assess the readiness of an organization's IT disaster recovery program. It defines five levels of maturity for IT disaster recovery from initial/ad hoc to optimized/resilient. The model evaluates three components: strategy, implementation, and exercise/maintenance. Each component contains capabilities used to measure the organization's maturity level. The model can be used to evaluate risk, develop roadmaps, and identify areas for improvement. Results can be presented using milestone, dashboard, or per-component models to visualize the current status and guide further development of the disaster recovery program.
2. Maturity Model Purpose
īĩ The purpose of this model is to help the reader in evaluating the current
status of his/her Disaster Recovery program.
īĩ Beside that, below sample use-cases for this maturity model:
īĩ Evaluate the current Risk level on business continuity in event of disaster.
īĩ Anticipate a road-map or desire state for IT Disaster Recovery planning.
īĩ business case input for Disaster Recovery initiative.
īĩ Driver for Integration plan for multiple enterprise wide capabilities into
Disaster recovery planning.
īĩ Use as scale to evaluate multiple capabilities or line of businesses against
each other in term of disaster recovery readiness.
3. Maturity levels Definition
Level Definition Description
1 Initial or Ad hoc minimal or ineffective recovery capabilities in-place.
2 Established or
Reactive
disaster recovery strategy in-place but lack proper capabilities
and handled on best-efforts bases.
3 Prepared or business
enabled
disaster recovery strategy in-place and align with business
demands.
4 Managed or
Proactive
disaster recovery managed as program where dedicated team
manage/maintain/validate various components of disaster
recovery program.
5 Optimized or
resilient
disaster recovery program complement & seamlessly
integrate with various enterprise wide practices (IT/none-IT)
and address growth requirement.
This Model has five levels of readiness or maturity for Disaster Recovery âDRâ capabilities,
where level 0 âNon-existentâ removed due to easiness of determining the same âlack of
recovery strategy or capabilitiesâ.
4. Maturity Model Structure
īĩ I have structured this model into three (3) main ITDR components (21 ITDR
capabilities), each of them measure different aspects of DR capabilities:
īĩ ITDR Strategy (measure stakeholder engagement âreflected as Interest & Powerâ
in ITDR)
īĩ ITDR Implementation (measure IT alignment with business demands)
īĩ ITDR Exercise & Maintenance (measure the effectiveness âreflected on Success
rate & Credibility/Confidence levelâ of ITDR program)
īĩ The outcome of overall review of these three ITDR components, will be
Maturity/Readiness level for Enterpriseâs recovery capabilities in term of:
īĩ Recovery likelihood
īĩ Risk to business
5. Measuring Capabilities
īĩ For each ITDR component, set of measuring capabilities identified to help
differentiating different maturity level:
ITDR Strategy ITDR Implementation ITDR Exercise &
Maintenance
ITDR Plan Recovery site setup & connectivity Support/SLA for ITDR infrastructure
ITDR Processes & Procedures
(BIA/RA)
Data Backup/Restore & Replication Backup/Restore testing & validation
ITDR Team , Roles, & Responsibilities Configuration synchronization Infrastructure Monitoring
Backup/Recovery practice DR Facility & User connectivity ITDR Training
ITDR Budget & fund ITDR Run-book or Play-book ITDR Infrastructure/Application/OS
Patching Practice
ITDR integration with Enterprise wide
practices
ITDR Infrastructure ITDR testing exercise
ITDR Road-map ITDR Failover/Failback
Implementation
ITDR Audit , Report & document
review.
7. Level to capability Mapping
ITDRStrategy Level1 (Unaware) Level 2 (Neutral) Level 3 (Sponsor) Level 4 (Involved) Level 5 (Empower)
īˇ Basic Backup/Recovery
Procedure documented.
īˇ Lack of ITDR plan, processes
or procedures documented.
īˇ Lack of BIA/RA Performed to
identify IT Critical services &
its RTO/RPO demands .
īˇ Lack of ITDR Team structure
(best-efforts bases).
īˇ Lack of dedicated Budget.
īˇ Basic definition of critical
services documented (IT
point of view).
īˇ Backup/Recovery Procedure
documented.
īˇ One General Definition for
recovery strategy for all IT
services.
īˇ High-level or Basic ITDR Plan
documented (usually as part
of IT procedures).
īˇ Budget funded from other IT
initiatives.
īˇ Partial recovery team
structure, but roles
responsibilities not clearly
defined (lack of ownership).
īˇ Chaotic DR communication ,
declaration & escalation
structure ( lack of
governance structure)
īˇ Business RTO/RPO defined
(through Business Impact
Analysis âBIAâ).
īˇ Backup/Recovery Procedure
documented & align with
defined RTO/RPO.
īˇ ITDR Plan documented but
not reviewed or it is
reviewed but on best efforts
bases.
īˇ ITDR BIA/RA processes &
procedures documented.
īˇ Criticality-based recovery
strategy (ex. Replication for
critical services and restore
from backup for others).
īˇ ITDR recovery strategy cover
both Failover/Failback.
īˇ Lack of dedicated ITDR lead
identified but the âacting-
asâ defined.
īˇ Recovery team member
identified.
īˇ ITDR communication,
declaration & escalation
structure documented.
īˇ Level 3 (in addition to the
following).
īˇ ITDR Plan Reviewed in
Frequent bases.
īˇ KPI Defined for various IT DR
Plan components.
īˇ ITDR Road-Map defined.
īˇ ITDR Budget allocated.
īˇ Recovery Strategy Defined
Per-service.
īˇ ITDR Roles and
Responsibilities Defined.
īˇ ITDR process automation
(BIA/RA) in-place.
īˇ ITDR Plan cover full, partial
(multiple-services), &
service-level
Failover/Failback.
īˇ ITDR planned as part of IT
budget & infrastructure
capacity planning.
īˇ Level 4 (in addition to the
following).
īˇ ITDR integrated with change
management process.
īˇ ITDR Road-map integrated
with business plan/IT
strategy.
īˇ ITDR Processes and
procedures integrated in
enterprise architect as well
īˇ ITDR requirements collected
as part of business
requirements gathering for
any new IT initiative.
īˇ ITDR Integrated with both
business continuity plan
(BCP) and Crisis
Management Plan (CMP) for
the enterprise.
īˇ IT DR Roles &
Responsibilities integrated
as part of job description.
īˇ (in case of insurance) IT DR
integrated with Enterprise
insurance plan.
īˇ ITDR communication plan
align with enterprise
communication guidelines.
Indicator
Low (Power , Interest)
Stakeholder Engagement
High (Power, Interest )
9. Level to capability Mapping
ITDRImplementation
Level1 (Siloed) Level 2 (best-
efforts)
Level 3 Level 4 (Improve) Level 5 (Value
Creator)
īˇ Basic backup/restore
capability.
īˇ Cold recovery site
capabilities.
īˇ Lack of resiliency for DR site
connectivity.
īˇ Shared and/or Low
Bandwidth DR link.
īˇ Legacy or limited DR
infrastructure components
implement.
īˇ Lack of userâs connectivity
planned.
īˇ Advanced backup/restore
capability (restoration
testing capability in-place).
īˇ Basic data replication
capability in-place.
īˇ Warm recovery site
capabilities.
īˇ Active/standby connectivity
for DR site.
īˇ DR infrastructure
configuration is not up-to-
date (Manual Configuration
required in case of Disaster).
īˇ Recovery procedure
documented at high-level
(lack of run-book concept).
īˇ Most of userâs access
planned âin event of
disaster- based on remote
access (no dedicated DR
facility).
īˇ Both Replication and
Backup/restore capabilities
in-place.
īˇ Warm or hot recovery site
capabilities with highly
available connectivity.
īˇ Most of DR infrastructure
configuration up-to-date.
īˇ Per-service Disaster recovery
run-book in-place.
īˇ Both remote access and
onsite access planned (DR
Facility secured).
īˇ Technology support
recovery strategy in-place.
īˇ Technology implementation
cover both failover/failback
demands.
īˇ Both Main & Remote site
connectivity toward the
recovery site planned/
secured.
īˇ Level 3 (in addition to the
following).
īˇ Run-book maintained &
updated by owners.
īˇ DNS load-balancing
techniques (Global service
Load Balancing)
implemented for seamless
DR Failover.
īˇ Automated DR
Failover/Failback capability
in-place (software or
network based).
īˇ DR Implementation support
full, partial (multiple-
services), & service-level
Failover/Failback.
īˇ Level 4 (in addition to the
following).
īˇ Technology/capability to
align/sync configuration
from Main site to recovery
site in-place and support
real-time synchronization.
īˇ DR recovery site can expand
to permanent.
īˇ DR Run-book automated for
both failover/failback
īˇ capability to provide real-
time (RTO/RPO) calculation
for IT services.
īˇ capability to integrate/feed
companyâs BCP/CM platform
(in case of any).
īˇ Users Recovery facility used
for day to day operation by
set of actual users.
īˇ Tap backup/restore
capability implemented as
last restoration option.
īˇ Advanced user connectivity
techniques planned in case
of disaster (ex. VDI).
Indicator
Low (Alignment , Added Value)
IT alignment with business
High (Alignment , Added Value)
10. ITDR Exercise & Maintenance Maturity
LEVEL TO CAPABILITY MAPPING
11. Level to capability Mapping
ITDRExercise&Maintenance
Level1(Ineffective) Level 2 (Reactive) Level 3 (Effective) Level 4 (Proactive) Level 5 (Reliable)
īˇ Basic restoration testing
conducted (usually done by
IT and application owner not
in the picture).
īˇ Random testing cycle taken
place and cover only backup
restoration for critical
services.
īˇ Lack of monitoring for
backup jobs/disaster
recovery link/infrastructure.
īˇ Some DR infrastructure
components end-of-life/end-
of-support.
īˇ Backup Restoration
conducted for critical
services and application
owner engaged.
īˇ Support & maintenance
secured for DR
infrastructure.
īˇ Backup job monitoring in-
place, but lack DR
infrastructure monitoring.
īˇ Limited or Partial DR testing
exercise (usually due to
limitation DR infrastructure
capabilities or lack of
management approval).
īˇ DR testing taken place out of
working hours.
īˇ DR Application/OS patching
performed on best-efforts
bases.
īˇ Backup restoration testing
conducted in frequent bases.
īˇ Both Main & Recovery site
has same level of
monitoring.
īˇ Both Main & Recovery site
has same level of
Application/OS Patching.
īˇ DR testing include user-level
testing from DR facility.
īˇ DR testing perfumed in form
of partial testing (cover
business critical
applications) and cover both
failover/failback.
īˇ ITDR training program in-
place and individual aware
of their Roles &
Responsibilities.
īˇ Test results and lessons
learned captured within post
exercise report.
īˇ Conduct ITDR Training in
frequent bases.
īˇ Level 3 (in addition to the
following).
īˇ DR testing include run some
actual transactions from
recovery site.
īˇ Test results and lessons
learned used to update the
ITDR Plan as well as services
RTO/RPO values.
īˇ ITDR Plan, Policies &
Procedures reviewed in
frequent bases.
īˇ Different ITDR testing
exercises performed (full,
Partial, table-top &
simulation).
īˇ ITDR testing performed
during working hours.
īˇ Level 4 (in addition to the
following).
īˇ Unannounced ITDR testing
may take place during
business hours.
īˇ ITDR testing/exercise has a
defined KPI.
īˇ ITDR training is part of IT
employee induction
program.
īˇ Established Capability to
communicate DR individuals
Roles & Responsibility in
event of disaster
declaration.
īˇ Multiple DR communication
media capabilities.
īˇ ITDR focal participate
effectively in overall
enterprise BCP and CMP
exercise.
īˇ Regular audit performed for
ITDR program as well as
ITDR budget spending.
Indicator
Low (Success, Credible)
Program Effectiveness
High (Success, Credible)
15. Maturity level Calculation
īĩ To Provide Quantitative spin for the ITDR maturity readiness, ITDR measuring capabilities (refer
to slide 5) can be utilized for the same, where a weight assigned for each individual measuring
capability.
īĩ these weights used to calculate individual ITDR componentâs scoring, which in turn used to
calculate ITDR overall maturity scoring (as illustrated in next two slides).
Note: The assigned weight may differ depend on user point-of-view, however the overall weight distribution need to follow same logic (sum per ITDR component =100)
16. Maturity level Calculation (Cont.)
To add more granularity to the Maturity level calculation, another multiplier âcompleteness" drafted wherein
each ITDR capability evaluated/scored against the same Maturity levels used earlier (as shown below), then both
weight and completeness are used to calculate ITDR capability Score: Capability(n) Score = ((Capability(n)
Weight) X Completeness (%))
Then ITDR component scoring (ITDR Component Maturity) will be:
ITDR Component Score = Sum (Capability 1 score + Capability 2 score + âĻ + Capability 7 score)
17. Maturity level Calculation (Cont.)
Once individual ITDR component scored, ITDR overall Maturing scoring will be:
ITDR Maturity = (Sum of âITDR Strategy Capabilities Scoringâ + Sum of âITDR Implementation
capabilities Scoringâ + Sum of âITDR Exercise & Maintenance Capabilities Scoringâ )/3
Finally, we can map the maturity scoring to our 5 Risk levels as shown below:
19. ITDR Executive Presentation
īĩ There are a lot of methods to present the output(s) of any maturity model
depend on demands and audience, in coming slides I have added three models
for maturity profile representation:
īĩ Milestone or state-based Model (can be used for Road-Map presentation).
īĩ Maturity Dashboard (can be used to represent current state ITDR maturity).
īĩ Per-component Model (can be used to provide a quick visual gab analysis per ITDR
component as well as areas of improvement).
īĩ Other presentation method can be used as well to factor the risk level and
recovery likelihood, however In this presentation I am covering the previous
three only.
20. Milestone or state-based Model
2017
2018
2019
Green: available
Umber: Partially available
Red: Not available
23. â
â
Finally I would highlight that this effort is a human product,
and like any other human products, it is a reflection of
his/her creator experience & knowledge, and will not reach
perfection. So if you find it useful feel free to use it,
otherwise send me your inputs (my contact below).
Bashar Al-Khatib
Bashar_khatis83@yahoo.com
https://www.linkedin.com/in/bashar-alkhatib-59861518/