The document discusses different levels of disaster recovery (DR) options on AWS including backup and restore, pilot light, warm standby, and multi-site architectures. It provides an overview of each option including suitable use cases, architectures, costs, and considerations. Key details on RTO, RPO and the ascending level of protection and availability that each option provides are summarized. The document also discusses lessons learned from customer DR experiences on AWS.
2. Session Agenda
Context: on-premises Disaster Recovery (DR) using AWS
Why AWS for recovery of on-premises IT infrastructure
The ascending levels of DR
DR/Continuity scenarios
Demo
3. Business Continuity
Business Continuity ensures that an
organization's critical business functions
continue to operate or recover quickly
despite serious incidents.
Terminology
Disaster Recovery
Disaster Recovery (DR) enables the
recovery or continuation of vital technology
infrastructure and systems following a
natural or human-induced disaster.
Recovery Point Objective Recovery Time Objective
RTO is a targeted duration in which a
business process must be restored after a
disaster or disruption.
RPO is the maximum targeted period in
which data might be lost from an IT
service due to a major incident.
6. History of DR
There have been many challenges for traditional DR
for enterprises
Building and maintaining regional data centers
Failed DR tests
Not meeting RPO & RTO
High technical debt
7. Conventional
High cost to build disaster recovery
sites or data centers (CAPEX)
High cost of storage, backup, archival
and retrieval tools, and processes
(OPEX)
Difficult planning, procurement and
deployment
Challenging to verify DR plans
Single level of DR across the
organization
AWS compared to traditional disaster recovery
AWS
Low cost upfront investment (CAPEX)
On-demand costs (OPEX)
Consistent experience across AWS
environments
Recovery automation
Separate levels of DR per application
or business unit
8. DR topology map
ELB/Appliance
EC2/Auto Scaling
Route 53
Load Balancers
Web/App Servers
Your Data Centers
DNS
DB failover nodes
AD failover nodes
Availability Zones
Multi-regionDisaster Recovery
Data Centers
AD/Authentication
Database Servers
9. Ascending levels of DR options
Backup &
Restore
Pilot Light
Warm
Standby
Multi-Site
Backup of on-
premises data to
AWS to use in a
DR event
Replicate data and
minimal running
services into AWS,
ready to take over
and flare up
Replicate data and
services into AWS
ready to take over
Replicated and
load balanced
environments that
are both actively
taking production
traffic
RPO
a
RTO
COST
24 hours 24 hours
$
RPO
a
RTO
COST
12 hours 4 hours
$$
RPO
a
RTO
COST
1-4 hours 15 min
$$$
RPO
a
RTO
COST
<15 min 0-5 min
$$$$
Business continuity
begins
Un-interrupted Business
continuity
10. Backup & Restore Pilot Light Warm Standby Multi-Site
S3Storage
Gateway
Glacier EBS
Volumes
Route 53 Direct
Connect
VPN
NetworkingStorage
Multiple Direct
Connect
locations
Compute
Auto
Scaling
ELBEC2
Deployment/
Management
CloudFormation IAM
Added through the levels of DR
VPC
11. Backup and restore architecture
~$USD 218 / Month
in AP-SOUTHEAST-2
+VPN
On-premises
Active
Production
www.example.com
Corporate data center AWS region
AWS DR failover
App
Servers
DB
Server
VPN
Connection
Storage
Gateway
iSCSI
Backup
System
S3 / Bucket
Glacier / Archive
Web
Servers Internet traffic
S3 (1TB)
$34/Month
Glacier (2TB)
$25/Month
Storage Gateway
$125/Month
S3 / Bucket
S3 (1TB)
$34/Month
1TB
Data
Volume
12. Backup and restore details
Suitable for:
• Solutions that can sustain higher technical debt
• Lower business critical nature
• Low cost DR option
Leverage existing investments in
• De-duplication
• Compression
• WAN Acceleration
13. Pilot light architecture
Data Replication
On-premises
Active
Production
Route 53
www.example.com
Corporate data center
1 TB Data
Volume
AWS region
Web
Servers
AWS
Active
Production
Direct Connect
App
Servers
DB
Server
1TB
Data
Volume
DB
Server
14. Pilot light architecture
$USD 373 / Month
In AP-SOUTHEAST-2
+DirectConnect
Data Replication
ELB
On-premises
Active
Production
Route 53
www.example.com
Corporate data center
1 TB Data
Volume
Web
Servers
AWS region
Web
Servers
AWS
Active
Production
Direct Connect
App
Servers
DB
Server
App
Servers
1TB
Data
Volume
DB
Server
EBS (GP2)
$123/Month
EC2 (m4.xlarge)
$246/Month
EC2 (t2.medium)
$0/Month
ELB (100GB Data)
$0/Month
EC2 (t2.small)
$0/Month
ELB (100GB Data)
$0/Month
R53 (1M Query)
$4/Month
CloudFormation
15. Considerations
Suitable for:
Solutions that need lower RTO
& RPO
higher business critical nature
Mid-range cost DR option
Pilot light details
3rd Party & Marketplace
CloudEndure
Racemi
Zerto
Others
16. Warm standby architecture
$USD 483 / Month
in AP-SOUTHEAST-2
+DirectConnect
ELB
On-premises
Active
Production
Route 53
www.example.com
Corporate data center
1 TB Data
Volume
Web
Servers
AWS region
Web
Servers
AWS
Active
Production
App
Servers
DB
Server
App
Servers
1TB
Data
Volume
DB
Server EBS (GP2)
$123/Month
EC2 (m4.xlarge)
$246/Month
EC2 (t2.medium)
$59/Month
ELB (100GB Data)
$11/Month
EC2 (t2.small)
$29/Month
ELB (100GB Data)
$11/Month
R53 (1M Query)
$4/Month
CloudFormation
Data Replication
Direct Connect
17. Multi-site architecture
$USD 571 / Month
in AP-SOUTHEAST-2
+DirectConnect
Data Replication
ELB
On-premises
Active
Production
Route 53
www.example.com
Corporate data center
1 TB Data
Volume
Web
Servers
AWS region
Web
Servers
AWS
Active
Production
Direct Connect
App
Servers
DB
Server
App
Servers
1TB
Data
Volume
DB
Server EBS (GP2)
$123/Month
EC2 (m4.xlarge)
$246/Month
EC2 (t2.medium)
$118/Month
ELB (100GB Data)
$11/Month
EC2 (t2.small)
$58/Month
ELB (100GB Data)
$11/Month
R53 (1M Query)
$4/Month
CloudFormation
18. Considerations
Suitable for:
Solutions that require RTO &
RPO in minutes
Core business critical functions
Higher cost DR option
Warm standby and multi-site details
Partners
Partner ecosystem
19. Lessons Learned
3rd Party solutions
Partner engagement
Opportunity to automate technical debt
Customer experiences