Arranging business continuity for IT applications requires different measures than for traditional on-premises IT infrastructure. In this presentation I go into detail on how to protect your application (on Amazon in this example) against operation, blue moon and epic events.
2. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 2
About:
@BartMVeldhuis
bart.veldhuis@weolcan.eu
blog.weolcan.eu
Bart M. Veldhuis
• (Certified) Cloud Architect – Weolcan
• Founder Cloud Architect Alliance
3. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 3
Two sides of the story
Cloud as a solution for
Business Continuity
The Business Continuity
issues with cloud
4. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 4
IT INFRASTRUCTURE
WORKSPACEBUSINESS PROCESSES
Scope: IT infrastructure
5. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 5
Different types of events
SEVERITY OF DISASTER
Operational events Blue Moon events Epic events
Eventtype
Continuity by Design
Exit Strategy
Engineer for portability
Measures
misconfigurations
control plane failures
component failures
storms
power outages
natural disaster
commercial opportunity
failure to contract
bankruptcy
6. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 6
Continuity by design
Operational events
Auto-scaling
BA
Multi-zone
loadbalancing
Self-healing
architecture
7. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 7
Setting up Auto scaling
Identify the
constraining
resources
Find a way to track
constraining resource
Configure alarms and
policies to trigger the
right scaling actions
1
2
3
8. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 8
Scalable Application Tiering
Web Tier Application Tier DB Tier
Current
Compute
Capacity
Capacity added
only when
needed
User
9. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 9
Continuity by design
Operational events
Auto-scaling
BA
Multi-zone
loadbalancing
Self-healing
architecture
12. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 12
Continuity by design
Operational events
Auto-scaling
BA
Multi-zone
loadbalancing
Self-healing
architecture
13. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 13
Five layers of self-healing infrastructure
Network & datacenter
Virtual Machine & OS
Middleware
Application & data
consistency
Security
Changes (patches, application upgrades,
configuration changes etc.) are only made by
modifying a versioned service definition and
updating running services instances to the new
definition version.
Policy based
Continuous Delivery
Multi-zone Auto scaling
& load balancing
Automated Configuration
management
Automated Configuration
management
Event Monitoring &
Reacting
14. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 14
Chaos Kong
Inspector monkey
Latency monkey
Janitor monkey
Conformity monkey
Chaos Gorilla
Chaos monkey
Doctor monkey
15. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 15
Chaos monkey
Can simulate:
• Instance failure
• CPU burn
• Null route (network loss)
• Network corruption
• Network latency
• Disk burn (fill the root partition)
• Detach disk
16. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 16
Backup and DR
scenarios
AA
Multi-region
loadbalancing
Continuity by design
Blue Moon events
18. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 18
Backup and DR
scenarios
AA
Multi-zone
loadbalancing
Continuity by design
Blue Moon events
19. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 19
It’s all about…RPO & RTO
02:00
Last backup
21:00
Service restored
RPO: 10 hours RTO: 9 hours
Disaster strikes!
12:00
RPO: Recovery Point Objective
RTO: Recovery Time Objective
20. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 20
Backup and DR scenarios for IaaS
1
Daily backup
BACKUP STANDBY
No standby
RPO/RTO
24h/96h
2
Frequent backups
(6x/day)
Reserved cloud
capacity
04h/24h
3
Continuous Data
Protection
Cold standby 01h/04h
4
Active/Active
cloud design
Active 0h/0h
21. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 21
Different types of events
SEVERITY OF DISASTER
Operational events Blue Moon events Epic events
Eventtype
Continuity by Design
Exit Strategy
Engineer for portability
Measures
24. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 24
Exit Strategy and Engineer for portability
Epic events
Achieve portabilityExit Strategy
25. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 25
IaaS: Zero RPO/RTO setup multi-cloud
Global
Load
Balancer
50%
50%
App
OS
App
OS
App
OS
App
OS
Application Data
?
Zero downtime / Zero data loss
26. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 26
IaaS: Zero RPO/RTO setup: Data
• Database scale: if there are
only few joins in the queries,
try NOSql.
• Cache: do not touch database
if you can.
• Async: implement this where
possible.
App
OS
App
OS
App
OS
App
OS
DataApplication
M
S
Zero downtime / Zero data loss
27. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 27
IaaS: Zero RPO/RTO setup Multi cloud
Global
Load
Balancer
DNS
failover
App
OS
App
OS
ELB
ELB
Azure
EU-W
AWS
EU-C
DB
DB
DB
DB
Elastic
scale
Failover
replica
…
App
OS
App
OS
…
MongoDB
replica set
Read only
replica
(near) Zero downtime / Zero data loss
29. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 29
Exit Strategy and Engineer for portability
Epic events
Achieve portabilityExit Strategy
34. www.weolcan.eu . twitter.com/weolcan . Business Continuity in the Cloud Era 14-okt-16 34
What’s on the agenda for tomorrow?
Key takeaway:
– Operational events and Blue Moon events require ‘continuity by design’
– Epic events require upfront planning and strategy
Tomorrow:
• Utilize container technology where possible to improve portability
• Use a multi cloud strategy to enable application mobility
• Read our research on cloud management portals