Backup and Archiving in the AWS Cloud

1,012
-1

Published on

Learn how AWS customers save money, time, and effort by using AWS's backup and archive services. Organizations of all sizes rely on AWS's services to durably safeguard their data off-premises at a surprisingly low cost. This session will illustrate backup and archive architectures that AWS customers are benefitting from today.

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,012
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
79
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Backup and Archiving in the AWS Cloud

  1. 1. © 2014 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.© 2014 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc. Backup and Archiving in the AWS Cloud John Downey, Amazon Web Services July 10, 2014
  2. 2. Agenda • Why AWS for Backup and Archive? • AWS Global Infrastructure • Backup and Archive Concepts • Traditional vs. Cloud Approach • Cloud Integrated Backup and Archive Gateways • Data Ingestion Options • AWS Storage and Archive Options • Retrieve / Restore • TCO
  3. 3. Why AWS for Backup and Archive? Metered usage: Pay as you go No capital investment No commitment No risky capacity planning Avoid opex and risks of physical media handling Control your geographic locality for performance and compliance
  4. 4. AWS Global Infrastructure 10 Regions 26+ Availability Zones 50+ Edge locations
  5. 5. AWS Regions and Availability Zones Customer decides where applications and data reside Asia Pacific (Tokyo) US West (Oregon)EU (Ireland)US East (N. Virginia) US West (N. Cal) (Asia Pacific) Singapore AWS GovCloud (US) South America (Sao Paulo) China (Beijing – Limited Preview) Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Availability Zone Asia Pacific (Sydney)
  6. 6. Archive: Data retained for the long term, for compliance or research Backup: Data retained to support near-term business continuity Backup and Archive defined
  7. 7. Traditional Backup and Archive
  8. 8. Traditional Backup and Archive • Time: Long/slow recovery time • Money: Capital intensive with ongoing upgrades • Effort: Complex to manage • Quality: Low durability, Error Prone Days or Weeks
  9. 9. Traditional Backup and Archive • Backup Software • Edge to core topologies • Dedicated people • Tape Silos / Tape Libraries • Tape Drives (LTO-X / DLT / etc) • Virtual Tape Libraries (VTLs) • Tape out / Vaulting • Unified Backup Solutions • Deduplication • Compression • Encryption
  10. 10. Traditional Backup and Archive Tape Systems
  11. 11. Traditional Backup and Archive Virtual Tape Library
  12. 12. Cloud Backup and Archive
  13. 13. Cloud Backup and Archive • Backup Software • Edge to core topologies • Single pane of glass • Tape Silos / Tape Libraries • Tape Drives (LTO-X / DLT / etc) • Virtual Tape Libraries (VTLs) • Tape out • Unified Backup Solutions • Deduplication • Compression • Encryption
  14. 14. Cloud Backup and Archive Topologies 1. Branch office backup to cloud 2. Core data center backup to cloud 3. Cloud backup to cloud 4. Hybrid cloud backup
  15. 15. Branch office backup to cloud Considerations: - Backup Software - Storage / Caching Gateway - WAN or Internet - Deduplication - Compression - Encryption - WAN Acceleration
  16. 16. Core data center backup to cloud Considerations: - Backup Software - Storage / Caching gateway - Direct Connect or Internet - Telco Burst Models - Time Warner Telecom - Deduplication - Compression - Encryption - WAN Acceleration
  17. 17. Cloud backup to Cloud Applications running on EC2 backing up to S3 / Glacier Considerations: - Backup software - Encryption - Deduplication - Compression - Native S3 and Glacier integration - AMI Backup Appliance? - AMI = Golden Master - EBS Snaps / Scripting
  18. 18. Hybrid Cloud Backup VPC – Datacenter #4 Single GUI for Management
  19. 19. Cloud Integrated Backup and Archive Gateways
  20. 20. AWS Storage Gateway • On-premises, virtual iSCSI storage appliance • $125 / Month* • Local cache enables low latency access to data • Server Side Encryption (SSE) • 5 TB of throughput per day • Recover to Amazon EBS
  21. 21. AWS Storage Gateway-VTL (Virtual Tape Library) • On-premises, virtual tape library storage appliance • $125 / Month • 10 virtual tape drives / 1500 virtual tape slots • 150 TB local cache – VTL – virtual tape library • Restore in seconds from VTL – VTS – virtual tape shelf • Next Generation Offsite Vault • 24 hour retrieval from VTS • Encryption in transit and at rest • Gateway VTL-AMI
  22. 22. AWS partner backup and archive solutions Avere → S3 AWS SGW → S3 AWS VTL → S3-Glacier BridgeSTOR → S3-Glacier CA Arcserve → S3 CA Mainframe → S3-Glacier Commvault → S3-Glacier Ctera → S3 Druva → S3 Maginatics → S3 Oracle RMAN + OSB Module → S3 Panzura → S3 Riverbed Whitewater → S3-Glacier Sonian → S3 Veeam → S3-Glacier Zmanda → S3
  23. 23. Commvault • Unified platform integrates Backup, Archive, Replication, Analysis and Search, Alerting, Reporting, and Tracking of all data via a single common code base • Integrated with Amazon S3 and Amazon Glacier with deduplication & encryption support • Single console management Amazon S3 Amazon Glacier
  24. 24. Riverbed SteelStore • Local caching appliance • Presents NAS protocols – CIFS / NFS • Up to 30x deduplication • S3 and Glacier support • AMI Available
  25. 25. CA Cloud Storage for System z (Mainframe backup) CA provides the System z Connector De-duplication, compression & encryption Encrypted In-Flight Reduces dependency on storage hardware up to 60% Reduce data protection costs by up to 80% Slash Data Center Storage Costs to Pennies per GB Public Cloud Gateway Amazon S3 Glacier AWS
  26. 26. Ctera CTERA Multi-Tenant Services Delivery Platform Data Management, Provisioning, Monitoring and Metering + APIs Versatile Cloud Storage Services Edge Gateways, Endpoint and In-Cloud Clients Ft. Endpoint Encryption & Source-Based Data Optimization Virtual Private Cloud Files in S3 Metadata in EBS CTERA Portal in EC2 PC’s / Laptops Mobile Devices Servers Gateway
  27. 27. Maginatics • Create a globally accessible, NAS archive based on SW clients • NO HARDWARE REQUIRED • High throughput migration • Global Namespace, Deduplication, Locking • Purpose built cloud file system • POSIX / SMB compliant • Windows / Mac / Linux / iOS / Android Amazon S3
  28. 28. Data Ingestion Options
  29. 29. Data Ingestion Options AWS Direct Connect Dedicated bandwidth between your site and AWS Internet Transfer data in a secure SSL tunnel over the public Internet AWS Import/Export Physical transfer of media into and out of AWS
  30. 30. AWS Ingest Options Internet / One Common Theme: Parallel Uploads 1. Multipart upload 2. Request rate optimization 3. TCP window scaling 4. TCP selective acknowledgement AWS has customers that ingest roughly 1 PB per day
  31. 31. AWS Ingest Options AWS Direct Connect • Private connectivity to AWS – Physical connection – 1 Gbps or 10 Gbps port • Consistent network performance • Consider burst models on ingest • Reduces costs for bandwidth- heavy outbound workloads Locations • CoreSite 32 Avenue of the Americas, NY • CoreSite One Wilshire & 900 North Alameda, LA • Equinix DC1 – DC6 & DC10 - DC11, Ashburn, VA • Equinix SV1 & SV5, San Jose, CA • Equinix SE2 & SE3, Seattle, WA • Equinix SG2, Singapore • Equinix SY3, Sydney • Equinix TY2, Tokyo • Eircom, Clonshaugh • TelecityGroup Docklands, London • Terremark NAP do Brasil, Sao Paulo
  32. 32. AWS Ingest Options AWS Import/Export • Rapidly move data into and out of AWS • Portable storage device shipment to AWS • Supports – Amazon EBS – Amazon S3 – Amazon Glacier • Use cases – Initial data migration – Content distribution via portable devices – Disaster recovery
  33. 33. AWS Storage and Archive Options
  34. 34. AWS Storage and Archive Options Amazon Simple Storage Service (S3) Highly scalable object storage 1 byte to 5 TB in size 99.999999999% durability Amazon Elastic Block Store (EBS) High-performance block storage device 1 GB to 1 TB in size Mount as drives to instances with snapshot/cloning functionalities Amazon Glacier Long-term object archive Extremely low cost per gigabyte 99.999999999% durability
  35. 35. AWS Storage and Archive Options Amazon Elastic Block Store (EBS) • High I/O block storage for Amazon EC2 • Point-in-time snapshots to Amazon S3 • 99.999999999% Durability • Snapshot software is FREE • Point-in-time snapshots across regions
  36. 36. AWS Storage and Archive Options Amazon S3: Trillions of objects under management • New!: SSE-C • Time: Instant access, any time, any where • Money: Pay for what you store / 51% price reduction • Effort: Scales as you grow • Quality: 99.999999999% durability
  37. 37. AWS Storage and Archive Options Amazon Glacier • $0.01 per GB/mo, $120 per TB/yr • 3-5 hour data retrieval latency • Archives: single file or zipped files • Vaults: collection of archives • Infinite archival storage • 99.999999999% durability • Immutable, encrypted by default
  38. 38. AWS Storage and Archive Options Object Lifecycle Management: Amazon S3 → Amazon Glacier → • Seamlessly move data from Amazon S3 → Amazon Glacier • 3-5 hour asynchronous retrieval • Data lifecycle policies • $0.01 per GB for Amazon Glacier costs
  39. 39. Retrieve / Restore
  40. 40. Retrieve / Restore back to On Premises Data Centers Amazon Glacier Amazon S3 3-5 Hour Retrieval (staging) Immediate Restore *Storage Gateway mitigates restores from cloud
  41. 41. Retrieve / Restore in AWS $$ Hours Amazon Glacier Amazon S3 Amazon EC2 / HPC Amazon CloudFront Generating Business Value Mins / Seconds
  42. 42. Restore in AWS
  43. 43. Restore in AWS
  44. 44. Create instances from AMIs Restore data from backups Restore in AWS
  45. 45. TCO: On-Premises Cost Considerations 1. Primary storage hardware (primary / remote site) 2. DR / Remote site storage hardware 3. Raw to utilized storage (both primary and DR) 4. Storage growth (cost of upgrades) 5. Storage management software and 3rd party tools 6. Professional services 7. Hardware maintenance 8. Software maintenance 9. Backup software 10.Backup hardware (primary / remote site) 11.Offsite tape storage / vault 12.Archive software 13.Archive hardware 14.Power 15.Cooling 16.Space 17.Labor 18.Cost of capital 19.Training 20.Asset depreciation 21.Migration 22.Decommission / remove 23.Recycle
  46. 46. Customer Stories AWS Storage Gateway is used in a variety of ways Jollibee (JFC) is using the AWS Storage Gateway to backup and mirror their Oracle SQL server database from their on-premises data center to AWS. JFC is the largest fast food chain in the Philippines with revenues well over 2 Billion USD.. The Storage Gateway also provides us access to the same database snapshots for use in Amazon EC2, providing a cost-effective in- cloud DR solution. AWS Storage Gateway provided us the most cost effective way to backup our SAP workloads to AWS, it is helped us perform SAP System ‘refresh’ much faster and in a more convenient way, backing up to S3 has also helped us to prepare for DR & also run SAP Dev/QA restores easily on EC2 “Amazon Web Services and AWS Storage Gateway are great assets that help us scale fast, store data in an ultra-secure environment, spend more time on product development (rather than disaster recovery & backup) …By using AWS Storage Gateway, we went to just hours instead of days to restore from backup.” The large Japanese Retail chain uses AWS Storage Gateway to share & store files in S3 and drastically cut down it’s spend on premise NAS footprint.
  47. 47. • Digital marketing agency, What’s Up Interactive, offers managed hosting for high-availability websites • What’s Up Interactive wanted to expand this service to include affordable backup and disaster recovery (DR) • What’s Up Interactive’s backup and DR solution is anchored by Amazon S3, which is designed to provide 99.999999999% durability and 99.99% availability of objects over a given year AWS Allows What’s Up Interactive to Save Over $1M
  48. 48. AWS – Your Global Data Center for Backup and Archive • Choose the region that fits your business and compliance needs • 10 regions world wide – set up with a few clicks • Broad range of backup/archive tools that are AWS integrated • Low cost, reliable AWS Transport and Storage options • Enhance Security Posture • Increase Scalability • Significantly Higher Data Durability • All at a lower TCO
  49. 49. © 2014 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.© 2014 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc. THANK YOU John Downey
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×