Amazon S3 and Amazon Glacier provide developers and IT teams with secure, durable, highly-scalable object storage with no minimum fees or setup costs. In this webcast, we will provide an introduction to each service, dive deep into key features of Amazon S3 and Amazon Glacier, and explore different use cases that these services optimize.
Learning Objectives:
• Business value of Amazon S3 and Amazon Glacier
• Leveraging S3 for web applications, media delivery, big data analytics and backup
• Leveraging Amazon Glacier to build cost effective archives
• Understand the life cycle management of AWS’s storage services
Who Should Attend:
• Developers, DevOps Engineers, Engineers and System Administrators
7. What is Amazon S3
Highly durable object storage for all types of data
Internet-scale storage
Grow without limits
Low price per GB
per month
No commitment
No up-front cost
Built-in redundancy
Designed for
99.999999999%
durability
Benefit from AWS’s
massive security
investments
8. Key Features of Amazon S3
Data Management
• Cost monitoring and controls
• Lifecycle management
Ease of use
• Programmatic access using AWS SDKs
• REST APIs
• Management Console, AWS CLI
Event Notifications
• Delivered using SQS, SNS, or Lambda
• Enable you to trigger workflows, alerts or
other processing
Data protection
• Versioning
• Cross-region replication
Security
• Multi-factor authentication delete
• Flexible access control mechanisms
• Time-limited access to object
• Access logs
• Multiple client and server-side
Encryption options
9. Cross-region
replication
Amazon CloudWatch
& AWS CloudTrail support
VPC endpoint
for Amazon S3
Read-after-write
consistency in all regions
Event
notifications
Amazon S3 bucket
limit increase
Innovation for Amazon S3
11. Active data Archive dataInfrequently accessed data
S3 - Standard S3 – Standard
Infrequent Access
Glacier
Choice of storage class on Amazon S3
12. L
i
f
e
c
y
c
l
e
Available
S3: 99.99%
S3-IA: 99.9%
Performant
Low Latency
High Throughput
Secure
SSE, client
encryption, IAM
integration
Event
Notifications
SQS, SNS, and
Lambda
Versioning
Keep multiple
copies
automatically
Cross Region
Replication
Common
Namespace
Define storage
class per object
Durable
99.999999999%
Scalable
Elastic capacity
No preset limits
“Hot” Data
Active and/or
Temporary Data
“Warm” Data
Infrequently
Accessed Data
“Cold” Data
Archive and
Compliance Data
S3-IA
Glacier
S3
Storage tiered to your requirements
13. L
i
f
e
c
y
c
l
e
Available
S3: 99.99%
S3-IA: 99.9%
Performant
Low Latency
High Throughput
≥ 30 Days≥ 128K
≥ 90 Days
Durable
99.999999999%
Scalable
Elastic capacity
No preset limits
> 0K$0.007/GB per month
$0.0125/GB per month
“Hot” Data
Active and/or
Temporary Data
“Warm” Data
Infrequently
Accessed Data
“Cold” Data
Archive and
Compliance Data
≥ 0 Days> 0K$0.03/GB per month
3 – 5 Hrs
$0.01/GB retrieval
$0.01/GB retrieval > 5%
S3-IA
Glacier
S3
Storage tiered to your requirements
14. S3
S3-IA
Glacier
10-% Reads 90+% Writes Use case
User files become dormant days after upload. The
access pattern is usually 90+% writes and 10-%
reads.
Benefits
Lower costs with minimal integration.
Assuming 90/10 access ratio:
$0.0125/GB + $0.001 (retrievals) = $0.0135/GB
User Generated Content Example
15. Active Archive Example
S3
S3-IA
Glacier
On-Demand Reads
L
i
f
e
c
y
c
l
e
Active Data
Deep Archive
Use case
Data reads from archive are infrequent but require
immediate response. Data is archived for future
reference or compliance and often resides on tape.
The optimal tier for deep archives is Glacier. S3-IA can
be an intermediate phase into Glacier.
Customer value
Improve access to valuable content, reduce costs and
improve durability.
Example applications
Digital media archives
Intermediate log archives for Big Data Analytics
16. S3-IA
Glacier
Active Backup
Long-term Backup
L
i
f
e
c
y
c
l
e
SGW
Use case
Backup and archive on-premises
data or EC2 data volumes to
AWS directly from backup
applications or through a
gateway.
Customer value
Reduce costs, simplify
management, infinite scale
compared to on-prem tape/disk
Enterprise Backup Example
17. • Preserve, retrieve, and restore every version
of every object stored in your bucket
• S3 automatically adds new versions and
preserves deleted objects with delete markers
• Easily control the number of versions kept by
using lifecycle expiration policies
• Easy to turn on in the AWS Management
Console
Key = photo.gif
ID = 121212
Key = photo.gif
ID = 111111
Versioning
Enabled
PUT
Key = photo.gif
Amazon S3 Versioning
18. Delivers notifications to Amazon SNS, Amazon SQS, or AWS
Lambda when events occur in S3
S3
Events
SNS topic
SQS queue
Lambda function
Notifications
Foo() {
…
}
Amazon S3 Event Notifications
19. Automated, fast, and reliable asynchronous replication of data across AWS regions
Source
(Virginia)
Destination
(Oregon)
• Only replicates new PUTs. Once
S3 is configured, all new uploads
into a source bucket will be
replicated
• Entire bucket or prefix based
• 1:1 replication between any 2
regions
• Versioning required
Use cases:
• Compliance—store data hundreds of miles apart
• Lower latency—distribute data to regional customers)
• Security—create remote replicas managed by separate AWS accounts
Amazon S3 Cross-region Replication
20. Amazon S3 Virtual Private Endpoint (VPCE)
Prior to S3 VPCE Using S3 VPCE
• Public IP on EC2 Instances and IGW
• Private IP on EC2 Instances and NAT
• Access S3 using S3 Private Endpoint (VPE)
without using NAT instances or Gateways
• Increased security
Amazon S3
Amazon S3
21. Client-side encryption use AWS SDKs
• You manage the encryption keys and never send them to AWS
Server-side encryption (SSE) with Amazon S3 managed keys
• “Check-the-box” to encrypt your data at rest. Keys managed by S3
SSE with customer provided keys
• You manage your encryption keys and provide them for PUTs and GETS
SSE with AWS Key Management Service managed keys
• Keys managed centrally in AWS KMS with permissions and auditing of usage
For more details – watch Encryption and Key Management in AWS:
https://www.youtube.com/watch?v=uhXalpNzPU4
Amazon S3 Data Encryption Options
22. Amazon S3 Availability & Usage
Amazon S3 holds trillions of objects and regularly peaks at millions of requests per
second.
Available in 14
regions today and
4 new regions
coming soon.
23. 1 PB raw storage
800 TB usable storage
600 TB allocated storage
400 TB application data
pay only for what you use!
Amazon S3
Amazon S3 Capacity Pricing
24. Pay only for what you use.
There is no minimum fee.
We charge less where our
costs are less, and prices
are based on the location
of your Amazon S3 bucket.
Estimate your monthly bill using the AWS Simple Monthly Calculator.
Amazon S3 Price
26. Archival storage for infrequently accessed data
Amazon Glacier
is optimized for
infrequent retrieval
Stop managing
physical media
Even lower cost than
Amazon S3;
Same high durability
3-5 hour retrieval latency
%5 free tier on retrievals
$0.007 per GB/month
$86 per TB/year
Replace tape libraries, VTLs
What is Amazon Glacier
27. Key Features of Amazon Glacier
Vault Inventory
• Inventory all archives
• Available as JSON or CSV
Ease of use
• Programmatic access using AWS SDKs
• REST APIs
• Management Console, AWS CLI
Data Retrieval Policies
• Define data retrieval limits and cost
ceiling
• Example: ”Free Tier Only”, “Max
Retrieval Rate”,
Access Controls
• Integrated with AWS IAM
• Supports MFA device access
Integrated Lifecycle Management
• Integrated with Amazon S3 Lifecycle
policies
• Establish auto-archive rules for
Amazon S3 objects
Tagging Support
• Tag vaults for cost management
• Filter cost reports based on tags
29. Three Ways to Ingest Data with Amazon Glacier
•Direct Glacier API/SDK
• Direct access to Glacier for deep archives
•S3 lifecycle integration
• Move older data to less expensive archive
tier
•Third party tools and gateways
• Integrate existing backup and archive
applications using an IT-friendly interface
33. AWS provides full-site,
or media asset, delivery
via a worldwide content
delivery network (CDN)
called Amazon CloudFront.
Amazon CloudFront Edge Locations
36. SoundCloud: Audio Transcoding
- World’s leading social sound platform
- Audio files must be transcoded and stored in
multiple formats
- Stores petabytes of data
- Transcoded files served from Amazon S3 via
Amazon CloudFront
- Originals moved to Amazon Glacier for cost
savings