The document discusses several Amazon Web Services (AWS) managed database options. It begins by explaining why companies choose managed database services over self-managed options, noting that AWS handles maintenance, backups, scaling and other tasks. It then summarizes the major AWS managed database services: Amazon Relational Database Service (RDS) for relational databases, Amazon DynamoDB for non-relational databases, Amazon ElastiCache for in-memory caching, and Amazon Redshift for data warehousing. For each service, it provides examples of common use cases and highlights features like automation, scalability, availability and pay-as-you-go pricing.
4. If you host your databases on premises
Power, HVAC, net
Rack and stack
Server maintenance
OS patches
DB software patches
Database backups
Scaling
High availability
DB software installs
OS installation
you
App optimization
5. If you host your databases on premises
Power, HVAC, net
Rack and stack
Server maintenance
OS patches
DB software patches
Database backups
Scaling
High availability
DB software installs
OS installation
you
App optimization
6. If you host your databases in Amazon EC2
Power, HVAC, net
Rack and stack
Server maintenance
OS patches
DB software patches
Database backups
Scaling
High availability
DB software installs
OS installation
you
App optimization
7. If you host your databases in EC2
OS patches
DB software patches
Database backups
Scaling
High availability
DB software installs
you
App optimization
Power, HVAC, net
Rack and stack
Server maintenance
OS installation
8. If you choose a managed DB service
Power, HVAC, net
Rack and stack
Server maintenance
OS patches
DB software patches
Database backups
App optimization
High availability
DB software installs
OS installation
you
Scaling
9. Quick summary of the options
Self-managed EC2 instances Fully managed
Corporate data
center
Database DB on EC2
instance
RDS
11. A managed service for each major DB type
Amazon
Redshift
Data
warehouse
Amazon
DynamoDB
Document
and key-
value store
Amazon
RDS
Relational
database
engines
Amazon
ElastiCache
In-memory
key-value
store
13. NoSQL vs. relational DB for a new app: How to choose?
• Schemaless, easy reads
and writes, simple data
model
• Scaling is easy
• Focus on performance and
availability at any scale
• Strong schema, complex
relationships,
transactions and joins
• Scaling is difficult
• Focus on consistency
over scale and availability
NoSQL SQL
19. Amazon Aurora: Fast, available, and MySQL-compatible
SQL
Trans-
actions
AZ 1 AZ 2 AZ 3
Caching
Amazon
S3
5x faster than MySQL on
same hardware
SysBench: 100 K writes/sec
and 500 K reads/sec
Designed for 99.99%
availability
6-way replicated storage
across 3 AZs
Scale to 64 TB and 15 Read
Replicas
20. RDS is simple and fast to scale
Database instance types
offer a range of CPU and
memory selections
Scale up or down among
instance types on demand
Database storage is
scalable on demand
21. RDS offers fast, predictable storage
General Purpose
(SSD) for most
workloads
Provisioned IOPS
(SSD) for OLTP
workloads up to
30,000 IOPS
Magnetic for small
workloads with
infrequent access
22. High availability multi-AZ deployments
Enterprise-grade fault tolerance solution for
production databases
23. Choose Read Replicas for scalability and enhanced
data locality
Relieve pressure on your master
node for supporting reads and
writes
Even faster recovery in the event
of disaster
Bring data close to your
customers
Promote to a master for easy
migration
24. Choose cross-region snapshot copy for even greater
durability, ease of migration
Copy a database snapshot
to a different AWS Region
Warm standby for disaster
recovery
Base for migration to a
different region
25. How do RDS backups work?
Automated backups
Restore your database to a point in
time
Enabled by default
Choose a retention period, up to 35
days
Manual snapshots
Build a new database instance from a
snapshot when needed
Initiated by you
Persist until you delete them
Stored in S3
26. Monthly
bill
= +
Further details at http://aws.amazon.com/rds/pricing/
You pay for the resources that you use
Storage
consumed
Duration for which DB
instances were used
(Price depends on
type of storage)
(Price depends on
type of DB instance)
Free Tier (for first 12 months)
• 750 micro DB instance hours
• 20 GB of DB storage
• 20 GB for backups
• 10 million I/O operations
GBN ×
30. DynamoDB: a managed document and key-
value store
• Simple and fast to deploy, easy to scale
• Data is automatically replicated
• Fast, predictable performance
• Helps provide security and control
• Fine-grained access control
• No cost to get started
• Pay only for what you consume
31. Popular use cases
Ad tech IoT Gaming
Mobile
& web
Ad serving, ID
lookup, user
profile
management,
session
tracking, real-
time bidding
Tracking state,
metadata and
readings from
millions of
devices,
real-time
notifications
Recording
game details,
leaderboards,
session
information,
usage history,
and logs
Storing user
profiles,
session details,
personalization
settings,
entity-specific
metadata
32. Writes
Replicated continuously to 3 AZs
Persisted to disk (custom SSD)
Reads
Strongly or eventually consistent
No latency trade-off
Automatic replication for rock-solid durability and
availability
33. DynamoDB: A schemaless database
Attributes
Schemaless
Schema is defined per item
Items
Table
Item
key
34. DynamoDB: What are capacity units?
One write capacity unit One read capacity unit
One strongly consistent
read per second for items
up to 4 KB
or
Two eventually consistent
reads per second
One write per
second for items
up to 1 KB
35. Simple app architecture with DynamoDB
Elastic Load
Balancing EC2 app
instances
Clients
DynamoDB
Business logic
36. You pay for the resources that you use
Monthly
bill = GB +
Pricing varies by region. Further details at http://aws.amazon.com/dynamodb/pricing/
Storage
consumed
Write
capacity
units
(WCUs)
+
Read
capacity
units
(RCUs)
Free Tier:
• Generous Free Tier of 25 GB, 25 WCUs, and 25 RCUs
• That is, you get over 60M read requests and 60M write requests for free in a
month
• The free tier is indefinite—you benefit from this every month
39. Amazon
Redshift
a lot faster
a lot cheaper
a whole lot simpler
Relational data warehouse
Massively parallel; petabyte scale
Fully managed
HDD and SSD platforms
$1,000/TB/year; starts at $0.25/hour
40. Popular use cases
10x cheaper
Easy to provision
Greater DBA productivity
Traditional
enterprises
10x faster
No programming
Easily leverage BI tools,
Hadoop, machine
learning, streaming
Companies
with big data
Analysis in line with
process flows
Pay as you go, grow as
you need
Managed availability and
disaster recovery
SaaS
companies
41. Amazon Redshift architecture
Leader node
• Simple SQL endpoint
• Stores metadata
• Optimizes query plan
• Coordinates query execution
Compute nodes
• Local columnar storage
• Parallel/distributed execution of all
queries, loads, backups, restores,
resizes
Start at just $0.25/hour, grow to 2 PB
(compressed)
• DC1: SSD; scale 160 GB–326 TB
• DS2: HDD; scale 2 TB–2 PB
10 GigE
(HPC)
Ingestion
Backup
Restore
JDBC/ODBC
42. Amazon Redshift is fast
Dramatically less I/O
Column storage
Data compression
Zone maps
Direct-attached storage
Large data block sizes
10 | 13 | 14 | 26 |…
… | 100 | 245 | 324
375 | 393 | 417…
… 512 | 549 | 623
637 | 712 | 809 …
… | 834 | 921 | 959
10
324
375
623
637
959
ID Age State Amount
123 20 CA 500
345 25 WA 250
678 40 FL 125
957 37 WA 375
43. Fully managed, continuous/incremental backups
Multiple copies within cluster
Continuous and incremental backups
to S3
Continuous and incremental backups
across regions
Streaming restore
S3
S3
Region 1
Region 2
44. Amazon Redshift offers rock-solid fault tolerance
S3
S3
Region 1
Region 2
Disk failures
Node failures
Network failure
Availability Zone–or region-level
disasters
45. Security is built in
• Load encrypted from S3
• SSL to secure data in transit
• Amazon VPC for network isolation
• Encryption to secure data at rest
• On-premises HSM and AWS
CloudHSM support
• SOC 1, 2, and 3; PCI-DSS;
FedRAMP; BAA
10 GigE
(HPC)
Ingestion
Backup
Restore
Customer VPC
Internal
VPC
JDBC/ODBC
46. You pay for what you use
Further details at https://aws.amazon.com/redshift/pricing/
Monthly
bill = N ×
Duration for which the
nodes were used
Number of nodes
(Price depends on type
of node)
2-month free trial
Leader node is free
No upfront costs, pay as you go
47. Amazon Redshift has a large ecosystem
Data Integration Systems IntegratorsBusiness Intelligence
54. How ElastiCache billing works
Monthly
bill = N ×
Further details at http://aws.amazon.com/elasticache/pricing/
Duration for which the
nodes were used
Number of nodes
(Price depends on type
of node)
Free Tier (for first 12 months)—750 micro cache node hours
57. Benefits of AWS managed database services
Pay only for what
you use
No upfront cost
Fully managed
services
AWS handles
installs, patching,
restarts
Easy to scale
Grow as you need
Designed for use
with other AWS
services
AWS
Data Pipeline
Amazon
EC2
Amazon
S3
Amazon
CloudWatch
Amazon
SNS
Amazon
VPC