2. AWS Data Services to Accelerate Your Move to the Cloud
RDS
Open
Source
RDS
Commercial
Aurora
Migration for DB Freedom
DynamoDB
& DAX
ElastiCache EMR Amazon
Redshift
Redshift
Spectrum
AthenaElasticsearch
Service
QuickSightGlue
Databases to Elevate your Apps
Relational Non-Relational
& In-Memory
Analytics to Engage your Data
Inline Data Warehousing Reporting
Data Lake
Amazon AI to Drive the Future
Lex
Polly
Rekognition Machine
Learning
Deep Learning, MXNet
Database Migration
Schema Conversion
3. DynamoDB
Amazon DynamoDB is a fast and flexible NoSQL
database service for all applications that need
consistent, single-digit millisecond latency at any
scale.
4. Scalability Performance Security Availability & Data Protection Manageability Development Platform
Scalability Performance Security Availability Manageability Developer
Platform• Millions of
reads/sec from
single DAX cluster
• Unlimited items
and storage
• Consistent, single
digit millisecond
latency
• Optimized for
analytics
workloads with
native indexing
• Microsecond
response times
with DynamoDB
Accelerator
(DAX)*
• Control user access
at items and
attributes level
• SOC, PCI, ISO,
FedRAMP (Mod &
High), HIPAA BAA
• Monitor with
CloudWatch metrics
& logging with
CloudTrail
• Client-side
encryption library
• Secure, private
VPC endpoints*
• Designed for
99.99% high
availability (HA)
• Built-in
replication across
3 zones
• Fully-managed
• Perpetual free tier
• Pay-as-you-grow for
capacity and storage
independently
• Track table level
spending with Tagging
• Purge data
automatically (Time To
Live)
• DMS connector for
DynamoDB
• Event-driven
programming with
Triggers & Lambda
• Advanced analytics
with EMR & Amazon
Redshift
• Full-text query
support with Amazon
Elasticsearch
Service
• Real-time stream
processing with
Amazon Kinesis
5. NoSQL foundations
0000 {“Texas”}
0001 {“Illinois”}
0002 {“Oregon”}
TXW
A
I
L
Key
Column
0000-0000-0000-0001
Game Heroes
Version 3.4
CRC ADE4
Key Value Graph Document Column-family
Amazon’s
Highly Available
Key-value
Store
January 2012Fall 2007 Late 2007
Amazon SimpleDB Amazon DynamoDBDynamo
6. Scaling relational vs. non-relational
databases
• Traditional SQL
• NoSQL
DB
DB
Scale up
DB
Host
1
DB
Host
n
DB
Host
2
DB
Host
3
Scale out to many shards
(DynamoDB: partitions)
7. Scaling NoSQL
- Good sharding (partitioning) scheme affords even
distribution of both data and workload, as they grow
- Key concept: partition key
- Ideal scaling conditions:
1. Partition key is from a high cardinality set (that grows)
2. Requests are evenly spread over the key space
3. Requests are evenly spread over time
13. What is DAX?
The benefits of DAX:
• Latency: <1ms
• Throughput: Millions of RPS for a 3-node (highly
available) cluster
• Simplified caching: DynamoDB API, AWS integration
• Hot-key/cost savings: reduce over provisioning for
frequently accessed data
17. How DAX works
DAX is API compatible with DynamoDB
• Read APIs: GetItem, BatchGetItem, Query, Scan
• Modify APIs: PutItem, UpdateItem, DeleteItem,
BatchWriteItem
• Control plane APIs: Not supported (CreateTable,
DeleteTable, etc.)
18. How DAX works
DAX has two caches:
– Item cache: GetItem, PutItem…
– Query cache: Query, Scan
For the query cache, the query/scan query text is
the “key” and the result set is stored as the “value”
21. Cache Eviction
AMAZON VPC
EC2
App
DAX
SDK
Least Recently Used (LRU)
• Based on memory pressure, evict the oldest
items first
Time-to-live (TTL)
• Configurable
• Default 5 mins
• Per item/query
22. Configuration
A DAX cluster be configured in many ways:
Table 1
Table 1
Table 2
Table 1
Table 1
Table 2
one-to-one
one-to-many
many-to-one
many-to-many
23. Fully Managed Service
• AWS runs all the database infrastructure for you
• You get all the benefits of a cache without the
operational overhead:
• High availability within a region
• VPC support (only)
• AWS console
• Integration with AWS Services including IAM, CloudFormation,
CloudWatch, CloudTrail