DynamoDB Deep Dive

Deep dive on DynamoDB to create scalable app
Eduardo Horai
AWS Solutions Architect

DynamoDB is a managed
NoSQL database service.
Store and retrieve any amount of data.

Serve any level of request traffic.

Without the operational burden.

Consistent, predictable performance.
Single digit millisecond latency.

Backed on solid-state drives.

Flexible data model.
Key/attribute pairs. No schema required.

Easy to create. Easy to adjust.

Seamless scalability.
No table size limits. Unlimited storage.

No downtime.

Durable.
Consistent, disk only writes.

Replication across data centers and availability zones.

Provisioned throughput.
Reserve IOPS for reads and writes.

Scale up for down at any time.

Pay per capacity unit.
Priced per hour of provisioned throughput.

Write throughput.
Size of item x writes per second
>= 1KB

Consistent writes.
Atomic increment and decrement.
Optimistic concurrency control: conditional writes.

Transactions.
Item level transactions only.
Puts, updates and deletes are ACID.

Strong or eventual consistency

Read throughput.


Read throughput.
Provisioned units = size of item x reads per second
>= 4KB


Read throughput.
Provisioned units = size of item x reads per second
2


Read throughput.
Same latency expectations.
Mix and match at ‘read time’.

Provisioned throughput is
managed by DynamoDB.

Data is partitioned and
managed by DynamoDB.

Partitioning
•  DynamoDB automatically partitions data by the hash key
–  Hash key spreads data & workload across partitions

•  Auto-Partitioning driven by:
–  Data set size
–  Provisioned Throughput

•  Tip: large number of unique hash keys and uniform
distribution of workload across hash keys lends well to
massive scale!

Indexed data storage.
Tiered bandwidth pricing:
aws.amazon.com/dynamodb/pricing

Reserved capacity.
Up to 53% for 1 year reservation.
Up to 76% for 3 year reservation.

Authentication.
Session based to minimize latency.
Uses the Amazon Security T
oken Service.

Handled by AWS SDKs.
Integrates with IAM.

Monitoring.
CloudWatch metrics:
latency, consumed read and write throughput,
errors and throttling.

Libraries, mappers and mocks.
ColdFusion, Django, Erlang, Java, .Net,
Node.js, Perl, PHP Python, Ruby
,

http://j.mp/dynamodb-libs

id = 100

date =
2012-05-16-09-00-10

total = 25.00

id = 101

date =
2012-05-15-15-00-11

total = 35.00

id = 101

date =
2012-05-16-12-00-10

total = 100.00

T
able

id = 100

date =
2012-05-16-09-00-10

total = 25.00

id = 101

date =
2012-05-15-15-00-11

total = 35.00

id = 101

date =
2012-05-16-12-00-10

total = 100.00

id = 100

date =
2012-05-16-09-00-10

total = 25.00

id = 101

date =
2012-05-15-15-00-11

total = 35.00

id = 101

date =
2012-05-16-12-00-10

total = 100.00

Item

date =
2012-05-16-09-00-10

total = 25.00

id = 101

date =
2012-05-15-15-00-11

total = 35.00

id = 101

date =
2012-05-16-12-00-10

total = 100.00

id = 100

Attribute

Where is the schema?
T
ables do not require a formal schema.

Items are an arbitrarily sized hash.

Indexing.
Items are indexed by primary and secondary keys.
Primary keys can be composite.
Secondary keys are local to the table.

Hash key

Range key

ID

Date

Composite primary key

T
otal

Hash key

Range key

Secondary range key

ID

Date

T
otal

Programming DynamoDB.
Small but perfectly formed API.

CreateTable

PutItem

UpdateTable

GetItem

DeleteTable

UpdateItem

DescribeTable

DeleteItem

ListTables
Query
Scan

BatchGetItem
BatchWriteItem

Conditional updates.
PutItem, UpdateItem, DeleteItem can take
optional conditions for operation.

UpdateItem performs atomic increments.

One API call, multiple items
BatchGet returns multiple items by key.
BatchWrite performs up to 25 put or delete operations.

Throughput is measured by IO, not API calls.

Query vs Scan
Query returns items by key.

Scan reads the whole table sequentially.

Query patterns
Retrieve all items by hash key.

Range key conditions:
==, <, >, >=, <=, begins with, between.

Counts. T and bottom n values.
op
Paged responses.

AmazonDynamoDBClient dynamoDB; = new AmazonDynamoDBClient(

new ClasspathPropertiesFileCredentialsProvider());

dynamoDB.setRegion(Region.getRegion(Regions. SA_EAST_1));

Players
user_id =
mza

location =
Cambridge

joined =
2011-07-04

user_id =
jeffbarr

location =
Seattle

joined =
2012-01-20

user_id =
werner

location =
Worldwide

joined =
2011-05-15

CreateTableRequest createPlayersTable =

new CreateTableRequest().withTableName("Players")

.withKeySchema(new KeySchemaElement().withAttributeName("user_id")

.withKeyType(KeyType.HASH))

.withAttributeDeﬁnitions(newAttributeDeﬁnition()

.withAttributeName("user_id").withAttributeType(ScalarAttributeType.S))

.withProvisionedThroughput(new ProvisionedThroughput()

.withReadCapacityUnits(10L)

.withWriteCapacityUnits(10L));

dynamoDB.createTable(createPlayersTable);

Players
user_id =
mza

location =
Cambridge

joined =
2011-07-04

user_id =
jeffbarr

location =
Seattle

joined =
2012-01-20

user_id =
werner

location =
Worldwide

joined =
2011-05-15

user_id =
mza

game =
angry-birds

score =
11,000

user_id =
mza

game =
tetris

score =
1,223,000

user_id =
werner

game =
bejewelled

score =
55,000

Scores

CreateTableRequest createScoresTable =

new CreateTableRequest().withTableName(”Scores")

.withKeySchema(new KeySchemaElement().withAttributeName("user_id")



.withAttributeName("user_id").withAttributeType(ScalarAttributeType.S))
.withKeySchema(new KeySchemaElement().withAttributeName(”game")

.withKeyType(KeyType.RANGE))


.withAttributeName(”game").withAttributeType(ScalarAttributeType.S))




Players
user_id =
mza

location =
Cambridge

joined =
2011-07-04

user_id =
jeffbarr

location =
Seattle

joined =
2012-01-20

user_id =
werner

location =
Worldwide

joined =
2011-05-15

Leader boards

Scores
user_id =
mza

game =
angry-birds

score =
11,000

game =
angry-birds

score =
11,000

user_id =
mza

user_id =
mza

game =
tetris

score =
1,223,000

game =
tetris

score =
1,223,000

user_id =
mza

user_id =
werner

game =
bejewelled

score =
55,000

game =
tetris

score =
9,000,000

user_id =
jeffbarr

CreateTableRequest createLeaderBoardsTable =

new CreateTableRequest().withTableName(”LeaderBoards")

.withKeySchema(new KeySchemaElement().withAttributeName(”game")



.withAttributeName(”game").withAttributeType(ScalarAttributeType.S))

.withKeySchema(new KeySchemaElement().withAttributeName(”score")

.withKeyType(KeyType.RANGE))


.withAttributeName(”score").withAttributeType(ScalarAttributeType.N))




Players
user_id =
mza

location =
Cambridge

joined =
2011-07-04

user_id =
jeffbarr

location =
Seattle

joined =
2012-01-20

user_id =
werner

location =
Worldwide

joined =
2011-05-15

Query for user

Leader boards

Scores
user_id =
mza

game =
angry-birds

score =
11,000

game =
angry-birds

score =
11,000

user_id =
mza

user_id =
mza

game =
tetris

score =
1,223,000

game =
tetris

score =
1,223,000

user_id =
mza

user_id =
werner

game =
bejewelled

score =
55,000

game =
tetris

score =
9,000,000

user_id =
jeffbarr

Map<String, Condition> keyConditions = new HashMap<String, Condition>();
keyConditions.put("user_id", new Condition()

.withComparisonOperator(ComparisonOperator.EQ.toString())

.withAttributeValueList(new AttributeValue().withS("mza")));

QueryRequest queryRequest = new QueryRequest()

.withTableName("Players")

.withKeyConditions(keyConditions);

QueryResult result = dynamoDB.query(queryRequest);

for (Map<String, AttributeValue> item : result.getItems()) {

printItem(item);

}

Players
user_id =
mza

location =
Cambridge

joined =
2011-07-04

user_id =
jeffbarr

location =
Seattle

joined =
2012-01-20

user_id =
werner

location =
Worldwide

joined =
2011-05-15

Query for scores
by user

Leader boards

Scores
user_id =
mza

game =
angry-birds

score =
11,000

game =
angry-birds

score =
11,000

user_id =
mza

user_id =
mza

game =
tetris

score =
1,223,000

game =
tetris

score =
1,223,000

user_id =
mza

user_id =
werner

game =
bejewelled

score =
55,000

game =
tetris

score =
9,000,000

user_id =
jeffbarr





.withTableName(”Scores")

.withAttributesToGet(”score”, “game”)




printItem(item);

}

Players
user_id =
mza

location =
Cambridge

joined =
2011-07-04

user_id =
jeffbarr

location =
Seattle

joined =
2012-01-20

user_id =
werner

location =
Worldwide

joined =
2011-05-15

Query for scores
by user, game

Leader boards

Scores
user_id =
mza

game =
angry-birds

score =
11,000

game =
angry-birds

score =
11,000

user_id =
mza

user_id =
mza

game =
tetris

score =
1,223,000

game =
tetris

score =
1,223,000

user_id =
mza

user_id =
werner

game =
bejewelled

score =
55,000

game =
tetris

score =
9,000,000

user_id =
jeffbarr




keyConditions.put(”game", new Condition()


.withAttributeValueList(new AttributeValue().withS(”tetris")));


.withTableName(”Scores")




printItem(item);

}

Players
user_id =
mza

location =
Cambridge

joined =
2011-07-04

user_id =
jeffbarr

location =
Seattle

joined =
2012-01-20

user_id =
werner

location =
Worldwide

joined =
2011-05-15

High scores by game

Leader boards

Scores
user_id =
mza

game =
angry-birds

score =
11,000

game =
angry-birds

score =
11,000

user_id =
mza

user_id =
mza

game =
tetris

score =
1,223,000

game =
tetris

score =
1,223,000

user_id =
mza

user_id =
werner

game =
bejewelled

score =
55,000

game =
tetris

score =
9,000,000

user_id =
jeffbarr

keyConditions.put(”game", new Condition()


.withAttributeValueList(new AttributeValue().withS(”tetris")));


.withTableName(”LeaderBoards")

.withKeyConditions(keyConditions)

. withScanIndexForward(false);



printItem(item);

}

Players
user_id =
mza

location =
Cambridge

joined =
2011-07-04

user_id =
jeffbarr

location =
Seattle

joined =
2012-01-20

user_id =
werner

location =
Worldwide

joined =
2011-05-15

Insert Players

Leader boards

Scores
user_id =
mza

game =
angry-birds

score =
11,000

game =
angry-birds

score =
11,000

user_id =
mza

user_id =
mza

game =
tetris

score =
1,223,000

game =
tetris

score =
1,223,000

user_id =
mza

user_id =
werner

game =
bejewelled

score =
55,000

game =
tetris

score =
9,000,000

user_id =
jeffbarr

Map<String, AttributeValue> itemPlayer =

new HashMap<String, AttributeValue>();

itemPlayer.put("user_id", new AttributeValue("eduardohorai"));

itemPlayer.put("location", new AttributeValue("Sao Paulo"));

itemPlayer.put("joined", new AttributeValue("27/01/2013"));

PutItemRequest putItemRequest =

new PutItemRequest("Players", itemPlayer);

PutItemResult putItemResult = dynamoDB.putItem(putItemRequest);

Players
user_id =
mza

location =
Cambridge

joined =
2011-07-04

user_id =
jeffbarr

location =
Seattle

joined =
2012-01-20

user_id =
werner

location =
Worldwide

joined =
2011-05-15

Increase writes/reads
on Scores!!!!!

Leader boards

Scores
user_id =
mza

game =
angry-birds

score =
11,000

game =
angry-birds

score =
11,000

user_id =
mza

user_id =
mza

game =
tetris

score =
1,223,000

game =
tetris

score =
1,223,000

user_id =
mza

user_id =
werner

game =
bejewelled

score =
55,000

game =
tetris

score =
9,000,000

user_id =
jeffbarr

UpdateTableRequest updateTableRequest = new UpdateTableRequest()

.withTableName("Scores")




UpdateTableResult result =

dynamoDB.updateTable(updateTableRequest);

Links
§  aws.amazon.com/dynamodb

§  aws.typepad.com/brasil/

§  aws.typepad.com

§  awshub.com.br

§  ehorai@amazon.com

Questions?
Learn More:
aws.amazon.com/dynamodb

Obrigado!
Learn More:
aws.amazon.com/dynamodb

DynamoDB Deep Dive

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to DynamoDB Deep Dive

Similar to DynamoDB Deep Dive (20)

More from Amazon Web Services LATAM

More from Amazon Web Services LATAM (20)

Recently uploaded

Recently uploaded (20)

DynamoDB Deep Dive