Amazon Dynamo

Dynamo: Amazon’s Highly Available Key-Value Store
Farley Lai
University of Iowa
poyuan-lai@uiowa.edu

February 21, 2014

Farley Lai (UIOWA)

Amazon Dynamo (Big Data)

February 21, 2014

1 / 14

Motivation

MapReduce processes big data in a parallel and distributed fashion.
Daynamo forms the foundation of big data, namely, the storage.

Shopping Cart
Clients tend to insert and update items frequenty but review the cart to
check out only at the end. Is it fun for the sytem to always ask you to
retry later in minutes whenever there is an item inserted/updated in the
shopping cart?

Farley Lai (UIOWA)


February 21, 2014

2 / 14

SOA of Amazon’s Platform

Farley Lai (UIOWA)


February 21, 2014

3 / 14

Roles
Service Provider: Amazon
Service: Dynamo, the storage service
Customer: application/service vendors
Client: applications/services
User: human and/or bots

Service Level Agreements (SLA)
SLA are contracts signed by service providers and customers, specifying
the quality of service guaranteed for a client access distribution.
Example: service guaranteeing that it will provide a response within
300ms for 99.9% of its requests for a peak client load of 500 requests per
second.
Farley Lai (UIOWA)


February 21, 2014

4 / 14

What is Dynamo?

A distributed key-value storage service built on a ring topology with
high availability for writes
eventual consistency

Farley Lai (UIOWA)


February 21, 2014

5 / 14

Requirements and Assumptions

Requirements
Simple read/write to data items identiﬁed by unique keys
ACID: automicity, consistency, isolation and durability
SLA: latency constraints on the 99.9th percentile of the
distribution
Assumptions
Trusted environment and machines without security concerns

Farley Lai (UIOWA)


February 21, 2014

6 / 14

Problems, Techniques and Advantages

Problems
Partitioning
High write availability
Temporary failures

Permanent failures
Membership

Farley Lai (UIOWA)

Techniques

Advantages

Consistent Hashing
Vector clocks with
conlict resolution
Sloppy
Quorum,
hinted handoﬀ

Incremental Scalability
Version size is decoupled
from update rates
High availability and durability guarantee despite
some unavailable replicas
Fast replica synchronization
decentralized registry for
storing membership and
liveness info

Merkle trees
Gossip protocol


February 21, 2014

7 / 14

Partitioning

Consistent hashing
1

key space

2

tokens assignment

3

replication

4

load distribution

5

node availability

6

node capacity

Farley Lai (UIOWA)


February 21, 2014

8 / 14

Data Versioning

Operations
1

read()⇒get()

2

write()⇒put()

3

conﬂict resolution

4

vector clock

Farley Lai (UIOWA)


February 21, 2014

9 / 14

Sloppy Quorum

1

R(2) + W (2) > N(3)

2

latency

Farley Lai (UIOWA)


February 21, 2014

10 / 14

Replica Synchronization

Figure : Merkle hash tree1

Farley Lai (UIOWA)

Figure : Merkle hash tree2


February 21, 2014

11 / 14

Evaluation: latency

Farley Lai (UIOWA)


February 21, 2014

12 / 14

Evaluation: load balance

Farley Lai (UIOWA)


February 21, 2014

13 / 14

Evaluation: write buﬀer

Farley Lai (UIOWA)


February 21, 2014

14 / 14

Amazon Dynamo

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (9)

Similar to Amazon Dynamo

Similar to Amazon Dynamo (20)

Recently uploaded

Recently uploaded (20)

Amazon Dynamo