An economic approach for scalable and highly-available distributed applications

An economic approach for
scalable and highly-available
distributed applications

Nicolas Bonvin, Thanasis Papaioannou, Karl Aberer

CLOUD 2010, July 5-10 2010, Miami, Florida, USA

nicolas.bonvin@epfl.ch
LSIR - EPFL

Introduction

● A distributed application = many (remote) components
● A component is
– A piece of software
– Loosely coupled
– Self-Contained
E
● e.g. a SOA-based application

B C D

A

2 EPFL – LSIR - Nicolas Bonvin

Placement: first problem

● Where should the components be placed to maximize the
application performance ?
E

B C D

A

?
1 2 3 4



application performance ?
E
– Random placement ?
B C D

A

1 2 3 4
A D B
C E

Bad resource utilization !



application performance ? E
– « Clever » random placement ?
B C D

A

1 2 3 4
A E D B
C

D and E should probably be hosted on the same server !

Not always optimal !


Even more components !

● High Availability: software, hardware, network failures
● Scalability: growing load, peaks, scaling down, ...

Replication !

E E

B B C C D D

A A A


Placement: second problem

application availability ?
E E

B B C C D D

A A A

?

Rack 1 Rack 2 Rack 3 Rack 4

Datacenter 1 Datacenter 2


Multi Objective Optimization Problem

● Maximize the geographical distance of replicas
– Greater availability
● Minimize the geographical distance between related
components
– Lower latency
● Balance the load (disk I/O, network I/O, CPU) between the
servers
– Better application performance

NP-Complete


Scarce:
a framework to build scalable cloud applications

Architecture overview

● An agent on each server
– starts/stops/monitors the components
– Takes decisions on behalf of the components
● An agent communicates with other agents
– Routing table
– Status of the server (resources usage)

Server Agent
Agent
A

B Agent GOSSIPING
+ BROADCAST
Agent
Agent
E

Agent


An economic approach

● Time is split into epochs (no synchronization between servers)
● Servers charge a virtual rent for hosting a component according to
– Current resource usage (I/O, CPU, ...) of the server
– Technical factors (HW, connectivity, ...)
– Non-technical factors (country stability, ....)

● Components
– Pay virtual rent at each epoch
– Gain virtual money by processing requests
– Take decisions based on balance ( = gain – rent )
● Replicate, migrate, suicide, stay

● Virtual rents are updated by gossiping (no centralized board)


Economic model

● Replication of a component
– If minimum availability is not reached
– If b' > 0 for last n epochs
● Migration/Suicide of a component
– If balance c < 0 for last n epochs


Availability (i)

● Increase availability by increasing geographical diversity
● Handled by replication
– Granularity: rack, room, datacenter, country, ...
– Label: NA-US-NY1-C01-R12-S02
● Each component must satisfy a minimum availability

● Si is the set of server hosting a replica of component i


Availability (ii)

● Similarity: computes the distance between 2 servers

● Diversity:

● Choosing a candidate server j

● gj : weight related to the proximity of the server location to the
geographical distribution of the client requests to the component


Summary

● High Availability: software, hardware, network failures
– Geographical aware placement (netbenef maximization)
– Minimum availability level per component

● Scalability: growing load, peaks, scaling down, ...
– Quick replication of busy components

● Load Balancing: load has to be shared by all available servers
– Replication of busy components
– Migration of less busy components
– Reach equilibrium when load is stable

● No synchronization, fully decentralized


Evaluation: Setup

● E-Ticketing application (print@home)

● 1 or 3 applications deployed in the cloud
● 7 or 15 servers (Intel Core i7 920, 2.67 GHz, 8GB, Linux 2.6.32-
trunk-amd64)
● Servers dedicated to the components: 4 or 10


Static vs Dynamic placement (i)


Static vs Dynamic placement (ii)


Adaptability to new resources

● 1500 concurrent users


Fairness between applications


Conclusion

● Framework for building cloud applications
● Maximize cloud resource utilization
● Maximize availability
● React to sudden load changes
● Elastic (add/remove resources)
● No synchronization
● Fully decentralized


An economic approach for scalable and highly-available distributed applications

More Related Content

Viewers also liked

Similar to An economic approach for scalable and highly-available distributed applications

Recently uploaded

An economic approach for scalable and highly-available distributed applications